Personal tools


Search Advanced Search
www.dmmlg.uh.edu
November 22, 2009
Document Actions
Dataset Repository

If your work is based on the databases from this repository, please acknowledge it such that other people may obtain the same data sets to replicate the experimental results.

Title Description Source Download
Wyoming Datasets 2D Spatial datasets
Four Wyoming datasets based on income, age, race, and poverty status.
United States Census Bureau  
Complex9 Complex8  Diamond9 2D Spatial datasets
Three 2D datasets with x , y coordinates and class label.
Salvador, S. and Chan, P., ”Determining the Number of Clusters/Segments in Hierarchical clustering/Segmentation Algorithm”, ICTAI 2004,576-584. Complex&Diamond.zip
Complex 9 with noise Complex 9 data with different percentage of noise UH-DMML Group created the datasets based on complex 9 Complex9_noise.zip
Earthquake data, Volcanoes data, and Population data 2D Spatial datasets of earthquake, volcanoes and population Geosciences Department of the University of Houston Geosciences Data.zip
Earthquake data with depth as class label Modified version of the earthquake dataset; Define the depth of epicenters of different earthquakes in different locations. UH-DMML Group created the datasets based on earthquake data. Earthquake_depth.zip
Arsenic datasets of Texas water supply 2D spatial datasets of water wells in Texas (longitude, latitude, arsenic). UH-DMML Group created the datasets based on information retrieved from Texas Ground Water Database in March 2006.
  1. arsenic_nitrate_fluoride_wellDepth.arff (all in numeric attributes)
  2. arsenic
  3. arsenic and nitrate and fluoride and vanadium and iron and molybdenum and selenium complete dataset
  4. arsenic and nitrate and fluoride complete dataset
  5. arsenic in nominal (extract from 2)
  6. arsenic in numeric (extract from 2)
Earthquake data set The data set covers global earthquakes from 01/01/1976 to 12/31/2002 U.S. Geological Survey Earthquake Hazards Program http://earthquake.usgs.gov/  click here for dataset files
Oval10 Data set      click here for dataset files
      volcanoes_point.csv
Cougar^2 An open source library for data mining and machine learning. It features test-first development and promotes more open development standards. The Data Mining and Machine Learning Group  
 
573 Philip G Hoffman Hall University of Houston 4800 Calhoun Road, Houston Texas 77204
Telephone 713-743-3361 | FAX 713-743-3376