Data Libraries: Difference between revisions
Jump to navigation
Jump to search
No edit summary |
No edit summary |
||
Line 37: | Line 37: | ||
*[http://numbrary.com/ Numbrary] Numbrary is a free online service dedicated to finding, using and sharing numbers on the web. | *[http://numbrary.com/ Numbrary] Numbrary is a free online service dedicated to finding, using and sharing numbers on the web. | ||
*[http://infochimps.org/ Infochimps] Free Redistributable Rich Data Sets | *[http://infochimps.org/ Infochimps] Free Redistributable Rich Data Sets | ||
*[http://dss.ucar.edu/catalogs/free.html] CISL Research Data Archive (RDA) - large and diverse collection of meteorological and oceanographic observations, operational and reanalysis model outputs, and remote sensing datasets | *[http://dss.ucar.edu/catalogs/free.html CISL Research Data Archive] CISL Research Data Archive (RDA) - large and diverse collection of meteorological and oceanographic observations, operational and reanalysis model outputs, and remote sensing datasets | ||
*[http://search.dss.ucar.edu/cgi-bin/rdabrowse?nb=true&c=list&cv=All+RDA+Datasets] CISL Research Data Archive (RDA) | *[http://search.dss.ucar.edu/cgi-bin/rdabrowse?nb=true&c=list&cv=All+RDA+Datasets CISL Research Data Archive] CISL Research Data Archive (RDA) dataset selection according to variables, time resolution, etc. | ||
== InfoVis Contest Datasets == | == InfoVis Contest Datasets == |
Revision as of 10:21, 24 February 2009
- UCI Machine Learning Repository
- UCI KDD Archive UCI Knowledge Discovery in Databases Archive
- UW XML Repository University of Washington XML Data Repository
- StatLib Data, Software and News from the Statistics Community
- Time Series Analysis and Its Applications Book Example Data
- Time Series Data Library
- TimeWeb Web-based time series databanks
- PKDD 2000 Challenge Challenges of European Conferences on Principles and Practice of Knowledge Discovery in Databases
- PhysioNet the research resource for complex physiologic signals
- MatrixMarket A visual repository of test data for use in comparative studies of algorithms for numerical linear algebra, featuring nearly 500 sparse matrices from a variety of applications, as well as matrix generation tools and services.
- LifeLine project Visualising Migrations, Transitions and Trajectories
- 10x10 Every hour, 10x10 gathers the 100 most important words and pictures in the world, based on what's happening in the news.
- InfoVis Cyberinfrastructure Data Bases
- Enron Email Dataset
- Historical Stock Data Historical Data for S&P 500 Stocks
- Ensembl Provides sequence databases of gene, transcript and protein predictions.
- EPA air quality AirData Web site gives you access to air pollution data for the entire United States.
- Network intrusion dataset
- Internet Backbone Data Internet Mapping Project
- UCR Time Series Data Mining Archive A resource for researchers interested in the clustering, classification, indexing, segmentation, change point detection and rule extraction of time series. (by Eamonn Keogh)
- Deutsche Bundesbank Zeitreihen
- Source data sources of the Worldmapper project (Worldmapper project)
- Worldbank Development Data & Statistics
- Business Intelligence Network 2006 Data Visualization Competition (Excel spreadsheet)
- Timeseries by Eamonn Keogh et al.
- U.S. Department of Labor Bureau of Labor Statistics
- National Atlas
- UK Air Quality Archive Air Quality data in the UK from the present back to 1960
- Mitsubishi Electric Research Labs (MERL) Motion sensor data from a network of over 200 sensors for a year (WMD 2007).
- European Road Safety Observatory Road Safety Data
- CARE Community database on road accidents resulting in death or injury (EU)
- IRTAD international database that gathers data on traffic and road accidents from 28 out of the 30 OECD Member countries
- Trends Online A Compendium of Data on Global Change
- Finder! Finder is a browser-based application for finding, organizing and sharing GeoData in common formats.
- ICWSM 2009 Data Challenge data set containing 44 million blog posts; suitable for link analysis, social network extraction, analysis of influence among bloggers, ...
- CKAN Comprehensive Knowledge Archive Network
- Numbrary Numbrary is a free online service dedicated to finding, using and sharing numbers on the web.
- Infochimps Free Redistributable Rich Data Sets
- CISL Research Data Archive CISL Research Data Archive (RDA) - large and diverse collection of meteorological and oceanographic observations, operational and reanalysis model outputs, and remote sensing datasets
- CISL Research Data Archive CISL Research Data Archive (RDA) dataset selection according to variables, time resolution, etc.
InfoVis Contest Datasets
- InfoVis 2003 Contest Tree data
- InfoVis 2004 Contest Meta Data of publications
- InfoVis 2005 Contest Technology Trends in the United States
- InfoVis 2006 Contest 1% public use microdata sample from the 2002 Census
- InfoVis 2007 Contest Movie Database