From 3ac95867c623390a4fdafdc11c0b0fd3cd7abc54 Mon Sep 17 00:00:00 2001 From: Alexandre Pinto Date: Tue, 6 Jan 2015 00:28:31 +0000 Subject: [PATCH 01/99] New image processing data sets --- README.rst | 5 +++++ 1 file changed, 5 insertions(+) diff --git a/README.rst b/README.rst index d4d2ce8..120f212 100644 --- a/README.rst +++ b/README.rst @@ -195,6 +195,11 @@ Image Processing * `2GB of photos of cats `_ * `Face Recognition Benchmark `_ * `ImageNet `_ +* `SUN database `_ +* `10k US Adult Faces Database `_ +* `Affective Image Classification `_ +* `International Affective Picture System `_ +* `Massive Visual Memory Stimuli `_ Machine Learning From 0f850530464e6cae74c75375abeba21280d6e193 Mon Sep 17 00:00:00 2001 From: David Dao Date: Fri, 18 Mar 2016 09:36:16 -0400 Subject: [PATCH 02/99] Adding Broad Bioimage Benchmark Collection (BBBC) The Broad Bioimage Benchmark Collection (BBBC) is a large curated collection of published data sets in bio imaging. It includes all the images, metadata and ground truths. The BBBC resource is described in the following publication: Ljosa V, Sokolnicki KL, Carpenter AE (2012). Annotated high-throughput microscopy image sets for validation. Nature Methods 9(7):637 / doi. PMID: 22743765 PMCID: PMC3627348. Available at http://dx.doi.org/10.1038/nmeth.2083 --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index dee2bf2..0aa0cdd 100755 --- a/README.rst +++ b/README.rst @@ -27,6 +27,7 @@ Biology * `1000 Genomes `_ * `American Gut (Microbiome Project) `_ * `Broad Cancer Cell Line Encyclopedia (CCLE) `_ +* `Broad Bioimage Benchmark Collection (BBBC) `_ * `Cell Image Library `_ * `Collaborative Research in Computational Neuroscience (CRCNS) `_ * `Complete Genomics Public Data `_ From 8a09814e7778b54bb1ea5ed70e9c2fca242c6143 Mon Sep 17 00:00:00 2001 From: Xiaming Date: Fri, 15 Apr 2016 14:02:08 +0800 Subject: [PATCH 03/99] Add EMPIAR to bio. cat #215 --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index 0aa0cdd..d63864e 100755 --- a/README.rst +++ b/README.rst @@ -33,6 +33,7 @@ Biology * `Complete Genomics Public Data `_ * `EBI ArrayExpress `_ * `EBI Protein Data Bank in Europe `_ +* `Electron Microscopy Pilot Image Archive (EMPIAR) `_ * `ENCODE project `_ * `Ensembl Genomes `_ * `Gene Expression Omnibus (GEO) `_ From b59f3bbb6503e9bfca3d2611a8cd512bcc3e320f Mon Sep 17 00:00:00 2001 From: Pierre Fenoll Date: Tue, 26 Apr 2016 20:54:35 +0200 Subject: [PATCH 04/99] Add NYSE --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index d63864e..11748fc 100755 --- a/README.rst +++ b/README.rst @@ -208,6 +208,7 @@ Finance * `Quandl `_ * `St Louis Federal `_ * `Yahoo Finance `_ +* `NYSE Market Data `_ Geology From 4400bf5a80b1b81e06acfcdbdf6fdac4c5e2dd05 Mon Sep 17 00:00:00 2001 From: Jack Kelly Date: Wed, 8 Jun 2016 13:19:18 +0100 Subject: [PATCH 05/99] Update README.rst Adding more Energy datasets. And fixing capitalisation for UK-DALE and PLAID --- README.rst | 9 +++++++-- 1 file changed, 7 insertions(+), 2 deletions(-) diff --git a/README.rst b/README.rst index 11748fc..247a1e0 100755 --- a/README.rst +++ b/README.rst @@ -187,13 +187,18 @@ Energy * `BLUEd `_ * `COMBED `_ * `Dataport `_ +* `DRED `_ * `ECO `_ * `EIA `_ +* `HES `_ - Household Electricity Study, UK * `HFED `_ * `iAWE `_ -* `Plaid `_ +* `PLAID `_ - the Plug Load Appliance Identification Dataset * `REDD `_ -* `UK-Dale `_ +* `Tracebase `_ +* `UK-DALE `_ - UK Domestic Appliance-Level Electricity +* `WHITED `_ + Finance From 2f40e980d27a8ced2274bdbb2244f25d026b9fe2 Mon Sep 17 00:00:00 2001 From: John Pellman Date: Thu, 23 Jun 2016 05:24:21 -0400 Subject: [PATCH 06/99] Added Brain Catalogue. --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index 11748fc..f337771 100755 --- a/README.rst +++ b/README.rst @@ -26,6 +26,7 @@ Biology * `1000 Genomes `_ * `American Gut (Microbiome Project) `_ +* `Brain Catalogue `_ * `Broad Cancer Cell Line Encyclopedia (CCLE) `_ * `Broad Bioimage Benchmark Collection (BBBC) `_ * `Cell Image Library `_ From 7e00e1a52b09d80d59a99bc3144cae4f3e9e0da4 Mon Sep 17 00:00:00 2001 From: John Pellman Date: Mon, 4 Jul 2016 11:05:14 -0400 Subject: [PATCH 07/99] Neuroscience data added; new section for neuroscience --- README.rst | 21 +++++++++++++++++---- 1 file changed, 17 insertions(+), 4 deletions(-) diff --git a/README.rst b/README.rst index f337771..25de67c 100755 --- a/README.rst +++ b/README.rst @@ -26,11 +26,9 @@ Biology * `1000 Genomes `_ * `American Gut (Microbiome Project) `_ -* `Brain Catalogue `_ * `Broad Cancer Cell Line Encyclopedia (CCLE) `_ * `Broad Bioimage Benchmark Collection (BBBC) `_ * `Cell Image Library `_ -* `Collaborative Research in Computational Neuroscience (CRCNS) `_ * `Complete Genomics Public Data `_ * `EBI ArrayExpress `_ * `EBI Protein Data Bank in Europe `_ @@ -49,7 +47,6 @@ Biology * `MIT Cancer Genomics Data `_ * `NCBI Proteins `_ * `NCBI Taxonomy `_ -* `NeuroData `_ * `NIH Microarray data `_ or `FTP `_ * `OpenSNP genotypes data `_ * `Pathguid - Protein-Protein Interactions Catalog `_ @@ -63,7 +60,6 @@ Biology * `Stanford Microarray Data `_ * `Stowers Institute Original Data Repository `_ * `Systems Science of Biological Dynamics (SSBD) Database `_ -* `Temple University Hospital EEG Database `_ * `The Cancer Genome Atlas (TCGA), available via Broad GDAC `_ * `The Catalogue of Life `_ * `The Personal Genome Project `_ or `PGP `_ @@ -352,6 +348,23 @@ Natural Language * `Wikipedia Links data - 40 Million Entities in Context `_ * `WordNet databases and tools `_ +Neuroscience +------------- + +* `Allen Institute Datasets `_ +* `Brain Catalogue `_ +* `Brainomics `_ +* `CodeNeuro Datasets `_ +* `Collaborative Research in Computational Neuroscience (CRCNS) `_ +* `FCP-INDI `_ +* `Human Connectome Project `_ +* `NDAR `_ +* `NIMH Data Archive `_ +* `NeuroData `_ +* `OASIS `_ +* `OpenfMRI `_ +* `Neuroelectro `_ +* `Study Forrest `_ Physics ------- From a3bde36abbb7192bc27b64849dc051218c35ee3c Mon Sep 17 00:00:00 2001 From: Alexandre Rademaker Date: Tue, 5 Jul 2016 05:34:44 -0300 Subject: [PATCH 08/99] wordnet and the corpora from UD project --- README.rst | 4 +++- 1 file changed, 3 insertions(+), 1 deletion(-) diff --git a/README.rst b/README.rst index 11748fc..bf567ac 100755 --- a/README.rst +++ b/README.rst @@ -349,8 +349,10 @@ Natural Language * `USENET postings corpus of 2005~2011 `_ * `Wikidata - Wikipedia databases `_ * `Wikipedia Links data - 40 Million Entities in Context `_ +* `Universal Dependencies `_ * `WordNet databases and tools `_ - +* `Open Multilingual Wordnet `_ + Physics ------- From af605c3869628da629ec19b6d4605fe8fec4718f Mon Sep 17 00:00:00 2001 From: handmadeby Date: Thu, 7 Jul 2016 14:33:06 +0100 Subject: [PATCH 09/99] Updated TFL to current API link. The Transport for London API link was pointing to a legacy page - I updated to the current valid page. --- README.rst | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.rst b/README.rst index 11748fc..d24f52b 100755 --- a/README.rst +++ b/README.rst @@ -532,7 +532,7 @@ Transportation * `RITA Airline On-Time Performance data `_ * `RITA/BTS transport data collection (TranStat) `_ * `Toronto Bike Share Stations (XML file) `_ -* `Transport for London (TFL) `_ +* `Transport for London (TFL) `_ * `Travel Tracker Survey (TTS) for Chicago `_ * `U.S. Bureau of Transportation Statistics (BTS) `_ * `U.S. Domestic Flights 1990 to 2009 `_ From 21ffee83e3926fbf3d397d8bb230985e06c1dc4a Mon Sep 17 00:00:00 2001 From: Haochi Kiang Date: Wed, 20 Jul 2016 10:39:51 +0800 Subject: [PATCH 10/99] Added Uppsala Conflict Data Program "The Uppsala Conflict Data Program (UCDP) offers a number of datasets on organised violence and peacemaking, all of which can be downloaded for free through the links below." --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index 11748fc..549b3f7 100755 --- a/README.rst +++ b/README.rst @@ -475,6 +475,7 @@ Social Sciences * `Texas Inmates Executed Since 1984 `_ * `Titanic Survival Data Set `_ * `UCB's Archive of Social Science Data (D-Lab) `_ +* `Uppsala Conflict Data Program `_ * `UCLA Social Sciences Data Archive `_ * `UN Civil Society Database `_ * `Universities Worldwide `_ From 2bf5f661f48801bcbcd5ffa4e160d1bd606b5500 Mon Sep 17 00:00:00 2001 From: Scott Sievert Date: Fri, 22 Jul 2016 10:52:48 -0500 Subject: [PATCH 11/99] adds caption contest dataset --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index 11748fc..75624b5 100755 --- a/README.rst +++ b/README.rst @@ -307,6 +307,7 @@ Machine Learning * `Machine Learning Data Set Repository `_ * `Million Song Dataset `_ * `More Song Datasets `_ +* `New Yorker caption contest ratings `_ * `MovieLens Data Sets `_ * `RDataMining - "R and Data Mining" ebook data `_ * `Registered Meteorites on Earth `_ From 9bb6ab1e8919e0aefb9a4c33fa3b95fcbf09b95c Mon Sep 17 00:00:00 2001 From: jeremie Date: Wed, 10 Aug 2016 11:04:50 +0200 Subject: [PATCH 12/99] Fix broken link: Netflix prize --- README.rst | 7 +++---- 1 file changed, 3 insertions(+), 4 deletions(-) diff --git a/README.rst b/README.rst index 11748fc..6e64370 100755 --- a/README.rst +++ b/README.rst @@ -126,7 +126,7 @@ Computer Networks * `CRAWDAD Wireless datasets from Dartmouth Univ. `_ * `Criteo click-through data `_ * `Open Mobile Data by MobiPerf `_ -* `Rapid7 Sonar Internet Scans `_ +* `Rapid7 Sonar Internet Scans `_ * `UCSD Network Telescope, IPv4 /8 net `_ @@ -147,7 +147,7 @@ Data Challenges * `Kaggle Competition Data `_ * `KDD Cup by Tencent 2012 `_ * `Localytics Data Visualization Challenge `_ -* `Netflix Prize `_ +* `Netflix Prize `_ * `Space Apps Challenge `_ * `Telecom Italia Big Data Challenge `_ * `Yelp Dataset Challenge `_ @@ -268,7 +268,7 @@ Healthcare * `MeSH, the vocabulary thesaurus used for indexing articles for PubMed `_ * `Number of Ebola Cases and Deaths in Affected Countries (2014) `_ * `Open-ODS (structure of the UK NHS) `_ -* `OpenPaymentsData, Healthcare financial relationship data `_ +* `OpenPaymentsData, Healthcare financial relationship data `_ * `The Cancer Genome Atlas project (TCGA) `_ and `BigQuery table `_ * `World Health Organization Global Health Observatory `_ @@ -550,4 +550,3 @@ Complementary Collections * Quora: `Where can I find large datasets open to the public? `_ * RS.io: `100+ Interesting Data Sets for Statistics `_ * StaTrek: `Leveraging open data to understand urban lives `_ - From 71d9c2466db3704a409d43cbebc6f43c6da18230 Mon Sep 17 00:00:00 2001 From: Sammy X Chen Date: Thu, 11 Aug 2016 10:45:55 +0800 Subject: [PATCH 13/99] add International Economics Database --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index 11748fc..8c2f9f2 100755 --- a/README.rst +++ b/README.rst @@ -160,6 +160,7 @@ Economics * `EconData from UMD `_ * `Economic Freedom of the World Data `_ * `Historical MacroEconomc Statistics `_ +* `International Economics Database `_ and `various data tools `_ * `International Trade Statistics `_ * `Internet Product Code Database `_ * `Joint External Debt Data Hub `_ From 86fe0cf6dcc5f4c1c1ad5fd628dbd0ba91dfdeae Mon Sep 17 00:00:00 2001 From: Sammy X Chen Date: Thu, 11 Aug 2016 10:51:08 +0800 Subject: [PATCH 14/99] add AWC --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index 8c2f9f2..835e9b3 100755 --- a/README.rst +++ b/README.rst @@ -75,6 +75,7 @@ Climate/Weather --------------- * `Australian Weather `_ +* `Aviation Weather Center - Consistent, timely and accurate weather information for the world airspace system `_ * `Brazilian Weather - Historical data (In Portuguese) `_ * `Canadian Meteorological Centre `_ * `Climate Data from UEA (updated monthly) `_ From e2e48c39a080f8538c8d9d8d2013585a694513fc Mon Sep 17 00:00:00 2001 From: Xiaming Chen Date: Mon, 15 Aug 2016 11:18:24 +0800 Subject: [PATCH 15/99] #230 --- README.rst | 3 ++- 1 file changed, 2 insertions(+), 1 deletion(-) diff --git a/README.rst b/README.rst index 1513444..7265c24 100755 --- a/README.rst +++ b/README.rst @@ -154,7 +154,7 @@ Data Challenges Economics --------- -* `American Economic Ass (AEA) `_ +* `American Economic Association (AEA) `_ * `EconData from UMD `_ * `Economic Freedom of the World Data `_ * `Historical MacroEconomc Statistics `_ @@ -485,6 +485,7 @@ Social Sciences * `International Studies Compendium Project `_ * `James McGuire Cross National Data `_ * `MacroData Guide by Norsk samfunnsvitenskapelig datatjeneste `_ +* `Minnesota Population Center `_ * `MIT Reality Mining Dataset `_ * `Open Crime and Policing Data in England, Wales and Northern Ireland `_ * `Paul Hensel General International Data Page `_ From 87df786d26266a95ba09e2a3f52ed10aa1c8414e Mon Sep 17 00:00:00 2001 From: Xiaming Chen Date: Mon, 15 Aug 2016 11:26:55 +0800 Subject: [PATCH 16/99] Disable fake reports of links --- .travis.yml | 20 ++++++++++---------- 1 file changed, 10 insertions(+), 10 deletions(-) diff --git a/.travis.yml b/.travis.yml index d4709b6..066e607 100644 --- a/.travis.yml +++ b/.travis.yml @@ -1,10 +1,10 @@ -language: ruby -rvm: - - 2.2 -before_script: - - gem install awesome_bot -script: - - site404=www.datawrangling.com,getglue-data.s3.amazonaws.com,archive.org/details/2011-05-calufa-twitter-sql,www.stats4stem.org,lib.stat.cmu.edu,http://www.oecd.org/document/0,census.gov/acs/www/data_documentation/data_release_info/ - - whtlist=travis,crawdad.cs.dartmouth.edu,data.nasdaq.com,137.189.35.203/WebUI/CatDatabase/catData.html,numbrary.com,www.cmr.osu.edu,gutenberg.org,donnees.gouv.qc.ca,data.rio.rj.gov.br,ntrl.ntis.gov,openflights.org,www.data.gov.bc.ca,earthdata.nasa,pgp-hms,cru.uea.ac.uk,networkdata.ics,datos.argentina,data.gov.ie,isi.edu,data.go.id,wiki.dbpedia,www.laval.ca,www.wunderground.com,data.lexingtonky.gov,arcgis,bixi - - site503=datamob.org,research.microsoft.com - - awesome_bot README.rst --allow-dupe --allow-redirect --set-timeout 5 --allow-timeout --white-list $site404,$whtlist,$site503 +# language: ruby +# rvm: +# - 2.2 +# before_script: +# - gem install awesome_bot +# script: +# - site404=www.datawrangling.com,getglue-data.s3.amazonaws.com,archive.org/details/2011-05-calufa-twitter-sql,www.stats4stem.org,lib.stat.cmu.edu,http://www.oecd.org/document/0,census.gov/acs/www/data_documentation/data_release_info/ +# - whtlist=travis,crawdad.cs.dartmouth.edu,data.nasdaq.com,137.189.35.203/WebUI/CatDatabase/catData.html,numbrary.com,www.cmr.osu.edu,gutenberg.org,donnees.gouv.qc.ca,data.rio.rj.gov.br,ntrl.ntis.gov,openflights.org,www.data.gov.bc.ca,earthdata.nasa,pgp-hms,cru.uea.ac.uk,networkdata.ics,datos.argentina,data.gov.ie,isi.edu,data.go.id,wiki.dbpedia,www.laval.ca,www.wunderground.com,data.lexingtonky.gov,arcgis,bixi +# - site503=datamob.org,research.microsoft.com +# - awesome_bot README.rst --allow-dupe --allow-redirect --set-timeout 5 --allow-timeout --white-list $site404,$whtlist,$site503 From 9d1f4fb10d6a2944a60012bd668e02fe094b1971 Mon Sep 17 00:00:00 2001 From: Sammy X Chen Date: Mon, 15 Aug 2016 13:59:28 +0800 Subject: [PATCH 17/99] Add AQUASTAT and category Earth Science Earch Science maintains data from geoscience and earth related fields, like environment, water etc. --- README.rst | 34 +++++++++++++++++----------------- 1 file changed, 17 insertions(+), 17 deletions(-) diff --git a/README.rst b/README.rst index c0f7ff4..c04cf75 100755 --- a/README.rst +++ b/README.rst @@ -3,8 +3,6 @@ Awesome Public Datasets .. image:: https://cdn.rawgit.com/sindresorhus/awesome/d7305f38d29fed78fa85652e3a63e154dd8e8829/media/badge.svg :alt: Awesome :target: https://github.com/sindresorhus/awesome -.. image:: https://travis-ci.org/caesar0301/awesome-public-datasets.svg - :target: https://travis-ci.org/caesar0301/awesome-public-datasets `This list of public data sources `_ are collected and tidied from blogs, answers, and user responses. @@ -151,6 +149,20 @@ Data Challenges * `Yelp Dataset Challenge `_ * `Bruteforce Database `_ + +Earth Science +------------- + +* `AQUASTAT - Global water resources and uses `_ +* `BODC - marine data of ~22K vars `_ +* `Earth Models `_ +* `EOSDIS - NASA's earth observing system data `_ +* `Integrated Marine Observing System (IMOS) - roughly 30TB of ocean measurements `_ or `on S3 `_ +* `Marinexplore - Open Oceanographic Data `_ +* `Smithsonian Institution Global Volcano and Eruption Database `_ +* `USGS Earthquake Archives `_ + + Economics --------- @@ -215,20 +227,10 @@ Finance * `NYSE Market Data `_ -Geology -------- +GIS +--- -* `Earth Models `_ -* `Smithsonian Institution Global Volcano and Eruption Database `_ -* `USGS Earthquake Archives `_ - - -GIS/Environment ---------------- - -* `BODC - marine data of ~22K vars `_ * `Cambridge, MA, US, GIS data on GitHub `_ -* `EOSDIS - NASA's earth observing system data `_ * `Factual Global Location Data `_ * `Geo Spatial Data from ASU `_ * `Geo Wiki Project - Citizen-driven Environmental Monitoring `_ @@ -236,11 +238,8 @@ GIS/Environment * `GeoNames Worldwide `_ * `Global Administrative Areas Database (GADM) `_ * `Homeland Infrastructure Foundation-Level Data `_ -* `Integrated Marine Observing System (IMOS) - roughly 30TB of ocean measurements `_ or `on S3 `_ -* `International Institute for Systems Analysis - GIS Datasets `_ * `Landsat 8 on AWS `_ * `List of all countries in all languages `_ -* `Marinexplore - Open Oceanographic Data `_ * `National Weather Service GIS Data Portal `_ * `Natural Earth - vectors and rasters of the world `_ * `OpenAddresses `_ @@ -254,6 +253,7 @@ GIS/Environment * `World boundaries from the U.S. Department of State `_ * `World countries in multiple formats `_ + Government ---------- From 2530bbf1338df2deed7cbd7caf0c942f89e18415 Mon Sep 17 00:00:00 2001 From: Sammy X Chen Date: Mon, 15 Aug 2016 14:04:32 +0800 Subject: [PATCH 18/99] Update README.rst --- README.rst | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/README.rst b/README.rst index c04cf75..bc0b84b 100755 --- a/README.rst +++ b/README.rst @@ -45,7 +45,7 @@ Biology * `MIT Cancer Genomics Data `_ * `NCBI Proteins `_ * `NCBI Taxonomy `_ -* `NIH Microarray data `_ or `FTP `_ +* `NIH Microarray data `_ or `FTP `_ (see FTP link on `RAW `_) * `OpenSNP genotypes data `_ * `Pathguid - Protein-Protein Interactions Catalog `_ * `Protein Data Bank `_ @@ -224,7 +224,7 @@ Finance * `Quandl `_ * `St Louis Federal `_ * `Yahoo Finance `_ -* `NYSE Market Data `_ +* `NYSE Market Data `_ (see FTP link on `RAW `_) GIS From 0954d9aa6b21f61782358fb0debd6aad65aad2e9 Mon Sep 17 00:00:00 2001 From: Xiaming Chen Date: Fri, 11 Nov 2016 09:48:18 +0800 Subject: [PATCH 19/99] Add Kaggle link to Titanic data --- README.rst | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.rst b/README.rst index bc0b84b..7146d69 100755 --- a/README.rst +++ b/README.rst @@ -500,7 +500,7 @@ Social Sciences * `StackExchange Data Explorer `_ * `Terrorism Research and Analysis Consortium `_ * `Texas Inmates Executed Since 1984 `_ -* `Titanic Survival Data Set `_ +* `Titanic Survival Data Set `_ or `on Kaggle `_ * `UCB's Archive of Social Science Data (D-Lab) `_ * `Uppsala Conflict Data Program `_ * `UCLA Social Sciences Data Archive `_ From 57d9c7bff7eb0ac17b8963c4ef4e9578f909cc2f Mon Sep 17 00:00:00 2001 From: Samuel Taylor Date: Sat, 12 Nov 2016 09:41:05 -0600 Subject: [PATCH 20/99] Remove dead link to GetGlue --- README.rst | 1 - 1 file changed, 1 deletion(-) diff --git a/README.rst b/README.rst index 7146d69..dc2029d 100755 --- a/README.rst +++ b/README.rst @@ -449,7 +449,6 @@ Social Networks * `Facebook Data Scrape (2005) `_ * `Facebook Social Networks from LAW (since 2007) `_ * `Foursquare from UMN/Sarwat (2013) `_ -* `GetGlue - users rating TV shows `_ * `GitHub Collaboration Archive `_ * `Google Scholar citation relations `_ * `High-Resolution Contact Networks from Wearable Sensors `_ From 80ecc66409f548ab4d8e2a607b94ece1dbb74300 Mon Sep 17 00:00:00 2001 From: Diomidis Spinellis Date: Sun, 27 Nov 2016 10:47:59 +0200 Subject: [PATCH 21/99] Add Microsoft's Data Science for Research --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index 7146d69..ae62485 100755 --- a/README.rst +++ b/README.rst @@ -408,6 +408,7 @@ Public Domains * `Infochimps `_ * `KDNuggets Data Collections `_ * `Microsoft Azure Data Market Free DataSets `_ +* `Microsoft Data Science for Research `_ * `Numbray `_ * `Open Library Data Dumps `_ * `Reddit Datasets `_ From 6b7120dad2cfa2a28966ee5cf3c06bd42e6170f8 Mon Sep 17 00:00:00 2001 From: =?UTF-8?q?Arturo=20Filast=C3=B2?= Date: Thu, 8 Dec 2016 18:44:01 +0000 Subject: [PATCH 22/99] Add OONI data Add a link to data provided by the Open Observatory of Network Interference on internet censorship --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index 7146d69..5706729 100755 --- a/README.rst +++ b/README.rst @@ -121,6 +121,7 @@ Computer Networks * `CommonCrawl Web Data over 7 years `_ * `CRAWDAD Wireless datasets from Dartmouth Univ. `_ * `Criteo click-through data `_ +* `OONI: Open Observatory of Network Interference - Internet censorship data `_ * `Open Mobile Data by MobiPerf `_ * `Rapid7 Sonar Internet Scans `_ * `UCSD Network Telescope, IPv4 /8 net `_ From 4dc886ac006ecf418ad49d4e4f54416fe973025a Mon Sep 17 00:00:00 2001 From: Maxwell Rebo Date: Sun, 11 Dec 2016 15:17:54 +0400 Subject: [PATCH 23/99] Update README.rst --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index 7146d69..b771d33 100755 --- a/README.rst +++ b/README.rst @@ -357,6 +357,7 @@ Natural Language * `Universal Dependencies `_ * `WordNet databases and tools `_ * `Open Multilingual Wordnet `_ +* `Automatic Keyphrase Extracttion `_ Neuroscience From 0d0117a88a7f8ba4d8053b4305e834dea25c2ad6 Mon Sep 17 00:00:00 2001 From: Xiaming Chen Date: Sun, 18 Dec 2016 16:08:36 +0800 Subject: [PATCH 24/99] Update new image sets and three NLP sets Images: Chars74K dataset and MNIST, NLP: Google MC-AFP, MS-MACRO, and MDST --- README.rst | 5 +++++ 1 file changed, 5 insertions(+) diff --git a/README.rst b/README.rst index 7146d69..e971eba 100755 --- a/README.rst +++ b/README.rst @@ -284,11 +284,13 @@ Image Processing * `2GB of Photos of Cats `_ or `Archive version `_ * `Affective Image Classification `_ * `Animals with attributes `_ +* `Chars74K dataset, Character Recognition in Natural Images (both English and Kannada are available) `_ * `Face Recognition Benchmark `_ * `ImageNet (in WordNet hierarchy) `_ * `Indoor Scene Recognition `_ * `International Affective Picture System, UFL `_ * `Massive Visual Memory Stimuli, MIT `_ +* `MNIST database of handwritten digits, near 1 million examples `_ * `Several Shape-from-Silhouette Datasets `_ * `Stanford Dogs Dataset `_ * `SUN database, MIT `_ @@ -343,11 +345,14 @@ Natural Language * `Flickr Personal Taxonomies `_ * `Freebase.com of people, places, and things `_ * `Google Books Ngrams (2.2TB) `_ +* `Google MC-AFP, generated based on the public available Gigaword dataset using Paragraph Vectors `_ * `Google Web 5gram (1TB, 2006) `_ * `Gutenberg eBooks List `_ * `Hansards text chunks of Canadian Parliament `_ * `Machine Comprehension Test (MCTest) of text from Microsoft Research `_ * `Machine Translation of European languages `_ +* `Multi-Domain Sentiment Dataset (version 2.0) `_ +* `Microsoft MAchine Reading COmprehension Dataset (or MS MARCO) `_ * `Personae Corpus `_ * `SaudiNewsNet Collection of Saudi Newspaper Articles (Arabic, 30K articles) `_ * `SMS Spam Collection in English `_ From d5a61529bc585d4d11889cef03098d3e0309fc45 Mon Sep 17 00:00:00 2001 From: Victor Laerte Oliveira Date: Sun, 18 Dec 2016 20:57:22 -0300 Subject: [PATCH 25/99] Adding TravisTorrent MSR2017 Mining Challenge. TravisTorrent, a GHTorrent partner project, provides free and easy-to-use Travis CI build analyses to the masses through its open database. --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index a07f976..a119174 100755 --- a/README.rst +++ b/README.rst @@ -148,6 +148,7 @@ Data Challenges * `Telecom Italia Big Data Challenge `_ * `Yelp Dataset Challenge `_ * `Bruteforce Database `_ +* `TravisTorrent Dataset - MSR'2017 Mining Challenge `_ Earth Science From 606189b55c1f628b0fa6c815f0496756cd3efc15 Mon Sep 17 00:00:00 2001 From: ghazy ben ahmed Date: Wed, 28 Dec 2016 20:56:27 +0100 Subject: [PATCH 26/99] Added Tunisia government data site --- Government.rst | 3 ++- 1 file changed, 2 insertions(+), 1 deletion(-) diff --git a/Government.rst b/Government.rst index 26555da..db7f229 100644 --- a/Government.rst +++ b/Government.rst @@ -85,6 +85,7 @@ Government * `Texas Open Data `_ * `The World Bank `_ * `Toronto, ON, Canada `_ +* `Tunisia `_ * `U.K. Government Data `_ * `U.S. American Community Survey `_ * `U.S. CDC Public Health datasets `_ @@ -100,4 +101,4 @@ Government * `Uruguay `_ * `Vancouver, BC Open Data Catalog `_ * `Victoria, BC, Canada `_ -* `Vienna, Austria `_ \ No newline at end of file +* `Vienna, Austria `_ From 3ba773df2de068da80e495437f1b8663a1f6939f Mon Sep 17 00:00:00 2001 From: Daniel Darabos Date: Thu, 5 Jan 2017 17:07:31 +0100 Subject: [PATCH 27/99] Fix typo. --- README.rst | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.rst b/README.rst index 25186ef..6a03677 100755 --- a/README.rst +++ b/README.rst @@ -113,7 +113,7 @@ Complex Networks Computer Networks ----------------- -* `3.5B Web Pages from CommonCraw 2012 `_ +* `3.5B Web Pages from CommonCrawl 2012 `_ * `53.5B Web clicks of 100K users in Indiana Univ. `_ * `CAIDA Internet Datasets `_ * `ClueWeb09 - 1B web pages `_ From cddb768b860c18928e35b5ffc4b13cea481986e9 Mon Sep 17 00:00:00 2001 From: =?UTF-8?q?Fran=C3=A7ois=20Pelletier?= Date: Sun, 8 Jan 2017 14:17:45 -0500 Subject: [PATCH 28/99] Update Government.rst --- Government.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/Government.rst b/Government.rst index db7f229..85f5efd 100644 --- a/Government.rst +++ b/Government.rst @@ -96,6 +96,7 @@ Government * `U.S. Food and Drug Administration (FDA) `_ * `U.S. National Center for Education Statistics (NCES) `_ * `U.S. Open Government `_ +* `Uganda Bureau of Statistics `_ * `UK 2011 Census Open Atlas Project `_ * `United Nations `_ * `Uruguay `_ From 6ea30d09b4f01d27ac433062df457aabac5c66d2 Mon Sep 17 00:00:00 2001 From: =?UTF-8?q?Fran=C3=A7ois=20Pelletier?= Date: Sun, 8 Jan 2017 14:23:43 -0500 Subject: [PATCH 29/99] Update README.rst --- README.rst | 11 +++++++---- 1 file changed, 7 insertions(+), 4 deletions(-) diff --git a/README.rst b/README.rst index 6a03677..05a5b8e 100755 --- a/README.rst +++ b/README.rst @@ -68,7 +68,7 @@ Biology Climate/Weather --------------- - +* `Actuaries Climate Index `_ * `Australian Weather `_ * `Aviation Weather Center - Consistent, timely and accurate weather information for the world airspace system `_ * `Brazilian Weather - Historical data (In Portuguese) `_ @@ -151,7 +151,6 @@ Data Challenges * `Bruteforce Database `_ * `TravisTorrent Dataset - MSR'2017 Mining Challenge `_ - Earth Science ------------- @@ -259,7 +258,8 @@ GIS Government ---------- -* `OpenDataSoft's list of 1,600 open data portals `_ +* `OpenDataSoft's list of 1,600 open data `_ +* `Open Data for Africa `_ * `A list of cities and countries contributed by community `_ @@ -487,11 +487,13 @@ Social Sciences * `Datacards `_ * `European Social Survey `_ * `FBI Hate Crime 2013 - aggregated data `_ +* `Fragile States Index `_ * `GDELT Global Events Database `_ * `General Social Survey (GSS) since 1972 `_ * `German Social Survey `_ * `Global Religious Futures Project `_ * `Humanitarian Data Exchange `_ +* `INFORM Index for Risk Management `_ * `Institute for Demographic Studies `_ * `International Networks Archive `_ * `International Social Survey Program ISSP `_ @@ -500,6 +502,7 @@ Social Sciences * `MacroData Guide by Norsk samfunnsvitenskapelig datatjeneste `_ * `Minnesota Population Center `_ * `MIT Reality Mining Dataset `_ +* `Notre Dame Global Adaptation Index (NG-DAIN) `_ * `Open Crime and Policing Data in England, Wales and Northern Ireland `_ * `Paul Hensel General International Data Page `_ * `PewResearch Internet Survey Project `_ @@ -515,7 +518,7 @@ Social Sciences * `UN Civil Society Database `_ * `Universities Worldwide `_ * `UPJOHN for Labor Employment Research `_ -* `World Bank Data `_ +* `World Bank Open Data `_ * `WorldPop project - Worldwide human population distributions `_ From e07bb6ccc26ed59f0680ffd45cd28d2d9dd6266a Mon Sep 17 00:00:00 2001 From: Katherine Schinkel Date: Sun, 15 Jan 2017 19:41:14 -0800 Subject: [PATCH 30/99] Add College Scorecard https://collegescorecard.ed.gov/data/ --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index 05a5b8e..a003f47 100755 --- a/README.rst +++ b/README.rst @@ -189,6 +189,7 @@ Economics Education ------------ +* `College Scorecard Data `_ * `Student Data from Free Code Camp `_ From ff5ed076f4cef7ec935fd7ff444eaa8d38c15fee Mon Sep 17 00:00:00 2001 From: Raul Jimenez Ortega Date: Fri, 27 Jan 2017 08:10:21 +0100 Subject: [PATCH 31/99] Adding ArcGIS Open Data portal --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index 05a5b8e..fee51aa 100755 --- a/README.rst +++ b/README.rst @@ -231,6 +231,7 @@ Finance GIS --- +* `ArcGIS Open Data portal `_ * `Cambridge, MA, US, GIS data on GitHub `_ * `Factual Global Location Data `_ * `Geo Spatial Data from ASU `_ From 1c940529b037528433049fdc0e9d6e0d5d0d7b2a Mon Sep 17 00:00:00 2001 From: Jad Chaar Date: Sat, 28 Jan 2017 23:43:32 -0500 Subject: [PATCH 32/99] Added links to SURFRAD data --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index 05a5b8e..8b03172 100755 --- a/README.rst +++ b/README.rst @@ -80,6 +80,7 @@ Climate/Weather * `NOAA Bering Sea Climate `_ * `NOAA Climate Datasets `_ * `NOAA Realtime Weather Models `_ +* `NOAA SURFRAD Meteorology and Radiation Datasets `_ * `The World Bank Open Data Resources for Climate Change `_ * `UEA Climatic Research Unit `_ * `WorldClim - Global Climate Data `_ From 92ede117e165d4e2883bcb8c8b696d74a23b49a6 Mon Sep 17 00:00:00 2001 From: Xiaming Chen Date: Sat, 4 Feb 2017 13:24:06 +0800 Subject: [PATCH 33/99] fix link issue #276 --- README.rst | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.rst b/README.rst index 3596eda..c300e61 100755 --- a/README.rst +++ b/README.rst @@ -131,7 +131,7 @@ Computer Networks Contextual Data --------------- -* `Context-aware data sets from five domains `_ or `GitHub `_ +* `Context-aware data sets from five domains `_ Data Challenges From 20ad345175ca9e16ed7c6896448e8c2e813305e2 Mon Sep 17 00:00:00 2001 From: Xiaming Chen Date: Sat, 4 Feb 2017 13:25:54 +0800 Subject: [PATCH 34/99] Fix link issue #277 --- README.rst | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.rst b/README.rst index c300e61..10da14f 100755 --- a/README.rst +++ b/README.rst @@ -156,7 +156,7 @@ Earth Science ------------- * `AQUASTAT - Global water resources and uses `_ -* `BODC - marine data of ~22K vars `_ +* `BODC - marine data of ~22K vars `_ * `Earth Models `_ * `EOSDIS - NASA's earth observing system data `_ * `Integrated Marine Observing System (IMOS) - roughly 30TB of ocean measurements `_ or `on S3 `_ From cb41229790348825ded701259413459cac920591 Mon Sep 17 00:00:00 2001 From: Philip Fung Date: Tue, 7 Feb 2017 12:24:59 -0800 Subject: [PATCH 35/99] adding National Cancer Institute - Genomic Data Commons --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index 10da14f..202d181 100755 --- a/README.rst +++ b/README.rst @@ -45,6 +45,7 @@ Biology * `MIT Cancer Genomics Data `_ * `NCBI Proteins `_ * `NCBI Taxonomy `_ +* `NCI Genomic Data Commons `_ * `NIH Microarray data `_ or `FTP `_ (see FTP link on `RAW `_) * `OpenSNP genotypes data `_ * `Pathguid - Protein-Protein Interactions Catalog `_ From 64fe2cc8c35d8765bfe0735890e18ff409e1cfcd Mon Sep 17 00:00:00 2001 From: Alex Date: Mon, 13 Feb 2017 14:49:11 +1300 Subject: [PATCH 36/99] added youtube 8 and visual genome --- README.rst | 2 ++ 1 file changed, 2 insertions(+) diff --git a/README.rst b/README.rst index 10da14f..47eff1f 100755 --- a/README.rst +++ b/README.rst @@ -304,6 +304,7 @@ Image Processing * `Adience Unfiltered faces for gender and age classification `_ * `The Action Similarity Labeling (ASLAN) Challenge `_ * `Violent-Flows - Crowd Violence \ Non-violence Database and benchmark `_ +* `Visual genome `_ Machine Learning ---------------- @@ -325,6 +326,7 @@ Machine Learning * `Restaurants Health Score Data in San Francisco `_ * `UCI Machine Learning Repository `_ * `Yahoo! Ratings and Classification Data `_ +* `Youtube 8m `_ Museums From e5cea9a18422088a4f641d9d21e6b323f9fd6526 Mon Sep 17 00:00:00 2001 From: Alex Date: Mon, 13 Feb 2017 14:57:38 +1300 Subject: [PATCH 37/99] Update README.rst --- README.rst | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/README.rst b/README.rst index 47eff1f..6b57705 100755 --- a/README.rst +++ b/README.rst @@ -304,7 +304,7 @@ Image Processing * `Adience Unfiltered faces for gender and age classification `_ * `The Action Similarity Labeling (ASLAN) Challenge `_ * `Violent-Flows - Crowd Violence \ Non-violence Database and benchmark `_ -* `Visual genome `_ +* `Visual genome `_ Machine Learning ---------------- @@ -326,7 +326,7 @@ Machine Learning * `Restaurants Health Score Data in San Francisco `_ * `UCI Machine Learning Repository `_ * `Yahoo! Ratings and Classification Data `_ -* `Youtube 8m `_ +* `Youtube 8m `_ Museums From 5587d232b599a2b9dc23ab4b1c99bc2bc19ed399 Mon Sep 17 00:00:00 2001 From: Xiaming Chen Date: Mon, 13 Feb 2017 11:30:01 +0800 Subject: [PATCH 38/99] Add EveryPolitician, #280 --- Government.rst | 2 ++ 1 file changed, 2 insertions(+) diff --git a/Government.rst b/Government.rst index 85f5efd..1df8d04 100644 --- a/Government.rst +++ b/Government.rst @@ -1,6 +1,8 @@ Government ---------- +* `EveryPolitician, ongoing project collating and sharing data on every politician. `_ + * `Alberta, Province of Canada `_ * `Antwerp, Belgium `_ * `Argentina (non official) `_ From 49e07e34c284b9292cd68fb590affeb57756194e Mon Sep 17 00:00:00 2001 From: Xiaming Chen Date: Mon, 13 Feb 2017 11:34:21 +0800 Subject: [PATCH 39/99] Add data.world #279 --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index 6b57705..2bee928 100755 --- a/README.rst +++ b/README.rst @@ -417,6 +417,7 @@ Public Domains * `CMU StatLab collections `_ * `Data360 `_ * `Datamob.org `_ +* `Data.World `_ * `Google `_ * `Infochimps `_ * `KDNuggets Data Collections `_ From 7ac9f9e367cdc5d47d897fc788b68ead5135d827 Mon Sep 17 00:00:00 2001 From: Xiaming Chen Date: Mon, 13 Feb 2017 11:45:07 +0800 Subject: [PATCH 40/99] Add Tennis database from Jeff Sackmann #278 --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index 2bee928..b59d440 100755 --- a/README.rst +++ b/README.rst @@ -544,6 +544,7 @@ Sports * `Lahman's Baseball Database `_ * `Pinhooker: Thoroughbred Bloodstock Sale Data `_ * `Retrosheet Baseball Statistics `_ +* `Tennis database of rankings, results, and stats for ATP `_, `WTA `_, `Grand Slams `_ and `Match Charting Project `_ Time Series From 6141e30d29e36a90eeaddc756f08f7164f351b74 Mon Sep 17 00:00:00 2001 From: Emre Bolat Date: Thu, 23 Feb 2017 10:26:22 +0200 Subject: [PATCH 41/99] New addition to Agriculture category U.S. Department of Agriculture's Nutrient Database link added. --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index aba0efe..b59c6c7 100755 --- a/README.rst +++ b/README.rst @@ -17,6 +17,7 @@ Other amazingly awesome lists can be found in the Agriculture ------------ * `U.S. Department of Agriculture's PLANTS Database `_ +* `U.S. Department of Agriculture's Nutrient Database `_ Biology From e746ff23857f0550d47ad3074af00d597446188a Mon Sep 17 00:00:00 2001 From: Alex Date: Fri, 24 Feb 2017 14:20:01 +1300 Subject: [PATCH 42/99] added comp vision dataset --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index aba0efe..d5a2910 100755 --- a/README.rst +++ b/README.rst @@ -306,6 +306,7 @@ Image Processing * `The Action Similarity Labeling (ASLAN) Challenge `_ * `Violent-Flows - Crowd Violence \ Non-violence Database and benchmark `_ * `Visual genome `_ +* `Caltech Pedestrian Detection Benchmark `_ Machine Learning ---------------- From dc1f51b3263d700596603c4a52c54dd9b44d0955 Mon Sep 17 00:00:00 2001 From: Martin Linkov Date: Wed, 1 Mar 2017 11:14:10 +0100 Subject: [PATCH 43/99] CoolDatasets The twitter account upgraded to a website, the collection grows, I think it is worth including in the Complementary List --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index aba0efe..79ac468 100755 --- a/README.rst +++ b/README.rst @@ -592,6 +592,7 @@ Complementary Collections * `Data Packaged Core Datasets `_ * `Database of Scientific Code Contributions `_ * DataWrangling: `Some Datasets Available on the Web `_ +* A growing collection of public datasets: `CoolDatasets. `_ * Inside-r: `Finding Data on the Internet `_ * OpenDataMonitor: `An overview of available open data resources in Europe `_ * Quora: `Where can I find large datasets open to the public? `_ From aff0331e4e2dcbfc259b92a464c734ad73ffcd28 Mon Sep 17 00:00:00 2001 From: owkwen Date: Thu, 9 Mar 2017 13:54:36 -0500 Subject: [PATCH 44/99] Resurrected link Montreal BIXI Bike Share link is dead. Updated with new link and in english. --- README.rst | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.rst b/README.rst index aba0efe..de92f21 100755 --- a/README.rst +++ b/README.rst @@ -568,7 +568,7 @@ Transportation * `German train system by Deutsche Bahn `_ * `Hubway Million Rides in MA `_ * `Marine Traffic - ship tracks, port calls and more `_ -* `Montreal BIXI Bike Share `_ +* `Montreal BIXI Bike Share `_ * `NYC Taxi Trip Data 2009- `_ * `NYC Taxi Trip Data 2013 (FOIA/FOILed) `_ * `NYC Uber trip data April 2014 to September 2014 `_ From 1633901880b97b47194c97f6abd896a5dbe14e8f Mon Sep 17 00:00:00 2001 From: Clement Michaud Date: Tue, 28 Mar 2017 22:04:21 +0200 Subject: [PATCH 45/99] Fix broken link to Transport for London open datasets --- README.rst | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.rst b/README.rst index aba0efe..4a57290 100755 --- a/README.rst +++ b/README.rst @@ -579,7 +579,7 @@ Transportation * `RITA Airline On-Time Performance data `_ * `RITA/BTS transport data collection (TranStat) `_ * `Toronto Bike Share Stations (XML file) `_ -* `Transport for London (TFL) `_ +* `Transport for London (TFL) `_ * `Travel Tracker Survey (TTS) for Chicago `_ * `U.S. Bureau of Transportation Statistics (BTS) `_ * `U.S. Domestic Flights 1990 to 2009 `_ From 863c2c831100a9d03eb6fba2b0644f068edf4d91 Mon Sep 17 00:00:00 2001 From: shagun Sodhani Date: Thu, 6 Apr 2017 14:00:41 +0530 Subject: [PATCH 46/99] Added webhose datasets - related to News/Blogs in multiple languages --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index b59c6c7..87071ab 100755 --- a/README.rst +++ b/README.rst @@ -372,6 +372,7 @@ Natural Language * `WordNet databases and tools `_ * `Open Multilingual Wordnet `_ * `Automatic Keyphrase Extracttion `_ +* `News/Blogs in multiple languages `_ Neuroscience From e53e99c4c468cb6528cc4993ba40cfaf58467114 Mon Sep 17 00:00:00 2001 From: Katherine Schinkel Date: Thu, 6 Apr 2017 21:09:07 -0700 Subject: [PATCH 47/99] Create PULL_REQUEST_TEMPLATE.md --- PULL_REQUEST_TEMPLATE.md | 3 +++ 1 file changed, 3 insertions(+) create mode 100644 PULL_REQUEST_TEMPLATE.md diff --git a/PULL_REQUEST_TEMPLATE.md b/PULL_REQUEST_TEMPLATE.md new file mode 100644 index 0000000..4690fa4 --- /dev/null +++ b/PULL_REQUEST_TEMPLATE.md @@ -0,0 +1,3 @@ +# Overview +Dataset Description:
+[link to dataset](putlinkhere.com) From f96c461782a6d899e21046de3d4a7b622b19e598 Mon Sep 17 00:00:00 2001 From: Xiaming Chen Date: Fri, 7 Apr 2017 16:47:40 +0800 Subject: [PATCH 48/99] Clear format and fix #291 --- README.rst | 69 +++++++++++++++++++++++++++--------------------------- 1 file changed, 34 insertions(+), 35 deletions(-) diff --git a/README.rst b/README.rst index da3a2e7..4068950 100755 --- a/README.rst +++ b/README.rst @@ -25,8 +25,8 @@ Biology * `1000 Genomes `_ * `American Gut (Microbiome Project) `_ -* `Broad Cancer Cell Line Encyclopedia (CCLE) `_ * `Broad Bioimage Benchmark Collection (BBBC) `_ +* `Broad Cancer Cell Line Encyclopedia (CCLE) `_ * `Cell Image Library `_ * `Complete Genomics Public Data `_ * `EBI ArrayExpress `_ @@ -64,12 +64,13 @@ Biology * `The Catalogue of Life `_ * `The Personal Genome Project `_ or `PGP `_ * `UCSC Public Data `_ -* `Universal Protein Resource (UnitProt) `_ * `UniGene `_ +* `Universal Protein Resource (UnitProt) `_ Climate/Weather --------------- + * `Actuaries Climate Index `_ * `Australian Weather `_ * `Aviation Weather Center - Consistent, timely and accurate weather information for the world airspace system `_ @@ -95,6 +96,7 @@ Complex Networks * `AMiner Citation Network Dataset `_ * `CrossRef DOI URLs `_ * `DBLP Citation dataset `_ +* `DIMACS Road Networks Collection `_ * `NBER Patent Citations `_ * `Network Repository with Interactive Exploratory Analysis Tools `_ * `NIST complex networks data collection `_ @@ -111,7 +113,7 @@ Complex Networks * `UCI Network Data Repository `_ * `UFL sparse matrix collection `_ * `WSU Graph Database `_ -* `DIMACS Road Networks Collection `_ + Computer Networks ----------------- @@ -130,15 +132,10 @@ Computer Networks * `UCSD Network Telescope, IPv4 /8 net `_ -Contextual Data ---------------- - -* `Context-aware data sets from five domains `_ - - Data Challenges --------------- +* `Bruteforce Database `_ * `Challenges in Machine Learning `_ * `CrowdANALYTIX dataX `_ * `D4D Challenge of Orange `_ @@ -150,9 +147,9 @@ Data Challenges * `Netflix Prize `_ * `Space Apps Challenge `_ * `Telecom Italia Big Data Challenge `_ -* `Yelp Dataset Challenge `_ -* `Bruteforce Database `_ * `TravisTorrent Dataset - MSR'2017 Mining Challenge `_ +* `Yelp Dataset Challenge `_ + Earth Science ------------- @@ -216,7 +213,6 @@ Energy * `WHITED `_ - Finance ------- @@ -224,12 +220,12 @@ Finance * `Google Finance `_ * `Google Trends `_ * `NASDAQ `_ +* `NYSE Market Data `_ (see FTP link on `RAW `_) * `OANDA `_ * `OSU Financial data `_ * `Quandl `_ * `St Louis Federal `_ * `Yahoo Finance `_ -* `NYSE Market Data `_ (see FTP link on `RAW `_) GIS @@ -263,9 +259,9 @@ GIS Government ---------- -* `OpenDataSoft's list of 1,600 open data `_ -* `Open Data for Africa `_ * `A list of cities and countries contributed by community `_ +* `Open Data for Africa `_ +* `OpenDataSoft's list of 1,600 open data `_ Healthcare @@ -289,10 +285,13 @@ Image Processing * `10k US Adult Faces Database `_ * `2GB of Photos of Cats `_ or `Archive version `_ +* `Adience Unfiltered faces for gender and age classification `_ * `Affective Image Classification `_ * `Animals with attributes `_ +* `Caltech Pedestrian Detection Benchmark `_ * `Chars74K dataset, Character Recognition in Natural Images (both English and Kannada are available) `_ * `Face Recognition Benchmark `_ +* `GDXray: X-ray images for X-ray testing and Computer Vision `_ * `ImageNet (in WordNet hierarchy) `_ * `Indoor Scene Recognition `_ * `International Affective Picture System, UFL `_ @@ -301,17 +300,17 @@ Image Processing * `Several Shape-from-Silhouette Datasets `_ * `Stanford Dogs Dataset `_ * `SUN database, MIT `_ -* `The Oxford-IIIT Pet Dataset `_ -* `YouTube Faces Database `_ -* `Adience Unfiltered faces for gender and age classification `_ * `The Action Similarity Labeling (ASLAN) Challenge `_ +* `The Oxford-IIIT Pet Dataset `_ * `Violent-Flows - Crowd Violence \ Non-violence Database and benchmark `_ * `Visual genome `_ -* `Caltech Pedestrian Detection Benchmark `_ +* `YouTube Faces Database `_ + Machine Learning ---------------- +* `Context-aware data sets from five domains `_ * `Delve Datasets for classification and regression (Univ. of Toronto) `_ * `Discogs Monthly Data `_ * `eBay Online Auctions (2012) `_ @@ -322,8 +321,8 @@ Machine Learning * `Machine Learning Data Set Repository `_ * `Million Song Dataset `_ * `More Song Datasets `_ -* `New Yorker caption contest ratings `_ * `MovieLens Data Sets `_ +* `New Yorker caption contest ratings `_ * `RDataMining - "R and Data Mining" ebook data `_ * `Registered Meteorites on Earth `_ * `Restaurants Health Score Data in San Francisco `_ @@ -347,6 +346,7 @@ Museums Natural Language ---------------- +* `Automatic Keyphrase Extracttion `_ * `Blogger Corpus `_ * `CLiPS Stylometry Investigation Corpus `_ * `ClueWeb09 FACC `_ @@ -361,37 +361,36 @@ Natural Language * `Hansards text chunks of Canadian Parliament `_ * `Machine Comprehension Test (MCTest) of text from Microsoft Research `_ * `Machine Translation of European languages `_ -* `Multi-Domain Sentiment Dataset (version 2.0) `_ * `Microsoft MAchine Reading COmprehension Dataset (or MS MARCO) `_ +* `Multi-Domain Sentiment Dataset (version 2.0) `_ +* `Open Multilingual Wordnet `_ * `Personae Corpus `_ * `SaudiNewsNet Collection of Saudi Newspaper Articles (Arabic, 30K articles) `_ * `SMS Spam Collection in English `_ +* `Universal Dependencies `_ * `USENET postings corpus of 2005~2011 `_ +* `Webhose - News/Blogs in multiple languages `_ * `Wikidata - Wikipedia databases `_ * `Wikipedia Links data - 40 Million Entities in Context `_ -* `Universal Dependencies `_ * `WordNet databases and tools `_ -* `Open Multilingual Wordnet `_ -* `Automatic Keyphrase Extracttion `_ -* `News/Blogs in multiple languages `_ - + Neuroscience ------------- * `Allen Institute Datasets `_ * `Brain Catalogue `_ -* `Brainomics `_ -* `CodeNeuro Datasets `_ +* `Brainomics `_ +* `CodeNeuro Datasets `_ * `Collaborative Research in Computational Neuroscience (CRCNS) `_ * `FCP-INDI `_ -* `Human Connectome Project `_ +* `Human Connectome Project `_ * `NDAR `_ -* `NIMH Data Archive `_ * `NeuroData `_ +* `Neuroelectro `_ +* `NIMH Data Archive `_ * `OASIS `_ * `OpenfMRI `_ -* `Neuroelectro `_ * `Study Forrest `_ @@ -419,9 +418,9 @@ Public Domains * `Archive.org Datasets `_ * `CMU JASA data archive `_ * `CMU StatLab collections `_ +* `Data.World `_ * `Data360 `_ * `Datamob.org `_ -* `Data.World `_ * `Google `_ * `Infochimps `_ * `KDNuggets Data Collections `_ @@ -477,8 +476,8 @@ Social Networks * `Skytrax' Air Travel Reviews Dataset `_ * `Social Twitter Data `_ * `SourceForge.net Research Data `_ -* `Twitter Data for Sentiment Analysis `_ * `Twitter Data for Online Reputation Management `_ +* `Twitter Data for Sentiment Analysis `_ * `Twitter Graph of entire Twitter site `_ * `Twitter Scrape Calufa May 2011 `_ * `UNIMI/LAW Social Network Datasets `_ @@ -523,11 +522,11 @@ Social Sciences * `Texas Inmates Executed Since 1984 `_ * `Titanic Survival Data Set `_ or `on Kaggle `_ * `UCB's Archive of Social Science Data (D-Lab) `_ -* `Uppsala Conflict Data Program `_ * `UCLA Social Sciences Data Archive `_ * `UN Civil Society Database `_ * `Universities Worldwide `_ * `UPJOHN for Labor Employment Research `_ +* `Uppsala Conflict Data Program `_ * `World Bank Open Data `_ * `WorldPop project - Worldwide human population distributions `_ @@ -594,8 +593,8 @@ Complementary Collections * `Data Packaged Core Datasets `_ * `Database of Scientific Code Contributions `_ -* DataWrangling: `Some Datasets Available on the Web `_ * A growing collection of public datasets: `CoolDatasets. `_ +* DataWrangling: `Some Datasets Available on the Web `_ * Inside-r: `Finding Data on the Internet `_ * OpenDataMonitor: `An overview of available open data resources in Europe `_ * Quora: `Where can I find large datasets open to the public? `_ From 68088197e998355435117ec3a660d8ad96bf4aad Mon Sep 17 00:00:00 2001 From: Xiaming Chen Date: Fri, 7 Apr 2017 16:59:02 +0800 Subject: [PATCH 49/99] Modify pull_request_template --- PULL_REQUEST_TEMPLATE.md | 3 --- PULL_REQUEST_TEMPLATE.rst | 3 +++ 2 files changed, 3 insertions(+), 3 deletions(-) delete mode 100644 PULL_REQUEST_TEMPLATE.md create mode 100644 PULL_REQUEST_TEMPLATE.rst diff --git a/PULL_REQUEST_TEMPLATE.md b/PULL_REQUEST_TEMPLATE.md deleted file mode 100644 index 4690fa4..0000000 --- a/PULL_REQUEST_TEMPLATE.md +++ /dev/null @@ -1,3 +0,0 @@ -# Overview -Dataset Description:
-[link to dataset](putlinkhere.com) diff --git a/PULL_REQUEST_TEMPLATE.rst b/PULL_REQUEST_TEMPLATE.rst new file mode 100644 index 0000000..1014736 --- /dev/null +++ b/PULL_REQUEST_TEMPLATE.rst @@ -0,0 +1,3 @@ +# Overview + +* `Dataset Description `_ From e3dcb1c503e792d692f64a179f8ee1a81a75ce1b Mon Sep 17 00:00:00 2001 From: Cameron Date: Fri, 28 Apr 2017 15:00:28 -0700 Subject: [PATCH 50/99] add flickr logo dataset --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index 47e51b5..6e50e87 100755 --- a/README.rst +++ b/README.rst @@ -291,6 +291,7 @@ Image Processing * `Caltech Pedestrian Detection Benchmark `_ * `Chars74K dataset, Character Recognition in Natural Images (both English and Kannada are available) `_ * `Face Recognition Benchmark `_ +* `Flickr: 32 Class Brand Logos `_ * `GDXray: X-ray images for X-ray testing and Computer Vision `_ * `ImageNet (in WordNet hierarchy) `_ * `Indoor Scene Recognition `_ From dac0811dc28755fa5101613f31bbcbf01f887d05 Mon Sep 17 00:00:00 2001 From: =?UTF-8?q?Micha=C3=ABl=20Defferrard?= Date: Wed, 10 May 2017 15:54:12 +0200 Subject: [PATCH 51/99] Add Free Music Archive --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index 47e51b5..761f7e2 100755 --- a/README.rst +++ b/README.rst @@ -319,6 +319,7 @@ Machine Learning * `Labeled Faces in the Wild (LFW) `_ * `Lending Club Loan Data `_ * `Machine Learning Data Set Repository `_ +* `Free Music Archive `_ * `Million Song Dataset `_ * `More Song Datasets `_ * `MovieLens Data Sets `_ From 2f651a452a3f617a9a9cff4ee8f8dfd4c4fbf35a Mon Sep 17 00:00:00 2001 From: EngineerEmily Date: Fri, 23 Jun 2017 21:35:34 -0700 Subject: [PATCH 52/99] Adding local data portals --- Government.rst | 7 +++++++ 1 file changed, 7 insertions(+) diff --git a/Government.rst b/Government.rst index 1df8d04..7c7758a 100644 --- a/Government.rst +++ b/Government.rst @@ -51,20 +51,24 @@ Government * `London, ON, Canada `_ * `Los Angeles Open Data `_ * `MassGIS, Massachusetts, U.S. `_ +* `Metropolitain Transportation Commission (MTC), California, US `_ * `Mexico `_ * `Missisauga, ON, Canada `_ * `Moldova `_ * `Moncton, NB, Canada `_ +* `Mountain View, California, US (GIS) `_ * `Montreal, QC, Canada `_ * `Netherlands `_ * `New Zealand `_ * `NYC betanyc `_ * `NYC Open Data `_ +* `Oakland, California, US `_ * `OECD `_ * `Oklahoma `_ * `Open Government Data (OGD) Platform India `_ * `Oregon `_ * `Ottawa, ON, Canada `_ +* `Palo Alto, California, US `_ * `Portland, Oregon `_ * `Portugal - Pordata organization `_ * `Puerto Rico Government `_ @@ -75,6 +79,8 @@ Government * `Romania `_ * `Russia `_ * `San Francisco Data sets `_ +* `San Jose, California, US `_ +* `San Mateo County, California, US `_ * `Saskatchewan, Province of Canada `_ * `Seattle `_ * `Singapore Government Data `_ @@ -102,6 +108,7 @@ Government * `UK 2011 Census Open Atlas Project `_ * `United Nations `_ * `Uruguay `_ +* `Valley Transportation Authority (VTA), California, US `_ * `Vancouver, BC Open Data Catalog `_ * `Victoria, BC, Canada `_ * `Vienna, Austria `_ From 0bde4fd8edcf044131d5669fd22a1ac10f1b2ee3 Mon Sep 17 00:00:00 2001 From: Ryan Barrett Date: Thu, 29 Jun 2017 07:36:48 -0700 Subject: [PATCH 53/99] Add Indie Map --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index edab464..b169fcb 100755 --- a/README.rst +++ b/README.rst @@ -472,6 +472,7 @@ Social Networks * `GitHub Collaboration Archive `_ * `Google Scholar citation relations `_ * `High-Resolution Contact Networks from Wearable Sensors `_ +* `Indie Map: social graph and crawl of top IndieWeb sites `_ * `Mobile Social Networks from UMASS `_ * `Network Twitter Data `_ * `Reddit Comments `_ From 1c57e245bd11f2f6d650ad07a4c3b4d92bc6d087 Mon Sep 17 00:00:00 2001 From: Tom Morris Date: Tue, 11 Jul 2017 10:37:39 -0400 Subject: [PATCH 54/99] Datamob is gone --- README.rst | 1 - 1 file changed, 1 deletion(-) diff --git a/README.rst b/README.rst index edab464..1a33385 100755 --- a/README.rst +++ b/README.rst @@ -422,7 +422,6 @@ Public Domains * `CMU StatLab collections `_ * `Data.World `_ * `Data360 `_ -* `Datamob.org `_ * `Google `_ * `Infochimps `_ * `KDNuggets Data Collections `_ From 76ee6a0012c8d5d835581928e15b3f8416b71383 Mon Sep 17 00:00:00 2001 From: Xiaming Chen Date: Thu, 10 Aug 2017 10:54:22 +0800 Subject: [PATCH 55/99] Fix #308 --- README.rst | 3 ++- 1 file changed, 2 insertions(+), 1 deletion(-) diff --git a/README.rst b/README.rst index 1a33385..f631ee5 100755 --- a/README.rst +++ b/README.rst @@ -269,6 +269,7 @@ Healthcare * `EHDP Large Health Data Sets `_ * `Gapminder World demographic databases `_ +* `GDC supports several cancer genome programs for CCG, TCGA, TARGET etc. `_ * `Medicare Coverage Database (MCD), U.S. `_ * `Medicare Data Engine of medicare.gov Data `_ * `Medicare Data File `_ @@ -276,7 +277,7 @@ Healthcare * `Number of Ebola Cases and Deaths in Affected Countries (2014) `_ * `Open-ODS (structure of the UK NHS) `_ * `OpenPaymentsData, Healthcare financial relationship data `_ -* `The Cancer Genome Atlas project (TCGA) `_ and `BigQuery table `_ +* The Cancer Genome Atlas project (TCGA) (refer to `GDC `_ and `BigQuery table `_) * `World Health Organization Global Health Observatory `_ From a12a3b41693047128bda88552ad1543950c4bb32 Mon Sep 17 00:00:00 2001 From: Xiaming Chen Date: Thu, 10 Aug 2017 10:55:40 +0800 Subject: [PATCH 56/99] Fix #307 --- README.rst | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.rst b/README.rst index f631ee5..8155e1e 100755 --- a/README.rst +++ b/README.rst @@ -349,7 +349,7 @@ Museums Natural Language ---------------- -* `Automatic Keyphrase Extracttion `_ +* `Automatic Keyphrase Extraction `_ * `Blogger Corpus `_ * `CLiPS Stylometry Investigation Corpus `_ * `ClueWeb09 FACC `_ From 853dbff93781b301cc4af8249927c505192d1d41 Mon Sep 17 00:00:00 2001 From: Xiaming Chen Date: Thu, 10 Aug 2017 11:06:01 +0800 Subject: [PATCH 57/99] #306 --- README.rst | 3 ++- 1 file changed, 2 insertions(+), 1 deletion(-) diff --git a/README.rst b/README.rst index 8155e1e..9472dc3 100755 --- a/README.rst +++ b/README.rst @@ -4,7 +4,7 @@ Awesome Public Datasets :alt: Awesome :target: https://github.com/sindresorhus/awesome -`This list of public data sources `_ +`This list of a topic-centric public data sources `_ in high quality. They are collected and tidied from blogs, answers, and user responses. Most of the data sets listed below are free, however, some are not. Other amazingly awesome lists can be found in the @@ -270,6 +270,7 @@ Healthcare * `EHDP Large Health Data Sets `_ * `Gapminder World demographic databases `_ * `GDC supports several cancer genome programs for CCG, TCGA, TARGET etc. `_ +* `PhysioBank Databases - a large and growing archive of physiological data `_ * `Medicare Coverage Database (MCD), U.S. `_ * `Medicare Data Engine of medicare.gov Data `_ * `Medicare Data File `_ From 15d70df85e958cec172ddd7c39ef5183b9fa2b38 Mon Sep 17 00:00:00 2001 From: Fabio D'Elia Date: Mon, 21 Aug 2017 10:59:02 +0200 Subject: [PATCH 58/99] changed Registered Meteorites on Earth to new link --- README.rst | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.rst b/README.rst index f6b6bde..5ea3cc0 100755 --- a/README.rst +++ b/README.rst @@ -328,7 +328,7 @@ Machine Learning * `MovieLens Data Sets `_ * `New Yorker caption contest ratings `_ * `RDataMining - "R and Data Mining" ebook data `_ -* `Registered Meteorites on Earth `_ +* `Registered Meteorites on Earth `_ * `Restaurants Health Score Data in San Francisco `_ * `UCI Machine Learning Repository `_ * `Yahoo! Ratings and Classification Data `_ From 39dab15b605b1c93a77a185ab019e6348264b39f Mon Sep 17 00:00:00 2001 From: Muhammad Faheem Akhtar Date: Sat, 26 Aug 2017 17:34:12 +0500 Subject: [PATCH 59/99] Fixed a broken link The link to "Caltech Pedestrian Detection Benchmark" was broken - issue 315 by sentientmachine --- README.rst | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.rst b/README.rst index f6b6bde..ef7fc93 100755 --- a/README.rst +++ b/README.rst @@ -290,7 +290,7 @@ Image Processing * `Adience Unfiltered faces for gender and age classification `_ * `Affective Image Classification `_ * `Animals with attributes `_ -* `Caltech Pedestrian Detection Benchmark `_ +* `Caltech Pedestrian Detection Benchmark `_ * `Chars74K dataset, Character Recognition in Natural Images (both English and Kannada are available) `_ * `Face Recognition Benchmark `_ * `Flickr: 32 Class Brand Logos `_ From 0822a7840965d68e4ed773fd02fe2768f7c8c3ac Mon Sep 17 00:00:00 2001 From: Leonardo Taccari Date: Thu, 31 Aug 2017 11:35:11 +0200 Subject: [PATCH 60/99] Broken link The link is broken. The pages http://www.draftexpress.com/stats/nba,http://www.draftexpress.com/stats/ncaa, http://www.draftexpress.com/stats/euroleague exist, but it looks like there's no downloadable dataset. --- README.rst | 1 - 1 file changed, 1 deletion(-) diff --git a/README.rst b/README.rst index f6b6bde..8054e7c 100755 --- a/README.rst +++ b/README.rst @@ -543,7 +543,6 @@ Software Sports ------ -* `Basketball (NBA/NCAA/Euro) Player Database and Statistics `_ * `Betfair Historical Exchange Data `_ * `Cricsheet Matches (cricket) `_ * `Ergast Formula 1, from 1950 up to date (API) `_ From 713e56ad6c83e73c0716a85c907af82391043adc Mon Sep 17 00:00:00 2001 From: Keith Stolte Date: Mon, 16 Oct 2017 21:24:22 -0400 Subject: [PATCH 61/99] Update of a few US Gov Links Looks like some of the pages may have been moved around since this was started. Updated a few. --- Government.rst | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/Government.rst b/Government.rst index 1df8d04..7b7f26d 100644 --- a/Government.rst +++ b/Government.rst @@ -89,8 +89,8 @@ Government * `Toronto, ON, Canada `_ * `Tunisia `_ * `U.K. Government Data `_ -* `U.S. American Community Survey `_ -* `U.S. CDC Public Health datasets `_ +* `U.S. American Community Survey `_ +* `U.S. CDC Public Health datasets `_ * `U.S. Census Bureau `_ * `U.S. Department of Housing and Urban Development (HUD) `_ * `U.S. Federal Government Agencies `_ From 1de47f3ed06b1362b9d8f9e38c168ad09468540c Mon Sep 17 00:00:00 2001 From: Kostas Christidis Date: Tue, 31 Oct 2017 19:23:37 -0400 Subject: [PATCH 62/99] Fix Dataport URL Closes #331. Signed-off-by: Kostas Christidis --- README.rst | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.rst b/README.rst index 0c556f0..7f28f55 100755 --- a/README.rst +++ b/README.rst @@ -199,7 +199,7 @@ Energy * `AMPds `_ * `BLUEd `_ * `COMBED `_ -* `Dataport `_ +* `Dataport `_ * `DRED `_ * `ECO `_ * `EIA `_ From f6381e21f3457b2f9035363efe6af2087ff250d6 Mon Sep 17 00:00:00 2001 From: Kostas Christidis Date: Fri, 3 Nov 2017 05:37:40 -0400 Subject: [PATCH 63/99] Remove Dataport URL Dataport no longer offers public datasets. Closes #331. Signed-off-by: Kostas Christidis --- README.rst | 1 - 1 file changed, 1 deletion(-) diff --git a/README.rst b/README.rst index 7f28f55..60b10b0 100755 --- a/README.rst +++ b/README.rst @@ -199,7 +199,6 @@ Energy * `AMPds `_ * `BLUEd `_ * `COMBED `_ -* `Dataport `_ * `DRED `_ * `ECO `_ * `EIA `_ From 1c1bd03b4d4de1a93d34f0b923a2962288f38e31 Mon Sep 17 00:00:00 2001 From: Tom Morris Date: Fri, 10 Nov 2017 17:29:24 -0500 Subject: [PATCH 64/99] Remove commercial marinetraffic.com - fixes #333 --- README.rst | 1 - 1 file changed, 1 deletion(-) diff --git a/README.rst b/README.rst index 0c556f0..740d59a 100755 --- a/README.rst +++ b/README.rst @@ -575,7 +575,6 @@ Transportation * `GeoLife GPS Trajectory from Microsoft Research `_ * `German train system by Deutsche Bahn `_ * `Hubway Million Rides in MA `_ -* `Marine Traffic - ship tracks, port calls and more `_ * `Montreal BIXI Bike Share `_ * `NYC Taxi Trip Data 2009- `_ * `NYC Taxi Trip Data 2013 (FOIA/FOILed) `_ From 7e881ea669743f4095b24151a5800e271f834c9d Mon Sep 17 00:00:00 2001 From: Xiaming Chen Date: Sun, 26 Nov 2017 19:13:09 +0800 Subject: [PATCH 65/99] Fix #333. Remove Marine Traffic It turns non-open any more --- README.rst | 1 - 1 file changed, 1 deletion(-) diff --git a/README.rst b/README.rst index 60b10b0..0296190 100755 --- a/README.rst +++ b/README.rst @@ -574,7 +574,6 @@ Transportation * `GeoLife GPS Trajectory from Microsoft Research `_ * `German train system by Deutsche Bahn `_ * `Hubway Million Rides in MA `_ -* `Marine Traffic - ship tracks, port calls and more `_ * `Montreal BIXI Bike Share `_ * `NYC Taxi Trip Data 2009- `_ * `NYC Taxi Trip Data 2013 (FOIA/FOILed) `_ From 23b406d5370b3032df09a4e9b5869be0688bc3b9 Mon Sep 17 00:00:00 2001 From: Min Date: Mon, 18 Dec 2017 14:13:25 +1300 Subject: [PATCH 66/99] Added Stanford Question Answering Dataset (SQuAD) In right alphabetical order. --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index 0296190..c34ccea 100755 --- a/README.rst +++ b/README.rst @@ -373,6 +373,7 @@ Natural Language * `Personae Corpus `_ * `SaudiNewsNet Collection of Saudi Newspaper Articles (Arabic, 30K articles) `_ * `SMS Spam Collection in English `_ +* `Stanford Question Answering Dataset (SQuAD) `_ * `Universal Dependencies `_ * `USENET postings corpus of 2005~2011 `_ * `Webhose - News/Blogs in multiple languages `_ From 5254acc97cf631e260c8306b1af447f0c5546957 Mon Sep 17 00:00:00 2001 From: eveah Date: Fri, 5 Jan 2018 12:01:45 -0500 Subject: [PATCH 67/99] Adding Enigma Public Adding Enigma Public to the public domain section. Public Domains Enigma Public _ --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index c34ccea..d24d4d0 100755 --- a/README.rst +++ b/README.rst @@ -427,6 +427,7 @@ Public Domains * `CMU StatLab collections `_ * `Data.World `_ * `Data360 `_ +* `Enigma Public `_ * `Google `_ * `Infochimps `_ * `KDNuggets Data Collections `_ From 036e5b32bfd0bdc129c66d1a21c1a1f76d0f981d Mon Sep 17 00:00:00 2001 From: Xiaming Chen Date: Mon, 15 Jan 2018 01:04:07 +0800 Subject: [PATCH 68/99] Update README from APD2 --- Government.rst | 114 --- PULL_REQUEST_TEMPLATE.rst | 3 - README.rst | 1579 ++++++++++++++++++++++++++----------- 3 files changed, 1099 insertions(+), 597 deletions(-) delete mode 100644 Government.rst delete mode 100644 PULL_REQUEST_TEMPLATE.rst mode change 100755 => 100644 README.rst diff --git a/Government.rst b/Government.rst deleted file mode 100644 index eb14f30..0000000 --- a/Government.rst +++ /dev/null @@ -1,114 +0,0 @@ -Government ----------- - -* `EveryPolitician, ongoing project collating and sharing data on every politician. `_ - -* `Alberta, Province of Canada `_ -* `Antwerp, Belgium `_ -* `Argentina (non official) `_ -* `Argentina `_ -* `Austin, TX, US `_ -* `Australia (abs.gov.au) `_ -* `Australia (data.gov.au) `_ -* `Austria (data.gv.at) `_ -* `Baton Rouge, LA, US `_ -* `Belgium `_ -* `Brazil `_ -* `Buenos Aires, Argentina `_ -* `Calgary, AB, Canada `_ -* `Cambridge, MA, US `_ -* `Canada `_ -* `Chicago `_ -* `Chile `_ -* `Dallas Open Data `_ -* `DataBC - data from the Province of British Columbia `_ -* `Denver Open Data `_ -* `Durham, NC Open Data `_ -* `Edmonton, AB, Canada `_ -* `England LGInform `_ -* `EuroStat `_ -* `FedStats `_ -* `Finland `_ -* `France `_ -* `Fredericton, NB, Canada `_ -* `Gatineau, QC, Canada `_ -* `Germany `_ -* `Ghent, Belgium `_ -* `Glasgow, Scotland, UK `_ -* `Greece `_ -* `Guardian world governments `_ -* `Halifax, NS, Canada `_ -* `Helsinki Region, Finland `_ -* `Hong Kong, China `_ -* `Houston Open Data `_ -* `Indian Government Data `_ -* `Indonesian Data Portal `_ -* `Ireland's Open Data Portal `_ -* `Japan `_ -* `Laval, QC, Canada `_ -* `Lexington, KY `_ -* `London Datastore, UK `_ -* `London, ON, Canada `_ -* `Los Angeles Open Data `_ -* `MassGIS, Massachusetts, U.S. `_ -* `Metropolitain Transportation Commission (MTC), California, US `_ -* `Mexico `_ -* `Missisauga, ON, Canada `_ -* `Moldova `_ -* `Moncton, NB, Canada `_ -* `Mountain View, California, US (GIS) `_ -* `Montreal, QC, Canada `_ -* `Netherlands `_ -* `New Zealand `_ -* `NYC betanyc `_ -* `NYC Open Data `_ -* `Oakland, California, US `_ -* `OECD `_ -* `Oklahoma `_ -* `Open Government Data (OGD) Platform India `_ -* `Oregon `_ -* `Ottawa, ON, Canada `_ -* `Palo Alto, California, US `_ -* `Portland, Oregon `_ -* `Portugal - Pordata organization `_ -* `Puerto Rico Government `_ -* `Quebec City, QC, Canada `_ -* `Quebec Province of Canada `_ -* `Regina SK, Canada `_ -* `Rio de Janeiro, Brazil `_ -* `Romania `_ -* `Russia `_ -* `San Francisco Data sets `_ -* `San Jose, California, US `_ -* `San Mateo County, California, US `_ -* `Saskatchewan, Province of Canada `_ -* `Seattle `_ -* `Singapore Government Data `_ -* `South Africa `_ -* `South Africa Trade Statistics `_ -* `State of Utah, US `_ -* `Switzerland `_ -* `Taiwan `_ -* `Taiwan g0v `_ -* `Texas Open Data `_ -* `The World Bank `_ -* `Toronto, ON, Canada `_ -* `Tunisia `_ -* `U.K. Government Data `_ -* `U.S. American Community Survey `_ -* `U.S. CDC Public Health datasets `_ -* `U.S. Census Bureau `_ -* `U.S. Department of Housing and Urban Development (HUD) `_ -* `U.S. Federal Government Agencies `_ -* `U.S. Federal Government Data Catalog `_ -* `U.S. Food and Drug Administration (FDA) `_ -* `U.S. National Center for Education Statistics (NCES) `_ -* `U.S. Open Government `_ -* `Uganda Bureau of Statistics `_ -* `UK 2011 Census Open Atlas Project `_ -* `United Nations `_ -* `Uruguay `_ -* `Valley Transportation Authority (VTA), California, US `_ -* `Vancouver, BC Open Data Catalog `_ -* `Victoria, BC, Canada `_ -* `Vienna, Austria `_ diff --git a/PULL_REQUEST_TEMPLATE.rst b/PULL_REQUEST_TEMPLATE.rst deleted file mode 100644 index 1014736..0000000 --- a/PULL_REQUEST_TEMPLATE.rst +++ /dev/null @@ -1,3 +0,0 @@ -# Overview - -* `Dataset Description `_ diff --git a/README.rst b/README.rst old mode 100755 new mode 100644 index d24d4d0..e16a1ae --- a/README.rst +++ b/README.rst @@ -1,608 +1,1227 @@ Awesome Public Datasets ======================= + .. image:: https://cdn.rawgit.com/sindresorhus/awesome/d7305f38d29fed78fa85652e3a63e154dd8e8829/media/badge.svg :alt: Awesome :target: https://github.com/sindresorhus/awesome -`This list of a topic-centric public data sources `_ in high quality. They -are collected and tidied from blogs, answers, and user responses. + +**NOTICE**: This repo is automatically generated by `APD2 `_. +Please **DO NOT** modify this file directly. We now provide +`a new way `_ +to contribute to Awesome Public Datasets. + + +`This list of a topic-centric public data sources `_ +in high quality. They are collected and tidied from blogs, answers, and user responses. Most of the data sets listed below are free, however, some are not. Other amazingly awesome lists can be found in the `awesome-awesomeness `_ and `sindresorhus's awesome `_ list. + .. contents:: Table of Contents + Agriculture ------------- -* `U.S. Department of Agriculture's PLANTS Database `_ +----------- contribute + * `U.S. Department of Agriculture's Nutrient Database `_ - - + +* `U.S. Department of Agriculture's PLANTS Database `_ + Biology -------- - -* `1000 Genomes `_ -* `American Gut (Microbiome Project) `_ -* `Broad Bioimage Benchmark Collection (BBBC) `_ -* `Broad Cancer Cell Line Encyclopedia (CCLE) `_ -* `Cell Image Library `_ -* `Complete Genomics Public Data `_ -* `EBI ArrayExpress `_ -* `EBI Protein Data Bank in Europe `_ -* `Electron Microscopy Pilot Image Archive (EMPIAR) `_ -* `ENCODE project `_ -* `Ensembl Genomes `_ -* `Gene Expression Omnibus (GEO) `_ -* `Gene Ontology (GO) `_ -* `Global Biotic Interactions (GloBI) `_ -* `Harvard Medical School (HMS) LINCS Project `_ -* `Human Genome Diversity Project `_ -* `Human Microbiome Project (HMP) `_ -* `ICOS PSP Benchmark `_ -* `International HapMap Project `_ -* `Journal of Cell Biology DataViewer `_ -* `MIT Cancer Genomics Data `_ +------- contribute + * `NCBI Proteins `_ -* `NCBI Taxonomy `_ -* `NCI Genomic Data Commons `_ -* `NIH Microarray data `_ or `FTP `_ (see FTP link on `RAW `_) -* `OpenSNP genotypes data `_ -* `Pathguid - Protein-Protein Interactions Catalog `_ -* `Protein Data Bank `_ -* `Psychiatric Genomics Consortium `_ -* `PubChem Project `_ -* `PubGene (now Coremine Medical) `_ -* `Sanger Catalogue of Somatic Mutations in Cancer (COSMIC) `_ -* `Sanger Genomics of Drug Sensitivity in Cancer Project (GDSC) `_ -* `Sequence Read Archive(SRA) `_ -* `Stanford Microarray Data `_ -* `Stowers Institute Original Data Repository `_ -* `Systems Science of Biological Dynamics (SSBD) Database `_ -* `The Cancer Genome Atlas (TCGA), available via Broad GDAC `_ -* `The Catalogue of Life `_ -* `The Personal Genome Project `_ or `PGP `_ -* `UCSC Public Data `_ + +* `Gene Expression Omnibus (GEO) `_ + * `UniGene `_ + +* `Gene Ontology (GO) `_ + +* `UCSC Public Data `_ + +* `EBI Protein Data Bank in Europe `_ + +* `OpenSNP genotypes data `_ + +* `The Personal Genome Project `_ + +* `Stowers Institute Original Data Repository `_ + +* `American Gut (Microbiome Project) `_ + +* `Systems Science of Biological Dynamics (SSBD) Database `_ + +* `Electron Microscopy Pilot Image Archive (EMPIAR) `_ + +* `Broad Bioimage Benchmark Collection (BBBC) `_ + +* `Journal of Cell Biology DataViewer `_ + +* `NCI Genomic Data Commons `_ + +* `Protein Data Bank `_ + +* `Pathguid - Protein-Protein Interactions Catalog `_ + +* `International HapMap Project `_ + +* `Global Biotic Interactions (GloBI) `_ + +* `NCBI Taxonomy `_ + +* `The Cancer Genome Atlas (TCGA), available via Broad GDAC `_ + +* `Broad Cancer Cell Line Encyclopedia (CCLE) `_ + +* `Ensembl Genomes `_ + +* `Sanger Catalogue of Somatic Mutations in Cancer (COSMIC) `_ + +* `ICOS PSP Benchmark `_ + +* `PubChem Project `_ + +* `Psychiatric Genomics Consortium `_ + +* `Human Microbiome Project (HMP) `_ + +* `Stanford Microarray Data `_ + +* `EBI ArrayExpress `_ + +* `Sanger Genomics of Drug Sensitivity in Cancer Project (GDSC) `_ + +* `PubGene (now Coremine Medical) `_ + +* `Harvard Medical School (HMS) LINCS Project `_ + +* `ENCODE project `_ + +* `Complete Genomics Public Data `_ + +* `Cell Image Library `_ + * `Universal Protein Resource (UnitProt) `_ - - -Climate/Weather ---------------- - -* `Actuaries Climate Index `_ -* `Australian Weather `_ -* `Aviation Weather Center - Consistent, timely and accurate weather information for the world airspace system `_ -* `Brazilian Weather - Historical data (In Portuguese) `_ -* `Canadian Meteorological Centre `_ -* `Climate Data from UEA (updated monthly) `_ -* `European Climate Assessment & Dataset `_ + +* `MIT Cancer Genomics Data `_ + +* `The Catalogue of Life `_ + +* `NIH Microarray data `_ + +* `Sequence Read Archive(SRA) `_ + +* `Human Genome Diversity Project `_ + +* `1000 Genomes `_ + +Climate+Weather +--------------- contribute + * `Global Climate Data Since 1929 `_ -* `NASA Global Imagery Browse Services `_ -* `NOAA Bering Sea Climate `_ -* `NOAA Climate Datasets `_ -* `NOAA Realtime Weather Models `_ -* `NOAA SURFRAD Meteorology and Radiation Datasets `_ + * `The World Bank Open Data Resources for Climate Change `_ -* `UEA Climatic Research Unit `_ -* `WorldClim - Global Climate Data `_ + +* `Brazilian Weather - Historical data (In Portuguese) `_ + +* `NOAA Bering Sea Climate `_ + * `WU Historical Weather Worldwide `_ - - -Complex Networks ----------------- - -* `AMiner Citation Network Dataset `_ -* `CrossRef DOI URLs `_ -* `DBLP Citation dataset `_ + +* `Climate Data from UEA (updated monthly) `_ + +* `Actuaries Climate Index `_ + +* `WorldClim - Global Climate Data `_ + +* `Australian Weather `_ + +* `Aviation Weather Center - Consistent, timely and accurate weather information for the world airspace system `_ + +* `NASA Global Imagery Browse Services `_ + +* `NOAA Realtime Weather Models `_ + +* `UEA Climatic Research Unit `_ + +* `European Climate Assessment & Dataset `_ + +* `Canadian Meteorological Centre `_ + +* `NOAA Climate Datasets `_ + +* `NOAA SURFRAD Meteorology and Radiation Datasets `_ + +ComplexNetworks +--------------- contribute + * `DIMACS Road Networks Collection `_ -* `NBER Patent Citations `_ -* `Network Repository with Interactive Exploratory Analysis Tools `_ -* `NIST complex networks data collection `_ -* `Protein-protein interaction network `_ -* `PyPI and Maven Dependency Network `_ -* `Scopus Citation Database `_ -* `Small Network Data `_ -* `Stanford GraphBase (Steven Skiena) `_ -* `Stanford Large Network Dataset Collection `_ -* `Stanford Longitudinal Network Data Sources `_ -* `The Koblenz Network Collection `_ -* `The Laboratory for Web Algorithmics (UNIMI) `_ -* `The Nexus Network Repository `_ -* `UCI Network Data Repository `_ + * `UFL sparse matrix collection `_ + +* `Stanford GraphBase `_ + +* `DBLP Citation dataset `_ + +* `Small Network Data `_ + +* `CrossRef DOI URLs `_ + +* `The Nexus Network Repository `_ + +* `Stanford Longitudinal Network Data Sources `_ + +* `PyPI and Maven Dependency Network `_ + +* `Stanford Large Network Dataset Collection `_ + * `WSU Graph Database `_ - - -Computer Networks ------------------ - -* `3.5B Web Pages from CommonCrawl 2012 `_ + +* `The Koblenz Network Collection `_ + +* `The Laboratory for Web Algorithmics (UNIMI) `_ + +* `Network Repository with Interactive Exploratory Analysis Tools `_ + +* `UCI Network Data Repository `_ + +* `Scopus Citation Database `_ + +* `NBER Patent Citations `_ + +* `Protein-protein interaction network `_ + +* `NIST complex networks data collection `_ + +* `AMiner Citation Network Dataset `_ + +ComputerNetworks +---------------- contribute + * `53.5B Web clicks of 100K users in Indiana Univ. `_ -* `CAIDA Internet Datasets `_ -* `ClueWeb09 - 1B web pages `_ -* `ClueWeb12 - 733M web pages `_ -* `CommonCrawl Web Data over 7 years `_ -* `CRAWDAD Wireless datasets from Dartmouth Univ. `_ -* `Criteo click-through data `_ -* `OONI: Open Observatory of Network Interference - Internet censorship data `_ + * `Open Mobile Data by MobiPerf `_ -* `Rapid7 Sonar Internet Scans `_ + +* `ClueWeb12 - 733M web pages `_ + +* `CRAWDAD Wireless datasets from Dartmouth Univ. `_ + +* `CAIDA Internet Datasets `_ + +* `ClueWeb09 - 1B web pages `_ + * `UCSD Network Telescope, IPv4 /8 net `_ - - -Data Challenges ---------------- - -* `Bruteforce Database `_ -* `Challenges in Machine Learning `_ -* `CrowdANALYTIX dataX `_ -* `D4D Challenge of Orange `_ -* `DrivenData Competitions for Social Good `_ -* `ICWSM Data Challenge (since 2009) `_ -* `Kaggle Competition Data `_ -* `KDD Cup by Tencent 2012 `_ -* `Localytics Data Visualization Challenge `_ + +* `Criteo click-through data `_ + +* `3.5B Web Pages from CommonCrawl 2012 `_ + +* `Rapid7 Sonar Internet Scans `_ + +* `OONI: Open Observatory of Network Interference - Internet censorship data `_ + +* `CommonCrawl Web Data over 7 years `_ + +DataChallenges +-------------- contribute + * `Netflix Prize `_ + * `Space Apps Challenge `_ -* `Telecom Italia Big Data Challenge `_ -* `TravisTorrent Dataset - MSR'2017 Mining Challenge `_ + +* `ICWSM Data Challenge (since 2009) `_ + +* `DrivenData Competitions for Social Good `_ + +* `CrowdANALYTIX dataX `_ + +* `Bruteforce Database `_ + +* `Kaggle Competition Data `_ + * `Yelp Dataset Challenge `_ - - -Earth Science -------------- - + +* `Localytics Data Visualization Challenge `_ + +* `D4D Challenge of Orange `_ + +* `Telecom Italia Big Data Challenge `_ + +* `KDD Cup by Tencent 2012 `_ + +* `Challenges in Machine Learning `_ + +* `TravisTorrent Dataset - MSR'2017 Mining Challenge `_ + +EarthScience +------------ contribute + * `AQUASTAT - Global water resources and uses `_ -* `BODC - marine data of ~22K vars `_ -* `Earth Models `_ -* `EOSDIS - NASA's earth observing system data `_ -* `Integrated Marine Observing System (IMOS) - roughly 30TB of ocean measurements `_ or `on S3 `_ + * `Marinexplore - Open Oceanographic Data `_ + +* `EOSDIS - NASA's earth observing system data `_ + +* `BODC - marine data of ~22K vars `_ + +* `Integrated Marine Observing System (IMOS) - roughly 30TB of ocean measurements `_ + * `Smithsonian Institution Global Volcano and Eruption Database `_ + +* `Earth Models `_ + * `USGS Earthquake Archives `_ - - + Economics ---------- - -* `American Economic Association (AEA) `_ -* `EconData from UMD `_ -* `Economic Freedom of the World Data `_ -* `Historical MacroEconomc Statistics `_ -* `International Economics Database `_ and `various data tools `_ -* `International Trade Statistics `_ -* `Internet Product Code Database `_ -* `Joint External Debt Data Hub `_ -* `Jon Haveman International Trade Data Links `_ -* `OpenCorporates Database of Companies in the World `_ -* `Our World in Data `_ -* `SciencesPo World Trade Gravity Datasets `_ -* `The Atlas of Economic Complexity `_ +--------- contribute + * `The Center for International Data `_ + +* `Historical MacroEconomc Statistics `_ + +* `International Economics Database `_ + +* `Internet Product Code Database `_ + +* `American Economic Association (AEA) `_ + +* `Jon Haveman International Trade Data Links `_ + * `The Observatory of Economic Complexity `_ + +* `The Atlas of Economic Complexity `_ + +* `SciencesPo World Trade Gravity Datasets `_ + +* `Our World in Data `_ + * `UN Commodity Trade Statistics `_ + +* `OpenCorporates Database of Companies in the World `_ + +* `International Trade Statistics `_ + +* `Joint External Debt Data Hub `_ + +* `EconData from UMD `_ + * `UN Human Development Reports `_ - - + +* `Economic Freedom of the World Data `_ + Education ------------- - -* `College Scorecard Data `_ +--------- contribute + * `Student Data from Free Code Camp `_ - - + +* `College Scorecard Data `_ + Energy ------- - -* `AMPds `_ -* `BLUEd `_ -* `COMBED `_ +------ contribute + * `DRED `_ -* `ECO `_ -* `EIA `_ -* `HES `_ - Household Electricity Study, UK -* `HFED `_ + +* `COMBED `_ + * `iAWE `_ -* `PLAID `_ - the Plug Load Appliance Identification Dataset -* `REDD `_ -* `Tracebase `_ -* `UK-DALE `_ - UK Domestic Appliance-Level Electricity + +* `AMPds `_ + +* `ECO `_ + * `WHITED `_ - - + +* `HES - Household Electricity Study, UK `_ + +* `PLAID - The Plug Load Appliance Identification Dataset `_ + +* `BLUEd `_ + +* `UK-DALE - UK Domestic Appliance-Level Electricity `_ + +* `HFED `_ + +* `Tracebase `_ + +* `EIA `_ + +* `REDD `_ + Finance -------- - -* `CBOE Futures Exchange `_ -* `Google Finance `_ -* `Google Trends `_ +------- contribute + * `NASDAQ `_ -* `NYSE Market Data `_ (see FTP link on `RAW `_) -* `OANDA `_ -* `OSU Financial data `_ -* `Quandl `_ -* `St Louis Federal `_ + +* `Google Finance `_ + * `Yahoo Finance `_ - - + +* `NYSE Market Data `_ + +* `CBOE Futures Exchange `_ + +* `St Louis Federal `_ + +* `Quandl `_ + +* `Google Trends `_ + +* `OANDA `_ + +* `OSU Financial data `_ + GIS ---- - -* `ArcGIS Open Data portal `_ -* `Cambridge, MA, US, GIS data on GitHub `_ -* `Factual Global Location Data `_ -* `Geo Spatial Data from ASU `_ -* `Geo Wiki Project - Citizen-driven Environmental Monitoring `_ -* `GeoFabrik - OSM data extracted to a variety of formats and areas `_ -* `GeoNames Worldwide `_ -* `Global Administrative Areas Database (GADM) `_ -* `Homeland Infrastructure Foundation-Level Data `_ -* `Landsat 8 on AWS `_ -* `List of all countries in all languages `_ -* `National Weather Service GIS Data Portal `_ -* `Natural Earth - vectors and rasters of the world `_ -* `OpenAddresses `_ -* `OpenStreetMap (OSM) `_ -* `Pleiades - Gazetteer and graph of ancient places `_ -* `Reverse Geocoder using OSM data `_ & `additional high-resolution data files `_ -* `TIGER/Line - U.S. boundaries and roads `_ -* `TwoFishes - Foursquare's coarse geocoder `_ +--- contribute + * `TZ Timezones shapfiles `_ -* `UN Environmental Data `_ + +* `Pleiades - Gazetteer and graph of ancient places `_ + +* `OpenStreetMap (OSM) `_ + +* `Factual Global Location Data `_ + * `World boundaries from the U.S. Department of State `_ + +* `GeoNames Worldwide `_ + +* `Landsat 8 on AWS `_ + +* `Global Administrative Areas Database (GADM) `_ + +* `Natural Earth - vectors and rasters of the world `_ + +* `Geo Spatial Data from ASU `_ + +* `Geo Wiki Project - Citizen-driven Environmental Monitoring `_ + +* `GeoFabrik - OSM data extracted to a variety of formats and areas `_ + +* `Cambridge, MA, US, GIS data on GitHub `_ + +* `ArcGIS Open Data portal `_ + +* `OpenAddresses `_ + +* `UN Environmental Data `_ + +* `TwoFishes - Foursquare's coarse geocoder `_ + +* `TIGER/Line - U.S. boundaries and roads `_ + +* `Reverse Geocoder using OSM data `_ + +* `Homeland Infrastructure Foundation-Level Data `_ + +* `List of all countries in all languages `_ + +* `National Weather Service GIS Data Portal `_ + * `World countries in multiple formats `_ - - + Government ----------- - -* `A list of cities and countries contributed by community `_ +---------- contribute + +* `New Zealand `_ + +* `Glasgow, Scotland, UK `_ + +* `Puerto Rico Government `_ + +* `Vienna, Austria `_ + +* `Missisauga, ON, Canada `_ + +* `Open Government Data (OGD) Platform India `_ + +* `Montreal, QC, Canada `_ + +* `Indian Government Data `_ + +* `U.S. Food and Drug Administration (FDA) `_ + +* `MassGIS, Massachusetts, U.S. `_ + +* `Los Angeles Open Data `_ + +* `Vancouver, BC Open Data Catalog `_ + +* `U.S. Federal Government Agencies `_ + +* `State of Utah, US `_ + +* `Buenos Aires, Argentina `_ + +* `Texas Open Data `_ + +* `Baton Rouge, LA, US `_ + +* `Netherlands `_ + +* `Uganda Bureau of Statistics `_ + +* `Palo Alto, California, US `_ + +* `Victoria, BC, Canada `_ + +* `U.S. CDC Public Health datasets `_ + +* `NYC Open Data `_ + +* `U.S. American Community Survey `_ + +* `Finland `_ + +* `Guardian world governments `_ + +* `Japan `_ + +* `Portland, Oregon `_ + +* `Uruguay `_ + +* `Australia (data.gov.au) `_ + +* `Laval, QC, Canada `_ + +* `Lexington, KY `_ + +* `Helsinki Region, Finland `_ + +* `Mexico `_ + +* `Romania `_ + +* `Singapore Government Data `_ + +* `Chile `_ + +* `U.K. Government Data `_ + +* `Canada `_ + +* `Cambridge, MA, US `_ + +* `San Francisco Data sets `_ + +* `San Jose, California, US `_ + +* `FedStats `_ + +* `Germany `_ + +* `DataBC - data from the Province of British Columbia `_ + +* `U.S. Federal Government Data Catalog `_ + * `Open Data for Africa `_ + +* `Toronto, ON, Canada `_ + +* `Ghent, Belgium `_ + +* `Saskatchewan, Province of Canada `_ + +* `Gatineau, QC, Canada `_ + +* `Dallas Open Data `_ + +* `South Africa `_ + +* `Quebec City, QC, Canada `_ + +* `OECD `_ + +* `Denver Open Data `_ + +* `Portugal - Pordata organization `_ + +* `Metropolitain Transportation Commission (MTC), California, US `_ + +* `France `_ + +* `London, ON, Canada `_ + +* `San Mateo County, California, US `_ + +* `Houston Open Data `_ + +* `Edmonton, AB, Canada `_ + +* `Argentina (non official) `_ + +* `Chicago `_ + +* `Durham, NC Open Data `_ + +* `Alberta, Province of Canada `_ + +* `Oklahoma `_ + +* `Belgium `_ + +* `Moldova `_ + +* `Austria (data.gv.at) `_ + +* `Greece `_ + +* `U.S. National Center for Education Statistics (NCES) `_ + +* `Brazil `_ + +* `Austin, TX, US `_ + +* `Moncton, NB, Canada `_ + +* `Mountain View, California, US (GIS) `_ + * `OpenDataSoft's list of 1,600 open data `_ - - + +* `England LGInform `_ + +* `Valley Transportation Authority (VTA), California, US `_ + +* `Switzerland `_ + +* `U.S. Department of Housing and Urban Development (HUD) `_ + +* `Antwerp, Belgium `_ + +* `Ireland's Open Data Portal `_ + +* `UK 2011 Census Open Atlas Project `_ + +* `Rio de Janeiro, Brazil `_ + +* `Russia `_ + +* `Australia (abs.gov.au) `_ + +* `Taiwan g0v `_ + +* `Halifax, NS, Canada `_ + +* `Argentina `_ + +* `Hong Kong, China `_ + +* `U.S. Open Government `_ + +* `Calgary, AB, Canada `_ + +* `EuroStat `_ + +* `Seattle `_ + +* `NYC betanyc `_ + +* `London Datastore, UK `_ + +* `The World Bank `_ + +* `EveryPolitician - Ongoing project collating and sharing data on every politician. `_ + +* `U.S. Census Bureau `_ + +* `Tunisia `_ + +* `Indonesian Data Portal `_ + +* `Oregon `_ + +* `Fredericton, NB, Canada `_ + +* `South Africa Trade Statistics `_ + +* `Ottawa, ON, Canada `_ + +* `Regina SK, Canada `_ + +* `United Nations `_ + +* `Oakland, California, US `_ + +* `Quebec Province of Canada `_ + +* `Taiwan `_ + Healthcare ----------- - -* `EHDP Large Health Data Sets `_ -* `Gapminder World demographic databases `_ -* `GDC supports several cancer genome programs for CCG, TCGA, TARGET etc. `_ -* `PhysioBank Databases - a large and growing archive of physiological data `_ -* `Medicare Coverage Database (MCD), U.S. `_ -* `Medicare Data Engine of medicare.gov Data `_ -* `Medicare Data File `_ +---------- contribute + +* `PhysioBank Databases - A large and growing archive of physiological data. `_ + * `MeSH, the vocabulary thesaurus used for indexing articles for PubMed `_ -* `Number of Ebola Cases and Deaths in Affected Countries (2014) `_ + +* `Gapminder World demographic databases `_ + * `Open-ODS (structure of the UK NHS) `_ + +* `Number of Ebola Cases and Deaths in Affected Countries (2014) `_ + +* `EHDP Large Health Data Sets `_ + +* `Medicare Data Engine of medicare.gov Data `_ + +* `Medicare Data File `_ + * `OpenPaymentsData, Healthcare financial relationship data `_ -* The Cancer Genome Atlas project (TCGA) (refer to `GDC `_ and `BigQuery table `_) + * `World Health Organization Global Health Observatory `_ - - -Image Processing ----------------- - -* `10k US Adult Faces Database `_ -* `2GB of Photos of Cats `_ or `Archive version `_ -* `Adience Unfiltered faces for gender and age classification `_ -* `Affective Image Classification `_ -* `Animals with attributes `_ -* `Caltech Pedestrian Detection Benchmark `_ -* `Chars74K dataset, Character Recognition in Natural Images (both English and Kannada are available) `_ -* `Face Recognition Benchmark `_ -* `Flickr: 32 Class Brand Logos `_ -* `GDXray: X-ray images for X-ray testing and Computer Vision `_ -* `ImageNet (in WordNet hierarchy) `_ -* `Indoor Scene Recognition `_ -* `International Affective Picture System, UFL `_ -* `Massive Visual Memory Stimuli, MIT `_ -* `MNIST database of handwritten digits, near 1 million examples `_ + +* `GDC - GDC supports several cancer genome programs for CCG, TCGA, TARGET etc. `_ + +* `Medicare Coverage Database (MCD), U.S. `_ + +* `The Cancer Genome Atlas project (TCGA) `_ + +ImageProcessing +--------------- contribute + * `Several Shape-from-Silhouette Datasets `_ + * `Stanford Dogs Dataset `_ -* `SUN database, MIT `_ -* `The Action Similarity Labeling (ASLAN) Challenge `_ -* `The Oxford-IIIT Pet Dataset `_ -* `Violent-Flows - Crowd Violence \ Non-violence Database and benchmark `_ -* `Visual genome `_ + +* `Flickr: 32 Class Brand Logos `_ + +* `Indoor Scene Recognition `_ + * `YouTube Faces Database `_ - - -Machine Learning ----------------- - -* `Context-aware data sets from five domains `_ -* `Delve Datasets for classification and regression (Univ. of Toronto) `_ + +* `MNIST database of handwritten digits, near 1 million examples `_ + +* `Visual genome `_ + +* `Affective Image Classification `_ + +* `Adience Unfiltered faces for gender and age classification `_ + +* `The Oxford-IIIT Pet Dataset `_ + +* `2GB of Photos of Cats `_ + +* `The Action Similarity Labeling (ASLAN) Challenge `_ + +* `Chars74K dataset - Character Recognition in Natural Images (both English and Kannada are available) `_ + +* `10k US Adult Faces Database `_ + +* `Caltech Pedestrian Detection Benchmark `_ + +* `Massive Visual Memory Stimuli, MIT `_ + +* `International Affective Picture System, UFL `_ + +* `Violent-Flows - Crowd Violence / Non-violence Database and benchmark `_ + +* `SUN database, MIT `_ + +* `GDXray - X-ray images for X-ray testing and Computer Vision `_ + +* `ImageNet (in WordNet hierarchy) `_ + +* `Face Recognition Benchmark `_ + +* `Animals with attributes `_ + +MachineLearning +--------------- contribute + * `Discogs Monthly Data `_ -* `eBay Online Auctions (2012) `_ -* `IMDb Database `_ -* `Keel Repository for classification, regression and time series `_ -* `Labeled Faces in the Wild (LFW) `_ -* `Lending Club Loan Data `_ -* `Machine Learning Data Set Repository `_ + * `Free Music Archive `_ -* `Million Song Dataset `_ -* `More Song Datasets `_ -* `MovieLens Data Sets `_ -* `New Yorker caption contest ratings `_ -* `RDataMining - "R and Data Mining" ebook data `_ -* `Registered Meteorites on Earth `_ -* `Restaurants Health Score Data in San Francisco `_ -* `UCI Machine Learning Repository `_ + +* `Delve Datasets for classification and regression `_ + * `Yahoo! Ratings and Classification Data `_ + +* `Restaurants Health Score Data in San Francisco `_ + +* `Context-aware data sets from five domains `_ + +* `More Song Datasets `_ + +* `Lending Club Loan Data `_ + +* `MovieLens Data Sets `_ + +* `Labeled Faces in the Wild (LFW) `_ + +* `eBay Online Auctions (2012) `_ + +* `UCI Machine Learning Repository `_ + * `Youtube 8m `_ - - + +* `RDataMining - "R and Data Mining" ebook data `_ + +* `IMDb Database `_ + +* `Keel Repository for classification, regression and time series `_ + +* `Registered Meteorites on Earth `_ + +* `Million Song Dataset `_ + +* `New Yorker caption contest ratings `_ + +* `Machine Learning Data Set Repository `_ + Museums -------- - -* `Canada Science and Technology Museums Corporation's Open Data `_ -* `Cooper-Hewitt's Collection Database `_ -* `Minneapolis Institute of Arts metadata `_ -* `Natural History Museum (London) Data Portal `_ +------- contribute + * `Rijksmuseum Historical Art Collection `_ + * `Tate Collection metadata `_ + +* `Canada Science and Technology Museums Corporation's Open Data `_ + +* `Natural History Museum (London) Data Portal `_ + * `The Getty vocabularies `_ - - -Natural Language ----------------- - -* `POS/NER/Chunk annotated data `_ -* `Automatic Keyphrase Extraction `_ -* `Blogger Corpus `_ -* `CLiPS Stylometry Investigation Corpus `_ -* `ClueWeb09 FACC `_ -* `ClueWeb12 FACC `_ -* `DBpedia - 4.58M things with 583M facts `_ -* `Flickr Personal Taxonomies `_ -* `Freebase.com of people, places, and things `_ -* `Google Books Ngrams (2.2TB) `_ -* `Google MC-AFP, generated based on the public available Gigaword dataset using Paragraph Vectors `_ -* `Google Web 5gram (1TB, 2006) `_ -* `Gutenberg eBooks List `_ -* `Hansards text chunks of Canadian Parliament `_ -* `Machine Comprehension Test (MCTest) of text from Microsoft Research `_ -* `Machine Translation of European languages `_ -* `Making Sense of Microposts 2013 - Concept Extraction `_ -* `Making Sense of Microposts 2016 - Named Entity rEcognition and Linking `_ -* `Microsoft MAchine Reading COmprehension Dataset (or MS MARCO) `_ -* `Multi-Domain Sentiment Dataset (version 2.0) `_ -* `Open Multilingual Wordnet `_ -* `Personae Corpus `_ -* `SaudiNewsNet Collection of Saudi Newspaper Articles (Arabic, 30K articles) `_ -* `SMS Spam Collection in English `_ -* `Stanford Question Answering Dataset (SQuAD) `_ -* `Universal Dependencies `_ -* `USENET postings corpus of 2005~2011 `_ + +* `Minneapolis Institute of Arts metadata `_ + +* `Cooper-Hewitt's Collection Database `_ + +NaturalLanguage +--------------- contribute + * `Webhose - News/Blogs in multiple languages `_ -* `Wikidata - Wikipedia databases `_ + +* `Google MC-AFP - Generated based on the public available Gigaword dataset using Paragraph Vectors `_ + +* `Universal Dependencies `_ + +* `SMS Spam Collection in English `_ + +* `Stanford Question Answering Dataset (SQuAD) `_ + +* `Flickr Personal Taxonomies `_ + +* `Google Books Ngrams (2.2TB) `_ + +* `DBpedia - 4.58M things with 583M facts `_ + +* `Personae Corpus `_ + * `Wikipedia Links data - 40 Million Entities in Context `_ + +* `Automatic Keyphrase Extraction `_ + +* `ClueWeb12 FACC `_ + +* `CLiPS Stylometry Investigation Corpus `_ + +* `Making Sense of Microposts 2013 - Concept Extraction `_ + +* `ClueWeb09 FACC `_ + * `WordNet databases and tools `_ - - + +* `SaudiNewsNet Collection of Saudi Newspaper Articles (Arabic, 30K articles) `_ + +* `Machine Comprehension Test (MCTest) of text from Microsoft Research `_ + +* `Wikidata - Wikipedia databases `_ + +* `Making Sense of Microposts 2016 - Named Entity rEcognition and Linking `_ + +* `Gutenberg eBooks List `_ + +* `Google Web 5gram (1TB, 2006) `_ + +* `POS/NER/Chunk annotated data `_ + +* `Freebase of people, places, and things `_ + +* `Hansards text chunks of Canadian Parliament `_ + +* `Machine Translation of European languages `_ + +* `Multi-Domain Sentiment Dataset (version 2.0) `_ + +* `USENET postings corpus of 2005~2011 `_ + +* `Open Multilingual Wordnet `_ + +* `Microsoft MAchine Reading COmprehension Dataset (or MS MARCO) `_ + +* `Blogger Corpus `_ + Neuroscience -------------- - -* `Allen Institute Datasets `_ -* `Brain Catalogue `_ -* `Brainomics `_ -* `CodeNeuro Datasets `_ -* `Collaborative Research in Computational Neuroscience (CRCNS) `_ -* `FCP-INDI `_ +------------ contribute + * `Human Connectome Project `_ -* `NDAR `_ -* `NeuroData `_ + +* `Brain Catalogue `_ + +* `CodeNeuro Datasets `_ + * `Neuroelectro `_ + +* `Allen Institute Datasets `_ + +* `NDAR `_ + +* `Collaborative Research in Computational Neuroscience (CRCNS) `_ + * `NIMH Data Archive `_ + +* `NeuroData `_ + +* `Brainomics `_ + +* `FCP-INDI `_ + * `OASIS `_ + * `OpenfMRI `_ + * `Study Forrest `_ - - + Physics -------- - +------- contribute + * `CERN Open Data Portal `_ -* `Crystallography Open Database `_ -* `NASA Exoplanet Archive `_ -* `NSSDC (NASA) data of 550 space spacecraft `_ + * `Sloan Digital Sky Survey (SDSS) - Mapping the Universe `_ - - -Psychology/Cognition --------------------- - + +* `Crystallography Open Database `_ + +* `NASA Exoplanet Archive `_ + +* `NSSDC (NASA) data of 550 space spacecraft `_ + +Psychology+Cognition +-------------------- contribute + * `OSU Cognitive Modeling Repository Datasets `_ - - -Public Domains --------------- - -* `Amazon `_ -* `Archive-it from Internet Archive `_ -* `Archive.org Datasets `_ -* `CMU JASA data archive `_ -* `CMU StatLab collections `_ -* `Data.World `_ -* `Data360 `_ -* `Enigma Public `_ + +PublicDomains +------------- contribute + * `Google `_ + +* `Amazon `_ + * `Infochimps `_ -* `KDNuggets Data Collections `_ -* `Microsoft Azure Data Market Free DataSets `_ -* `Microsoft Data Science for Research `_ -* `Numbray `_ -* `Open Library Data Dumps `_ -* `Reddit Datasets `_ + +* `CMU StatLab collections `_ + +* `Archive.org Datasets `_ + +* `Enigma Public `_ + * `RevolutionAnalytics Collection `_ -* `Sample R data sets `_ + +* `KDNuggets Data Collections `_ + * `Stats4Stem R data sets `_ -* `StatSci.org `_ -* `The Washington Post List `_ -* `UCLA SOCR data collection `_ -* `UFO Reports `_ -* `Wikileaks 911 pager intercepts `_ + * `Yahoo Webscope `_ - - -Search Engines --------------- - + +* `Data360 `_ + +* `UCLA SOCR data collection `_ + +* `Microsoft Azure Data Market Free DataSets `_ + +* `Wikileaks 911 pager intercepts `_ + +* `Data.World `_ + +* `Reddit Datasets `_ + +* `The Washington Post List `_ + +* `StatSci.org `_ + +* `Microsoft Data Science for Research `_ + +* `Open Library Data Dumps `_ + +* `Numbray `_ + +* `Sample R data sets `_ + +* `UFO Reports `_ + +* `Archive-it from Internet Archive `_ + +* `CMU JASA data archive `_ + +SearchEngines +------------- contribute + * `Academic Torrents of data sharing from UMB `_ -* `Datahub.io `_ -* `DataMarket (Qlik) `_ -* `Harvard Dataverse Network of scientific data `_ + * `ICPSR (UMICH) `_ -* `Institute of Education Sciences `_ -* `National Technical Reports Library `_ -* `Open Data Certificates (beta) `_ + +* `Datahub.io `_ + +* `Harvard Dataverse Network of scientific data `_ + * `OpenDataNetwork - A search engine of all Socrata powered data portals `_ + +* `Institute of Education Sciences `_ + +* `DataMarket (Qlik) `_ + +* `Open Data Certificates (beta) `_ + +* `National Technical Reports Library `_ + * `Statista.com - statistics and Studies `_ + * `Zenodo - An open dependable home for the long-tail of science `_ - - -Social Networks ---------------- - -* `72 hours #gamergate Twitter Scrape `_ -* `Ancestry.com Forum Dataset over 10 years `_ -* `Cheng-Caverlee-Lee September 2009 - January 2010 Twitter Scrape `_ -* `CMU Enron Email of 150 users `_ -* `EDRM Enron EMail of 151 users, hosted on S3 `_ -* `Facebook Data Scrape (2005) `_ -* `Facebook Social Networks from LAW (since 2007) `_ -* `Foursquare from UMN/Sarwat (2013) `_ -* `GitHub Collaboration Archive `_ -* `Google Scholar citation relations `_ -* `High-Resolution Contact Networks from Wearable Sensors `_ -* `Indie Map: social graph and crawl of top IndieWeb sites `_ -* `Mobile Social Networks from UMASS `_ -* `Network Twitter Data `_ + +SocialNetworks +-------------- contribute + * `Reddit Comments `_ -* `Skytrax' Air Travel Reviews Dataset `_ -* `Social Twitter Data `_ -* `SourceForge.net Research Data `_ -* `Twitter Data for Online Reputation Management `_ -* `Twitter Data for Sentiment Analysis `_ -* `Twitter Graph of entire Twitter site `_ -* `Twitter Scrape Calufa May 2011 `_ -* `UNIMI/LAW Social Network Datasets `_ -* `Yahoo! Graph and Social Data `_ + * `Youtube Video Social Graph in 2007,2008 `_ - - -Social Sciences ---------------- - -* `ACLED (Armed Conflict Location & Event Data Project) `_ -* `Canadian Legal Information Institute `_ -* `Center for Systemic Peace Datasets - Conflict Trends, Polities, State Fragility, etc `_ -* `Correlates of War Project `_ -* `Cryptome Conspiracy Theory Items `_ -* `Datacards `_ -* `European Social Survey `_ -* `FBI Hate Crime 2013 - aggregated data `_ -* `Fragile States Index `_ -* `GDELT Global Events Database `_ -* `General Social Survey (GSS) since 1972 `_ -* `German Social Survey `_ -* `Global Religious Futures Project `_ -* `Humanitarian Data Exchange `_ + +* `High-Resolution Contact Networks from Wearable Sensors `_ + +* `Yahoo! Graph and Social Data `_ + +* `Facebook Data Scrape (2005) `_ + +* `Google Scholar citation relations `_ + +* `CMU Enron Email of 150 users `_ + +* `Foursquare from UMN/Sarwat (2013) `_ + +* `Twitter Graph of entire Twitter site `_ + +* `Twitter Data for Sentiment Analysis `_ + +* `Mobile Social Networks from UMASS `_ + +* `Skytrax' Air Travel Reviews Dataset `_ + +* `Network Twitter Data `_ + +* `SourceForge.net Research Data `_ + +* `Ancestry.com Forum Dataset over 10 years `_ + +* `Social Twitter Data `_ + +* `Twitter Scrape Calufa May 2011 `_ + +* `Facebook Social Networks from LAW (since 2007) `_ + +* `Indie Map: social graph and crawl of top IndieWeb sites `_ + +* `Cheng-Caverlee-Lee September 2009 - January 2010 Twitter Scrape `_ + +* `EDRM Enron EMail of 151 users, hosted on S3 `_ + +* `UNIMI/LAW Social Network Datasets `_ + +* `72 hours #gamergate Twitter Scrape `_ + +* `Twitter Data for Online Reputation Management `_ + +* `GitHub Collaboration Archive `_ + +SocialSciences +-------------- contribute + * `INFORM Index for Risk Management `_ -* `Institute for Demographic Studies `_ -* `International Networks Archive `_ -* `International Social Survey Program ISSP `_ -* `International Studies Compendium Project `_ -* `James McGuire Cross National Data `_ -* `MacroData Guide by Norsk samfunnsvitenskapelig datatjeneste `_ + +* `Correlates of War Project `_ + +* `Canadian Legal Information Institute `_ + * `Minnesota Population Center `_ -* `MIT Reality Mining Dataset `_ -* `Notre Dame Global Adaptation Index (NG-DAIN) `_ + +* `Datacards `_ + +* `International Social Survey Program ISSP `_ + * `Open Crime and Policing Data in England, Wales and Northern Ireland `_ -* `Paul Hensel General International Data Page `_ -* `PewResearch Internet Survey Project `_ -* `PewResearch Society Data Collection `_ -* `Political Polarity Data `_ -* `StackExchange Data Explorer `_ -* `Terrorism Research and Analysis Consortium `_ -* `Texas Inmates Executed Since 1984 `_ -* `Titanic Survival Data Set `_ or `on Kaggle `_ -* `UCB's Archive of Social Science Data (D-Lab) `_ -* `UCLA Social Sciences Data Archive `_ -* `UN Civil Society Database `_ -* `Universities Worldwide `_ -* `UPJOHN for Labor Employment Research `_ -* `Uppsala Conflict Data Program `_ -* `World Bank Open Data `_ + +* `International Studies Compendium Project `_ + +* `FBI Hate Crime 2013 - aggregated data `_ + +* `Center for Systemic Peace Datasets - Conflict Trends, Polities, State Fragility, etc `_ + +* `ACLED (Armed Conflict Location & Event Data Project) `_ + +* `Institute for Demographic Studies `_ + +* `International Networks Archive `_ + +* `General Social Survey (GSS) since 1972 `_ + * `WorldPop project - Worldwide human population distributions `_ - - + +* `PewResearch Society Data Collection `_ + +* `Terrorism Research and Analysis Consortium `_ + +* `UN Civil Society Database `_ + +* `GDELT Global Events Database `_ + +* `Humanitarian Data Exchange `_ + +* `World Bank Open Data `_ + +* `James McGuire Cross National Data `_ + +* `German Social Survey `_ + +* `PewResearch Internet Survey Project `_ + +* `Global Religious Futures Project `_ + +* `Universities Worldwide `_ + +* `Fragile States Index `_ + +* `Notre Dame Global Adaptation Index (NG-DAIN) `_ + +* `StackExchange Data Explorer `_ + +* `European Social Survey `_ + +* `Cryptome Conspiracy Theory Items `_ + +* `Political Polarity Data `_ + +* `Texas Inmates Executed Since 1984 `_ + +* `UCLA Social Sciences Data Archive `_ + +* `MacroData Guide by Norsk samfunnsvitenskapelig datatjeneste `_ + +* `UPJOHN for Labor Employment Research `_ + +* `Uppsala Conflict Data Program `_ + +* `MIT Reality Mining Dataset `_ + +* `UCB's Archive of Social Science Data (D-Lab) `_ + +* `Titanic Survival Data Set `_ + +* `Paul Hensel General International Data Page `_ + Software --------- - +-------- contribute + * `FLOSSmole data about free, libre, and open source software development `_ - + Sports ------- - -* `Betfair Historical Exchange Data `_ -* `Cricsheet Matches (cricket) `_ -* `Ergast Formula 1, from 1950 up to date (API) `_ +------ contribute + * `Football/Soccer resources (data and APIs) `_ -* `Lahman's Baseball Database `_ + +* `Ergast Formula 1, from 1950 up to date (API) `_ + * `Pinhooker: Thoroughbred Bloodstock Sale Data `_ + * `Retrosheet Baseball Statistics `_ -* `Tennis database of rankings, results, and stats for ATP `_, `WTA `_, `Grand Slams `_ and `Match Charting Project `_ - - -Time Series ------------ - -* `Databanks International Cross National Time Series Data Archive `_ + +* `Cricsheet Matches (cricket) `_ + +* `Tennis database of rankings, results, and stats for ATP `_ + +* `Lahman's Baseball Database `_ + +* `Betfair Historical Exchange Data `_ + +TimeSeries +---------- contribute + * `Hard Drive Failure Rates `_ -* `Heart Rate Time Series from MIT `_ + * `Time Series Data Library (TSDL) from MU `_ + * `UC Riverside Time Series Dataset `_ - - + +* `Databanks International Cross National Time Series Data Archive `_ + +* `Heart Rate Time Series from MIT `_ + Transportation --------------- - -* `Airlines OD Data 1987-2008 `_ -* `Bay Area Bike Share Data `_ -* `Bike Share Systems (BSS) collection `_ -* `GeoLife GPS Trajectory from Microsoft Research `_ -* `German train system by Deutsche Bahn `_ -* `Hubway Million Rides in MA `_ -* `Montreal BIXI Bike Share `_ -* `NYC Taxi Trip Data 2009- `_ -* `NYC Taxi Trip Data 2013 (FOIA/FOILed) `_ -* `NYC Uber trip data April 2014 to September 2014 `_ -* `Open Traffic collection `_ -* `OpenFlights - airport, airline and route data `_ -* `Philadelphia Bike Share Stations (JSON) `_ -* `Plane Crash Database, since 1920 `_ -* `RITA Airline On-Time Performance data `_ -* `RITA/BTS transport data collection (TranStat) `_ -* `Toronto Bike Share Stations (XML file) `_ -* `Transport for London (TFL) `_ -* `Travel Tracker Survey (TTS) for Chicago `_ -* `U.S. Bureau of Transportation Statistics (BTS) `_ -* `U.S. Domestic Flights 1990 to 2009 `_ +-------------- contribute + * `U.S. Freight Analysis Framework since 2007 `_ + +* `RITA/BTS transport data collection (TranStat) `_ + +* `GeoLife GPS Trajectory from Microsoft Research `_ + +* `NYC Taxi Trip Data 2009- `_ + +* `Plane Crash Database, since 1920 `_ + +* `RITA Airline On-Time Performance data `_ + +* `Travel Tracker Survey (TTS) for Chicago `_ + +* `U.S. Domestic Flights 1990 to 2009 `_ + +* `Philadelphia Bike Share Stations (JSON) `_ + +* `NYC Uber trip data April 2014 to September 2014 `_ + +* `OpenFlights - airport, airline and route data `_ + +* `Bay Area Bike Share Data `_ + +* `Montreal BIXI Bike Share `_ + +* `Hubway Million Rides in MA `_ + +* `NYC Taxi Trip Data 2013 (FOIA/FOILed) `_ + +* `Open Traffic collection `_ + +* `Transport for London (TFL) `_ + +* `U.S. Bureau of Transportation Statistics (BTS) `_ + +* `Toronto Bike Share Stations (XML file) `_ + +* `Bike Share Systems (BSS) collection `_ + +* `German train system by Deutsche Bahn `_ + +* `Airlines OD Data 1987-2008 `_ Complementary Collections ------------------------- * `Data Packaged Core Datasets `_ + * `Database of Scientific Code Contributions `_ + * A growing collection of public datasets: `CoolDatasets. `_ + * DataWrangling: `Some Datasets Available on the Web `_ + * Inside-r: `Finding Data on the Internet `_ + * OpenDataMonitor: `An overview of available open data resources in Europe `_ + * Quora: `Where can I find large datasets open to the public? `_ + * RS.io: `100+ Interesting Data Sets for Statistics `_ + * StaTrek: `Leveraging open data to understand urban lives `_ + From 0bbcc7d29c5ce3c7893ac5b34354e96946481a29 Mon Sep 17 00:00:00 2001 From: Xiaming Chen Date: Mon, 15 Jan 2018 01:06:25 +0800 Subject: [PATCH 69/99] Update README from APD2 --- README.rst | 58 +++++++++++++++++++++++++++--------------------------- 1 file changed, 29 insertions(+), 29 deletions(-) diff --git a/README.rst b/README.rst index e16a1ae..547fe33 100644 --- a/README.rst +++ b/README.rst @@ -25,14 +25,14 @@ Other amazingly awesome lists can be found in the Agriculture ------------ contribute +----------- * `U.S. Department of Agriculture's Nutrient Database `_ * `U.S. Department of Agriculture's PLANTS Database `_ Biology -------- contribute +------- * `NCBI Proteins `_ @@ -121,7 +121,7 @@ Biology * `1000 Genomes `_ Climate+Weather ---------------- contribute +--------------- * `Global Climate Data Since 1929 `_ @@ -158,7 +158,7 @@ Climate+Weather * `NOAA SURFRAD Meteorology and Radiation Datasets `_ ComplexNetworks ---------------- contribute +--------------- * `DIMACS Road Networks Collection `_ @@ -201,7 +201,7 @@ ComplexNetworks * `AMiner Citation Network Dataset `_ ComputerNetworks ----------------- contribute +---------------- * `53.5B Web clicks of 100K users in Indiana Univ. `_ @@ -228,7 +228,7 @@ ComputerNetworks * `CommonCrawl Web Data over 7 years `_ DataChallenges --------------- contribute +-------------- * `Netflix Prize `_ @@ -259,7 +259,7 @@ DataChallenges * `TravisTorrent Dataset - MSR'2017 Mining Challenge `_ EarthScience ------------- contribute +------------ * `AQUASTAT - Global water resources and uses `_ @@ -278,7 +278,7 @@ EarthScience * `USGS Earthquake Archives `_ Economics ---------- contribute +--------- * `The Center for International Data `_ @@ -315,14 +315,14 @@ Economics * `Economic Freedom of the World Data `_ Education ---------- contribute +--------- * `Student Data from Free Code Camp `_ * `College Scorecard Data `_ Energy ------- contribute +------ * `DRED `_ @@ -353,7 +353,7 @@ Energy * `REDD `_ Finance -------- contribute +------- * `NASDAQ `_ @@ -376,7 +376,7 @@ Finance * `OSU Financial data `_ GIS ---- contribute +--- * `TZ Timezones shapfiles `_ @@ -425,7 +425,7 @@ GIS * `World countries in multiple formats `_ Government ----------- contribute +---------- * `New Zealand `_ @@ -652,7 +652,7 @@ Government * `Taiwan `_ Healthcare ----------- contribute +---------- * `PhysioBank Databases - A large and growing archive of physiological data. `_ @@ -681,7 +681,7 @@ Healthcare * `The Cancer Genome Atlas project (TCGA) `_ ImageProcessing ---------------- contribute +--------------- * `Several Shape-from-Silhouette Datasets `_ @@ -730,7 +730,7 @@ ImageProcessing * `Animals with attributes `_ MachineLearning ---------------- contribute +--------------- * `Discogs Monthly Data `_ @@ -773,7 +773,7 @@ MachineLearning * `Machine Learning Data Set Repository `_ Museums -------- contribute +------- * `Rijksmuseum Historical Art Collection `_ @@ -790,7 +790,7 @@ Museums * `Cooper-Hewitt's Collection Database `_ NaturalLanguage ---------------- contribute +--------------- * `Webhose - News/Blogs in multiple languages `_ @@ -855,7 +855,7 @@ NaturalLanguage * `Blogger Corpus `_ Neuroscience ------------- contribute +------------ * `Human Connectome Project `_ @@ -886,7 +886,7 @@ Neuroscience * `Study Forrest `_ Physics -------- contribute +------- * `CERN Open Data Portal `_ @@ -899,12 +899,12 @@ Physics * `NSSDC (NASA) data of 550 space spacecraft `_ Psychology+Cognition --------------------- contribute +-------------------- * `OSU Cognitive Modeling Repository Datasets `_ PublicDomains -------------- contribute +------------- * `Google `_ @@ -957,7 +957,7 @@ PublicDomains * `CMU JASA data archive `_ SearchEngines -------------- contribute +------------- * `Academic Torrents of data sharing from UMB `_ @@ -982,7 +982,7 @@ SearchEngines * `Zenodo - An open dependable home for the long-tail of science `_ SocialNetworks --------------- contribute +-------------- * `Reddit Comments `_ @@ -1035,7 +1035,7 @@ SocialNetworks * `GitHub Collaboration Archive `_ SocialSciences --------------- contribute +-------------- * `INFORM Index for Risk Management `_ @@ -1120,12 +1120,12 @@ SocialSciences * `Paul Hensel General International Data Page `_ Software --------- contribute +-------- * `FLOSSmole data about free, libre, and open source software development `_ Sports ------- contribute +------ * `Football/Soccer resources (data and APIs) `_ @@ -1144,7 +1144,7 @@ Sports * `Betfair Historical Exchange Data `_ TimeSeries ----------- contribute +---------- * `Hard Drive Failure Rates `_ @@ -1157,7 +1157,7 @@ TimeSeries * `Heart Rate Time Series from MIT `_ Transportation --------------- contribute +-------------- * `U.S. Freight Analysis Framework since 2007 `_ From 5419b62a9fe927f89eda1ce5978d0bf032e5f4e0 Mon Sep 17 00:00:00 2001 From: Xiaming Chen Date: Mon, 15 Jan 2018 01:08:24 +0800 Subject: [PATCH 70/99] Update README from APD2 --- README.rst | 1394 ++++++++++++++++++++++++++-------------------------- 1 file changed, 697 insertions(+), 697 deletions(-) diff --git a/README.rst b/README.rst index 547fe33..784a064 100644 --- a/README.rst +++ b/README.rst @@ -34,850 +34,850 @@ Agriculture Biology ------- -* `NCBI Proteins `_ - -* `Gene Expression Omnibus (GEO) `_ - -* `UniGene `_ - -* `Gene Ontology (GO) `_ - -* `UCSC Public Data `_ - -* `EBI Protein Data Bank in Europe `_ - -* `OpenSNP genotypes data `_ - -* `The Personal Genome Project `_ - -* `Stowers Institute Original Data Repository `_ +* `1000 Genomes `_ * `American Gut (Microbiome Project) `_ -* `Systems Science of Biological Dynamics (SSBD) Database `_ - -* `Electron Microscopy Pilot Image Archive (EMPIAR) `_ - * `Broad Bioimage Benchmark Collection (BBBC) `_ -* `Journal of Cell Biology DataViewer `_ - -* `NCI Genomic Data Commons `_ - -* `Protein Data Bank `_ - -* `Pathguid - Protein-Protein Interactions Catalog `_ - -* `International HapMap Project `_ - -* `Global Biotic Interactions (GloBI) `_ - -* `NCBI Taxonomy `_ - -* `The Cancer Genome Atlas (TCGA), available via Broad GDAC `_ - * `Broad Cancer Cell Line Encyclopedia (CCLE) `_ -* `Ensembl Genomes `_ - -* `Sanger Catalogue of Somatic Mutations in Cancer (COSMIC) `_ - -* `ICOS PSP Benchmark `_ - -* `PubChem Project `_ - -* `Psychiatric Genomics Consortium `_ - -* `Human Microbiome Project (HMP) `_ - -* `Stanford Microarray Data `_ - -* `EBI ArrayExpress `_ - -* `Sanger Genomics of Drug Sensitivity in Cancer Project (GDSC) `_ - -* `PubGene (now Coremine Medical) `_ - -* `Harvard Medical School (HMS) LINCS Project `_ - -* `ENCODE project `_ +* `Cell Image Library `_ * `Complete Genomics Public Data `_ -* `Cell Image Library `_ +* `EBI ArrayExpress `_ -* `Universal Protein Resource (UnitProt) `_ +* `EBI Protein Data Bank in Europe `_ -* `MIT Cancer Genomics Data `_ +* `ENCODE project `_ -* `The Catalogue of Life `_ +* `Electron Microscopy Pilot Image Archive (EMPIAR) `_ -* `NIH Microarray data `_ +* `Ensembl Genomes `_ -* `Sequence Read Archive(SRA) `_ +* `Gene Expression Omnibus (GEO) `_ + +* `Gene Ontology (GO) `_ + +* `Global Biotic Interactions (GloBI) `_ + +* `Harvard Medical School (HMS) LINCS Project `_ * `Human Genome Diversity Project `_ -* `1000 Genomes `_ +* `Human Microbiome Project (HMP) `_ + +* `ICOS PSP Benchmark `_ + +* `International HapMap Project `_ + +* `Journal of Cell Biology DataViewer `_ + +* `MIT Cancer Genomics Data `_ + +* `NCBI Proteins `_ + +* `NCBI Taxonomy `_ + +* `NCI Genomic Data Commons `_ + +* `NIH Microarray data `_ + +* `OpenSNP genotypes data `_ + +* `Pathguid - Protein-Protein Interactions Catalog `_ + +* `Protein Data Bank `_ + +* `Psychiatric Genomics Consortium `_ + +* `PubChem Project `_ + +* `PubGene (now Coremine Medical) `_ + +* `Sanger Catalogue of Somatic Mutations in Cancer (COSMIC) `_ + +* `Sanger Genomics of Drug Sensitivity in Cancer Project (GDSC) `_ + +* `Sequence Read Archive(SRA) `_ + +* `Stanford Microarray Data `_ + +* `Stowers Institute Original Data Repository `_ + +* `Systems Science of Biological Dynamics (SSBD) Database `_ + +* `The Cancer Genome Atlas (TCGA), available via Broad GDAC `_ + +* `The Catalogue of Life `_ + +* `The Personal Genome Project `_ + +* `UCSC Public Data `_ + +* `UniGene `_ + +* `Universal Protein Resource (UnitProt) `_ Climate+Weather --------------- -* `Global Climate Data Since 1929 `_ - -* `The World Bank Open Data Resources for Climate Change `_ - -* `Brazilian Weather - Historical data (In Portuguese) `_ - -* `NOAA Bering Sea Climate `_ - -* `WU Historical Weather Worldwide `_ - -* `Climate Data from UEA (updated monthly) `_ - * `Actuaries Climate Index `_ -* `WorldClim - Global Climate Data `_ - * `Australian Weather `_ * `Aviation Weather Center - Consistent, timely and accurate weather information for the world airspace system `_ -* `NASA Global Imagery Browse Services `_ - -* `NOAA Realtime Weather Models `_ - -* `UEA Climatic Research Unit `_ - -* `European Climate Assessment & Dataset `_ +* `Brazilian Weather - Historical data (In Portuguese) `_ * `Canadian Meteorological Centre `_ +* `Climate Data from UEA (updated monthly) `_ + +* `European Climate Assessment & Dataset `_ + +* `Global Climate Data Since 1929 `_ + +* `NASA Global Imagery Browse Services `_ + +* `NOAA Bering Sea Climate `_ + * `NOAA Climate Datasets `_ +* `NOAA Realtime Weather Models `_ + * `NOAA SURFRAD Meteorology and Radiation Datasets `_ + +* `The World Bank Open Data Resources for Climate Change `_ + +* `UEA Climatic Research Unit `_ + +* `WU Historical Weather Worldwide `_ + +* `WorldClim - Global Climate Data `_ ComplexNetworks --------------- -* `DIMACS Road Networks Collection `_ - -* `UFL sparse matrix collection `_ - -* `Stanford GraphBase `_ - -* `DBLP Citation dataset `_ - -* `Small Network Data `_ +* `AMiner Citation Network Dataset `_ * `CrossRef DOI URLs `_ -* `The Nexus Network Repository `_ +* `DBLP Citation dataset `_ -* `Stanford Longitudinal Network Data Sources `_ +* `DIMACS Road Networks Collection `_ + +* `NBER Patent Citations `_ + +* `NIST complex networks data collection `_ + +* `Network Repository with Interactive Exploratory Analysis Tools `_ + +* `Protein-protein interaction network `_ * `PyPI and Maven Dependency Network `_ +* `Scopus Citation Database `_ + +* `Small Network Data `_ + +* `Stanford GraphBase `_ + * `Stanford Large Network Dataset Collection `_ -* `WSU Graph Database `_ +* `Stanford Longitudinal Network Data Sources `_ * `The Koblenz Network Collection `_ * `The Laboratory for Web Algorithmics (UNIMI) `_ -* `Network Repository with Interactive Exploratory Analysis Tools `_ +* `The Nexus Network Repository `_ * `UCI Network Data Repository `_ -* `Scopus Citation Database `_ +* `UFL sparse matrix collection `_ -* `NBER Patent Citations `_ - -* `Protein-protein interaction network `_ - -* `NIST complex networks data collection `_ - -* `AMiner Citation Network Dataset `_ +* `WSU Graph Database `_ ComputerNetworks ---------------- +* `3.5B Web Pages from CommonCrawl 2012 `_ + * `53.5B Web clicks of 100K users in Indiana Univ. `_ -* `Open Mobile Data by MobiPerf `_ - -* `ClueWeb12 - 733M web pages `_ - -* `CRAWDAD Wireless datasets from Dartmouth Univ. `_ - * `CAIDA Internet Datasets `_ +* `CRAWDAD Wireless datasets from Dartmouth Univ. `_ + * `ClueWeb09 - 1B web pages `_ -* `UCSD Network Telescope, IPv4 /8 net `_ +* `ClueWeb12 - 733M web pages `_ + +* `CommonCrawl Web Data over 7 years `_ * `Criteo click-through data `_ -* `3.5B Web Pages from CommonCrawl 2012 `_ +* `OONI: Open Observatory of Network Interference - Internet censorship data `_ + +* `Open Mobile Data by MobiPerf `_ * `Rapid7 Sonar Internet Scans `_ -* `OONI: Open Observatory of Network Interference - Internet censorship data `_ - -* `CommonCrawl Web Data over 7 years `_ +* `UCSD Network Telescope, IPv4 /8 net `_ DataChallenges -------------- +* `Bruteforce Database `_ + +* `Challenges in Machine Learning `_ + +* `CrowdANALYTIX dataX `_ + +* `D4D Challenge of Orange `_ + +* `DrivenData Competitions for Social Good `_ + +* `ICWSM Data Challenge (since 2009) `_ + +* `KDD Cup by Tencent 2012 `_ + +* `Kaggle Competition Data `_ + +* `Localytics Data Visualization Challenge `_ + * `Netflix Prize `_ * `Space Apps Challenge `_ -* `ICWSM Data Challenge (since 2009) `_ - -* `DrivenData Competitions for Social Good `_ - -* `CrowdANALYTIX dataX `_ - -* `Bruteforce Database `_ - -* `Kaggle Competition Data `_ - -* `Yelp Dataset Challenge `_ - -* `Localytics Data Visualization Challenge `_ - -* `D4D Challenge of Orange `_ - * `Telecom Italia Big Data Challenge `_ -* `KDD Cup by Tencent 2012 `_ - -* `Challenges in Machine Learning `_ - * `TravisTorrent Dataset - MSR'2017 Mining Challenge `_ + +* `Yelp Dataset Challenge `_ EarthScience ------------ * `AQUASTAT - Global water resources and uses `_ -* `Marinexplore - Open Oceanographic Data `_ +* `BODC - marine data of ~22K vars `_ * `EOSDIS - NASA's earth observing system data `_ -* `BODC - marine data of ~22K vars `_ +* `Earth Models `_ * `Integrated Marine Observing System (IMOS) - roughly 30TB of ocean measurements `_ -* `Smithsonian Institution Global Volcano and Eruption Database `_ +* `Marinexplore - Open Oceanographic Data `_ -* `Earth Models `_ +* `Smithsonian Institution Global Volcano and Eruption Database `_ * `USGS Earthquake Archives `_ Economics --------- -* `The Center for International Data `_ +* `American Economic Association (AEA) `_ + +* `EconData from UMD `_ + +* `Economic Freedom of the World Data `_ * `Historical MacroEconomc Statistics `_ * `International Economics Database `_ -* `Internet Product Code Database `_ - -* `American Economic Association (AEA) `_ - -* `Jon Haveman International Trade Data Links `_ - -* `The Observatory of Economic Complexity `_ - -* `The Atlas of Economic Complexity `_ - -* `SciencesPo World Trade Gravity Datasets `_ - -* `Our World in Data `_ - -* `UN Commodity Trade Statistics `_ - -* `OpenCorporates Database of Companies in the World `_ - * `International Trade Statistics `_ +* `Internet Product Code Database `_ + * `Joint External Debt Data Hub `_ -* `EconData from UMD `_ +* `Jon Haveman International Trade Data Links `_ + +* `OpenCorporates Database of Companies in the World `_ + +* `Our World in Data `_ + +* `SciencesPo World Trade Gravity Datasets `_ + +* `The Atlas of Economic Complexity `_ + +* `The Center for International Data `_ + +* `The Observatory of Economic Complexity `_ + +* `UN Commodity Trade Statistics `_ * `UN Human Development Reports `_ - -* `Economic Freedom of the World Data `_ Education --------- -* `Student Data from Free Code Camp `_ - * `College Scorecard Data `_ + +* `Student Data from Free Code Camp `_ Energy ------ -* `DRED `_ - -* `COMBED `_ - -* `iAWE `_ - * `AMPds `_ -* `ECO `_ - -* `WHITED `_ - -* `HES - Household Electricity Study, UK `_ - -* `PLAID - The Plug Load Appliance Identification Dataset `_ - * `BLUEd `_ -* `UK-DALE - UK Domestic Appliance-Level Electricity `_ +* `COMBED `_ -* `HFED `_ +* `DRED `_ -* `Tracebase `_ +* `ECO `_ * `EIA `_ +* `HES - Household Electricity Study, UK `_ + +* `HFED `_ + +* `PLAID - The Plug Load Appliance Identification Dataset `_ + * `REDD `_ + +* `Tracebase `_ + +* `UK-DALE - UK Domestic Appliance-Level Electricity `_ + +* `WHITED `_ + +* `iAWE `_ Finance ------- -* `NASDAQ `_ +* `CBOE Futures Exchange `_ * `Google Finance `_ -* `Yahoo Finance `_ +* `Google Trends `_ + +* `NASDAQ `_ * `NYSE Market Data `_ -* `CBOE Futures Exchange `_ - -* `St Louis Federal `_ - -* `Quandl `_ - -* `Google Trends `_ - * `OANDA `_ * `OSU Financial data `_ + +* `Quandl `_ + +* `St Louis Federal `_ + +* `Yahoo Finance `_ GIS --- -* `TZ Timezones shapfiles `_ +* `ArcGIS Open Data portal `_ -* `Pleiades - Gazetteer and graph of ancient places `_ - -* `OpenStreetMap (OSM) `_ +* `Cambridge, MA, US, GIS data on GitHub `_ * `Factual Global Location Data `_ -* `World boundaries from the U.S. Department of State `_ - -* `GeoNames Worldwide `_ - -* `Landsat 8 on AWS `_ - -* `Global Administrative Areas Database (GADM) `_ - -* `Natural Earth - vectors and rasters of the world `_ - * `Geo Spatial Data from ASU `_ * `Geo Wiki Project - Citizen-driven Environmental Monitoring `_ * `GeoFabrik - OSM data extracted to a variety of formats and areas `_ -* `Cambridge, MA, US, GIS data on GitHub `_ +* `GeoNames Worldwide `_ -* `ArcGIS Open Data portal `_ - -* `OpenAddresses `_ - -* `UN Environmental Data `_ - -* `TwoFishes - Foursquare's coarse geocoder `_ - -* `TIGER/Line - U.S. boundaries and roads `_ - -* `Reverse Geocoder using OSM data `_ +* `Global Administrative Areas Database (GADM) `_ * `Homeland Infrastructure Foundation-Level Data `_ +* `Landsat 8 on AWS `_ + * `List of all countries in all languages `_ * `National Weather Service GIS Data Portal `_ +* `Natural Earth - vectors and rasters of the world `_ + +* `OpenAddresses `_ + +* `OpenStreetMap (OSM) `_ + +* `Pleiades - Gazetteer and graph of ancient places `_ + +* `Reverse Geocoder using OSM data `_ + +* `TIGER/Line - U.S. boundaries and roads `_ + +* `TZ Timezones shapfiles `_ + +* `TwoFishes - Foursquare's coarse geocoder `_ + +* `UN Environmental Data `_ + +* `World boundaries from the U.S. Department of State `_ + * `World countries in multiple formats `_ Government ---------- -* `New Zealand `_ +* `Alberta, Province of Canada `_ -* `Glasgow, Scotland, UK `_ +* `Antwerp, Belgium `_ -* `Puerto Rico Government `_ +* `Argentina (non official) `_ -* `Vienna, Austria `_ +* `Argentina `_ -* `Missisauga, ON, Canada `_ +* `Austin, TX, US `_ -* `Open Government Data (OGD) Platform India `_ +* `Australia (abs.gov.au) `_ -* `Montreal, QC, Canada `_ +* `Australia (data.gov.au) `_ -* `Indian Government Data `_ - -* `U.S. Food and Drug Administration (FDA) `_ - -* `MassGIS, Massachusetts, U.S. `_ - -* `Los Angeles Open Data `_ - -* `Vancouver, BC Open Data Catalog `_ - -* `U.S. Federal Government Agencies `_ - -* `State of Utah, US `_ - -* `Buenos Aires, Argentina `_ - -* `Texas Open Data `_ +* `Austria (data.gv.at) `_ * `Baton Rouge, LA, US `_ -* `Netherlands `_ +* `Belgium `_ -* `Uganda Bureau of Statistics `_ +* `Brazil `_ -* `Palo Alto, California, US `_ +* `Buenos Aires, Argentina `_ -* `Victoria, BC, Canada `_ +* `Calgary, AB, Canada `_ -* `U.S. CDC Public Health datasets `_ +* `Cambridge, MA, US `_ -* `NYC Open Data `_ +* `Canada `_ -* `U.S. American Community Survey `_ +* `Chicago `_ + +* `Chile `_ + +* `Dallas Open Data `_ + +* `DataBC - data from the Province of British Columbia `_ + +* `Denver Open Data `_ + +* `Durham, NC Open Data `_ + +* `Edmonton, AB, Canada `_ + +* `England LGInform `_ + +* `EuroStat `_ + +* `EveryPolitician - Ongoing project collating and sharing data on every politician. `_ + +* `FedStats `_ * `Finland `_ +* `France `_ + +* `Fredericton, NB, Canada `_ + +* `Gatineau, QC, Canada `_ + +* `Germany `_ + +* `Ghent, Belgium `_ + +* `Glasgow, Scotland, UK `_ + +* `Greece `_ + * `Guardian world governments `_ +* `Halifax, NS, Canada `_ + +* `Helsinki Region, Finland `_ + +* `Hong Kong, China `_ + +* `Houston Open Data `_ + +* `Indian Government Data `_ + +* `Indonesian Data Portal `_ + +* `Ireland's Open Data Portal `_ + * `Japan `_ -* `Portland, Oregon `_ - -* `Uruguay `_ - -* `Australia (data.gov.au) `_ - * `Laval, QC, Canada `_ * `Lexington, KY `_ -* `Helsinki Region, Finland `_ +* `London Datastore, UK `_ + +* `London, ON, Canada `_ + +* `Los Angeles Open Data `_ + +* `MassGIS, Massachusetts, U.S. `_ + +* `Metropolitain Transportation Commission (MTC), California, US `_ * `Mexico `_ +* `Missisauga, ON, Canada `_ + +* `Moldova `_ + +* `Moncton, NB, Canada `_ + +* `Montreal, QC, Canada `_ + +* `Mountain View, California, US (GIS) `_ + +* `NYC Open Data `_ + +* `NYC betanyc `_ + +* `Netherlands `_ + +* `New Zealand `_ + +* `OECD `_ + +* `Oakland, California, US `_ + +* `Oklahoma `_ + +* `Open Data for Africa `_ + +* `Open Government Data (OGD) Platform India `_ + +* `OpenDataSoft's list of 1,600 open data `_ + +* `Oregon `_ + +* `Ottawa, ON, Canada `_ + +* `Palo Alto, California, US `_ + +* `Portland, Oregon `_ + +* `Portugal - Pordata organization `_ + +* `Puerto Rico Government `_ + +* `Quebec City, QC, Canada `_ + +* `Quebec Province of Canada `_ + +* `Regina SK, Canada `_ + +* `Rio de Janeiro, Brazil `_ + * `Romania `_ -* `Singapore Government Data `_ - -* `Chile `_ - -* `U.K. Government Data `_ - -* `Canada `_ - -* `Cambridge, MA, US `_ +* `Russia `_ * `San Francisco Data sets `_ * `San Jose, California, US `_ -* `FedStats `_ - -* `Germany `_ - -* `DataBC - data from the Province of British Columbia `_ - -* `U.S. Federal Government Data Catalog `_ - -* `Open Data for Africa `_ - -* `Toronto, ON, Canada `_ - -* `Ghent, Belgium `_ +* `San Mateo County, California, US `_ * `Saskatchewan, Province of Canada `_ -* `Gatineau, QC, Canada `_ - -* `Dallas Open Data `_ - -* `South Africa `_ - -* `Quebec City, QC, Canada `_ - -* `OECD `_ - -* `Denver Open Data `_ - -* `Portugal - Pordata organization `_ - -* `Metropolitain Transportation Commission (MTC), California, US `_ - -* `France `_ - -* `London, ON, Canada `_ - -* `San Mateo County, California, US `_ - -* `Houston Open Data `_ - -* `Edmonton, AB, Canada `_ - -* `Argentina (non official) `_ - -* `Chicago `_ - -* `Durham, NC Open Data `_ - -* `Alberta, Province of Canada `_ - -* `Oklahoma `_ - -* `Belgium `_ - -* `Moldova `_ - -* `Austria (data.gv.at) `_ - -* `Greece `_ - -* `U.S. National Center for Education Statistics (NCES) `_ - -* `Brazil `_ - -* `Austin, TX, US `_ - -* `Moncton, NB, Canada `_ - -* `Mountain View, California, US (GIS) `_ - -* `OpenDataSoft's list of 1,600 open data `_ - -* `England LGInform `_ - -* `Valley Transportation Authority (VTA), California, US `_ - -* `Switzerland `_ - -* `U.S. Department of Housing and Urban Development (HUD) `_ - -* `Antwerp, Belgium `_ - -* `Ireland's Open Data Portal `_ - -* `UK 2011 Census Open Atlas Project `_ - -* `Rio de Janeiro, Brazil `_ - -* `Russia `_ - -* `Australia (abs.gov.au) `_ - -* `Taiwan g0v `_ - -* `Halifax, NS, Canada `_ - -* `Argentina `_ - -* `Hong Kong, China `_ - -* `U.S. Open Government `_ - -* `Calgary, AB, Canada `_ - -* `EuroStat `_ - * `Seattle `_ -* `NYC betanyc `_ - -* `London Datastore, UK `_ - -* `The World Bank `_ - -* `EveryPolitician - Ongoing project collating and sharing data on every politician. `_ - -* `U.S. Census Bureau `_ - -* `Tunisia `_ - -* `Indonesian Data Portal `_ - -* `Oregon `_ - -* `Fredericton, NB, Canada `_ +* `Singapore Government Data `_ * `South Africa Trade Statistics `_ -* `Ottawa, ON, Canada `_ +* `South Africa `_ -* `Regina SK, Canada `_ +* `State of Utah, US `_ + +* `Switzerland `_ + +* `Taiwan g0v `_ + +* `Taiwan `_ + +* `Texas Open Data `_ + +* `The World Bank `_ + +* `Toronto, ON, Canada `_ + +* `Tunisia `_ + +* `U.K. Government Data `_ + +* `U.S. American Community Survey `_ + +* `U.S. CDC Public Health datasets `_ + +* `U.S. Census Bureau `_ + +* `U.S. Department of Housing and Urban Development (HUD) `_ + +* `U.S. Federal Government Agencies `_ + +* `U.S. Federal Government Data Catalog `_ + +* `U.S. Food and Drug Administration (FDA) `_ + +* `U.S. National Center for Education Statistics (NCES) `_ + +* `U.S. Open Government `_ + +* `UK 2011 Census Open Atlas Project `_ + +* `Uganda Bureau of Statistics `_ * `United Nations `_ -* `Oakland, California, US `_ +* `Uruguay `_ -* `Quebec Province of Canada `_ +* `Valley Transportation Authority (VTA), California, US `_ -* `Taiwan `_ +* `Vancouver, BC Open Data Catalog `_ + +* `Victoria, BC, Canada `_ + +* `Vienna, Austria `_ Healthcare ---------- -* `PhysioBank Databases - A large and growing archive of physiological data. `_ +* `EHDP Large Health Data Sets `_ -* `MeSH, the vocabulary thesaurus used for indexing articles for PubMed `_ +* `GDC - GDC supports several cancer genome programs for CCG, TCGA, TARGET etc. `_ * `Gapminder World demographic databases `_ -* `Open-ODS (structure of the UK NHS) `_ +* `MeSH, the vocabulary thesaurus used for indexing articles for PubMed `_ -* `Number of Ebola Cases and Deaths in Affected Countries (2014) `_ - -* `EHDP Large Health Data Sets `_ +* `Medicare Coverage Database (MCD), U.S. `_ * `Medicare Data Engine of medicare.gov Data `_ * `Medicare Data File `_ +* `Number of Ebola Cases and Deaths in Affected Countries (2014) `_ + +* `Open-ODS (structure of the UK NHS) `_ + * `OpenPaymentsData, Healthcare financial relationship data `_ -* `World Health Organization Global Health Observatory `_ - -* `GDC - GDC supports several cancer genome programs for CCG, TCGA, TARGET etc. `_ - -* `Medicare Coverage Database (MCD), U.S. `_ +* `PhysioBank Databases - A large and growing archive of physiological data. `_ * `The Cancer Genome Atlas project (TCGA) `_ + +* `World Health Organization Global Health Observatory `_ ImageProcessing --------------- -* `Several Shape-from-Silhouette Datasets `_ - -* `Stanford Dogs Dataset `_ - -* `Flickr: 32 Class Brand Logos `_ - -* `Indoor Scene Recognition `_ - -* `YouTube Faces Database `_ - -* `MNIST database of handwritten digits, near 1 million examples `_ - -* `Visual genome `_ - -* `Affective Image Classification `_ - -* `Adience Unfiltered faces for gender and age classification `_ - -* `The Oxford-IIIT Pet Dataset `_ +* `10k US Adult Faces Database `_ * `2GB of Photos of Cats `_ -* `The Action Similarity Labeling (ASLAN) Challenge `_ +* `Adience Unfiltered faces for gender and age classification `_ -* `Chars74K dataset - Character Recognition in Natural Images (both English and Kannada are available) `_ +* `Affective Image Classification `_ -* `10k US Adult Faces Database `_ +* `Animals with attributes `_ * `Caltech Pedestrian Detection Benchmark `_ -* `Massive Visual Memory Stimuli, MIT `_ +* `Chars74K dataset - Character Recognition in Natural Images (both English and Kannada are available) `_ -* `International Affective Picture System, UFL `_ +* `Face Recognition Benchmark `_ -* `Violent-Flows - Crowd Violence / Non-violence Database and benchmark `_ - -* `SUN database, MIT `_ +* `Flickr: 32 Class Brand Logos `_ * `GDXray - X-ray images for X-ray testing and Computer Vision `_ * `ImageNet (in WordNet hierarchy) `_ -* `Face Recognition Benchmark `_ +* `Indoor Scene Recognition `_ -* `Animals with attributes `_ +* `International Affective Picture System, UFL `_ + +* `MNIST database of handwritten digits, near 1 million examples `_ + +* `Massive Visual Memory Stimuli, MIT `_ + +* `SUN database, MIT `_ + +* `Several Shape-from-Silhouette Datasets `_ + +* `Stanford Dogs Dataset `_ + +* `The Action Similarity Labeling (ASLAN) Challenge `_ + +* `The Oxford-IIIT Pet Dataset `_ + +* `Violent-Flows - Crowd Violence / Non-violence Database and benchmark `_ + +* `Visual genome `_ + +* `YouTube Faces Database `_ MachineLearning --------------- +* `Context-aware data sets from five domains `_ + +* `Delve Datasets for classification and regression `_ + * `Discogs Monthly Data `_ * `Free Music Archive `_ -* `Delve Datasets for classification and regression `_ - -* `Yahoo! Ratings and Classification Data `_ - -* `Restaurants Health Score Data in San Francisco `_ - -* `Context-aware data sets from five domains `_ - -* `More Song Datasets `_ - -* `Lending Club Loan Data `_ - -* `MovieLens Data Sets `_ - -* `Labeled Faces in the Wild (LFW) `_ - -* `eBay Online Auctions (2012) `_ - -* `UCI Machine Learning Repository `_ - -* `Youtube 8m `_ - -* `RDataMining - "R and Data Mining" ebook data `_ - * `IMDb Database `_ * `Keel Repository for classification, regression and time series `_ -* `Registered Meteorites on Earth `_ +* `Labeled Faces in the Wild (LFW) `_ + +* `Lending Club Loan Data `_ + +* `Machine Learning Data Set Repository `_ * `Million Song Dataset `_ +* `More Song Datasets `_ + +* `MovieLens Data Sets `_ + * `New Yorker caption contest ratings `_ -* `Machine Learning Data Set Repository `_ +* `RDataMining - "R and Data Mining" ebook data `_ + +* `Registered Meteorites on Earth `_ + +* `Restaurants Health Score Data in San Francisco `_ + +* `UCI Machine Learning Repository `_ + +* `Yahoo! Ratings and Classification Data `_ + +* `Youtube 8m `_ + +* `eBay Online Auctions (2012) `_ Museums ------- +* `Canada Science and Technology Museums Corporation's Open Data `_ + +* `Cooper-Hewitt's Collection Database `_ + +* `Minneapolis Institute of Arts metadata `_ + +* `Natural History Museum (London) Data Portal `_ + * `Rijksmuseum Historical Art Collection `_ * `Tate Collection metadata `_ -* `Canada Science and Technology Museums Corporation's Open Data `_ - -* `Natural History Museum (London) Data Portal `_ - * `The Getty vocabularies `_ - -* `Minneapolis Institute of Arts metadata `_ - -* `Cooper-Hewitt's Collection Database `_ NaturalLanguage --------------- -* `Webhose - News/Blogs in multiple languages `_ - -* `Google MC-AFP - Generated based on the public available Gigaword dataset using Paragraph Vectors `_ - -* `Universal Dependencies `_ - -* `SMS Spam Collection in English `_ - -* `Stanford Question Answering Dataset (SQuAD) `_ - -* `Flickr Personal Taxonomies `_ - -* `Google Books Ngrams (2.2TB) `_ - -* `DBpedia - 4.58M things with 583M facts `_ - -* `Personae Corpus `_ - -* `Wikipedia Links data - 40 Million Entities in Context `_ - * `Automatic Keyphrase Extraction `_ -* `ClueWeb12 FACC `_ +* `Blogger Corpus `_ * `CLiPS Stylometry Investigation Corpus `_ -* `Making Sense of Microposts 2013 - Concept Extraction `_ - * `ClueWeb09 FACC `_ -* `WordNet databases and tools `_ +* `ClueWeb12 FACC `_ -* `SaudiNewsNet Collection of Saudi Newspaper Articles (Arabic, 30K articles) `_ +* `DBpedia - 4.58M things with 583M facts `_ -* `Machine Comprehension Test (MCTest) of text from Microsoft Research `_ - -* `Wikidata - Wikipedia databases `_ - -* `Making Sense of Microposts 2016 - Named Entity rEcognition and Linking `_ - -* `Gutenberg eBooks List `_ - -* `Google Web 5gram (1TB, 2006) `_ - -* `POS/NER/Chunk annotated data `_ +* `Flickr Personal Taxonomies `_ * `Freebase of people, places, and things `_ +* `Google Books Ngrams (2.2TB) `_ + +* `Google MC-AFP - Generated based on the public available Gigaword dataset using Paragraph Vectors `_ + +* `Google Web 5gram (1TB, 2006) `_ + +* `Gutenberg eBooks List `_ + * `Hansards text chunks of Canadian Parliament `_ -* `Machine Translation of European languages `_ - -* `Multi-Domain Sentiment Dataset (version 2.0) `_ - -* `USENET postings corpus of 2005~2011 `_ - -* `Open Multilingual Wordnet `_ - * `Microsoft MAchine Reading COmprehension Dataset (or MS MARCO) `_ -* `Blogger Corpus `_ +* `Machine Comprehension Test (MCTest) of text from Microsoft Research `_ + +* `Machine Translation of European languages `_ + +* `Making Sense of Microposts 2013 - Concept Extraction `_ + +* `Making Sense of Microposts 2016 - Named Entity rEcognition and Linking `_ + +* `Multi-Domain Sentiment Dataset (version 2.0) `_ + +* `Open Multilingual Wordnet `_ + +* `POS/NER/Chunk annotated data `_ + +* `Personae Corpus `_ + +* `SMS Spam Collection in English `_ + +* `SaudiNewsNet Collection of Saudi Newspaper Articles (Arabic, 30K articles) `_ + +* `Stanford Question Answering Dataset (SQuAD) `_ + +* `USENET postings corpus of 2005~2011 `_ + +* `Universal Dependencies `_ + +* `Webhose - News/Blogs in multiple languages `_ + +* `Wikidata - Wikipedia databases `_ + +* `Wikipedia Links data - 40 Million Entities in Context `_ + +* `WordNet databases and tools `_ Neuroscience ------------ -* `Human Connectome Project `_ +* `Allen Institute Datasets `_ * `Brain Catalogue `_ +* `Brainomics `_ + * `CodeNeuro Datasets `_ -* `Neuroelectro `_ +* `Collaborative Research in Computational Neuroscience (CRCNS) `_ -* `Allen Institute Datasets `_ +* `FCP-INDI `_ + +* `Human Connectome Project `_ * `NDAR `_ -* `Collaborative Research in Computational Neuroscience (CRCNS) `_ - * `NIMH Data Archive `_ * `NeuroData `_ -* `Brainomics `_ - -* `FCP-INDI `_ +* `Neuroelectro `_ * `OASIS `_ @@ -890,13 +890,13 @@ Physics * `CERN Open Data Portal `_ -* `Sloan Digital Sky Survey (SDSS) - Mapping the Universe `_ - * `Crystallography Open Database `_ * `NASA Exoplanet Archive `_ * `NSSDC (NASA) data of 550 space spacecraft `_ + +* `Sloan Digital Sky Survey (SDSS) - Mapping the Universe `_ Psychology+Cognition -------------------- @@ -906,76 +906,76 @@ Psychology+Cognition PublicDomains ------------- -* `Google `_ - * `Amazon `_ -* `Infochimps `_ - -* `CMU StatLab collections `_ - * `Archive.org Datasets `_ -* `Enigma Public `_ - -* `RevolutionAnalytics Collection `_ - -* `KDNuggets Data Collections `_ - -* `Stats4Stem R data sets `_ - -* `Yahoo Webscope `_ - -* `Data360 `_ - -* `UCLA SOCR data collection `_ - -* `Microsoft Azure Data Market Free DataSets `_ - -* `Wikileaks 911 pager intercepts `_ - -* `Data.World `_ - -* `Reddit Datasets `_ - -* `The Washington Post List `_ - -* `StatSci.org `_ - -* `Microsoft Data Science for Research `_ - -* `Open Library Data Dumps `_ - -* `Numbray `_ - -* `Sample R data sets `_ - -* `UFO Reports `_ - * `Archive-it from Internet Archive `_ * `CMU JASA data archive `_ + +* `CMU StatLab collections `_ + +* `Data.World `_ + +* `Data360 `_ + +* `Enigma Public `_ + +* `Google `_ + +* `Infochimps `_ + +* `KDNuggets Data Collections `_ + +* `Microsoft Azure Data Market Free DataSets `_ + +* `Microsoft Data Science for Research `_ + +* `Numbray `_ + +* `Open Library Data Dumps `_ + +* `Reddit Datasets `_ + +* `RevolutionAnalytics Collection `_ + +* `Sample R data sets `_ + +* `StatSci.org `_ + +* `Stats4Stem R data sets `_ + +* `The Washington Post List `_ + +* `UCLA SOCR data collection `_ + +* `UFO Reports `_ + +* `Wikileaks 911 pager intercepts `_ + +* `Yahoo Webscope `_ SearchEngines ------------- * `Academic Torrents of data sharing from UMB `_ -* `ICPSR (UMICH) `_ +* `DataMarket (Qlik) `_ * `Datahub.io `_ * `Harvard Dataverse Network of scientific data `_ -* `OpenDataNetwork - A search engine of all Socrata powered data portals `_ +* `ICPSR (UMICH) `_ * `Institute of Education Sciences `_ -* `DataMarket (Qlik) `_ +* `National Technical Reports Library `_ * `Open Data Certificates (beta) `_ -* `National Technical Reports Library `_ +* `OpenDataNetwork - A search engine of all Socrata powered data portals `_ * `Statista.com - statistics and Studies `_ @@ -984,140 +984,140 @@ SearchEngines SocialNetworks -------------- -* `Reddit Comments `_ - -* `Youtube Video Social Graph in 2007,2008 `_ - -* `High-Resolution Contact Networks from Wearable Sensors `_ - -* `Yahoo! Graph and Social Data `_ - -* `Facebook Data Scrape (2005) `_ - -* `Google Scholar citation relations `_ - -* `CMU Enron Email of 150 users `_ - -* `Foursquare from UMN/Sarwat (2013) `_ - -* `Twitter Graph of entire Twitter site `_ - -* `Twitter Data for Sentiment Analysis `_ - -* `Mobile Social Networks from UMASS `_ - -* `Skytrax' Air Travel Reviews Dataset `_ - -* `Network Twitter Data `_ - -* `SourceForge.net Research Data `_ +* `72 hours #gamergate Twitter Scrape `_ * `Ancestry.com Forum Dataset over 10 years `_ -* `Social Twitter Data `_ - -* `Twitter Scrape Calufa May 2011 `_ - -* `Facebook Social Networks from LAW (since 2007) `_ - -* `Indie Map: social graph and crawl of top IndieWeb sites `_ +* `CMU Enron Email of 150 users `_ * `Cheng-Caverlee-Lee September 2009 - January 2010 Twitter Scrape `_ * `EDRM Enron EMail of 151 users, hosted on S3 `_ -* `UNIMI/LAW Social Network Datasets `_ +* `Facebook Data Scrape (2005) `_ -* `72 hours #gamergate Twitter Scrape `_ +* `Facebook Social Networks from LAW (since 2007) `_ + +* `Foursquare from UMN/Sarwat (2013) `_ + +* `GitHub Collaboration Archive `_ + +* `Google Scholar citation relations `_ + +* `High-Resolution Contact Networks from Wearable Sensors `_ + +* `Indie Map: social graph and crawl of top IndieWeb sites `_ + +* `Mobile Social Networks from UMASS `_ + +* `Network Twitter Data `_ + +* `Reddit Comments `_ + +* `Skytrax' Air Travel Reviews Dataset `_ + +* `Social Twitter Data `_ + +* `SourceForge.net Research Data `_ * `Twitter Data for Online Reputation Management `_ -* `GitHub Collaboration Archive `_ +* `Twitter Data for Sentiment Analysis `_ + +* `Twitter Graph of entire Twitter site `_ + +* `Twitter Scrape Calufa May 2011 `_ + +* `UNIMI/LAW Social Network Datasets `_ + +* `Yahoo! Graph and Social Data `_ + +* `Youtube Video Social Graph in 2007,2008 `_ SocialSciences -------------- -* `INFORM Index for Risk Management `_ - -* `Correlates of War Project `_ +* `ACLED (Armed Conflict Location & Event Data Project) `_ * `Canadian Legal Information Institute `_ -* `Minnesota Population Center `_ +* `Center for Systemic Peace Datasets - Conflict Trends, Polities, State Fragility, etc `_ + +* `Correlates of War Project `_ + +* `Cryptome Conspiracy Theory Items `_ * `Datacards `_ -* `International Social Survey Program ISSP `_ - -* `Open Crime and Policing Data in England, Wales and Northern Ireland `_ - -* `International Studies Compendium Project `_ +* `European Social Survey `_ * `FBI Hate Crime 2013 - aggregated data `_ -* `Center for Systemic Peace Datasets - Conflict Trends, Polities, State Fragility, etc `_ +* `Fragile States Index `_ -* `ACLED (Armed Conflict Location & Event Data Project) `_ +* `GDELT Global Events Database `_ + +* `General Social Survey (GSS) since 1972 `_ + +* `German Social Survey `_ + +* `Global Religious Futures Project `_ + +* `Humanitarian Data Exchange `_ + +* `INFORM Index for Risk Management `_ * `Institute for Demographic Studies `_ * `International Networks Archive `_ -* `General Social Survey (GSS) since 1972 `_ +* `International Social Survey Program ISSP `_ -* `WorldPop project - Worldwide human population distributions `_ - -* `PewResearch Society Data Collection `_ - -* `Terrorism Research and Analysis Consortium `_ - -* `UN Civil Society Database `_ - -* `GDELT Global Events Database `_ - -* `Humanitarian Data Exchange `_ - -* `World Bank Open Data `_ +* `International Studies Compendium Project `_ * `James McGuire Cross National Data `_ -* `German Social Survey `_ - -* `PewResearch Internet Survey Project `_ - -* `Global Religious Futures Project `_ - -* `Universities Worldwide `_ - -* `Fragile States Index `_ - -* `Notre Dame Global Adaptation Index (NG-DAIN) `_ - -* `StackExchange Data Explorer `_ - -* `European Social Survey `_ - -* `Cryptome Conspiracy Theory Items `_ - -* `Political Polarity Data `_ - -* `Texas Inmates Executed Since 1984 `_ - -* `UCLA Social Sciences Data Archive `_ +* `MIT Reality Mining Dataset `_ * `MacroData Guide by Norsk samfunnsvitenskapelig datatjeneste `_ -* `UPJOHN for Labor Employment Research `_ +* `Minnesota Population Center `_ -* `Uppsala Conflict Data Program `_ +* `Notre Dame Global Adaptation Index (NG-DAIN) `_ -* `MIT Reality Mining Dataset `_ +* `Open Crime and Policing Data in England, Wales and Northern Ireland `_ -* `UCB's Archive of Social Science Data (D-Lab) `_ +* `Paul Hensel General International Data Page `_ + +* `PewResearch Internet Survey Project `_ + +* `PewResearch Society Data Collection `_ + +* `Political Polarity Data `_ + +* `StackExchange Data Explorer `_ + +* `Terrorism Research and Analysis Consortium `_ + +* `Texas Inmates Executed Since 1984 `_ * `Titanic Survival Data Set `_ -* `Paul Hensel General International Data Page `_ +* `UCB's Archive of Social Science Data (D-Lab) `_ + +* `UCLA Social Sciences Data Archive `_ + +* `UN Civil Society Database `_ + +* `UPJOHN for Labor Employment Research `_ + +* `Universities Worldwide `_ + +* `Uppsala Conflict Data Program `_ + +* `World Bank Open Data `_ + +* `WorldPop project - Worldwide human population distributions `_ Software -------- @@ -1127,81 +1127,81 @@ Software Sports ------ -* `Football/Soccer resources (data and APIs) `_ +* `Betfair Historical Exchange Data `_ + +* `Cricsheet Matches (cricket) `_ * `Ergast Formula 1, from 1950 up to date (API) `_ +* `Football/Soccer resources (data and APIs) `_ + +* `Lahman's Baseball Database `_ + * `Pinhooker: Thoroughbred Bloodstock Sale Data `_ * `Retrosheet Baseball Statistics `_ -* `Cricsheet Matches (cricket) `_ - * `Tennis database of rankings, results, and stats for ATP `_ - -* `Lahman's Baseball Database `_ - -* `Betfair Historical Exchange Data `_ TimeSeries ---------- +* `Databanks International Cross National Time Series Data Archive `_ + * `Hard Drive Failure Rates `_ +* `Heart Rate Time Series from MIT `_ + * `Time Series Data Library (TSDL) from MU `_ * `UC Riverside Time Series Dataset `_ - -* `Databanks International Cross National Time Series Data Archive `_ - -* `Heart Rate Time Series from MIT `_ Transportation -------------- -* `U.S. Freight Analysis Framework since 2007 `_ +* `Airlines OD Data 1987-2008 `_ -* `RITA/BTS transport data collection (TranStat) `_ +* `Bay Area Bike Share Data `_ + +* `Bike Share Systems (BSS) collection `_ * `GeoLife GPS Trajectory from Microsoft Research `_ +* `German train system by Deutsche Bahn `_ + +* `Hubway Million Rides in MA `_ + +* `Montreal BIXI Bike Share `_ + * `NYC Taxi Trip Data 2009- `_ +* `NYC Taxi Trip Data 2013 (FOIA/FOILed) `_ + +* `NYC Uber trip data April 2014 to September 2014 `_ + +* `Open Traffic collection `_ + +* `OpenFlights - airport, airline and route data `_ + +* `Philadelphia Bike Share Stations (JSON) `_ + * `Plane Crash Database, since 1920 `_ * `RITA Airline On-Time Performance data `_ -* `Travel Tracker Survey (TTS) for Chicago `_ - -* `U.S. Domestic Flights 1990 to 2009 `_ - -* `Philadelphia Bike Share Stations (JSON) `_ - -* `NYC Uber trip data April 2014 to September 2014 `_ - -* `OpenFlights - airport, airline and route data `_ - -* `Bay Area Bike Share Data `_ - -* `Montreal BIXI Bike Share `_ - -* `Hubway Million Rides in MA `_ - -* `NYC Taxi Trip Data 2013 (FOIA/FOILed) `_ - -* `Open Traffic collection `_ - -* `Transport for London (TFL) `_ - -* `U.S. Bureau of Transportation Statistics (BTS) `_ +* `RITA/BTS transport data collection (TranStat) `_ * `Toronto Bike Share Stations (XML file) `_ -* `Bike Share Systems (BSS) collection `_ +* `Transport for London (TFL) `_ -* `German train system by Deutsche Bahn `_ +* `Travel Tracker Survey (TTS) for Chicago `_ -* `Airlines OD Data 1987-2008 `_ +* `U.S. Bureau of Transportation Statistics (BTS) `_ + +* `U.S. Domestic Flights 1990 to 2009 `_ + +* `U.S. Freight Analysis Framework since 2007 `_ Complementary Collections From e48dfe7f8abd3b2230d7fb8432c99a2ae773caba Mon Sep 17 00:00:00 2001 From: Xiaming Chen Date: Mon, 15 Jan 2018 01:18:56 +0800 Subject: [PATCH 71/99] Remove travis.yml --- .travis.yml | 10 ---------- 1 file changed, 10 deletions(-) delete mode 100644 .travis.yml diff --git a/.travis.yml b/.travis.yml deleted file mode 100644 index 066e607..0000000 --- a/.travis.yml +++ /dev/null @@ -1,10 +0,0 @@ -# language: ruby -# rvm: -# - 2.2 -# before_script: -# - gem install awesome_bot -# script: -# - site404=www.datawrangling.com,getglue-data.s3.amazonaws.com,archive.org/details/2011-05-calufa-twitter-sql,www.stats4stem.org,lib.stat.cmu.edu,http://www.oecd.org/document/0,census.gov/acs/www/data_documentation/data_release_info/ -# - whtlist=travis,crawdad.cs.dartmouth.edu,data.nasdaq.com,137.189.35.203/WebUI/CatDatabase/catData.html,numbrary.com,www.cmr.osu.edu,gutenberg.org,donnees.gouv.qc.ca,data.rio.rj.gov.br,ntrl.ntis.gov,openflights.org,www.data.gov.bc.ca,earthdata.nasa,pgp-hms,cru.uea.ac.uk,networkdata.ics,datos.argentina,data.gov.ie,isi.edu,data.go.id,wiki.dbpedia,www.laval.ca,www.wunderground.com,data.lexingtonky.gov,arcgis,bixi -# - site503=datamob.org,research.microsoft.com -# - awesome_bot README.rst --allow-dupe --allow-redirect --set-timeout 5 --allow-timeout --white-list $site404,$whtlist,$site503 From edad2c21378715b38cc4607ba59625c66ceaa614 Mon Sep 17 00:00:00 2001 From: Xiaming Chen Date: Mon, 15 Jan 2018 11:54:59 +0800 Subject: [PATCH 72/99] Update README from APD2 --- README.rst | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/README.rst b/README.rst index 784a064..7e07728 100644 --- a/README.rst +++ b/README.rst @@ -7,9 +7,9 @@ Awesome Public Datasets **NOTICE**: This repo is automatically generated by `APD2 `_. -Please **DO NOT** modify this file directly. We now provide +Please **DO NOT** modify this file directly. We have provided `a new way `_ -to contribute to Awesome Public Datasets. +to contribute to Awesome Public Datasets. The original PR entrance directly on repo is closed forever. `This list of a topic-centric public data sources `_ From 5a514018faf7dbc230fb63bed0dc3b22adb1db59 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Mon, 15 Jan 2018 08:56:17 +0000 Subject: [PATCH 73/99] Update README from APD2: b6cb4c7e6ede5a60f527bcf52a799144a71134a4 --- README.rst | 4 +--- 1 file changed, 1 insertion(+), 3 deletions(-) diff --git a/README.rst b/README.rst index 7e07728..b3f5bc0 100644 --- a/README.rst +++ b/README.rst @@ -15,9 +15,7 @@ to contribute to Awesome Public Datasets. The original PR entrance directly on r `This list of a topic-centric public data sources `_ in high quality. They are collected and tidied from blogs, answers, and user responses. Most of the data sets listed below are free, however, some are not. -Other amazingly awesome lists can be found in the -`awesome-awesomeness `_ and -`sindresorhus's awesome `_ list. +Other amazingly awesome lists can be found in `sindresorhus's awesome `_ list. .. contents:: Table of Contents From 08be9a61d56d443ee4211f26a287c7eee6831d2a Mon Sep 17 00:00:00 2001 From: Travis CI Date: Mon, 15 Jan 2018 17:04:57 +0000 Subject: [PATCH 74/99] Update README from APD2: 38dab34a03d13dba645974a2dfe88a7bbe74aab9 --- README.rst | 1096 ++++++++++++++++++++++++++-------------------------- 1 file changed, 549 insertions(+), 547 deletions(-) diff --git a/README.rst b/README.rst index b3f5bc0..67c4bda 100644 --- a/README.rst +++ b/README.rst @@ -5,6 +5,8 @@ Awesome Public Datasets :alt: Awesome :target: https://github.com/sindresorhus/awesome +.. |OK_ICON| image:: https://raw.githubusercontent.com/awesomedata/apd2/master/deploy/ok-24.png +.. |FIXME_ICON| image:: https://raw.githubusercontent.com/awesomedata/apd2/master/deploy/fixme-24.png **NOTICE**: This repo is automatically generated by `APD2 `_. Please **DO NOT** modify this file directly. We have provided @@ -18,1188 +20,1188 @@ Most of the data sets listed below are free, however, some are not. Other amazingly awesome lists can be found in `sindresorhus's awesome `_ list. -.. contents:: Table of Contents +.. contents:: **Table of Contents** Agriculture ----------- -* `U.S. Department of Agriculture's Nutrient Database `_ +* `U.S. Department of Agriculture's Nutrient Database `_ |OK_ICON| -* `U.S. Department of Agriculture's PLANTS Database `_ +* `U.S. Department of Agriculture's PLANTS Database `_ |OK_ICON| Biology ------- -* `1000 Genomes `_ +* `1000 Genomes `_ |OK_ICON| -* `American Gut (Microbiome Project) `_ +* `American Gut (Microbiome Project) `_ |OK_ICON| -* `Broad Bioimage Benchmark Collection (BBBC) `_ +* `Broad Bioimage Benchmark Collection (BBBC) `_ |OK_ICON| -* `Broad Cancer Cell Line Encyclopedia (CCLE) `_ +* `Broad Cancer Cell Line Encyclopedia (CCLE) `_ |OK_ICON| -* `Cell Image Library `_ +* `Cell Image Library `_ |OK_ICON| -* `Complete Genomics Public Data `_ +* `Complete Genomics Public Data `_ |OK_ICON| -* `EBI ArrayExpress `_ +* `EBI ArrayExpress `_ |OK_ICON| -* `EBI Protein Data Bank in Europe `_ +* `EBI Protein Data Bank in Europe `_ |OK_ICON| -* `ENCODE project `_ +* `ENCODE project `_ |OK_ICON| -* `Electron Microscopy Pilot Image Archive (EMPIAR) `_ +* `Electron Microscopy Pilot Image Archive (EMPIAR) `_ |OK_ICON| -* `Ensembl Genomes `_ +* `Ensembl Genomes `_ |OK_ICON| -* `Gene Expression Omnibus (GEO) `_ +* `Gene Expression Omnibus (GEO) `_ |OK_ICON| -* `Gene Ontology (GO) `_ +* `Gene Ontology (GO) `_ |OK_ICON| -* `Global Biotic Interactions (GloBI) `_ +* `Global Biotic Interactions (GloBI) `_ |OK_ICON| -* `Harvard Medical School (HMS) LINCS Project `_ +* `Harvard Medical School (HMS) LINCS Project `_ |OK_ICON| -* `Human Genome Diversity Project `_ +* `Human Genome Diversity Project `_ |OK_ICON| -* `Human Microbiome Project (HMP) `_ +* `Human Microbiome Project (HMP) `_ |OK_ICON| -* `ICOS PSP Benchmark `_ +* `ICOS PSP Benchmark `_ |OK_ICON| -* `International HapMap Project `_ +* `International HapMap Project `_ |OK_ICON| -* `Journal of Cell Biology DataViewer `_ +* `Journal of Cell Biology DataViewer `_ |OK_ICON| -* `MIT Cancer Genomics Data `_ +* `MIT Cancer Genomics Data `_ |OK_ICON| -* `NCBI Proteins `_ +* `NCBI Proteins `_ |OK_ICON| -* `NCBI Taxonomy `_ +* `NCBI Taxonomy `_ |OK_ICON| -* `NCI Genomic Data Commons `_ +* `NCI Genomic Data Commons `_ |OK_ICON| -* `NIH Microarray data `_ +* `NIH Microarray data `_ |FIXME_ICON| -* `OpenSNP genotypes data `_ +* `OpenSNP genotypes data `_ |OK_ICON| -* `Pathguid - Protein-Protein Interactions Catalog `_ +* `Pathguid - Protein-Protein Interactions Catalog `_ |OK_ICON| -* `Protein Data Bank `_ +* `Protein Data Bank `_ |OK_ICON| -* `Psychiatric Genomics Consortium `_ +* `Psychiatric Genomics Consortium `_ |OK_ICON| -* `PubChem Project `_ +* `PubChem Project `_ |OK_ICON| -* `PubGene (now Coremine Medical) `_ +* `PubGene (now Coremine Medical) `_ |OK_ICON| -* `Sanger Catalogue of Somatic Mutations in Cancer (COSMIC) `_ +* `Sanger Catalogue of Somatic Mutations in Cancer (COSMIC) `_ |OK_ICON| -* `Sanger Genomics of Drug Sensitivity in Cancer Project (GDSC) `_ +* `Sanger Genomics of Drug Sensitivity in Cancer Project (GDSC) `_ |OK_ICON| -* `Sequence Read Archive(SRA) `_ +* `Sequence Read Archive(SRA) `_ |OK_ICON| -* `Stanford Microarray Data `_ +* `Stanford Microarray Data `_ |FIXME_ICON| -* `Stowers Institute Original Data Repository `_ +* `Stowers Institute Original Data Repository `_ |OK_ICON| -* `Systems Science of Biological Dynamics (SSBD) Database `_ +* `Systems Science of Biological Dynamics (SSBD) Database `_ |OK_ICON| -* `The Cancer Genome Atlas (TCGA), available via Broad GDAC `_ +* `The Cancer Genome Atlas (TCGA), available via Broad GDAC `_ |OK_ICON| -* `The Catalogue of Life `_ +* `The Catalogue of Life `_ |OK_ICON| -* `The Personal Genome Project `_ +* `The Personal Genome Project `_ |OK_ICON| -* `UCSC Public Data `_ +* `UCSC Public Data `_ |OK_ICON| -* `UniGene `_ +* `UniGene `_ |OK_ICON| -* `Universal Protein Resource (UnitProt) `_ +* `Universal Protein Resource (UnitProt) `_ |OK_ICON| Climate+Weather --------------- -* `Actuaries Climate Index `_ +* `Actuaries Climate Index `_ |OK_ICON| -* `Australian Weather `_ +* `Australian Weather `_ |OK_ICON| -* `Aviation Weather Center - Consistent, timely and accurate weather information for the world airspace system `_ +* `Aviation Weather Center - Consistent, timely and accurate weather information for the world airspace system `_ |OK_ICON| -* `Brazilian Weather - Historical data (In Portuguese) `_ +* `Brazilian Weather - Historical data (In Portuguese) `_ |OK_ICON| -* `Canadian Meteorological Centre `_ +* `Canadian Meteorological Centre `_ |OK_ICON| -* `Climate Data from UEA (updated monthly) `_ +* `Climate Data from UEA (updated monthly) `_ |OK_ICON| -* `European Climate Assessment & Dataset `_ +* `European Climate Assessment & Dataset `_ |OK_ICON| -* `Global Climate Data Since 1929 `_ +* `Global Climate Data Since 1929 `_ |OK_ICON| -* `NASA Global Imagery Browse Services `_ +* `NASA Global Imagery Browse Services `_ |OK_ICON| -* `NOAA Bering Sea Climate `_ +* `NOAA Bering Sea Climate `_ |FIXME_ICON| -* `NOAA Climate Datasets `_ +* `NOAA Climate Datasets `_ |OK_ICON| -* `NOAA Realtime Weather Models `_ +* `NOAA Realtime Weather Models `_ |OK_ICON| -* `NOAA SURFRAD Meteorology and Radiation Datasets `_ +* `NOAA SURFRAD Meteorology and Radiation Datasets `_ |OK_ICON| -* `The World Bank Open Data Resources for Climate Change `_ +* `The World Bank Open Data Resources for Climate Change `_ |OK_ICON| -* `UEA Climatic Research Unit `_ +* `UEA Climatic Research Unit `_ |OK_ICON| -* `WU Historical Weather Worldwide `_ +* `WU Historical Weather Worldwide `_ |OK_ICON| -* `WorldClim - Global Climate Data `_ +* `WorldClim - Global Climate Data `_ |OK_ICON| ComplexNetworks --------------- -* `AMiner Citation Network Dataset `_ +* `AMiner Citation Network Dataset `_ |OK_ICON| -* `CrossRef DOI URLs `_ +* `CrossRef DOI URLs `_ |OK_ICON| -* `DBLP Citation dataset `_ +* `DBLP Citation dataset `_ |OK_ICON| -* `DIMACS Road Networks Collection `_ +* `DIMACS Road Networks Collection `_ |OK_ICON| -* `NBER Patent Citations `_ +* `NBER Patent Citations `_ |OK_ICON| -* `NIST complex networks data collection `_ +* `NIST complex networks data collection `_ |OK_ICON| -* `Network Repository with Interactive Exploratory Analysis Tools `_ +* `Network Repository with Interactive Exploratory Analysis Tools `_ |OK_ICON| -* `Protein-protein interaction network `_ +* `Protein-protein interaction network `_ |OK_ICON| -* `PyPI and Maven Dependency Network `_ +* `PyPI and Maven Dependency Network `_ |OK_ICON| -* `Scopus Citation Database `_ +* `Scopus Citation Database `_ |OK_ICON| -* `Small Network Data `_ +* `Small Network Data `_ |OK_ICON| -* `Stanford GraphBase `_ +* `Stanford GraphBase `_ |OK_ICON| -* `Stanford Large Network Dataset Collection `_ +* `Stanford Large Network Dataset Collection `_ |OK_ICON| -* `Stanford Longitudinal Network Data Sources `_ +* `Stanford Longitudinal Network Data Sources `_ |OK_ICON| -* `The Koblenz Network Collection `_ +* `The Koblenz Network Collection `_ |OK_ICON| -* `The Laboratory for Web Algorithmics (UNIMI) `_ +* `The Laboratory for Web Algorithmics (UNIMI) `_ |OK_ICON| -* `The Nexus Network Repository `_ +* `The Nexus Network Repository `_ |FIXME_ICON| -* `UCI Network Data Repository `_ +* `UCI Network Data Repository `_ |OK_ICON| -* `UFL sparse matrix collection `_ +* `UFL sparse matrix collection `_ |OK_ICON| -* `WSU Graph Database `_ +* `WSU Graph Database `_ |OK_ICON| ComputerNetworks ---------------- -* `3.5B Web Pages from CommonCrawl 2012 `_ +* `3.5B Web Pages from CommonCrawl 2012 `_ |OK_ICON| -* `53.5B Web clicks of 100K users in Indiana Univ. `_ +* `53.5B Web clicks of 100K users in Indiana Univ. `_ |OK_ICON| -* `CAIDA Internet Datasets `_ +* `CAIDA Internet Datasets `_ |OK_ICON| -* `CRAWDAD Wireless datasets from Dartmouth Univ. `_ +* `CRAWDAD Wireless datasets from Dartmouth Univ. `_ |FIXME_ICON| -* `ClueWeb09 - 1B web pages `_ +* `ClueWeb09 - 1B web pages `_ |OK_ICON| -* `ClueWeb12 - 733M web pages `_ +* `ClueWeb12 - 733M web pages `_ |OK_ICON| -* `CommonCrawl Web Data over 7 years `_ +* `CommonCrawl Web Data over 7 years `_ |OK_ICON| -* `Criteo click-through data `_ +* `Criteo click-through data `_ |OK_ICON| -* `OONI: Open Observatory of Network Interference - Internet censorship data `_ +* `OONI: Open Observatory of Network Interference - Internet censorship data `_ |OK_ICON| -* `Open Mobile Data by MobiPerf `_ +* `Open Mobile Data by MobiPerf `_ |OK_ICON| -* `Rapid7 Sonar Internet Scans `_ +* `Rapid7 Sonar Internet Scans `_ |OK_ICON| -* `UCSD Network Telescope, IPv4 /8 net `_ +* `UCSD Network Telescope, IPv4 /8 net `_ |OK_ICON| DataChallenges -------------- -* `Bruteforce Database `_ +* `Bruteforce Database `_ |OK_ICON| -* `Challenges in Machine Learning `_ +* `Challenges in Machine Learning `_ |OK_ICON| -* `CrowdANALYTIX dataX `_ +* `CrowdANALYTIX dataX `_ |OK_ICON| -* `D4D Challenge of Orange `_ +* `D4D Challenge of Orange `_ |FIXME_ICON| -* `DrivenData Competitions for Social Good `_ +* `DrivenData Competitions for Social Good `_ |OK_ICON| -* `ICWSM Data Challenge (since 2009) `_ +* `ICWSM Data Challenge (since 2009) `_ |FIXME_ICON| -* `KDD Cup by Tencent 2012 `_ +* `KDD Cup by Tencent 2012 `_ |OK_ICON| -* `Kaggle Competition Data `_ +* `Kaggle Competition Data `_ |OK_ICON| -* `Localytics Data Visualization Challenge `_ +* `Localytics Data Visualization Challenge `_ |OK_ICON| -* `Netflix Prize `_ +* `Netflix Prize `_ |OK_ICON| -* `Space Apps Challenge `_ +* `Space Apps Challenge `_ |OK_ICON| -* `Telecom Italia Big Data Challenge `_ +* `Telecom Italia Big Data Challenge `_ |OK_ICON| -* `TravisTorrent Dataset - MSR'2017 Mining Challenge `_ +* `TravisTorrent Dataset - MSR'2017 Mining Challenge `_ |OK_ICON| -* `Yelp Dataset Challenge `_ +* `Yelp Dataset Challenge `_ |OK_ICON| EarthScience ------------ -* `AQUASTAT - Global water resources and uses `_ +* `AQUASTAT - Global water resources and uses `_ |OK_ICON| -* `BODC - marine data of ~22K vars `_ +* `BODC - marine data of ~22K vars `_ |OK_ICON| -* `EOSDIS - NASA's earth observing system data `_ +* `EOSDIS - NASA's earth observing system data `_ |OK_ICON| -* `Earth Models `_ +* `Earth Models `_ |OK_ICON| -* `Integrated Marine Observing System (IMOS) - roughly 30TB of ocean measurements `_ +* `Integrated Marine Observing System (IMOS) - roughly 30TB of ocean measurements `_ |OK_ICON| -* `Marinexplore - Open Oceanographic Data `_ +* `Marinexplore - Open Oceanographic Data `_ |OK_ICON| -* `Smithsonian Institution Global Volcano and Eruption Database `_ +* `Smithsonian Institution Global Volcano and Eruption Database `_ |OK_ICON| -* `USGS Earthquake Archives `_ +* `USGS Earthquake Archives `_ |OK_ICON| Economics --------- -* `American Economic Association (AEA) `_ +* `American Economic Association (AEA) `_ |OK_ICON| -* `EconData from UMD `_ +* `EconData from UMD `_ |OK_ICON| -* `Economic Freedom of the World Data `_ +* `Economic Freedom of the World Data `_ |FIXME_ICON| -* `Historical MacroEconomc Statistics `_ +* `Historical MacroEconomc Statistics `_ |OK_ICON| -* `International Economics Database `_ +* `International Economics Database `_ |OK_ICON| -* `International Trade Statistics `_ +* `International Trade Statistics `_ |OK_ICON| -* `Internet Product Code Database `_ +* `Internet Product Code Database `_ |OK_ICON| -* `Joint External Debt Data Hub `_ +* `Joint External Debt Data Hub `_ |OK_ICON| -* `Jon Haveman International Trade Data Links `_ +* `Jon Haveman International Trade Data Links `_ |OK_ICON| -* `OpenCorporates Database of Companies in the World `_ +* `OpenCorporates Database of Companies in the World `_ |OK_ICON| -* `Our World in Data `_ +* `Our World in Data `_ |OK_ICON| -* `SciencesPo World Trade Gravity Datasets `_ +* `SciencesPo World Trade Gravity Datasets `_ |OK_ICON| -* `The Atlas of Economic Complexity `_ +* `The Atlas of Economic Complexity `_ |OK_ICON| -* `The Center for International Data `_ +* `The Center for International Data `_ |OK_ICON| -* `The Observatory of Economic Complexity `_ +* `The Observatory of Economic Complexity `_ |OK_ICON| -* `UN Commodity Trade Statistics `_ +* `UN Commodity Trade Statistics `_ |OK_ICON| -* `UN Human Development Reports `_ +* `UN Human Development Reports `_ |OK_ICON| Education --------- -* `College Scorecard Data `_ +* `College Scorecard Data `_ |OK_ICON| -* `Student Data from Free Code Camp `_ +* `Student Data from Free Code Camp `_ |OK_ICON| Energy ------ -* `AMPds `_ +* `AMPds `_ |OK_ICON| -* `BLUEd `_ +* `BLUEd `_ |OK_ICON| -* `COMBED `_ +* `COMBED `_ |OK_ICON| -* `DRED `_ +* `DRED `_ |OK_ICON| -* `ECO `_ +* `ECO `_ |OK_ICON| -* `EIA `_ +* `EIA `_ |OK_ICON| -* `HES - Household Electricity Study, UK `_ +* `HES - Household Electricity Study, UK `_ |OK_ICON| -* `HFED `_ +* `HFED `_ |OK_ICON| -* `PLAID - The Plug Load Appliance Identification Dataset `_ +* `PLAID - The Plug Load Appliance Identification Dataset `_ |FIXME_ICON| -* `REDD `_ +* `REDD `_ |OK_ICON| -* `Tracebase `_ +* `Tracebase `_ |OK_ICON| -* `UK-DALE - UK Domestic Appliance-Level Electricity `_ +* `UK-DALE - UK Domestic Appliance-Level Electricity `_ |OK_ICON| -* `WHITED `_ +* `WHITED `_ |OK_ICON| -* `iAWE `_ +* `iAWE `_ |OK_ICON| Finance ------- -* `CBOE Futures Exchange `_ +* `CBOE Futures Exchange `_ |FIXME_ICON| -* `Google Finance `_ +* `Google Finance `_ |OK_ICON| -* `Google Trends `_ +* `Google Trends `_ |OK_ICON| -* `NASDAQ `_ +* `NASDAQ `_ |OK_ICON| -* `NYSE Market Data `_ +* `NYSE Market Data `_ |OK_ICON| -* `OANDA `_ +* `OANDA `_ |OK_ICON| -* `OSU Financial data `_ +* `OSU Financial data `_ |OK_ICON| -* `Quandl `_ +* `Quandl `_ |OK_ICON| -* `St Louis Federal `_ +* `St Louis Federal `_ |OK_ICON| -* `Yahoo Finance `_ +* `Yahoo Finance `_ |OK_ICON| GIS --- -* `ArcGIS Open Data portal `_ +* `ArcGIS Open Data portal `_ |OK_ICON| -* `Cambridge, MA, US, GIS data on GitHub `_ +* `Cambridge, MA, US, GIS data on GitHub `_ |OK_ICON| -* `Factual Global Location Data `_ +* `Factual Global Location Data `_ |OK_ICON| -* `Geo Spatial Data from ASU `_ +* `Geo Spatial Data from ASU `_ |OK_ICON| -* `Geo Wiki Project - Citizen-driven Environmental Monitoring `_ +* `Geo Wiki Project - Citizen-driven Environmental Monitoring `_ |OK_ICON| -* `GeoFabrik - OSM data extracted to a variety of formats and areas `_ +* `GeoFabrik - OSM data extracted to a variety of formats and areas `_ |OK_ICON| -* `GeoNames Worldwide `_ +* `GeoNames Worldwide `_ |OK_ICON| -* `Global Administrative Areas Database (GADM) `_ +* `Global Administrative Areas Database (GADM) `_ |OK_ICON| -* `Homeland Infrastructure Foundation-Level Data `_ +* `Homeland Infrastructure Foundation-Level Data `_ |OK_ICON| -* `Landsat 8 on AWS `_ +* `Landsat 8 on AWS `_ |OK_ICON| -* `List of all countries in all languages `_ +* `List of all countries in all languages `_ |OK_ICON| -* `National Weather Service GIS Data Portal `_ +* `National Weather Service GIS Data Portal `_ |OK_ICON| -* `Natural Earth - vectors and rasters of the world `_ +* `Natural Earth - vectors and rasters of the world `_ |OK_ICON| -* `OpenAddresses `_ +* `OpenAddresses `_ |OK_ICON| -* `OpenStreetMap (OSM) `_ +* `OpenStreetMap (OSM) `_ |OK_ICON| -* `Pleiades - Gazetteer and graph of ancient places `_ +* `Pleiades - Gazetteer and graph of ancient places `_ |OK_ICON| -* `Reverse Geocoder using OSM data `_ +* `Reverse Geocoder using OSM data `_ |OK_ICON| -* `TIGER/Line - U.S. boundaries and roads `_ +* `TIGER/Line - U.S. boundaries and roads `_ |FIXME_ICON| -* `TZ Timezones shapfiles `_ +* `TZ Timezones shapfiles `_ |OK_ICON| -* `TwoFishes - Foursquare's coarse geocoder `_ +* `TwoFishes - Foursquare's coarse geocoder `_ |OK_ICON| -* `UN Environmental Data `_ +* `UN Environmental Data `_ |OK_ICON| -* `World boundaries from the U.S. Department of State `_ +* `World boundaries from the U.S. Department of State `_ |FIXME_ICON| -* `World countries in multiple formats `_ +* `World countries in multiple formats `_ |OK_ICON| Government ---------- -* `Alberta, Province of Canada `_ +* `Alberta, Province of Canada `_ |OK_ICON| -* `Antwerp, Belgium `_ +* `Antwerp, Belgium `_ |OK_ICON| -* `Argentina (non official) `_ +* `Argentina (non official) `_ |OK_ICON| -* `Argentina `_ +* `Argentina `_ |FIXME_ICON| -* `Austin, TX, US `_ +* `Austin, TX, US `_ |OK_ICON| -* `Australia (abs.gov.au) `_ +* `Australia (abs.gov.au) `_ |OK_ICON| -* `Australia (data.gov.au) `_ +* `Australia (data.gov.au) `_ |OK_ICON| -* `Austria (data.gv.at) `_ +* `Austria (data.gv.at) `_ |OK_ICON| -* `Baton Rouge, LA, US `_ +* `Baton Rouge, LA, US `_ |OK_ICON| -* `Belgium `_ +* `Belgium `_ |OK_ICON| -* `Brazil `_ +* `Brazil `_ |OK_ICON| -* `Buenos Aires, Argentina `_ +* `Buenos Aires, Argentina `_ |OK_ICON| -* `Calgary, AB, Canada `_ +* `Calgary, AB, Canada `_ |FIXME_ICON| -* `Cambridge, MA, US `_ +* `Cambridge, MA, US `_ |OK_ICON| -* `Canada `_ +* `Canada `_ |FIXME_ICON| -* `Chicago `_ +* `Chicago `_ |OK_ICON| -* `Chile `_ +* `Chile `_ |OK_ICON| -* `Dallas Open Data `_ +* `Dallas Open Data `_ |OK_ICON| -* `DataBC - data from the Province of British Columbia `_ +* `DataBC - data from the Province of British Columbia `_ |OK_ICON| -* `Denver Open Data `_ +* `Denver Open Data `_ |OK_ICON| -* `Durham, NC Open Data `_ +* `Durham, NC Open Data `_ |OK_ICON| -* `Edmonton, AB, Canada `_ +* `Edmonton, AB, Canada `_ |OK_ICON| -* `England LGInform `_ +* `England LGInform `_ |OK_ICON| -* `EuroStat `_ +* `EuroStat `_ |OK_ICON| -* `EveryPolitician - Ongoing project collating and sharing data on every politician. `_ +* `EveryPolitician - Ongoing project collating and sharing data on every politician. `_ |OK_ICON| -* `FedStats `_ +* `FedStats `_ |OK_ICON| -* `Finland `_ +* `Finland `_ |OK_ICON| -* `France `_ +* `France `_ |OK_ICON| -* `Fredericton, NB, Canada `_ +* `Fredericton, NB, Canada `_ |OK_ICON| -* `Gatineau, QC, Canada `_ +* `Gatineau, QC, Canada `_ |OK_ICON| -* `Germany `_ +* `Germany `_ |OK_ICON| -* `Ghent, Belgium `_ +* `Ghent, Belgium `_ |FIXME_ICON| -* `Glasgow, Scotland, UK `_ +* `Glasgow, Scotland, UK `_ |FIXME_ICON| -* `Greece `_ +* `Greece `_ |OK_ICON| -* `Guardian world governments `_ +* `Guardian world governments `_ |OK_ICON| -* `Halifax, NS, Canada `_ +* `Halifax, NS, Canada `_ |FIXME_ICON| -* `Helsinki Region, Finland `_ +* `Helsinki Region, Finland `_ |OK_ICON| -* `Hong Kong, China `_ +* `Hong Kong, China `_ |OK_ICON| -* `Houston Open Data `_ +* `Houston Open Data `_ |FIXME_ICON| -* `Indian Government Data `_ +* `Indian Government Data `_ |OK_ICON| -* `Indonesian Data Portal `_ +* `Indonesian Data Portal `_ |OK_ICON| -* `Ireland's Open Data Portal `_ +* `Ireland's Open Data Portal `_ |OK_ICON| -* `Japan `_ +* `Japan `_ |OK_ICON| -* `Laval, QC, Canada `_ +* `Laval, QC, Canada `_ |OK_ICON| -* `Lexington, KY `_ +* `Lexington, KY `_ |OK_ICON| -* `London Datastore, UK `_ +* `London Datastore, UK `_ |OK_ICON| -* `London, ON, Canada `_ +* `London, ON, Canada `_ |OK_ICON| -* `Los Angeles Open Data `_ +* `Los Angeles Open Data `_ |OK_ICON| -* `MassGIS, Massachusetts, U.S. `_ +* `MassGIS, Massachusetts, U.S. `_ |OK_ICON| -* `Metropolitain Transportation Commission (MTC), California, US `_ +* `Metropolitain Transportation Commission (MTC), California, US `_ |OK_ICON| -* `Mexico `_ +* `Mexico `_ |OK_ICON| -* `Missisauga, ON, Canada `_ +* `Missisauga, ON, Canada `_ |OK_ICON| -* `Moldova `_ +* `Moldova `_ |OK_ICON| -* `Moncton, NB, Canada `_ +* `Moncton, NB, Canada `_ |OK_ICON| -* `Montreal, QC, Canada `_ +* `Montreal, QC, Canada `_ |OK_ICON| -* `Mountain View, California, US (GIS) `_ +* `Mountain View, California, US (GIS) `_ |OK_ICON| -* `NYC Open Data `_ +* `NYC Open Data `_ |FIXME_ICON| -* `NYC betanyc `_ +* `NYC betanyc `_ |OK_ICON| -* `Netherlands `_ +* `Netherlands `_ |OK_ICON| -* `New Zealand `_ +* `New Zealand `_ |OK_ICON| -* `OECD `_ +* `OECD `_ |OK_ICON| -* `Oakland, California, US `_ +* `Oakland, California, US `_ |OK_ICON| -* `Oklahoma `_ +* `Oklahoma `_ |OK_ICON| -* `Open Data for Africa `_ +* `Open Data for Africa `_ |OK_ICON| -* `Open Government Data (OGD) Platform India `_ +* `Open Government Data (OGD) Platform India `_ |OK_ICON| -* `OpenDataSoft's list of 1,600 open data `_ +* `OpenDataSoft's list of 1,600 open data `_ |OK_ICON| -* `Oregon `_ +* `Oregon `_ |OK_ICON| -* `Ottawa, ON, Canada `_ +* `Ottawa, ON, Canada `_ |OK_ICON| -* `Palo Alto, California, US `_ +* `Palo Alto, California, US `_ |OK_ICON| -* `Portland, Oregon `_ +* `Portland, Oregon `_ |OK_ICON| -* `Portugal - Pordata organization `_ +* `Portugal - Pordata organization `_ |OK_ICON| -* `Puerto Rico Government `_ +* `Puerto Rico Government `_ |OK_ICON| -* `Quebec City, QC, Canada `_ +* `Quebec City, QC, Canada `_ |OK_ICON| -* `Quebec Province of Canada `_ +* `Quebec Province of Canada `_ |OK_ICON| -* `Regina SK, Canada `_ +* `Regina SK, Canada `_ |OK_ICON| -* `Rio de Janeiro, Brazil `_ +* `Rio de Janeiro, Brazil `_ |FIXME_ICON| -* `Romania `_ +* `Romania `_ |OK_ICON| -* `Russia `_ +* `Russia `_ |OK_ICON| -* `San Francisco Data sets `_ +* `San Francisco Data sets `_ |OK_ICON| -* `San Jose, California, US `_ +* `San Jose, California, US `_ |OK_ICON| -* `San Mateo County, California, US `_ +* `San Mateo County, California, US `_ |OK_ICON| -* `Saskatchewan, Province of Canada `_ +* `Saskatchewan, Province of Canada `_ |OK_ICON| -* `Seattle `_ +* `Seattle `_ |OK_ICON| -* `Singapore Government Data `_ +* `Singapore Government Data `_ |OK_ICON| -* `South Africa Trade Statistics `_ +* `South Africa Trade Statistics `_ |OK_ICON| -* `South Africa `_ +* `South Africa `_ |OK_ICON| -* `State of Utah, US `_ +* `State of Utah, US `_ |OK_ICON| -* `Switzerland `_ +* `Switzerland `_ |OK_ICON| -* `Taiwan g0v `_ +* `Taiwan g0v `_ |OK_ICON| -* `Taiwan `_ +* `Taiwan `_ |OK_ICON| -* `Texas Open Data `_ +* `Texas Open Data `_ |OK_ICON| -* `The World Bank `_ +* `The World Bank `_ |FIXME_ICON| -* `Toronto, ON, Canada `_ +* `Toronto, ON, Canada `_ |OK_ICON| -* `Tunisia `_ +* `Tunisia `_ |OK_ICON| -* `U.K. Government Data `_ +* `U.K. Government Data `_ |OK_ICON| -* `U.S. American Community Survey `_ +* `U.S. American Community Survey `_ |OK_ICON| -* `U.S. CDC Public Health datasets `_ +* `U.S. CDC Public Health datasets `_ |OK_ICON| -* `U.S. Census Bureau `_ +* `U.S. Census Bureau `_ |OK_ICON| -* `U.S. Department of Housing and Urban Development (HUD) `_ +* `U.S. Department of Housing and Urban Development (HUD) `_ |OK_ICON| -* `U.S. Federal Government Agencies `_ +* `U.S. Federal Government Agencies `_ |OK_ICON| -* `U.S. Federal Government Data Catalog `_ +* `U.S. Federal Government Data Catalog `_ |OK_ICON| -* `U.S. Food and Drug Administration (FDA) `_ +* `U.S. Food and Drug Administration (FDA) `_ |OK_ICON| -* `U.S. National Center for Education Statistics (NCES) `_ +* `U.S. National Center for Education Statistics (NCES) `_ |OK_ICON| -* `U.S. Open Government `_ +* `U.S. Open Government `_ |OK_ICON| -* `UK 2011 Census Open Atlas Project `_ +* `UK 2011 Census Open Atlas Project `_ |FIXME_ICON| -* `Uganda Bureau of Statistics `_ +* `Uganda Bureau of Statistics `_ |OK_ICON| -* `United Nations `_ +* `United Nations `_ |OK_ICON| -* `Uruguay `_ +* `Uruguay `_ |OK_ICON| -* `Valley Transportation Authority (VTA), California, US `_ +* `Valley Transportation Authority (VTA), California, US `_ |OK_ICON| -* `Vancouver, BC Open Data Catalog `_ +* `Vancouver, BC Open Data Catalog `_ |OK_ICON| -* `Victoria, BC, Canada `_ +* `Victoria, BC, Canada `_ |FIXME_ICON| -* `Vienna, Austria `_ +* `Vienna, Austria `_ |OK_ICON| Healthcare ---------- -* `EHDP Large Health Data Sets `_ +* `EHDP Large Health Data Sets `_ |OK_ICON| -* `GDC - GDC supports several cancer genome programs for CCG, TCGA, TARGET etc. `_ +* `GDC - GDC supports several cancer genome programs for CCG, TCGA, TARGET etc. `_ |OK_ICON| -* `Gapminder World demographic databases `_ +* `Gapminder World demographic databases `_ |OK_ICON| -* `MeSH, the vocabulary thesaurus used for indexing articles for PubMed `_ +* `MeSH, the vocabulary thesaurus used for indexing articles for PubMed `_ |OK_ICON| -* `Medicare Coverage Database (MCD), U.S. `_ +* `Medicare Coverage Database (MCD), U.S. `_ |OK_ICON| -* `Medicare Data Engine of medicare.gov Data `_ +* `Medicare Data Engine of medicare.gov Data `_ |OK_ICON| -* `Medicare Data File `_ +* `Medicare Data File `_ |OK_ICON| -* `Number of Ebola Cases and Deaths in Affected Countries (2014) `_ +* `Number of Ebola Cases and Deaths in Affected Countries (2014) `_ |FIXME_ICON| -* `Open-ODS (structure of the UK NHS) `_ +* `Open-ODS (structure of the UK NHS) `_ |OK_ICON| -* `OpenPaymentsData, Healthcare financial relationship data `_ +* `OpenPaymentsData, Healthcare financial relationship data `_ |OK_ICON| -* `PhysioBank Databases - A large and growing archive of physiological data. `_ +* `PhysioBank Databases - A large and growing archive of physiological data. `_ |OK_ICON| -* `The Cancer Genome Atlas project (TCGA) `_ +* `The Cancer Genome Atlas project (TCGA) `_ |OK_ICON| -* `World Health Organization Global Health Observatory `_ +* `World Health Organization Global Health Observatory `_ |OK_ICON| ImageProcessing --------------- -* `10k US Adult Faces Database `_ +* `10k US Adult Faces Database `_ |OK_ICON| -* `2GB of Photos of Cats `_ +* `2GB of Photos of Cats `_ |FIXME_ICON| -* `Adience Unfiltered faces for gender and age classification `_ +* `Adience Unfiltered faces for gender and age classification `_ |OK_ICON| -* `Affective Image Classification `_ +* `Affective Image Classification `_ |OK_ICON| -* `Animals with attributes `_ +* `Animals with attributes `_ |OK_ICON| -* `Caltech Pedestrian Detection Benchmark `_ +* `Caltech Pedestrian Detection Benchmark `_ |OK_ICON| -* `Chars74K dataset - Character Recognition in Natural Images (both English and Kannada are available) `_ +* `Chars74K dataset - Character Recognition in Natural Images (both English and Kannada are available) `_ |OK_ICON| -* `Face Recognition Benchmark `_ +* `Face Recognition Benchmark `_ |OK_ICON| -* `Flickr: 32 Class Brand Logos `_ +* `Flickr: 32 Class Brand Logos `_ |OK_ICON| -* `GDXray - X-ray images for X-ray testing and Computer Vision `_ +* `GDXray - X-ray images for X-ray testing and Computer Vision `_ |OK_ICON| -* `ImageNet (in WordNet hierarchy) `_ +* `ImageNet (in WordNet hierarchy) `_ |OK_ICON| -* `Indoor Scene Recognition `_ +* `Indoor Scene Recognition `_ |OK_ICON| -* `International Affective Picture System, UFL `_ +* `International Affective Picture System, UFL `_ |OK_ICON| -* `MNIST database of handwritten digits, near 1 million examples `_ +* `MNIST database of handwritten digits, near 1 million examples `_ |OK_ICON| -* `Massive Visual Memory Stimuli, MIT `_ +* `Massive Visual Memory Stimuli, MIT `_ |OK_ICON| -* `SUN database, MIT `_ +* `SUN database, MIT `_ |OK_ICON| -* `Several Shape-from-Silhouette Datasets `_ +* `Several Shape-from-Silhouette Datasets `_ |FIXME_ICON| -* `Stanford Dogs Dataset `_ +* `Stanford Dogs Dataset `_ |OK_ICON| -* `The Action Similarity Labeling (ASLAN) Challenge `_ +* `The Action Similarity Labeling (ASLAN) Challenge `_ |OK_ICON| -* `The Oxford-IIIT Pet Dataset `_ +* `The Oxford-IIIT Pet Dataset `_ |OK_ICON| -* `Violent-Flows - Crowd Violence / Non-violence Database and benchmark `_ +* `Violent-Flows - Crowd Violence / Non-violence Database and benchmark `_ |OK_ICON| -* `Visual genome `_ +* `Visual genome `_ |OK_ICON| -* `YouTube Faces Database `_ +* `YouTube Faces Database `_ |OK_ICON| MachineLearning --------------- -* `Context-aware data sets from five domains `_ +* `Context-aware data sets from five domains `_ |OK_ICON| -* `Delve Datasets for classification and regression `_ +* `Delve Datasets for classification and regression `_ |OK_ICON| -* `Discogs Monthly Data `_ +* `Discogs Monthly Data `_ |OK_ICON| -* `Free Music Archive `_ +* `Free Music Archive `_ |OK_ICON| -* `IMDb Database `_ +* `IMDb Database `_ |OK_ICON| -* `Keel Repository for classification, regression and time series `_ +* `Keel Repository for classification, regression and time series `_ |OK_ICON| -* `Labeled Faces in the Wild (LFW) `_ +* `Labeled Faces in the Wild (LFW) `_ |OK_ICON| -* `Lending Club Loan Data `_ +* `Lending Club Loan Data `_ |OK_ICON| -* `Machine Learning Data Set Repository `_ +* `Machine Learning Data Set Repository `_ |OK_ICON| -* `Million Song Dataset `_ +* `Million Song Dataset `_ |OK_ICON| -* `More Song Datasets `_ +* `More Song Datasets `_ |OK_ICON| -* `MovieLens Data Sets `_ +* `MovieLens Data Sets `_ |OK_ICON| -* `New Yorker caption contest ratings `_ +* `New Yorker caption contest ratings `_ |OK_ICON| -* `RDataMining - "R and Data Mining" ebook data `_ +* `RDataMining - "R and Data Mining" ebook data `_ |OK_ICON| -* `Registered Meteorites on Earth `_ +* `Registered Meteorites on Earth `_ |OK_ICON| -* `Restaurants Health Score Data in San Francisco `_ +* `Restaurants Health Score Data in San Francisco `_ |FIXME_ICON| -* `UCI Machine Learning Repository `_ +* `UCI Machine Learning Repository `_ |OK_ICON| -* `Yahoo! Ratings and Classification Data `_ +* `Yahoo! Ratings and Classification Data `_ |FIXME_ICON| -* `Youtube 8m `_ +* `Youtube 8m `_ |OK_ICON| -* `eBay Online Auctions (2012) `_ +* `eBay Online Auctions (2012) `_ |OK_ICON| Museums ------- -* `Canada Science and Technology Museums Corporation's Open Data `_ +* `Canada Science and Technology Museums Corporation's Open Data `_ |OK_ICON| -* `Cooper-Hewitt's Collection Database `_ +* `Cooper-Hewitt's Collection Database `_ |OK_ICON| -* `Minneapolis Institute of Arts metadata `_ +* `Minneapolis Institute of Arts metadata `_ |OK_ICON| -* `Natural History Museum (London) Data Portal `_ +* `Natural History Museum (London) Data Portal `_ |OK_ICON| -* `Rijksmuseum Historical Art Collection `_ +* `Rijksmuseum Historical Art Collection `_ |OK_ICON| -* `Tate Collection metadata `_ +* `Tate Collection metadata `_ |OK_ICON| -* `The Getty vocabularies `_ +* `The Getty vocabularies `_ |OK_ICON| NaturalLanguage --------------- -* `Automatic Keyphrase Extraction `_ +* `Automatic Keyphrase Extraction `_ |OK_ICON| -* `Blogger Corpus `_ +* `Blogger Corpus `_ |OK_ICON| -* `CLiPS Stylometry Investigation Corpus `_ +* `CLiPS Stylometry Investigation Corpus `_ |OK_ICON| -* `ClueWeb09 FACC `_ +* `ClueWeb09 FACC `_ |OK_ICON| -* `ClueWeb12 FACC `_ +* `ClueWeb12 FACC `_ |OK_ICON| -* `DBpedia - 4.58M things with 583M facts `_ +* `DBpedia - 4.58M things with 583M facts `_ |OK_ICON| -* `Flickr Personal Taxonomies `_ +* `Flickr Personal Taxonomies `_ |OK_ICON| -* `Freebase of people, places, and things `_ +* `Freebase of people, places, and things `_ |OK_ICON| -* `Google Books Ngrams (2.2TB) `_ +* `Google Books Ngrams (2.2TB) `_ |OK_ICON| -* `Google MC-AFP - Generated based on the public available Gigaword dataset using Paragraph Vectors `_ +* `Google MC-AFP - Generated based on the public available Gigaword dataset using Paragraph Vectors `_ |OK_ICON| -* `Google Web 5gram (1TB, 2006) `_ +* `Google Web 5gram (1TB, 2006) `_ |OK_ICON| -* `Gutenberg eBooks List `_ +* `Gutenberg eBooks List `_ |OK_ICON| -* `Hansards text chunks of Canadian Parliament `_ +* `Hansards text chunks of Canadian Parliament `_ |OK_ICON| -* `Microsoft MAchine Reading COmprehension Dataset (or MS MARCO) `_ +* `Microsoft MAchine Reading COmprehension Dataset (or MS MARCO) `_ |OK_ICON| -* `Machine Comprehension Test (MCTest) of text from Microsoft Research `_ +* `Machine Comprehension Test (MCTest) of text from Microsoft Research `_ |OK_ICON| -* `Machine Translation of European languages `_ +* `Machine Translation of European languages `_ |OK_ICON| -* `Making Sense of Microposts 2013 - Concept Extraction `_ +* `Making Sense of Microposts 2013 - Concept Extraction `_ |FIXME_ICON| -* `Making Sense of Microposts 2016 - Named Entity rEcognition and Linking `_ +* `Making Sense of Microposts 2016 - Named Entity rEcognition and Linking `_ |OK_ICON| -* `Multi-Domain Sentiment Dataset (version 2.0) `_ +* `Multi-Domain Sentiment Dataset (version 2.0) `_ |OK_ICON| -* `Open Multilingual Wordnet `_ +* `Open Multilingual Wordnet `_ |OK_ICON| -* `POS/NER/Chunk annotated data `_ +* `POS/NER/Chunk annotated data `_ |OK_ICON| -* `Personae Corpus `_ +* `Personae Corpus `_ |OK_ICON| -* `SMS Spam Collection in English `_ +* `SMS Spam Collection in English `_ |OK_ICON| -* `SaudiNewsNet Collection of Saudi Newspaper Articles (Arabic, 30K articles) `_ +* `SaudiNewsNet Collection of Saudi Newspaper Articles (Arabic, 30K articles) `_ |OK_ICON| -* `Stanford Question Answering Dataset (SQuAD) `_ +* `Stanford Question Answering Dataset (SQuAD) `_ |OK_ICON| -* `USENET postings corpus of 2005~2011 `_ +* `USENET postings corpus of 2005~2011 `_ |OK_ICON| -* `Universal Dependencies `_ +* `Universal Dependencies `_ |OK_ICON| -* `Webhose - News/Blogs in multiple languages `_ +* `Webhose - News/Blogs in multiple languages `_ |OK_ICON| -* `Wikidata - Wikipedia databases `_ +* `Wikidata - Wikipedia databases `_ |OK_ICON| -* `Wikipedia Links data - 40 Million Entities in Context `_ +* `Wikipedia Links data - 40 Million Entities in Context `_ |OK_ICON| -* `WordNet databases and tools `_ +* `WordNet databases and tools `_ |OK_ICON| Neuroscience ------------ -* `Allen Institute Datasets `_ +* `Allen Institute Datasets `_ |OK_ICON| -* `Brain Catalogue `_ +* `Brain Catalogue `_ |OK_ICON| -* `Brainomics `_ +* `Brainomics `_ |OK_ICON| -* `CodeNeuro Datasets `_ +* `CodeNeuro Datasets `_ |OK_ICON| -* `Collaborative Research in Computational Neuroscience (CRCNS) `_ +* `Collaborative Research in Computational Neuroscience (CRCNS) `_ |OK_ICON| -* `FCP-INDI `_ +* `FCP-INDI `_ |OK_ICON| -* `Human Connectome Project `_ +* `Human Connectome Project `_ |OK_ICON| -* `NDAR `_ +* `NDAR `_ |OK_ICON| -* `NIMH Data Archive `_ +* `NIMH Data Archive `_ |OK_ICON| -* `NeuroData `_ +* `NeuroData `_ |OK_ICON| -* `Neuroelectro `_ +* `Neuroelectro `_ |OK_ICON| -* `OASIS `_ +* `OASIS `_ |OK_ICON| -* `OpenfMRI `_ +* `OpenfMRI `_ |OK_ICON| -* `Study Forrest `_ +* `Study Forrest `_ |OK_ICON| Physics ------- -* `CERN Open Data Portal `_ +* `CERN Open Data Portal `_ |OK_ICON| -* `Crystallography Open Database `_ +* `Crystallography Open Database `_ |OK_ICON| -* `NASA Exoplanet Archive `_ +* `NASA Exoplanet Archive `_ |OK_ICON| -* `NSSDC (NASA) data of 550 space spacecraft `_ +* `NSSDC (NASA) data of 550 space spacecraft `_ |OK_ICON| -* `Sloan Digital Sky Survey (SDSS) - Mapping the Universe `_ +* `Sloan Digital Sky Survey (SDSS) - Mapping the Universe `_ |OK_ICON| Psychology+Cognition -------------------- -* `OSU Cognitive Modeling Repository Datasets `_ +* `OSU Cognitive Modeling Repository Datasets `_ |FIXME_ICON| PublicDomains ------------- -* `Amazon `_ +* `Amazon `_ |OK_ICON| -* `Archive.org Datasets `_ +* `Archive.org Datasets `_ |OK_ICON| -* `Archive-it from Internet Archive `_ +* `Archive-it from Internet Archive `_ |OK_ICON| -* `CMU JASA data archive `_ +* `CMU JASA data archive `_ |OK_ICON| -* `CMU StatLab collections `_ +* `CMU StatLab collections `_ |OK_ICON| -* `Data.World `_ +* `Data.World `_ |OK_ICON| -* `Data360 `_ +* `Data360 `_ |OK_ICON| -* `Enigma Public `_ +* `Enigma Public `_ |OK_ICON| -* `Google `_ +* `Google `_ |OK_ICON| -* `Infochimps `_ +* `Infochimps `_ |FIXME_ICON| -* `KDNuggets Data Collections `_ +* `KDNuggets Data Collections `_ |OK_ICON| -* `Microsoft Azure Data Market Free DataSets `_ +* `Microsoft Azure Data Market Free DataSets `_ |OK_ICON| -* `Microsoft Data Science for Research `_ +* `Microsoft Data Science for Research `_ |OK_ICON| -* `Numbray `_ +* `Numbray `_ |FIXME_ICON| -* `Open Library Data Dumps `_ +* `Open Library Data Dumps `_ |OK_ICON| -* `Reddit Datasets `_ +* `Reddit Datasets `_ |OK_ICON| -* `RevolutionAnalytics Collection `_ +* `RevolutionAnalytics Collection `_ |OK_ICON| -* `Sample R data sets `_ +* `Sample R data sets `_ |OK_ICON| -* `StatSci.org `_ +* `StatSci.org `_ |OK_ICON| -* `Stats4Stem R data sets `_ +* `Stats4Stem R data sets `_ |FIXME_ICON| -* `The Washington Post List `_ +* `The Washington Post List `_ |OK_ICON| -* `UCLA SOCR data collection `_ +* `UCLA SOCR data collection `_ |OK_ICON| -* `UFO Reports `_ +* `UFO Reports `_ |OK_ICON| -* `Wikileaks 911 pager intercepts `_ +* `Wikileaks 911 pager intercepts `_ |OK_ICON| -* `Yahoo Webscope `_ +* `Yahoo Webscope `_ |FIXME_ICON| SearchEngines ------------- -* `Academic Torrents of data sharing from UMB `_ +* `Academic Torrents of data sharing from UMB `_ |OK_ICON| -* `DataMarket (Qlik) `_ +* `DataMarket (Qlik) `_ |OK_ICON| -* `Datahub.io `_ +* `Datahub.io `_ |OK_ICON| -* `Harvard Dataverse Network of scientific data `_ +* `Harvard Dataverse Network of scientific data `_ |OK_ICON| -* `ICPSR (UMICH) `_ +* `ICPSR (UMICH) `_ |OK_ICON| -* `Institute of Education Sciences `_ +* `Institute of Education Sciences `_ |OK_ICON| -* `National Technical Reports Library `_ +* `National Technical Reports Library `_ |FIXME_ICON| -* `Open Data Certificates (beta) `_ +* `Open Data Certificates (beta) `_ |OK_ICON| -* `OpenDataNetwork - A search engine of all Socrata powered data portals `_ +* `OpenDataNetwork - A search engine of all Socrata powered data portals `_ |OK_ICON| -* `Statista.com - statistics and Studies `_ +* `Statista.com - statistics and Studies `_ |OK_ICON| -* `Zenodo - An open dependable home for the long-tail of science `_ +* `Zenodo - An open dependable home for the long-tail of science `_ |OK_ICON| SocialNetworks -------------- -* `72 hours #gamergate Twitter Scrape `_ +* `72 hours #gamergate Twitter Scrape `_ |OK_ICON| -* `Ancestry.com Forum Dataset over 10 years `_ +* `Ancestry.com Forum Dataset over 10 years `_ |OK_ICON| -* `CMU Enron Email of 150 users `_ +* `CMU Enron Email of 150 users `_ |OK_ICON| -* `Cheng-Caverlee-Lee September 2009 - January 2010 Twitter Scrape `_ +* `Cheng-Caverlee-Lee September 2009 - January 2010 Twitter Scrape `_ |OK_ICON| -* `EDRM Enron EMail of 151 users, hosted on S3 `_ +* `EDRM Enron EMail of 151 users, hosted on S3 `_ |OK_ICON| -* `Facebook Data Scrape (2005) `_ +* `Facebook Data Scrape (2005) `_ |OK_ICON| -* `Facebook Social Networks from LAW (since 2007) `_ +* `Facebook Social Networks from LAW (since 2007) `_ |OK_ICON| -* `Foursquare from UMN/Sarwat (2013) `_ +* `Foursquare from UMN/Sarwat (2013) `_ |OK_ICON| -* `GitHub Collaboration Archive `_ +* `GitHub Collaboration Archive `_ |OK_ICON| -* `Google Scholar citation relations `_ +* `Google Scholar citation relations `_ |OK_ICON| -* `High-Resolution Contact Networks from Wearable Sensors `_ +* `High-Resolution Contact Networks from Wearable Sensors `_ |OK_ICON| -* `Indie Map: social graph and crawl of top IndieWeb sites `_ +* `Indie Map: social graph and crawl of top IndieWeb sites `_ |OK_ICON| -* `Mobile Social Networks from UMASS `_ +* `Mobile Social Networks from UMASS `_ |OK_ICON| -* `Network Twitter Data `_ +* `Network Twitter Data `_ |OK_ICON| -* `Reddit Comments `_ +* `Reddit Comments `_ |OK_ICON| -* `Skytrax' Air Travel Reviews Dataset `_ +* `Skytrax' Air Travel Reviews Dataset `_ |OK_ICON| -* `Social Twitter Data `_ +* `Social Twitter Data `_ |OK_ICON| -* `SourceForge.net Research Data `_ +* `SourceForge.net Research Data `_ |OK_ICON| -* `Twitter Data for Online Reputation Management `_ +* `Twitter Data for Online Reputation Management `_ |OK_ICON| -* `Twitter Data for Sentiment Analysis `_ +* `Twitter Data for Sentiment Analysis `_ |OK_ICON| -* `Twitter Graph of entire Twitter site `_ +* `Twitter Graph of entire Twitter site `_ |OK_ICON| -* `Twitter Scrape Calufa May 2011 `_ +* `Twitter Scrape Calufa May 2011 `_ |FIXME_ICON| -* `UNIMI/LAW Social Network Datasets `_ +* `UNIMI/LAW Social Network Datasets `_ |OK_ICON| -* `Yahoo! Graph and Social Data `_ +* `Yahoo! Graph and Social Data `_ |FIXME_ICON| -* `Youtube Video Social Graph in 2007,2008 `_ +* `Youtube Video Social Graph in 2007,2008 `_ |OK_ICON| SocialSciences -------------- -* `ACLED (Armed Conflict Location & Event Data Project) `_ +* `ACLED (Armed Conflict Location & Event Data Project) `_ |OK_ICON| -* `Canadian Legal Information Institute `_ +* `Canadian Legal Information Institute `_ |FIXME_ICON| -* `Center for Systemic Peace Datasets - Conflict Trends, Polities, State Fragility, etc `_ +* `Center for Systemic Peace Datasets - Conflict Trends, Polities, State Fragility, etc `_ |OK_ICON| -* `Correlates of War Project `_ +* `Correlates of War Project `_ |OK_ICON| -* `Cryptome Conspiracy Theory Items `_ +* `Cryptome Conspiracy Theory Items `_ |OK_ICON| -* `Datacards `_ +* `Datacards `_ |FIXME_ICON| -* `European Social Survey `_ +* `European Social Survey `_ |OK_ICON| -* `FBI Hate Crime 2013 - aggregated data `_ +* `FBI Hate Crime 2013 - aggregated data `_ |OK_ICON| -* `Fragile States Index `_ +* `Fragile States Index `_ |FIXME_ICON| -* `GDELT Global Events Database `_ +* `GDELT Global Events Database `_ |OK_ICON| -* `General Social Survey (GSS) since 1972 `_ +* `General Social Survey (GSS) since 1972 `_ |OK_ICON| -* `German Social Survey `_ +* `German Social Survey `_ |OK_ICON| -* `Global Religious Futures Project `_ +* `Global Religious Futures Project `_ |OK_ICON| -* `Humanitarian Data Exchange `_ +* `Humanitarian Data Exchange `_ |FIXME_ICON| -* `INFORM Index for Risk Management `_ +* `INFORM Index for Risk Management `_ |OK_ICON| -* `Institute for Demographic Studies `_ +* `Institute for Demographic Studies `_ |OK_ICON| -* `International Networks Archive `_ +* `International Networks Archive `_ |OK_ICON| -* `International Social Survey Program ISSP `_ +* `International Social Survey Program ISSP `_ |OK_ICON| -* `International Studies Compendium Project `_ +* `International Studies Compendium Project `_ |OK_ICON| -* `James McGuire Cross National Data `_ +* `James McGuire Cross National Data `_ |OK_ICON| -* `MIT Reality Mining Dataset `_ +* `MIT Reality Mining Dataset `_ |OK_ICON| -* `MacroData Guide by Norsk samfunnsvitenskapelig datatjeneste `_ +* `MacroData Guide by Norsk samfunnsvitenskapelig datatjeneste `_ |OK_ICON| -* `Minnesota Population Center `_ +* `Minnesota Population Center `_ |OK_ICON| -* `Notre Dame Global Adaptation Index (NG-DAIN) `_ +* `Notre Dame Global Adaptation Index (NG-DAIN) `_ |OK_ICON| -* `Open Crime and Policing Data in England, Wales and Northern Ireland `_ +* `Open Crime and Policing Data in England, Wales and Northern Ireland `_ |OK_ICON| -* `Paul Hensel General International Data Page `_ +* `Paul Hensel General International Data Page `_ |OK_ICON| -* `PewResearch Internet Survey Project `_ +* `PewResearch Internet Survey Project `_ |FIXME_ICON| -* `PewResearch Society Data Collection `_ +* `PewResearch Society Data Collection `_ |OK_ICON| -* `Political Polarity Data `_ +* `Political Polarity Data `_ |OK_ICON| -* `StackExchange Data Explorer `_ +* `StackExchange Data Explorer `_ |OK_ICON| -* `Terrorism Research and Analysis Consortium `_ +* `Terrorism Research and Analysis Consortium `_ |OK_ICON| -* `Texas Inmates Executed Since 1984 `_ +* `Texas Inmates Executed Since 1984 `_ |FIXME_ICON| -* `Titanic Survival Data Set `_ +* `Titanic Survival Data Set `_ |OK_ICON| -* `UCB's Archive of Social Science Data (D-Lab) `_ +* `UCB's Archive of Social Science Data (D-Lab) `_ |OK_ICON| -* `UCLA Social Sciences Data Archive `_ +* `UCLA Social Sciences Data Archive `_ |FIXME_ICON| -* `UN Civil Society Database `_ +* `UN Civil Society Database `_ |OK_ICON| -* `UPJOHN for Labor Employment Research `_ +* `UPJOHN for Labor Employment Research `_ |OK_ICON| -* `Universities Worldwide `_ +* `Universities Worldwide `_ |OK_ICON| -* `Uppsala Conflict Data Program `_ +* `Uppsala Conflict Data Program `_ |OK_ICON| -* `World Bank Open Data `_ +* `World Bank Open Data `_ |OK_ICON| -* `WorldPop project - Worldwide human population distributions `_ +* `WorldPop project - Worldwide human population distributions `_ |OK_ICON| Software -------- -* `FLOSSmole data about free, libre, and open source software development `_ +* `FLOSSmole data about free, libre, and open source software development `_ |OK_ICON| Sports ------ -* `Betfair Historical Exchange Data `_ +* `Betfair Historical Exchange Data `_ |OK_ICON| -* `Cricsheet Matches (cricket) `_ +* `Cricsheet Matches (cricket) `_ |OK_ICON| -* `Ergast Formula 1, from 1950 up to date (API) `_ +* `Ergast Formula 1, from 1950 up to date (API) `_ |OK_ICON| -* `Football/Soccer resources (data and APIs) `_ +* `Football/Soccer resources (data and APIs) `_ |OK_ICON| -* `Lahman's Baseball Database `_ +* `Lahman's Baseball Database `_ |OK_ICON| -* `Pinhooker: Thoroughbred Bloodstock Sale Data `_ +* `Pinhooker: Thoroughbred Bloodstock Sale Data `_ |OK_ICON| -* `Retrosheet Baseball Statistics `_ +* `Retrosheet Baseball Statistics `_ |OK_ICON| -* `Tennis database of rankings, results, and stats for ATP `_ +* `Tennis database of rankings, results, and stats for ATP `_ |OK_ICON| TimeSeries ---------- -* `Databanks International Cross National Time Series Data Archive `_ +* `Databanks International Cross National Time Series Data Archive `_ |OK_ICON| -* `Hard Drive Failure Rates `_ +* `Hard Drive Failure Rates `_ |OK_ICON| -* `Heart Rate Time Series from MIT `_ +* `Heart Rate Time Series from MIT `_ |OK_ICON| -* `Time Series Data Library (TSDL) from MU `_ +* `Time Series Data Library (TSDL) from MU `_ |OK_ICON| -* `UC Riverside Time Series Dataset `_ +* `UC Riverside Time Series Dataset `_ |OK_ICON| Transportation -------------- -* `Airlines OD Data 1987-2008 `_ +* `Airlines OD Data 1987-2008 `_ |OK_ICON| -* `Bay Area Bike Share Data `_ +* `Bay Area Bike Share Data `_ |OK_ICON| -* `Bike Share Systems (BSS) collection `_ +* `Bike Share Systems (BSS) collection `_ |OK_ICON| -* `GeoLife GPS Trajectory from Microsoft Research `_ +* `GeoLife GPS Trajectory from Microsoft Research `_ |OK_ICON| -* `German train system by Deutsche Bahn `_ +* `German train system by Deutsche Bahn `_ |OK_ICON| -* `Hubway Million Rides in MA `_ +* `Hubway Million Rides in MA `_ |OK_ICON| -* `Montreal BIXI Bike Share `_ +* `Montreal BIXI Bike Share `_ |OK_ICON| -* `NYC Taxi Trip Data 2009- `_ +* `NYC Taxi Trip Data 2009- `_ |OK_ICON| -* `NYC Taxi Trip Data 2013 (FOIA/FOILed) `_ +* `NYC Taxi Trip Data 2013 (FOIA/FOILed) `_ |OK_ICON| -* `NYC Uber trip data April 2014 to September 2014 `_ +* `NYC Uber trip data April 2014 to September 2014 `_ |OK_ICON| -* `Open Traffic collection `_ +* `Open Traffic collection `_ |OK_ICON| -* `OpenFlights - airport, airline and route data `_ +* `OpenFlights - airport, airline and route data `_ |OK_ICON| -* `Philadelphia Bike Share Stations (JSON) `_ +* `Philadelphia Bike Share Stations (JSON) `_ |FIXME_ICON| -* `Plane Crash Database, since 1920 `_ +* `Plane Crash Database, since 1920 `_ |OK_ICON| -* `RITA Airline On-Time Performance data `_ +* `RITA Airline On-Time Performance data `_ |OK_ICON| -* `RITA/BTS transport data collection (TranStat) `_ +* `RITA/BTS transport data collection (TranStat) `_ |OK_ICON| -* `Toronto Bike Share Stations (XML file) `_ +* `Toronto Bike Share Stations (XML file) `_ |FIXME_ICON| -* `Transport for London (TFL) `_ +* `Transport for London (TFL) `_ |OK_ICON| -* `Travel Tracker Survey (TTS) for Chicago `_ +* `Travel Tracker Survey (TTS) for Chicago `_ |OK_ICON| -* `U.S. Bureau of Transportation Statistics (BTS) `_ +* `U.S. Bureau of Transportation Statistics (BTS) `_ |OK_ICON| -* `U.S. Domestic Flights 1990 to 2009 `_ +* `U.S. Domestic Flights 1990 to 2009 `_ |OK_ICON| -* `U.S. Freight Analysis Framework since 2007 `_ +* `U.S. Freight Analysis Framework since 2007 `_ |OK_ICON| Complementary Collections From 85d9454b7da1adb7b328284dd3a8c8451b641caf Mon Sep 17 00:00:00 2001 From: Travis CI Date: Mon, 15 Jan 2018 17:31:48 +0000 Subject: [PATCH 75/99] Update README from APD2: d5c9eda3c1e4bf884eddae1e6caa492683d42d87 --- README.rst | 1092 ++++++++++++++++++++++++++-------------------------- 1 file changed, 546 insertions(+), 546 deletions(-) diff --git a/README.rst b/README.rst index 67c4bda..ea9a499 100644 --- a/README.rst +++ b/README.rst @@ -27,1181 +27,1181 @@ Other amazingly awesome lists can be found in `sindresorhus's awesome `_ |OK_ICON| +* |OK_ICON| `U.S. Department of Agriculture's Nutrient Database `_ -* `U.S. Department of Agriculture's PLANTS Database `_ |OK_ICON| +* |OK_ICON| `U.S. Department of Agriculture's PLANTS Database `_ Biology ------- -* `1000 Genomes `_ |OK_ICON| +* |OK_ICON| `1000 Genomes `_ -* `American Gut (Microbiome Project) `_ |OK_ICON| +* |OK_ICON| `American Gut (Microbiome Project) `_ -* `Broad Bioimage Benchmark Collection (BBBC) `_ |OK_ICON| +* |OK_ICON| `Broad Bioimage Benchmark Collection (BBBC) `_ -* `Broad Cancer Cell Line Encyclopedia (CCLE) `_ |OK_ICON| +* |OK_ICON| `Broad Cancer Cell Line Encyclopedia (CCLE) `_ -* `Cell Image Library `_ |OK_ICON| +* |OK_ICON| `Cell Image Library `_ -* `Complete Genomics Public Data `_ |OK_ICON| +* |OK_ICON| `Complete Genomics Public Data `_ -* `EBI ArrayExpress `_ |OK_ICON| +* |OK_ICON| `EBI ArrayExpress `_ -* `EBI Protein Data Bank in Europe `_ |OK_ICON| +* |OK_ICON| `EBI Protein Data Bank in Europe `_ -* `ENCODE project `_ |OK_ICON| +* |OK_ICON| `ENCODE project `_ -* `Electron Microscopy Pilot Image Archive (EMPIAR) `_ |OK_ICON| +* |OK_ICON| `Electron Microscopy Pilot Image Archive (EMPIAR) `_ -* `Ensembl Genomes `_ |OK_ICON| +* |OK_ICON| `Ensembl Genomes `_ -* `Gene Expression Omnibus (GEO) `_ |OK_ICON| +* |OK_ICON| `Gene Expression Omnibus (GEO) `_ -* `Gene Ontology (GO) `_ |OK_ICON| +* |OK_ICON| `Gene Ontology (GO) `_ -* `Global Biotic Interactions (GloBI) `_ |OK_ICON| +* |OK_ICON| `Global Biotic Interactions (GloBI) `_ -* `Harvard Medical School (HMS) LINCS Project `_ |OK_ICON| +* |OK_ICON| `Harvard Medical School (HMS) LINCS Project `_ -* `Human Genome Diversity Project `_ |OK_ICON| +* |OK_ICON| `Human Genome Diversity Project `_ -* `Human Microbiome Project (HMP) `_ |OK_ICON| +* |OK_ICON| `Human Microbiome Project (HMP) `_ -* `ICOS PSP Benchmark `_ |OK_ICON| +* |OK_ICON| `ICOS PSP Benchmark `_ -* `International HapMap Project `_ |OK_ICON| +* |OK_ICON| `International HapMap Project `_ -* `Journal of Cell Biology DataViewer `_ |OK_ICON| +* |OK_ICON| `Journal of Cell Biology DataViewer `_ -* `MIT Cancer Genomics Data `_ |OK_ICON| +* |OK_ICON| `MIT Cancer Genomics Data `_ -* `NCBI Proteins `_ |OK_ICON| +* |OK_ICON| `NCBI Proteins `_ -* `NCBI Taxonomy `_ |OK_ICON| +* |OK_ICON| `NCBI Taxonomy `_ -* `NCI Genomic Data Commons `_ |OK_ICON| +* |OK_ICON| `NCI Genomic Data Commons `_ -* `NIH Microarray data `_ |FIXME_ICON| +* |FIXME_ICON| `NIH Microarray data `_ -* `OpenSNP genotypes data `_ |OK_ICON| +* |OK_ICON| `OpenSNP genotypes data `_ -* `Pathguid - Protein-Protein Interactions Catalog `_ |OK_ICON| +* |OK_ICON| `Pathguid - Protein-Protein Interactions Catalog `_ -* `Protein Data Bank `_ |OK_ICON| +* |OK_ICON| `Protein Data Bank `_ -* `Psychiatric Genomics Consortium `_ |OK_ICON| +* |OK_ICON| `Psychiatric Genomics Consortium `_ -* `PubChem Project `_ |OK_ICON| +* |OK_ICON| `PubChem Project `_ -* `PubGene (now Coremine Medical) `_ |OK_ICON| +* |OK_ICON| `PubGene (now Coremine Medical) `_ -* `Sanger Catalogue of Somatic Mutations in Cancer (COSMIC) `_ |OK_ICON| +* |OK_ICON| `Sanger Catalogue of Somatic Mutations in Cancer (COSMIC) `_ -* `Sanger Genomics of Drug Sensitivity in Cancer Project (GDSC) `_ |OK_ICON| +* |OK_ICON| `Sanger Genomics of Drug Sensitivity in Cancer Project (GDSC) `_ -* `Sequence Read Archive(SRA) `_ |OK_ICON| +* |OK_ICON| `Sequence Read Archive(SRA) `_ -* `Stanford Microarray Data `_ |FIXME_ICON| +* |FIXME_ICON| `Stanford Microarray Data `_ -* `Stowers Institute Original Data Repository `_ |OK_ICON| +* |OK_ICON| `Stowers Institute Original Data Repository `_ -* `Systems Science of Biological Dynamics (SSBD) Database `_ |OK_ICON| +* |OK_ICON| `Systems Science of Biological Dynamics (SSBD) Database `_ -* `The Cancer Genome Atlas (TCGA), available via Broad GDAC `_ |OK_ICON| +* |OK_ICON| `The Cancer Genome Atlas (TCGA), available via Broad GDAC `_ -* `The Catalogue of Life `_ |OK_ICON| +* |OK_ICON| `The Catalogue of Life `_ -* `The Personal Genome Project `_ |OK_ICON| +* |OK_ICON| `The Personal Genome Project `_ -* `UCSC Public Data `_ |OK_ICON| +* |OK_ICON| `UCSC Public Data `_ -* `UniGene `_ |OK_ICON| +* |OK_ICON| `UniGene `_ -* `Universal Protein Resource (UnitProt) `_ |OK_ICON| +* |OK_ICON| `Universal Protein Resource (UnitProt) `_ Climate+Weather --------------- -* `Actuaries Climate Index `_ |OK_ICON| +* |OK_ICON| `Actuaries Climate Index `_ -* `Australian Weather `_ |OK_ICON| +* |OK_ICON| `Australian Weather `_ -* `Aviation Weather Center - Consistent, timely and accurate weather information for the world airspace system `_ |OK_ICON| +* |OK_ICON| `Aviation Weather Center - Consistent, timely and accurate weather information for the world airspace system `_ -* `Brazilian Weather - Historical data (In Portuguese) `_ |OK_ICON| +* |OK_ICON| `Brazilian Weather - Historical data (In Portuguese) `_ -* `Canadian Meteorological Centre `_ |OK_ICON| +* |OK_ICON| `Canadian Meteorological Centre `_ -* `Climate Data from UEA (updated monthly) `_ |OK_ICON| +* |OK_ICON| `Climate Data from UEA (updated monthly) `_ -* `European Climate Assessment & Dataset `_ |OK_ICON| +* |OK_ICON| `European Climate Assessment & Dataset `_ -* `Global Climate Data Since 1929 `_ |OK_ICON| +* |OK_ICON| `Global Climate Data Since 1929 `_ -* `NASA Global Imagery Browse Services `_ |OK_ICON| +* |OK_ICON| `NASA Global Imagery Browse Services `_ -* `NOAA Bering Sea Climate `_ |FIXME_ICON| +* |FIXME_ICON| `NOAA Bering Sea Climate `_ -* `NOAA Climate Datasets `_ |OK_ICON| +* |OK_ICON| `NOAA Climate Datasets `_ -* `NOAA Realtime Weather Models `_ |OK_ICON| +* |OK_ICON| `NOAA Realtime Weather Models `_ -* `NOAA SURFRAD Meteorology and Radiation Datasets `_ |OK_ICON| +* |OK_ICON| `NOAA SURFRAD Meteorology and Radiation Datasets `_ -* `The World Bank Open Data Resources for Climate Change `_ |OK_ICON| +* |OK_ICON| `The World Bank Open Data Resources for Climate Change `_ -* `UEA Climatic Research Unit `_ |OK_ICON| +* |OK_ICON| `UEA Climatic Research Unit `_ -* `WU Historical Weather Worldwide `_ |OK_ICON| +* |OK_ICON| `WU Historical Weather Worldwide `_ -* `WorldClim - Global Climate Data `_ |OK_ICON| +* |OK_ICON| `WorldClim - Global Climate Data `_ ComplexNetworks --------------- -* `AMiner Citation Network Dataset `_ |OK_ICON| +* |OK_ICON| `AMiner Citation Network Dataset `_ -* `CrossRef DOI URLs `_ |OK_ICON| +* |OK_ICON| `CrossRef DOI URLs `_ -* `DBLP Citation dataset `_ |OK_ICON| +* |OK_ICON| `DBLP Citation dataset `_ -* `DIMACS Road Networks Collection `_ |OK_ICON| +* |OK_ICON| `DIMACS Road Networks Collection `_ -* `NBER Patent Citations `_ |OK_ICON| +* |OK_ICON| `NBER Patent Citations `_ -* `NIST complex networks data collection `_ |OK_ICON| +* |OK_ICON| `NIST complex networks data collection `_ -* `Network Repository with Interactive Exploratory Analysis Tools `_ |OK_ICON| +* |OK_ICON| `Network Repository with Interactive Exploratory Analysis Tools `_ -* `Protein-protein interaction network `_ |OK_ICON| +* |OK_ICON| `Protein-protein interaction network `_ -* `PyPI and Maven Dependency Network `_ |OK_ICON| +* |OK_ICON| `PyPI and Maven Dependency Network `_ -* `Scopus Citation Database `_ |OK_ICON| +* |OK_ICON| `Scopus Citation Database `_ -* `Small Network Data `_ |OK_ICON| +* |OK_ICON| `Small Network Data `_ -* `Stanford GraphBase `_ |OK_ICON| +* |OK_ICON| `Stanford GraphBase `_ -* `Stanford Large Network Dataset Collection `_ |OK_ICON| +* |OK_ICON| `Stanford Large Network Dataset Collection `_ -* `Stanford Longitudinal Network Data Sources `_ |OK_ICON| +* |OK_ICON| `Stanford Longitudinal Network Data Sources `_ -* `The Koblenz Network Collection `_ |OK_ICON| +* |OK_ICON| `The Koblenz Network Collection `_ -* `The Laboratory for Web Algorithmics (UNIMI) `_ |OK_ICON| +* |OK_ICON| `The Laboratory for Web Algorithmics (UNIMI) `_ -* `The Nexus Network Repository `_ |FIXME_ICON| +* |FIXME_ICON| `The Nexus Network Repository `_ -* `UCI Network Data Repository `_ |OK_ICON| +* |OK_ICON| `UCI Network Data Repository `_ -* `UFL sparse matrix collection `_ |OK_ICON| +* |OK_ICON| `UFL sparse matrix collection `_ -* `WSU Graph Database `_ |OK_ICON| +* |OK_ICON| `WSU Graph Database `_ ComputerNetworks ---------------- -* `3.5B Web Pages from CommonCrawl 2012 `_ |OK_ICON| +* |OK_ICON| `3.5B Web Pages from CommonCrawl 2012 `_ -* `53.5B Web clicks of 100K users in Indiana Univ. `_ |OK_ICON| +* |OK_ICON| `53.5B Web clicks of 100K users in Indiana Univ. `_ -* `CAIDA Internet Datasets `_ |OK_ICON| +* |OK_ICON| `CAIDA Internet Datasets `_ -* `CRAWDAD Wireless datasets from Dartmouth Univ. `_ |FIXME_ICON| +* |FIXME_ICON| `CRAWDAD Wireless datasets from Dartmouth Univ. `_ -* `ClueWeb09 - 1B web pages `_ |OK_ICON| +* |OK_ICON| `ClueWeb09 - 1B web pages `_ -* `ClueWeb12 - 733M web pages `_ |OK_ICON| +* |OK_ICON| `ClueWeb12 - 733M web pages `_ -* `CommonCrawl Web Data over 7 years `_ |OK_ICON| +* |OK_ICON| `CommonCrawl Web Data over 7 years `_ -* `Criteo click-through data `_ |OK_ICON| +* |OK_ICON| `Criteo click-through data `_ -* `OONI: Open Observatory of Network Interference - Internet censorship data `_ |OK_ICON| +* |OK_ICON| `OONI: Open Observatory of Network Interference - Internet censorship data `_ -* `Open Mobile Data by MobiPerf `_ |OK_ICON| +* |OK_ICON| `Open Mobile Data by MobiPerf `_ -* `Rapid7 Sonar Internet Scans `_ |OK_ICON| +* |OK_ICON| `Rapid7 Sonar Internet Scans `_ -* `UCSD Network Telescope, IPv4 /8 net `_ |OK_ICON| +* |OK_ICON| `UCSD Network Telescope, IPv4 /8 net `_ DataChallenges -------------- -* `Bruteforce Database `_ |OK_ICON| +* |OK_ICON| `Bruteforce Database `_ -* `Challenges in Machine Learning `_ |OK_ICON| +* |OK_ICON| `Challenges in Machine Learning `_ -* `CrowdANALYTIX dataX `_ |OK_ICON| +* |OK_ICON| `CrowdANALYTIX dataX `_ -* `D4D Challenge of Orange `_ |FIXME_ICON| +* |FIXME_ICON| `D4D Challenge of Orange `_ -* `DrivenData Competitions for Social Good `_ |OK_ICON| +* |OK_ICON| `DrivenData Competitions for Social Good `_ -* `ICWSM Data Challenge (since 2009) `_ |FIXME_ICON| +* |FIXME_ICON| `ICWSM Data Challenge (since 2009) `_ -* `KDD Cup by Tencent 2012 `_ |OK_ICON| +* |OK_ICON| `KDD Cup by Tencent 2012 `_ -* `Kaggle Competition Data `_ |OK_ICON| +* |OK_ICON| `Kaggle Competition Data `_ -* `Localytics Data Visualization Challenge `_ |OK_ICON| +* |OK_ICON| `Localytics Data Visualization Challenge `_ -* `Netflix Prize `_ |OK_ICON| +* |OK_ICON| `Netflix Prize `_ -* `Space Apps Challenge `_ |OK_ICON| +* |OK_ICON| `Space Apps Challenge `_ -* `Telecom Italia Big Data Challenge `_ |OK_ICON| +* |OK_ICON| `Telecom Italia Big Data Challenge `_ -* `TravisTorrent Dataset - MSR'2017 Mining Challenge `_ |OK_ICON| +* |OK_ICON| `TravisTorrent Dataset - MSR'2017 Mining Challenge `_ -* `Yelp Dataset Challenge `_ |OK_ICON| +* |OK_ICON| `Yelp Dataset Challenge `_ EarthScience ------------ -* `AQUASTAT - Global water resources and uses `_ |OK_ICON| +* |OK_ICON| `AQUASTAT - Global water resources and uses `_ -* `BODC - marine data of ~22K vars `_ |OK_ICON| +* |OK_ICON| `BODC - marine data of ~22K vars `_ -* `EOSDIS - NASA's earth observing system data `_ |OK_ICON| +* |OK_ICON| `EOSDIS - NASA's earth observing system data `_ -* `Earth Models `_ |OK_ICON| +* |OK_ICON| `Earth Models `_ -* `Integrated Marine Observing System (IMOS) - roughly 30TB of ocean measurements `_ |OK_ICON| +* |OK_ICON| `Integrated Marine Observing System (IMOS) - roughly 30TB of ocean measurements `_ -* `Marinexplore - Open Oceanographic Data `_ |OK_ICON| +* |OK_ICON| `Marinexplore - Open Oceanographic Data `_ -* `Smithsonian Institution Global Volcano and Eruption Database `_ |OK_ICON| +* |OK_ICON| `Smithsonian Institution Global Volcano and Eruption Database `_ -* `USGS Earthquake Archives `_ |OK_ICON| +* |OK_ICON| `USGS Earthquake Archives `_ Economics --------- -* `American Economic Association (AEA) `_ |OK_ICON| +* |OK_ICON| `American Economic Association (AEA) `_ -* `EconData from UMD `_ |OK_ICON| +* |OK_ICON| `EconData from UMD `_ -* `Economic Freedom of the World Data `_ |FIXME_ICON| +* |FIXME_ICON| `Economic Freedom of the World Data `_ -* `Historical MacroEconomc Statistics `_ |OK_ICON| +* |OK_ICON| `Historical MacroEconomc Statistics `_ -* `International Economics Database `_ |OK_ICON| +* |OK_ICON| `International Economics Database `_ -* `International Trade Statistics `_ |OK_ICON| +* |OK_ICON| `International Trade Statistics `_ -* `Internet Product Code Database `_ |OK_ICON| +* |OK_ICON| `Internet Product Code Database `_ -* `Joint External Debt Data Hub `_ |OK_ICON| +* |OK_ICON| `Joint External Debt Data Hub `_ -* `Jon Haveman International Trade Data Links `_ |OK_ICON| +* |OK_ICON| `Jon Haveman International Trade Data Links `_ -* `OpenCorporates Database of Companies in the World `_ |OK_ICON| +* |OK_ICON| `OpenCorporates Database of Companies in the World `_ -* `Our World in Data `_ |OK_ICON| +* |OK_ICON| `Our World in Data `_ -* `SciencesPo World Trade Gravity Datasets `_ |OK_ICON| +* |OK_ICON| `SciencesPo World Trade Gravity Datasets `_ -* `The Atlas of Economic Complexity `_ |OK_ICON| +* |OK_ICON| `The Atlas of Economic Complexity `_ -* `The Center for International Data `_ |OK_ICON| +* |OK_ICON| `The Center for International Data `_ -* `The Observatory of Economic Complexity `_ |OK_ICON| +* |OK_ICON| `The Observatory of Economic Complexity `_ -* `UN Commodity Trade Statistics `_ |OK_ICON| +* |OK_ICON| `UN Commodity Trade Statistics `_ -* `UN Human Development Reports `_ |OK_ICON| +* |OK_ICON| `UN Human Development Reports `_ Education --------- -* `College Scorecard Data `_ |OK_ICON| +* |OK_ICON| `College Scorecard Data `_ -* `Student Data from Free Code Camp `_ |OK_ICON| +* |OK_ICON| `Student Data from Free Code Camp `_ Energy ------ -* `AMPds `_ |OK_ICON| +* |OK_ICON| `AMPds `_ -* `BLUEd `_ |OK_ICON| +* |OK_ICON| `BLUEd `_ -* `COMBED `_ |OK_ICON| +* |OK_ICON| `COMBED `_ -* `DRED `_ |OK_ICON| +* |OK_ICON| `DRED `_ -* `ECO `_ |OK_ICON| +* |OK_ICON| `ECO `_ -* `EIA `_ |OK_ICON| +* |OK_ICON| `EIA `_ -* `HES - Household Electricity Study, UK `_ |OK_ICON| +* |OK_ICON| `HES - Household Electricity Study, UK `_ -* `HFED `_ |OK_ICON| +* |OK_ICON| `HFED `_ -* `PLAID - The Plug Load Appliance Identification Dataset `_ |FIXME_ICON| +* |FIXME_ICON| `PLAID - The Plug Load Appliance Identification Dataset `_ -* `REDD `_ |OK_ICON| +* |OK_ICON| `REDD `_ -* `Tracebase `_ |OK_ICON| +* |OK_ICON| `Tracebase `_ -* `UK-DALE - UK Domestic Appliance-Level Electricity `_ |OK_ICON| +* |OK_ICON| `UK-DALE - UK Domestic Appliance-Level Electricity `_ -* `WHITED `_ |OK_ICON| +* |OK_ICON| `WHITED `_ -* `iAWE `_ |OK_ICON| +* |OK_ICON| `iAWE `_ Finance ------- -* `CBOE Futures Exchange `_ |FIXME_ICON| +* |FIXME_ICON| `CBOE Futures Exchange `_ -* `Google Finance `_ |OK_ICON| +* |OK_ICON| `Google Finance `_ -* `Google Trends `_ |OK_ICON| +* |OK_ICON| `Google Trends `_ -* `NASDAQ `_ |OK_ICON| +* |OK_ICON| `NASDAQ `_ -* `NYSE Market Data `_ |OK_ICON| +* |OK_ICON| `NYSE Market Data `_ -* `OANDA `_ |OK_ICON| +* |OK_ICON| `OANDA `_ -* `OSU Financial data `_ |OK_ICON| +* |OK_ICON| `OSU Financial data `_ -* `Quandl `_ |OK_ICON| +* |OK_ICON| `Quandl `_ -* `St Louis Federal `_ |OK_ICON| +* |OK_ICON| `St Louis Federal `_ -* `Yahoo Finance `_ |OK_ICON| +* |OK_ICON| `Yahoo Finance `_ GIS --- -* `ArcGIS Open Data portal `_ |OK_ICON| +* |OK_ICON| `ArcGIS Open Data portal `_ -* `Cambridge, MA, US, GIS data on GitHub `_ |OK_ICON| +* |OK_ICON| `Cambridge, MA, US, GIS data on GitHub `_ -* `Factual Global Location Data `_ |OK_ICON| +* |OK_ICON| `Factual Global Location Data `_ -* `Geo Spatial Data from ASU `_ |OK_ICON| +* |OK_ICON| `Geo Spatial Data from ASU `_ -* `Geo Wiki Project - Citizen-driven Environmental Monitoring `_ |OK_ICON| +* |OK_ICON| `Geo Wiki Project - Citizen-driven Environmental Monitoring `_ -* `GeoFabrik - OSM data extracted to a variety of formats and areas `_ |OK_ICON| +* |OK_ICON| `GeoFabrik - OSM data extracted to a variety of formats and areas `_ -* `GeoNames Worldwide `_ |OK_ICON| +* |OK_ICON| `GeoNames Worldwide `_ -* `Global Administrative Areas Database (GADM) `_ |OK_ICON| +* |OK_ICON| `Global Administrative Areas Database (GADM) `_ -* `Homeland Infrastructure Foundation-Level Data `_ |OK_ICON| +* |OK_ICON| `Homeland Infrastructure Foundation-Level Data `_ -* `Landsat 8 on AWS `_ |OK_ICON| +* |OK_ICON| `Landsat 8 on AWS `_ -* `List of all countries in all languages `_ |OK_ICON| +* |OK_ICON| `List of all countries in all languages `_ -* `National Weather Service GIS Data Portal `_ |OK_ICON| +* |OK_ICON| `National Weather Service GIS Data Portal `_ -* `Natural Earth - vectors and rasters of the world `_ |OK_ICON| +* |OK_ICON| `Natural Earth - vectors and rasters of the world `_ -* `OpenAddresses `_ |OK_ICON| +* |OK_ICON| `OpenAddresses `_ -* `OpenStreetMap (OSM) `_ |OK_ICON| +* |OK_ICON| `OpenStreetMap (OSM) `_ -* `Pleiades - Gazetteer and graph of ancient places `_ |OK_ICON| +* |OK_ICON| `Pleiades - Gazetteer and graph of ancient places `_ -* `Reverse Geocoder using OSM data `_ |OK_ICON| +* |OK_ICON| `Reverse Geocoder using OSM data `_ -* `TIGER/Line - U.S. boundaries and roads `_ |FIXME_ICON| +* |FIXME_ICON| `TIGER/Line - U.S. boundaries and roads `_ -* `TZ Timezones shapfiles `_ |OK_ICON| +* |OK_ICON| `TZ Timezones shapfiles `_ -* `TwoFishes - Foursquare's coarse geocoder `_ |OK_ICON| +* |OK_ICON| `TwoFishes - Foursquare's coarse geocoder `_ -* `UN Environmental Data `_ |OK_ICON| +* |OK_ICON| `UN Environmental Data `_ -* `World boundaries from the U.S. Department of State `_ |FIXME_ICON| +* |FIXME_ICON| `World boundaries from the U.S. Department of State `_ -* `World countries in multiple formats `_ |OK_ICON| +* |OK_ICON| `World countries in multiple formats `_ Government ---------- -* `Alberta, Province of Canada `_ |OK_ICON| +* |OK_ICON| `Alberta, Province of Canada `_ -* `Antwerp, Belgium `_ |OK_ICON| +* |OK_ICON| `Antwerp, Belgium `_ -* `Argentina (non official) `_ |OK_ICON| +* |OK_ICON| `Argentina (non official) `_ -* `Argentina `_ |FIXME_ICON| +* |FIXME_ICON| `Argentina `_ -* `Austin, TX, US `_ |OK_ICON| +* |OK_ICON| `Austin, TX, US `_ -* `Australia (abs.gov.au) `_ |OK_ICON| +* |OK_ICON| `Australia (abs.gov.au) `_ -* `Australia (data.gov.au) `_ |OK_ICON| +* |OK_ICON| `Australia (data.gov.au) `_ -* `Austria (data.gv.at) `_ |OK_ICON| +* |OK_ICON| `Austria (data.gv.at) `_ -* `Baton Rouge, LA, US `_ |OK_ICON| +* |OK_ICON| `Baton Rouge, LA, US `_ -* `Belgium `_ |OK_ICON| +* |OK_ICON| `Belgium `_ -* `Brazil `_ |OK_ICON| +* |OK_ICON| `Brazil `_ -* `Buenos Aires, Argentina `_ |OK_ICON| +* |OK_ICON| `Buenos Aires, Argentina `_ -* `Calgary, AB, Canada `_ |FIXME_ICON| +* |FIXME_ICON| `Calgary, AB, Canada `_ -* `Cambridge, MA, US `_ |OK_ICON| +* |OK_ICON| `Cambridge, MA, US `_ -* `Canada `_ |FIXME_ICON| +* |FIXME_ICON| `Canada `_ -* `Chicago `_ |OK_ICON| +* |OK_ICON| `Chicago `_ -* `Chile `_ |OK_ICON| +* |OK_ICON| `Chile `_ -* `Dallas Open Data `_ |OK_ICON| +* |OK_ICON| `Dallas Open Data `_ -* `DataBC - data from the Province of British Columbia `_ |OK_ICON| +* |OK_ICON| `DataBC - data from the Province of British Columbia `_ -* `Denver Open Data `_ |OK_ICON| +* |OK_ICON| `Denver Open Data `_ -* `Durham, NC Open Data `_ |OK_ICON| +* |OK_ICON| `Durham, NC Open Data `_ -* `Edmonton, AB, Canada `_ |OK_ICON| +* |OK_ICON| `Edmonton, AB, Canada `_ -* `England LGInform `_ |OK_ICON| +* |OK_ICON| `England LGInform `_ -* `EuroStat `_ |OK_ICON| +* |OK_ICON| `EuroStat `_ -* `EveryPolitician - Ongoing project collating and sharing data on every politician. `_ |OK_ICON| +* |OK_ICON| `EveryPolitician - Ongoing project collating and sharing data on every politician. `_ -* `FedStats `_ |OK_ICON| +* |OK_ICON| `FedStats `_ -* `Finland `_ |OK_ICON| +* |OK_ICON| `Finland `_ -* `France `_ |OK_ICON| +* |OK_ICON| `France `_ -* `Fredericton, NB, Canada `_ |OK_ICON| +* |OK_ICON| `Fredericton, NB, Canada `_ -* `Gatineau, QC, Canada `_ |OK_ICON| +* |OK_ICON| `Gatineau, QC, Canada `_ -* `Germany `_ |OK_ICON| +* |OK_ICON| `Germany `_ -* `Ghent, Belgium `_ |FIXME_ICON| +* |FIXME_ICON| `Ghent, Belgium `_ -* `Glasgow, Scotland, UK `_ |FIXME_ICON| +* |FIXME_ICON| `Glasgow, Scotland, UK `_ -* `Greece `_ |OK_ICON| +* |OK_ICON| `Greece `_ -* `Guardian world governments `_ |OK_ICON| +* |OK_ICON| `Guardian world governments `_ -* `Halifax, NS, Canada `_ |FIXME_ICON| +* |FIXME_ICON| `Halifax, NS, Canada `_ -* `Helsinki Region, Finland `_ |OK_ICON| +* |OK_ICON| `Helsinki Region, Finland `_ -* `Hong Kong, China `_ |OK_ICON| +* |OK_ICON| `Hong Kong, China `_ -* `Houston Open Data `_ |FIXME_ICON| +* |FIXME_ICON| `Houston Open Data `_ -* `Indian Government Data `_ |OK_ICON| +* |OK_ICON| `Indian Government Data `_ -* `Indonesian Data Portal `_ |OK_ICON| +* |OK_ICON| `Indonesian Data Portal `_ -* `Ireland's Open Data Portal `_ |OK_ICON| +* |OK_ICON| `Ireland's Open Data Portal `_ -* `Japan `_ |OK_ICON| +* |OK_ICON| `Japan `_ -* `Laval, QC, Canada `_ |OK_ICON| +* |OK_ICON| `Laval, QC, Canada `_ -* `Lexington, KY `_ |OK_ICON| +* |OK_ICON| `Lexington, KY `_ -* `London Datastore, UK `_ |OK_ICON| +* |OK_ICON| `London Datastore, UK `_ -* `London, ON, Canada `_ |OK_ICON| +* |OK_ICON| `London, ON, Canada `_ -* `Los Angeles Open Data `_ |OK_ICON| +* |OK_ICON| `Los Angeles Open Data `_ -* `MassGIS, Massachusetts, U.S. `_ |OK_ICON| +* |OK_ICON| `MassGIS, Massachusetts, U.S. `_ -* `Metropolitain Transportation Commission (MTC), California, US `_ |OK_ICON| +* |OK_ICON| `Metropolitain Transportation Commission (MTC), California, US `_ -* `Mexico `_ |OK_ICON| +* |OK_ICON| `Mexico `_ -* `Missisauga, ON, Canada `_ |OK_ICON| +* |OK_ICON| `Missisauga, ON, Canada `_ -* `Moldova `_ |OK_ICON| +* |OK_ICON| `Moldova `_ -* `Moncton, NB, Canada `_ |OK_ICON| +* |OK_ICON| `Moncton, NB, Canada `_ -* `Montreal, QC, Canada `_ |OK_ICON| +* |OK_ICON| `Montreal, QC, Canada `_ -* `Mountain View, California, US (GIS) `_ |OK_ICON| +* |OK_ICON| `Mountain View, California, US (GIS) `_ -* `NYC Open Data `_ |FIXME_ICON| +* |FIXME_ICON| `NYC Open Data `_ -* `NYC betanyc `_ |OK_ICON| +* |OK_ICON| `NYC betanyc `_ -* `Netherlands `_ |OK_ICON| +* |OK_ICON| `Netherlands `_ -* `New Zealand `_ |OK_ICON| +* |OK_ICON| `New Zealand `_ -* `OECD `_ |OK_ICON| +* |OK_ICON| `OECD `_ -* `Oakland, California, US `_ |OK_ICON| +* |OK_ICON| `Oakland, California, US `_ -* `Oklahoma `_ |OK_ICON| +* |OK_ICON| `Oklahoma `_ -* `Open Data for Africa `_ |OK_ICON| +* |OK_ICON| `Open Data for Africa `_ -* `Open Government Data (OGD) Platform India `_ |OK_ICON| +* |OK_ICON| `Open Government Data (OGD) Platform India `_ -* `OpenDataSoft's list of 1,600 open data `_ |OK_ICON| +* |OK_ICON| `OpenDataSoft's list of 1,600 open data `_ -* `Oregon `_ |OK_ICON| +* |OK_ICON| `Oregon `_ -* `Ottawa, ON, Canada `_ |OK_ICON| +* |OK_ICON| `Ottawa, ON, Canada `_ -* `Palo Alto, California, US `_ |OK_ICON| +* |OK_ICON| `Palo Alto, California, US `_ -* `Portland, Oregon `_ |OK_ICON| +* |OK_ICON| `Portland, Oregon `_ -* `Portugal - Pordata organization `_ |OK_ICON| +* |OK_ICON| `Portugal - Pordata organization `_ -* `Puerto Rico Government `_ |OK_ICON| +* |OK_ICON| `Puerto Rico Government `_ -* `Quebec City, QC, Canada `_ |OK_ICON| +* |OK_ICON| `Quebec City, QC, Canada `_ -* `Quebec Province of Canada `_ |OK_ICON| +* |OK_ICON| `Quebec Province of Canada `_ -* `Regina SK, Canada `_ |OK_ICON| +* |OK_ICON| `Regina SK, Canada `_ -* `Rio de Janeiro, Brazil `_ |FIXME_ICON| +* |FIXME_ICON| `Rio de Janeiro, Brazil `_ -* `Romania `_ |OK_ICON| +* |OK_ICON| `Romania `_ -* `Russia `_ |OK_ICON| +* |OK_ICON| `Russia `_ -* `San Francisco Data sets `_ |OK_ICON| +* |OK_ICON| `San Francisco Data sets `_ -* `San Jose, California, US `_ |OK_ICON| +* |OK_ICON| `San Jose, California, US `_ -* `San Mateo County, California, US `_ |OK_ICON| +* |OK_ICON| `San Mateo County, California, US `_ -* `Saskatchewan, Province of Canada `_ |OK_ICON| +* |OK_ICON| `Saskatchewan, Province of Canada `_ -* `Seattle `_ |OK_ICON| +* |OK_ICON| `Seattle `_ -* `Singapore Government Data `_ |OK_ICON| +* |OK_ICON| `Singapore Government Data `_ -* `South Africa Trade Statistics `_ |OK_ICON| +* |OK_ICON| `South Africa Trade Statistics `_ -* `South Africa `_ |OK_ICON| +* |OK_ICON| `South Africa `_ -* `State of Utah, US `_ |OK_ICON| +* |OK_ICON| `State of Utah, US `_ -* `Switzerland `_ |OK_ICON| +* |OK_ICON| `Switzerland `_ -* `Taiwan g0v `_ |OK_ICON| +* |OK_ICON| `Taiwan g0v `_ -* `Taiwan `_ |OK_ICON| +* |OK_ICON| `Taiwan `_ -* `Texas Open Data `_ |OK_ICON| +* |OK_ICON| `Texas Open Data `_ -* `The World Bank `_ |FIXME_ICON| +* |FIXME_ICON| `The World Bank `_ -* `Toronto, ON, Canada `_ |OK_ICON| +* |OK_ICON| `Toronto, ON, Canada `_ -* `Tunisia `_ |OK_ICON| +* |OK_ICON| `Tunisia `_ -* `U.K. Government Data `_ |OK_ICON| +* |OK_ICON| `U.K. Government Data `_ -* `U.S. American Community Survey `_ |OK_ICON| +* |OK_ICON| `U.S. American Community Survey `_ -* `U.S. CDC Public Health datasets `_ |OK_ICON| +* |OK_ICON| `U.S. CDC Public Health datasets `_ -* `U.S. Census Bureau `_ |OK_ICON| +* |OK_ICON| `U.S. Census Bureau `_ -* `U.S. Department of Housing and Urban Development (HUD) `_ |OK_ICON| +* |OK_ICON| `U.S. Department of Housing and Urban Development (HUD) `_ -* `U.S. Federal Government Agencies `_ |OK_ICON| +* |OK_ICON| `U.S. Federal Government Agencies `_ -* `U.S. Federal Government Data Catalog `_ |OK_ICON| +* |OK_ICON| `U.S. Federal Government Data Catalog `_ -* `U.S. Food and Drug Administration (FDA) `_ |OK_ICON| +* |OK_ICON| `U.S. Food and Drug Administration (FDA) `_ -* `U.S. National Center for Education Statistics (NCES) `_ |OK_ICON| +* |OK_ICON| `U.S. National Center for Education Statistics (NCES) `_ -* `U.S. Open Government `_ |OK_ICON| +* |OK_ICON| `U.S. Open Government `_ -* `UK 2011 Census Open Atlas Project `_ |FIXME_ICON| +* |FIXME_ICON| `UK 2011 Census Open Atlas Project `_ -* `Uganda Bureau of Statistics `_ |OK_ICON| +* |OK_ICON| `Uganda Bureau of Statistics `_ -* `United Nations `_ |OK_ICON| +* |OK_ICON| `United Nations `_ -* `Uruguay `_ |OK_ICON| +* |OK_ICON| `Uruguay `_ -* `Valley Transportation Authority (VTA), California, US `_ |OK_ICON| +* |OK_ICON| `Valley Transportation Authority (VTA), California, US `_ -* `Vancouver, BC Open Data Catalog `_ |OK_ICON| +* |OK_ICON| `Vancouver, BC Open Data Catalog `_ -* `Victoria, BC, Canada `_ |FIXME_ICON| +* |FIXME_ICON| `Victoria, BC, Canada `_ -* `Vienna, Austria `_ |OK_ICON| +* |OK_ICON| `Vienna, Austria `_ Healthcare ---------- -* `EHDP Large Health Data Sets `_ |OK_ICON| +* |OK_ICON| `EHDP Large Health Data Sets `_ -* `GDC - GDC supports several cancer genome programs for CCG, TCGA, TARGET etc. `_ |OK_ICON| +* |OK_ICON| `GDC - GDC supports several cancer genome programs for CCG, TCGA, TARGET etc. `_ -* `Gapminder World demographic databases `_ |OK_ICON| +* |OK_ICON| `Gapminder World demographic databases `_ -* `MeSH, the vocabulary thesaurus used for indexing articles for PubMed `_ |OK_ICON| +* |OK_ICON| `MeSH, the vocabulary thesaurus used for indexing articles for PubMed `_ -* `Medicare Coverage Database (MCD), U.S. `_ |OK_ICON| +* |OK_ICON| `Medicare Coverage Database (MCD), U.S. `_ -* `Medicare Data Engine of medicare.gov Data `_ |OK_ICON| +* |OK_ICON| `Medicare Data Engine of medicare.gov Data `_ -* `Medicare Data File `_ |OK_ICON| +* |OK_ICON| `Medicare Data File `_ -* `Number of Ebola Cases and Deaths in Affected Countries (2014) `_ |FIXME_ICON| +* |FIXME_ICON| `Number of Ebola Cases and Deaths in Affected Countries (2014) `_ -* `Open-ODS (structure of the UK NHS) `_ |OK_ICON| +* |OK_ICON| `Open-ODS (structure of the UK NHS) `_ -* `OpenPaymentsData, Healthcare financial relationship data `_ |OK_ICON| +* |OK_ICON| `OpenPaymentsData, Healthcare financial relationship data `_ -* `PhysioBank Databases - A large and growing archive of physiological data. `_ |OK_ICON| +* |OK_ICON| `PhysioBank Databases - A large and growing archive of physiological data. `_ -* `The Cancer Genome Atlas project (TCGA) `_ |OK_ICON| +* |OK_ICON| `The Cancer Genome Atlas project (TCGA) `_ -* `World Health Organization Global Health Observatory `_ |OK_ICON| +* |OK_ICON| `World Health Organization Global Health Observatory `_ ImageProcessing --------------- -* `10k US Adult Faces Database `_ |OK_ICON| +* |OK_ICON| `10k US Adult Faces Database `_ -* `2GB of Photos of Cats `_ |FIXME_ICON| +* |FIXME_ICON| `2GB of Photos of Cats `_ -* `Adience Unfiltered faces for gender and age classification `_ |OK_ICON| +* |OK_ICON| `Adience Unfiltered faces for gender and age classification `_ -* `Affective Image Classification `_ |OK_ICON| +* |OK_ICON| `Affective Image Classification `_ -* `Animals with attributes `_ |OK_ICON| +* |OK_ICON| `Animals with attributes `_ -* `Caltech Pedestrian Detection Benchmark `_ |OK_ICON| +* |OK_ICON| `Caltech Pedestrian Detection Benchmark `_ -* `Chars74K dataset - Character Recognition in Natural Images (both English and Kannada are available) `_ |OK_ICON| +* |OK_ICON| `Chars74K dataset - Character Recognition in Natural Images (both English and Kannada are available) `_ -* `Face Recognition Benchmark `_ |OK_ICON| +* |OK_ICON| `Face Recognition Benchmark `_ -* `Flickr: 32 Class Brand Logos `_ |OK_ICON| +* |OK_ICON| `Flickr: 32 Class Brand Logos `_ -* `GDXray - X-ray images for X-ray testing and Computer Vision `_ |OK_ICON| +* |OK_ICON| `GDXray - X-ray images for X-ray testing and Computer Vision `_ -* `ImageNet (in WordNet hierarchy) `_ |OK_ICON| +* |OK_ICON| `ImageNet (in WordNet hierarchy) `_ -* `Indoor Scene Recognition `_ |OK_ICON| +* |OK_ICON| `Indoor Scene Recognition `_ -* `International Affective Picture System, UFL `_ |OK_ICON| +* |OK_ICON| `International Affective Picture System, UFL `_ -* `MNIST database of handwritten digits, near 1 million examples `_ |OK_ICON| +* |OK_ICON| `MNIST database of handwritten digits, near 1 million examples `_ -* `Massive Visual Memory Stimuli, MIT `_ |OK_ICON| +* |OK_ICON| `Massive Visual Memory Stimuli, MIT `_ -* `SUN database, MIT `_ |OK_ICON| +* |OK_ICON| `SUN database, MIT `_ -* `Several Shape-from-Silhouette Datasets `_ |FIXME_ICON| +* |FIXME_ICON| `Several Shape-from-Silhouette Datasets `_ -* `Stanford Dogs Dataset `_ |OK_ICON| +* |OK_ICON| `Stanford Dogs Dataset `_ -* `The Action Similarity Labeling (ASLAN) Challenge `_ |OK_ICON| +* |OK_ICON| `The Action Similarity Labeling (ASLAN) Challenge `_ -* `The Oxford-IIIT Pet Dataset `_ |OK_ICON| +* |OK_ICON| `The Oxford-IIIT Pet Dataset `_ -* `Violent-Flows - Crowd Violence / Non-violence Database and benchmark `_ |OK_ICON| +* |OK_ICON| `Violent-Flows - Crowd Violence / Non-violence Database and benchmark `_ -* `Visual genome `_ |OK_ICON| +* |OK_ICON| `Visual genome `_ -* `YouTube Faces Database `_ |OK_ICON| +* |OK_ICON| `YouTube Faces Database `_ MachineLearning --------------- -* `Context-aware data sets from five domains `_ |OK_ICON| +* |OK_ICON| `Context-aware data sets from five domains `_ -* `Delve Datasets for classification and regression `_ |OK_ICON| +* |OK_ICON| `Delve Datasets for classification and regression `_ -* `Discogs Monthly Data `_ |OK_ICON| +* |OK_ICON| `Discogs Monthly Data `_ -* `Free Music Archive `_ |OK_ICON| +* |OK_ICON| `Free Music Archive `_ -* `IMDb Database `_ |OK_ICON| +* |OK_ICON| `IMDb Database `_ -* `Keel Repository for classification, regression and time series `_ |OK_ICON| +* |OK_ICON| `Keel Repository for classification, regression and time series `_ -* `Labeled Faces in the Wild (LFW) `_ |OK_ICON| +* |OK_ICON| `Labeled Faces in the Wild (LFW) `_ -* `Lending Club Loan Data `_ |OK_ICON| +* |OK_ICON| `Lending Club Loan Data `_ -* `Machine Learning Data Set Repository `_ |OK_ICON| +* |OK_ICON| `Machine Learning Data Set Repository `_ -* `Million Song Dataset `_ |OK_ICON| +* |OK_ICON| `Million Song Dataset `_ -* `More Song Datasets `_ |OK_ICON| +* |OK_ICON| `More Song Datasets `_ -* `MovieLens Data Sets `_ |OK_ICON| +* |OK_ICON| `MovieLens Data Sets `_ -* `New Yorker caption contest ratings `_ |OK_ICON| +* |OK_ICON| `New Yorker caption contest ratings `_ -* `RDataMining - "R and Data Mining" ebook data `_ |OK_ICON| +* |OK_ICON| `RDataMining - "R and Data Mining" ebook data `_ -* `Registered Meteorites on Earth `_ |OK_ICON| +* |OK_ICON| `Registered Meteorites on Earth `_ -* `Restaurants Health Score Data in San Francisco `_ |FIXME_ICON| +* |FIXME_ICON| `Restaurants Health Score Data in San Francisco `_ -* `UCI Machine Learning Repository `_ |OK_ICON| +* |OK_ICON| `UCI Machine Learning Repository `_ -* `Yahoo! Ratings and Classification Data `_ |FIXME_ICON| +* |FIXME_ICON| `Yahoo! Ratings and Classification Data `_ -* `Youtube 8m `_ |OK_ICON| +* |OK_ICON| `Youtube 8m `_ -* `eBay Online Auctions (2012) `_ |OK_ICON| +* |OK_ICON| `eBay Online Auctions (2012) `_ Museums ------- -* `Canada Science and Technology Museums Corporation's Open Data `_ |OK_ICON| +* |OK_ICON| `Canada Science and Technology Museums Corporation's Open Data `_ -* `Cooper-Hewitt's Collection Database `_ |OK_ICON| +* |OK_ICON| `Cooper-Hewitt's Collection Database `_ -* `Minneapolis Institute of Arts metadata `_ |OK_ICON| +* |OK_ICON| `Minneapolis Institute of Arts metadata `_ -* `Natural History Museum (London) Data Portal `_ |OK_ICON| +* |OK_ICON| `Natural History Museum (London) Data Portal `_ -* `Rijksmuseum Historical Art Collection `_ |OK_ICON| +* |OK_ICON| `Rijksmuseum Historical Art Collection `_ -* `Tate Collection metadata `_ |OK_ICON| +* |OK_ICON| `Tate Collection metadata `_ -* `The Getty vocabularies `_ |OK_ICON| +* |OK_ICON| `The Getty vocabularies `_ NaturalLanguage --------------- -* `Automatic Keyphrase Extraction `_ |OK_ICON| +* |OK_ICON| `Automatic Keyphrase Extraction `_ -* `Blogger Corpus `_ |OK_ICON| +* |OK_ICON| `Blogger Corpus `_ -* `CLiPS Stylometry Investigation Corpus `_ |OK_ICON| +* |OK_ICON| `CLiPS Stylometry Investigation Corpus `_ -* `ClueWeb09 FACC `_ |OK_ICON| +* |OK_ICON| `ClueWeb09 FACC `_ -* `ClueWeb12 FACC `_ |OK_ICON| +* |OK_ICON| `ClueWeb12 FACC `_ -* `DBpedia - 4.58M things with 583M facts `_ |OK_ICON| +* |OK_ICON| `DBpedia - 4.58M things with 583M facts `_ -* `Flickr Personal Taxonomies `_ |OK_ICON| +* |OK_ICON| `Flickr Personal Taxonomies `_ -* `Freebase of people, places, and things `_ |OK_ICON| +* |OK_ICON| `Freebase of people, places, and things `_ -* `Google Books Ngrams (2.2TB) `_ |OK_ICON| +* |OK_ICON| `Google Books Ngrams (2.2TB) `_ -* `Google MC-AFP - Generated based on the public available Gigaword dataset using Paragraph Vectors `_ |OK_ICON| +* |OK_ICON| `Google MC-AFP - Generated based on the public available Gigaword dataset using Paragraph Vectors `_ -* `Google Web 5gram (1TB, 2006) `_ |OK_ICON| +* |OK_ICON| `Google Web 5gram (1TB, 2006) `_ -* `Gutenberg eBooks List `_ |OK_ICON| +* |FIXME_ICON| `Gutenberg eBooks List `_ -* `Hansards text chunks of Canadian Parliament `_ |OK_ICON| +* |OK_ICON| `Hansards text chunks of Canadian Parliament `_ -* `Microsoft MAchine Reading COmprehension Dataset (or MS MARCO) `_ |OK_ICON| +* |OK_ICON| `Microsoft MAchine Reading COmprehension Dataset (or MS MARCO) `_ -* `Machine Comprehension Test (MCTest) of text from Microsoft Research `_ |OK_ICON| +* |OK_ICON| `Machine Comprehension Test (MCTest) of text from Microsoft Research `_ -* `Machine Translation of European languages `_ |OK_ICON| +* |OK_ICON| `Machine Translation of European languages `_ -* `Making Sense of Microposts 2013 - Concept Extraction `_ |FIXME_ICON| +* |FIXME_ICON| `Making Sense of Microposts 2013 - Concept Extraction `_ -* `Making Sense of Microposts 2016 - Named Entity rEcognition and Linking `_ |OK_ICON| +* |OK_ICON| `Making Sense of Microposts 2016 - Named Entity rEcognition and Linking `_ -* `Multi-Domain Sentiment Dataset (version 2.0) `_ |OK_ICON| +* |OK_ICON| `Multi-Domain Sentiment Dataset (version 2.0) `_ -* `Open Multilingual Wordnet `_ |OK_ICON| +* |OK_ICON| `Open Multilingual Wordnet `_ -* `POS/NER/Chunk annotated data `_ |OK_ICON| +* |OK_ICON| `POS/NER/Chunk annotated data `_ -* `Personae Corpus `_ |OK_ICON| +* |OK_ICON| `Personae Corpus `_ -* `SMS Spam Collection in English `_ |OK_ICON| +* |OK_ICON| `SMS Spam Collection in English `_ -* `SaudiNewsNet Collection of Saudi Newspaper Articles (Arabic, 30K articles) `_ |OK_ICON| +* |OK_ICON| `SaudiNewsNet Collection of Saudi Newspaper Articles (Arabic, 30K articles) `_ -* `Stanford Question Answering Dataset (SQuAD) `_ |OK_ICON| +* |OK_ICON| `Stanford Question Answering Dataset (SQuAD) `_ -* `USENET postings corpus of 2005~2011 `_ |OK_ICON| +* |OK_ICON| `USENET postings corpus of 2005~2011 `_ -* `Universal Dependencies `_ |OK_ICON| +* |OK_ICON| `Universal Dependencies `_ -* `Webhose - News/Blogs in multiple languages `_ |OK_ICON| +* |OK_ICON| `Webhose - News/Blogs in multiple languages `_ -* `Wikidata - Wikipedia databases `_ |OK_ICON| +* |OK_ICON| `Wikidata - Wikipedia databases `_ -* `Wikipedia Links data - 40 Million Entities in Context `_ |OK_ICON| +* |OK_ICON| `Wikipedia Links data - 40 Million Entities in Context `_ -* `WordNet databases and tools `_ |OK_ICON| +* |OK_ICON| `WordNet databases and tools `_ Neuroscience ------------ -* `Allen Institute Datasets `_ |OK_ICON| +* |OK_ICON| `Allen Institute Datasets `_ -* `Brain Catalogue `_ |OK_ICON| +* |OK_ICON| `Brain Catalogue `_ -* `Brainomics `_ |OK_ICON| +* |OK_ICON| `Brainomics `_ -* `CodeNeuro Datasets `_ |OK_ICON| +* |OK_ICON| `CodeNeuro Datasets `_ -* `Collaborative Research in Computational Neuroscience (CRCNS) `_ |OK_ICON| +* |OK_ICON| `Collaborative Research in Computational Neuroscience (CRCNS) `_ -* `FCP-INDI `_ |OK_ICON| +* |OK_ICON| `FCP-INDI `_ -* `Human Connectome Project `_ |OK_ICON| +* |OK_ICON| `Human Connectome Project `_ -* `NDAR `_ |OK_ICON| +* |OK_ICON| `NDAR `_ -* `NIMH Data Archive `_ |OK_ICON| +* |OK_ICON| `NIMH Data Archive `_ -* `NeuroData `_ |OK_ICON| +* |OK_ICON| `NeuroData `_ -* `Neuroelectro `_ |OK_ICON| +* |OK_ICON| `Neuroelectro `_ -* `OASIS `_ |OK_ICON| +* |OK_ICON| `OASIS `_ -* `OpenfMRI `_ |OK_ICON| +* |OK_ICON| `OpenfMRI `_ -* `Study Forrest `_ |OK_ICON| +* |OK_ICON| `Study Forrest `_ Physics ------- -* `CERN Open Data Portal `_ |OK_ICON| +* |OK_ICON| `CERN Open Data Portal `_ -* `Crystallography Open Database `_ |OK_ICON| +* |OK_ICON| `Crystallography Open Database `_ -* `NASA Exoplanet Archive `_ |OK_ICON| +* |OK_ICON| `NASA Exoplanet Archive `_ -* `NSSDC (NASA) data of 550 space spacecraft `_ |OK_ICON| +* |OK_ICON| `NSSDC (NASA) data of 550 space spacecraft `_ -* `Sloan Digital Sky Survey (SDSS) - Mapping the Universe `_ |OK_ICON| +* |OK_ICON| `Sloan Digital Sky Survey (SDSS) - Mapping the Universe `_ Psychology+Cognition -------------------- -* `OSU Cognitive Modeling Repository Datasets `_ |FIXME_ICON| +* |FIXME_ICON| `OSU Cognitive Modeling Repository Datasets `_ PublicDomains ------------- -* `Amazon `_ |OK_ICON| +* |OK_ICON| `Amazon `_ -* `Archive.org Datasets `_ |OK_ICON| +* |OK_ICON| `Archive.org Datasets `_ -* `Archive-it from Internet Archive `_ |OK_ICON| +* |OK_ICON| `Archive-it from Internet Archive `_ -* `CMU JASA data archive `_ |OK_ICON| +* |OK_ICON| `CMU JASA data archive `_ -* `CMU StatLab collections `_ |OK_ICON| +* |OK_ICON| `CMU StatLab collections `_ -* `Data.World `_ |OK_ICON| +* |OK_ICON| `Data.World `_ -* `Data360 `_ |OK_ICON| +* |OK_ICON| `Data360 `_ -* `Enigma Public `_ |OK_ICON| +* |OK_ICON| `Enigma Public `_ -* `Google `_ |OK_ICON| +* |OK_ICON| `Google `_ -* `Infochimps `_ |FIXME_ICON| +* |FIXME_ICON| `Infochimps `_ -* `KDNuggets Data Collections `_ |OK_ICON| +* |OK_ICON| `KDNuggets Data Collections `_ -* `Microsoft Azure Data Market Free DataSets `_ |OK_ICON| +* |OK_ICON| `Microsoft Azure Data Market Free DataSets `_ -* `Microsoft Data Science for Research `_ |OK_ICON| +* |OK_ICON| `Microsoft Data Science for Research `_ -* `Numbray `_ |FIXME_ICON| +* |FIXME_ICON| `Numbray `_ -* `Open Library Data Dumps `_ |OK_ICON| +* |OK_ICON| `Open Library Data Dumps `_ -* `Reddit Datasets `_ |OK_ICON| +* |OK_ICON| `Reddit Datasets `_ -* `RevolutionAnalytics Collection `_ |OK_ICON| +* |OK_ICON| `RevolutionAnalytics Collection `_ -* `Sample R data sets `_ |OK_ICON| +* |OK_ICON| `Sample R data sets `_ -* `StatSci.org `_ |OK_ICON| +* |OK_ICON| `StatSci.org `_ -* `Stats4Stem R data sets `_ |FIXME_ICON| +* |FIXME_ICON| `Stats4Stem R data sets `_ -* `The Washington Post List `_ |OK_ICON| +* |OK_ICON| `The Washington Post List `_ -* `UCLA SOCR data collection `_ |OK_ICON| +* |OK_ICON| `UCLA SOCR data collection `_ -* `UFO Reports `_ |OK_ICON| +* |OK_ICON| `UFO Reports `_ -* `Wikileaks 911 pager intercepts `_ |OK_ICON| +* |OK_ICON| `Wikileaks 911 pager intercepts `_ -* `Yahoo Webscope `_ |FIXME_ICON| +* |FIXME_ICON| `Yahoo Webscope `_ SearchEngines ------------- -* `Academic Torrents of data sharing from UMB `_ |OK_ICON| +* |OK_ICON| `Academic Torrents of data sharing from UMB `_ -* `DataMarket (Qlik) `_ |OK_ICON| +* |OK_ICON| `DataMarket (Qlik) `_ -* `Datahub.io `_ |OK_ICON| +* |OK_ICON| `Datahub.io `_ -* `Harvard Dataverse Network of scientific data `_ |OK_ICON| +* |OK_ICON| `Harvard Dataverse Network of scientific data `_ -* `ICPSR (UMICH) `_ |OK_ICON| +* |OK_ICON| `ICPSR (UMICH) `_ -* `Institute of Education Sciences `_ |OK_ICON| +* |OK_ICON| `Institute of Education Sciences `_ -* `National Technical Reports Library `_ |FIXME_ICON| +* |FIXME_ICON| `National Technical Reports Library `_ -* `Open Data Certificates (beta) `_ |OK_ICON| +* |OK_ICON| `Open Data Certificates (beta) `_ -* `OpenDataNetwork - A search engine of all Socrata powered data portals `_ |OK_ICON| +* |OK_ICON| `OpenDataNetwork - A search engine of all Socrata powered data portals `_ -* `Statista.com - statistics and Studies `_ |OK_ICON| +* |OK_ICON| `Statista.com - statistics and Studies `_ -* `Zenodo - An open dependable home for the long-tail of science `_ |OK_ICON| +* |OK_ICON| `Zenodo - An open dependable home for the long-tail of science `_ SocialNetworks -------------- -* `72 hours #gamergate Twitter Scrape `_ |OK_ICON| +* |OK_ICON| `72 hours #gamergate Twitter Scrape `_ -* `Ancestry.com Forum Dataset over 10 years `_ |OK_ICON| +* |OK_ICON| `Ancestry.com Forum Dataset over 10 years `_ -* `CMU Enron Email of 150 users `_ |OK_ICON| +* |OK_ICON| `CMU Enron Email of 150 users `_ -* `Cheng-Caverlee-Lee September 2009 - January 2010 Twitter Scrape `_ |OK_ICON| +* |OK_ICON| `Cheng-Caverlee-Lee September 2009 - January 2010 Twitter Scrape `_ -* `EDRM Enron EMail of 151 users, hosted on S3 `_ |OK_ICON| +* |OK_ICON| `EDRM Enron EMail of 151 users, hosted on S3 `_ -* `Facebook Data Scrape (2005) `_ |OK_ICON| +* |OK_ICON| `Facebook Data Scrape (2005) `_ -* `Facebook Social Networks from LAW (since 2007) `_ |OK_ICON| +* |OK_ICON| `Facebook Social Networks from LAW (since 2007) `_ -* `Foursquare from UMN/Sarwat (2013) `_ |OK_ICON| +* |OK_ICON| `Foursquare from UMN/Sarwat (2013) `_ -* `GitHub Collaboration Archive `_ |OK_ICON| +* |OK_ICON| `GitHub Collaboration Archive `_ -* `Google Scholar citation relations `_ |OK_ICON| +* |OK_ICON| `Google Scholar citation relations `_ -* `High-Resolution Contact Networks from Wearable Sensors `_ |OK_ICON| +* |OK_ICON| `High-Resolution Contact Networks from Wearable Sensors `_ -* `Indie Map: social graph and crawl of top IndieWeb sites `_ |OK_ICON| +* |OK_ICON| `Indie Map: social graph and crawl of top IndieWeb sites `_ -* `Mobile Social Networks from UMASS `_ |OK_ICON| +* |OK_ICON| `Mobile Social Networks from UMASS `_ -* `Network Twitter Data `_ |OK_ICON| +* |OK_ICON| `Network Twitter Data `_ -* `Reddit Comments `_ |OK_ICON| +* |OK_ICON| `Reddit Comments `_ -* `Skytrax' Air Travel Reviews Dataset `_ |OK_ICON| +* |OK_ICON| `Skytrax' Air Travel Reviews Dataset `_ -* `Social Twitter Data `_ |OK_ICON| +* |OK_ICON| `Social Twitter Data `_ -* `SourceForge.net Research Data `_ |OK_ICON| +* |OK_ICON| `SourceForge.net Research Data `_ -* `Twitter Data for Online Reputation Management `_ |OK_ICON| +* |OK_ICON| `Twitter Data for Online Reputation Management `_ -* `Twitter Data for Sentiment Analysis `_ |OK_ICON| +* |OK_ICON| `Twitter Data for Sentiment Analysis `_ -* `Twitter Graph of entire Twitter site `_ |OK_ICON| +* |OK_ICON| `Twitter Graph of entire Twitter site `_ -* `Twitter Scrape Calufa May 2011 `_ |FIXME_ICON| +* |FIXME_ICON| `Twitter Scrape Calufa May 2011 `_ -* `UNIMI/LAW Social Network Datasets `_ |OK_ICON| +* |OK_ICON| `UNIMI/LAW Social Network Datasets `_ -* `Yahoo! Graph and Social Data `_ |FIXME_ICON| +* |FIXME_ICON| `Yahoo! Graph and Social Data `_ -* `Youtube Video Social Graph in 2007,2008 `_ |OK_ICON| +* |OK_ICON| `Youtube Video Social Graph in 2007,2008 `_ SocialSciences -------------- -* `ACLED (Armed Conflict Location & Event Data Project) `_ |OK_ICON| +* |OK_ICON| `ACLED (Armed Conflict Location & Event Data Project) `_ -* `Canadian Legal Information Institute `_ |FIXME_ICON| +* |OK_ICON| `Canadian Legal Information Institute `_ -* `Center for Systemic Peace Datasets - Conflict Trends, Polities, State Fragility, etc `_ |OK_ICON| +* |OK_ICON| `Center for Systemic Peace Datasets - Conflict Trends, Polities, State Fragility, etc `_ -* `Correlates of War Project `_ |OK_ICON| +* |OK_ICON| `Correlates of War Project `_ -* `Cryptome Conspiracy Theory Items `_ |OK_ICON| +* |OK_ICON| `Cryptome Conspiracy Theory Items `_ -* `Datacards `_ |FIXME_ICON| +* |FIXME_ICON| `Datacards `_ -* `European Social Survey `_ |OK_ICON| +* |OK_ICON| `European Social Survey `_ -* `FBI Hate Crime 2013 - aggregated data `_ |OK_ICON| +* |OK_ICON| `FBI Hate Crime 2013 - aggregated data `_ -* `Fragile States Index `_ |FIXME_ICON| +* |FIXME_ICON| `Fragile States Index `_ -* `GDELT Global Events Database `_ |OK_ICON| +* |OK_ICON| `GDELT Global Events Database `_ -* `General Social Survey (GSS) since 1972 `_ |OK_ICON| +* |OK_ICON| `General Social Survey (GSS) since 1972 `_ -* `German Social Survey `_ |OK_ICON| +* |OK_ICON| `German Social Survey `_ -* `Global Religious Futures Project `_ |OK_ICON| +* |OK_ICON| `Global Religious Futures Project `_ -* `Humanitarian Data Exchange `_ |FIXME_ICON| +* |FIXME_ICON| `Humanitarian Data Exchange `_ -* `INFORM Index for Risk Management `_ |OK_ICON| +* |OK_ICON| `INFORM Index for Risk Management `_ -* `Institute for Demographic Studies `_ |OK_ICON| +* |OK_ICON| `Institute for Demographic Studies `_ -* `International Networks Archive `_ |OK_ICON| +* |OK_ICON| `International Networks Archive `_ -* `International Social Survey Program ISSP `_ |OK_ICON| +* |OK_ICON| `International Social Survey Program ISSP `_ -* `International Studies Compendium Project `_ |OK_ICON| +* |OK_ICON| `International Studies Compendium Project `_ -* `James McGuire Cross National Data `_ |OK_ICON| +* |OK_ICON| `James McGuire Cross National Data `_ -* `MIT Reality Mining Dataset `_ |OK_ICON| +* |OK_ICON| `MIT Reality Mining Dataset `_ -* `MacroData Guide by Norsk samfunnsvitenskapelig datatjeneste `_ |OK_ICON| +* |OK_ICON| `MacroData Guide by Norsk samfunnsvitenskapelig datatjeneste `_ -* `Minnesota Population Center `_ |OK_ICON| +* |OK_ICON| `Minnesota Population Center `_ -* `Notre Dame Global Adaptation Index (NG-DAIN) `_ |OK_ICON| +* |OK_ICON| `Notre Dame Global Adaptation Index (NG-DAIN) `_ -* `Open Crime and Policing Data in England, Wales and Northern Ireland `_ |OK_ICON| +* |OK_ICON| `Open Crime and Policing Data in England, Wales and Northern Ireland `_ -* `Paul Hensel General International Data Page `_ |OK_ICON| +* |OK_ICON| `Paul Hensel General International Data Page `_ -* `PewResearch Internet Survey Project `_ |FIXME_ICON| +* |FIXME_ICON| `PewResearch Internet Survey Project `_ -* `PewResearch Society Data Collection `_ |OK_ICON| +* |OK_ICON| `PewResearch Society Data Collection `_ -* `Political Polarity Data `_ |OK_ICON| +* |OK_ICON| `Political Polarity Data `_ -* `StackExchange Data Explorer `_ |OK_ICON| +* |OK_ICON| `StackExchange Data Explorer `_ -* `Terrorism Research and Analysis Consortium `_ |OK_ICON| +* |OK_ICON| `Terrorism Research and Analysis Consortium `_ -* `Texas Inmates Executed Since 1984 `_ |FIXME_ICON| +* |FIXME_ICON| `Texas Inmates Executed Since 1984 `_ -* `Titanic Survival Data Set `_ |OK_ICON| +* |OK_ICON| `Titanic Survival Data Set `_ -* `UCB's Archive of Social Science Data (D-Lab) `_ |OK_ICON| +* |OK_ICON| `UCB's Archive of Social Science Data (D-Lab) `_ -* `UCLA Social Sciences Data Archive `_ |FIXME_ICON| +* |FIXME_ICON| `UCLA Social Sciences Data Archive `_ -* `UN Civil Society Database `_ |OK_ICON| +* |OK_ICON| `UN Civil Society Database `_ -* `UPJOHN for Labor Employment Research `_ |OK_ICON| +* |OK_ICON| `UPJOHN for Labor Employment Research `_ -* `Universities Worldwide `_ |OK_ICON| +* |OK_ICON| `Universities Worldwide `_ -* `Uppsala Conflict Data Program `_ |OK_ICON| +* |OK_ICON| `Uppsala Conflict Data Program `_ -* `World Bank Open Data `_ |OK_ICON| +* |OK_ICON| `World Bank Open Data `_ -* `WorldPop project - Worldwide human population distributions `_ |OK_ICON| +* |OK_ICON| `WorldPop project - Worldwide human population distributions `_ Software -------- -* `FLOSSmole data about free, libre, and open source software development `_ |OK_ICON| +* |OK_ICON| `FLOSSmole data about free, libre, and open source software development `_ Sports ------ -* `Betfair Historical Exchange Data `_ |OK_ICON| +* |OK_ICON| `Betfair Historical Exchange Data `_ -* `Cricsheet Matches (cricket) `_ |OK_ICON| +* |OK_ICON| `Cricsheet Matches (cricket) `_ -* `Ergast Formula 1, from 1950 up to date (API) `_ |OK_ICON| +* |OK_ICON| `Ergast Formula 1, from 1950 up to date (API) `_ -* `Football/Soccer resources (data and APIs) `_ |OK_ICON| +* |OK_ICON| `Football/Soccer resources (data and APIs) `_ -* `Lahman's Baseball Database `_ |OK_ICON| +* |OK_ICON| `Lahman's Baseball Database `_ -* `Pinhooker: Thoroughbred Bloodstock Sale Data `_ |OK_ICON| +* |OK_ICON| `Pinhooker: Thoroughbred Bloodstock Sale Data `_ -* `Retrosheet Baseball Statistics `_ |OK_ICON| +* |OK_ICON| `Retrosheet Baseball Statistics `_ -* `Tennis database of rankings, results, and stats for ATP `_ |OK_ICON| +* |OK_ICON| `Tennis database of rankings, results, and stats for ATP `_ TimeSeries ---------- -* `Databanks International Cross National Time Series Data Archive `_ |OK_ICON| +* |OK_ICON| `Databanks International Cross National Time Series Data Archive `_ -* `Hard Drive Failure Rates `_ |OK_ICON| +* |OK_ICON| `Hard Drive Failure Rates `_ -* `Heart Rate Time Series from MIT `_ |OK_ICON| +* |OK_ICON| `Heart Rate Time Series from MIT `_ -* `Time Series Data Library (TSDL) from MU `_ |OK_ICON| +* |OK_ICON| `Time Series Data Library (TSDL) from MU `_ -* `UC Riverside Time Series Dataset `_ |OK_ICON| +* |OK_ICON| `UC Riverside Time Series Dataset `_ Transportation -------------- -* `Airlines OD Data 1987-2008 `_ |OK_ICON| +* |OK_ICON| `Airlines OD Data 1987-2008 `_ -* `Bay Area Bike Share Data `_ |OK_ICON| +* |OK_ICON| `Bay Area Bike Share Data `_ -* `Bike Share Systems (BSS) collection `_ |OK_ICON| +* |OK_ICON| `Bike Share Systems (BSS) collection `_ -* `GeoLife GPS Trajectory from Microsoft Research `_ |OK_ICON| +* |OK_ICON| `GeoLife GPS Trajectory from Microsoft Research `_ -* `German train system by Deutsche Bahn `_ |OK_ICON| +* |OK_ICON| `German train system by Deutsche Bahn `_ -* `Hubway Million Rides in MA `_ |OK_ICON| +* |OK_ICON| `Hubway Million Rides in MA `_ -* `Montreal BIXI Bike Share `_ |OK_ICON| +* |OK_ICON| `Montreal BIXI Bike Share `_ -* `NYC Taxi Trip Data 2009- `_ |OK_ICON| +* |OK_ICON| `NYC Taxi Trip Data 2009- `_ -* `NYC Taxi Trip Data 2013 (FOIA/FOILed) `_ |OK_ICON| +* |OK_ICON| `NYC Taxi Trip Data 2013 (FOIA/FOILed) `_ -* `NYC Uber trip data April 2014 to September 2014 `_ |OK_ICON| +* |OK_ICON| `NYC Uber trip data April 2014 to September 2014 `_ -* `Open Traffic collection `_ |OK_ICON| +* |OK_ICON| `Open Traffic collection `_ -* `OpenFlights - airport, airline and route data `_ |OK_ICON| +* |OK_ICON| `OpenFlights - airport, airline and route data `_ -* `Philadelphia Bike Share Stations (JSON) `_ |FIXME_ICON| +* |FIXME_ICON| `Philadelphia Bike Share Stations (JSON) `_ -* `Plane Crash Database, since 1920 `_ |OK_ICON| +* |OK_ICON| `Plane Crash Database, since 1920 `_ -* `RITA Airline On-Time Performance data `_ |OK_ICON| +* |OK_ICON| `RITA Airline On-Time Performance data `_ -* `RITA/BTS transport data collection (TranStat) `_ |OK_ICON| +* |OK_ICON| `RITA/BTS transport data collection (TranStat) `_ -* `Toronto Bike Share Stations (XML file) `_ |FIXME_ICON| +* |FIXME_ICON| `Toronto Bike Share Stations (XML file) `_ -* `Transport for London (TFL) `_ |OK_ICON| +* |OK_ICON| `Transport for London (TFL) `_ -* `Travel Tracker Survey (TTS) for Chicago `_ |OK_ICON| +* |OK_ICON| `Travel Tracker Survey (TTS) for Chicago `_ -* `U.S. Bureau of Transportation Statistics (BTS) `_ |OK_ICON| +* |OK_ICON| `U.S. Bureau of Transportation Statistics (BTS) `_ -* `U.S. Domestic Flights 1990 to 2009 `_ |OK_ICON| +* |OK_ICON| `U.S. Domestic Flights 1990 to 2009 `_ -* `U.S. Freight Analysis Framework since 2007 `_ |OK_ICON| +* |OK_ICON| `U.S. Freight Analysis Framework since 2007 `_ Complementary Collections From b74a8a0d274e079131fff1932d225318beafe09a Mon Sep 17 00:00:00 2001 From: Travis CI Date: Tue, 16 Jan 2018 02:58:40 +0000 Subject: [PATCH 76/99] Update README from APD2: 7a429745cef43c18a251a3efcdc9f3d24bb76f29 --- README.rst | 10 +++++----- 1 file changed, 5 insertions(+), 5 deletions(-) diff --git a/README.rst b/README.rst index ea9a499..1fa5075 100644 --- a/README.rst +++ b/README.rst @@ -455,7 +455,7 @@ Government * |OK_ICON| `Cambridge, MA, US `_ -* |FIXME_ICON| `Canada `_ +* |OK_ICON| `Canada `_ * |OK_ICON| `Chicago `_ @@ -814,7 +814,7 @@ NaturalLanguage * |OK_ICON| `Google Web 5gram (1TB, 2006) `_ -* |FIXME_ICON| `Gutenberg eBooks List `_ +* |OK_ICON| `Gutenberg eBooks List `_ * |OK_ICON| `Hansards text chunks of Canadian Parliament `_ @@ -918,7 +918,7 @@ PublicDomains * |OK_ICON| `Data.World `_ -* |OK_ICON| `Data360 `_ +* |FIXME_ICON| `Data360 `_ * |OK_ICON| `Enigma Public `_ @@ -1039,7 +1039,7 @@ SocialSciences * |OK_ICON| `ACLED (Armed Conflict Location & Event Data Project) `_ -* |OK_ICON| `Canadian Legal Information Institute `_ +* |FIXME_ICON| `Canadian Legal Information Institute `_ * |OK_ICON| `Center for Systemic Peace Datasets - Conflict Trends, Polities, State Fragility, etc `_ @@ -1099,7 +1099,7 @@ SocialSciences * |OK_ICON| `Terrorism Research and Analysis Consortium `_ -* |FIXME_ICON| `Texas Inmates Executed Since 1984 `_ +* |OK_ICON| `Texas Inmates Executed Since 1984 `_ * |OK_ICON| `Titanic Survival Data Set `_ From c6adff41110a2aea439b1cbd80c26c615a61b445 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Tue, 16 Jan 2018 11:01:20 +0000 Subject: [PATCH 77/99] Update README from APD2: c1ced64df9666838f351d50f03fb2df7454e4964 --- README.rst | 26 +++++++++++++------------- 1 file changed, 13 insertions(+), 13 deletions(-) diff --git a/README.rst b/README.rst index 1fa5075..e9be04d 100644 --- a/README.rst +++ b/README.rst @@ -5,12 +5,12 @@ Awesome Public Datasets :alt: Awesome :target: https://github.com/sindresorhus/awesome -.. |OK_ICON| image:: https://raw.githubusercontent.com/awesomedata/apd2/master/deploy/ok-24.png -.. |FIXME_ICON| image:: https://raw.githubusercontent.com/awesomedata/apd2/master/deploy/fixme-24.png +.. |OK_ICON| image:: https://raw.githubusercontent.com/awesomedata/apd-core/master/deploy/ok-24.png +.. |FIXME_ICON| image:: https://raw.githubusercontent.com/awesomedata/apd-core/master/deploy/fixme-24.png -**NOTICE**: This repo is automatically generated by `APD2 `_. +**NOTICE**: This repo is automatically generated by `apd-core `_. Please **DO NOT** modify this file directly. We have provided -`a new way `_ +`a new way `_ to contribute to Awesome Public Datasets. The original PR entrance directly on repo is closed forever. @@ -455,7 +455,7 @@ Government * |OK_ICON| `Cambridge, MA, US `_ -* |OK_ICON| `Canada `_ +* |FIXME_ICON| `Canada `_ * |OK_ICON| `Chicago `_ @@ -703,7 +703,7 @@ ImageProcessing * |OK_ICON| `GDXray - X-ray images for X-ray testing and Computer Vision `_ -* |OK_ICON| `ImageNet (in WordNet hierarchy) `_ +* |FIXME_ICON| `ImageNet (in WordNet hierarchy) `_ * |OK_ICON| `Indoor Scene Recognition `_ @@ -721,7 +721,7 @@ ImageProcessing * |OK_ICON| `The Action Similarity Labeling (ASLAN) Challenge `_ -* |OK_ICON| `The Oxford-IIIT Pet Dataset `_ +* |FIXME_ICON| `The Oxford-IIIT Pet Dataset `_ * |OK_ICON| `Violent-Flows - Crowd Violence / Non-violence Database and benchmark `_ @@ -814,7 +814,7 @@ NaturalLanguage * |OK_ICON| `Google Web 5gram (1TB, 2006) `_ -* |OK_ICON| `Gutenberg eBooks List `_ +* |FIXME_ICON| `Gutenberg eBooks List `_ * |OK_ICON| `Hansards text chunks of Canadian Parliament `_ @@ -871,9 +871,9 @@ Neuroscience * |OK_ICON| `Human Connectome Project `_ -* |OK_ICON| `NDAR `_ +* |FIXME_ICON| `NDAR `_ -* |OK_ICON| `NIMH Data Archive `_ +* |FIXME_ICON| `NIMH Data Archive `_ * |OK_ICON| `NeuroData `_ @@ -918,13 +918,13 @@ PublicDomains * |OK_ICON| `Data.World `_ -* |FIXME_ICON| `Data360 `_ +* |OK_ICON| `Data360 `_ * |OK_ICON| `Enigma Public `_ * |OK_ICON| `Google `_ -* |FIXME_ICON| `Infochimps `_ +* |OK_ICON| `Infochimps `_ * |OK_ICON| `KDNuggets Data Collections `_ @@ -1039,7 +1039,7 @@ SocialSciences * |OK_ICON| `ACLED (Armed Conflict Location & Event Data Project) `_ -* |FIXME_ICON| `Canadian Legal Information Institute `_ +* |OK_ICON| `Canadian Legal Information Institute `_ * |OK_ICON| `Center for Systemic Peace Datasets - Conflict Trends, Polities, State Fragility, etc `_ From c916bc87f7b3cbb824f32aa511d08052a9379b81 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Wed, 17 Jan 2018 06:18:35 +0000 Subject: [PATCH 78/99] Update README from APD2: c5ae7a39118b657109b4d22433828bc71272e719 --- README.rst | 16 ++++++++-------- 1 file changed, 8 insertions(+), 8 deletions(-) diff --git a/README.rst b/README.rst index e9be04d..3453a64 100644 --- a/README.rst +++ b/README.rst @@ -455,7 +455,7 @@ Government * |OK_ICON| `Cambridge, MA, US `_ -* |FIXME_ICON| `Canada `_ +* |OK_ICON| `Canada `_ * |OK_ICON| `Chicago `_ @@ -577,7 +577,7 @@ Government * |OK_ICON| `Regina SK, Canada `_ -* |FIXME_ICON| `Rio de Janeiro, Brazil `_ +* |OK_ICON| `Rio de Janeiro, Brazil `_ * |OK_ICON| `Romania `_ @@ -674,7 +674,7 @@ Healthcare * |OK_ICON| `OpenPaymentsData, Healthcare financial relationship data `_ -* |OK_ICON| `PhysioBank Databases - A large and growing archive of physiological data. `_ +* |FIXME_ICON| `PhysioBank Databases - A large and growing archive of physiological data. `_ * |OK_ICON| `The Cancer Genome Atlas project (TCGA) `_ @@ -703,7 +703,7 @@ ImageProcessing * |OK_ICON| `GDXray - X-ray images for X-ray testing and Computer Vision `_ -* |FIXME_ICON| `ImageNet (in WordNet hierarchy) `_ +* |OK_ICON| `ImageNet (in WordNet hierarchy) `_ * |OK_ICON| `Indoor Scene Recognition `_ @@ -721,7 +721,7 @@ ImageProcessing * |OK_ICON| `The Action Similarity Labeling (ASLAN) Challenge `_ -* |FIXME_ICON| `The Oxford-IIIT Pet Dataset `_ +* |OK_ICON| `The Oxford-IIIT Pet Dataset `_ * |OK_ICON| `Violent-Flows - Crowd Violence / Non-violence Database and benchmark `_ @@ -871,9 +871,9 @@ Neuroscience * |OK_ICON| `Human Connectome Project `_ -* |FIXME_ICON| `NDAR `_ +* |OK_ICON| `NDAR `_ -* |FIXME_ICON| `NIMH Data Archive `_ +* |OK_ICON| `NIMH Data Archive `_ * |OK_ICON| `NeuroData `_ @@ -924,7 +924,7 @@ PublicDomains * |OK_ICON| `Google `_ -* |OK_ICON| `Infochimps `_ +* |FIXME_ICON| `Infochimps `_ * |OK_ICON| `KDNuggets Data Collections `_ From a42b1af4f7aa63c6ffcce4ca833f6f42c985f566 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Wed, 17 Jan 2018 10:32:32 +0000 Subject: [PATCH 79/99] Update README from APD2: 362913faebaddee52093e4e00dc07fde582c07ae --- README.rst | 16 ++++++++-------- 1 file changed, 8 insertions(+), 8 deletions(-) diff --git a/README.rst b/README.rst index 3453a64..e917faa 100644 --- a/README.rst +++ b/README.rst @@ -14,7 +14,7 @@ Please **DO NOT** modify this file directly. We have provided to contribute to Awesome Public Datasets. The original PR entrance directly on repo is closed forever. -`This list of a topic-centric public data sources `_ +`This list of a topic-centric public data sources `_ in high quality. They are collected and tidied from blogs, answers, and user responses. Most of the data sets listed below are free, however, some are not. Other amazingly awesome lists can be found in `sindresorhus's awesome `_ list. @@ -577,7 +577,7 @@ Government * |OK_ICON| `Regina SK, Canada `_ -* |OK_ICON| `Rio de Janeiro, Brazil `_ +* |FIXME_ICON| `Rio de Janeiro, Brazil `_ * |OK_ICON| `Romania `_ @@ -609,7 +609,7 @@ Government * |OK_ICON| `Texas Open Data `_ -* |FIXME_ICON| `The World Bank `_ +* |OK_ICON| `The World Bank `_ * |OK_ICON| `Toronto, ON, Canada `_ @@ -674,7 +674,7 @@ Healthcare * |OK_ICON| `OpenPaymentsData, Healthcare financial relationship data `_ -* |FIXME_ICON| `PhysioBank Databases - A large and growing archive of physiological data. `_ +* |OK_ICON| `PhysioBank Databases - A large and growing archive of physiological data. `_ * |OK_ICON| `The Cancer Genome Atlas project (TCGA) `_ @@ -814,7 +814,7 @@ NaturalLanguage * |OK_ICON| `Google Web 5gram (1TB, 2006) `_ -* |FIXME_ICON| `Gutenberg eBooks List `_ +* |OK_ICON| `Gutenberg eBooks List `_ * |OK_ICON| `Hansards text chunks of Canadian Parliament `_ @@ -924,7 +924,7 @@ PublicDomains * |OK_ICON| `Google `_ -* |FIXME_ICON| `Infochimps `_ +* |OK_ICON| `Infochimps `_ * |OK_ICON| `KDNuggets Data Collections `_ @@ -1039,7 +1039,7 @@ SocialSciences * |OK_ICON| `ACLED (Armed Conflict Location & Event Data Project) `_ -* |OK_ICON| `Canadian Legal Information Institute `_ +* |FIXME_ICON| `Canadian Legal Information Institute `_ * |OK_ICON| `Center for Systemic Peace Datasets - Conflict Trends, Polities, State Fragility, etc `_ @@ -1101,7 +1101,7 @@ SocialSciences * |OK_ICON| `Texas Inmates Executed Since 1984 `_ -* |OK_ICON| `Titanic Survival Data Set `_ +* |OK_ICON| `Titanic Survival Data Set `_ * |OK_ICON| `UCB's Archive of Social Science Data (D-Lab) `_ From 054344640b32dfb6952f4e81db5d757a71bcb4dc Mon Sep 17 00:00:00 2001 From: Travis CI Date: Wed, 17 Jan 2018 13:46:48 +0000 Subject: [PATCH 80/99] Update README from APD2: 1beef07e22f1a175369d199949a45b7bcd8f09f2 --- README.rst | 16 ++++++++++++---- 1 file changed, 12 insertions(+), 4 deletions(-) diff --git a/README.rst b/README.rst index e917faa..e6d20ca 100644 --- a/README.rst +++ b/README.rst @@ -219,6 +219,8 @@ ComputerNetworks * |OK_ICON| `Criteo click-through data `_ +* |OK_ICON| `Internet-Wide Scan Data Repository `_ + * |OK_ICON| `OONI: Open Observatory of Network Interference - Internet censorship data `_ * |OK_ICON| `Open Mobile Data by MobiPerf `_ @@ -256,6 +258,8 @@ DataChallenges * |OK_ICON| `TravisTorrent Dataset - MSR'2017 Mining Challenge `_ +* |OK_ICON| `TunedIT - Data mining & machine learning data sets, algorithms, challenges `_ + * |OK_ICON| `Yelp Dataset Challenge `_ EarthScience @@ -288,6 +292,8 @@ Economics * |OK_ICON| `Historical MacroEconomc Statistics `_ +* |OK_ICON| `INFORUM - Interindustry Forecasting at the University of Maryland `_ + * |OK_ICON| `International Economics Database `_ * |OK_ICON| `International Trade Statistics `_ @@ -609,7 +615,7 @@ Government * |OK_ICON| `Texas Open Data `_ -* |OK_ICON| `The World Bank `_ +* |FIXME_ICON| `The World Bank `_ * |OK_ICON| `Toronto, ON, Canada `_ @@ -637,6 +643,8 @@ Government * |FIXME_ICON| `UK 2011 Census Open Atlas Project `_ +* |OK_ICON| `U.S. Patent and Trademark Office (USPTO) Bulk Data Products `_ + * |OK_ICON| `Uganda Bureau of Statistics `_ * |OK_ICON| `United Nations `_ @@ -814,7 +822,7 @@ NaturalLanguage * |OK_ICON| `Google Web 5gram (1TB, 2006) `_ -* |OK_ICON| `Gutenberg eBooks List `_ +* |FIXME_ICON| `Gutenberg eBooks List `_ * |OK_ICON| `Hansards text chunks of Canadian Parliament `_ @@ -924,7 +932,7 @@ PublicDomains * |OK_ICON| `Google `_ -* |OK_ICON| `Infochimps `_ +* |FIXME_ICON| `Infochimps `_ * |OK_ICON| `KDNuggets Data Collections `_ @@ -1039,7 +1047,7 @@ SocialSciences * |OK_ICON| `ACLED (Armed Conflict Location & Event Data Project) `_ -* |FIXME_ICON| `Canadian Legal Information Institute `_ +* |OK_ICON| `Canadian Legal Information Institute `_ * |OK_ICON| `Center for Systemic Peace Datasets - Conflict Trends, Polities, State Fragility, etc `_ From d150cc8476fa5a77eb4d9f1ca27e64f23cb91098 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Thu, 18 Jan 2018 13:02:21 +0000 Subject: [PATCH 81/99] Update README from APD2: 41715b0f16f7271e07cb3b53a81e4d8addd87138 --- README.rst | 20 +++++++++++++++----- 1 file changed, 15 insertions(+), 5 deletions(-) diff --git a/README.rst b/README.rst index e6d20ca..e78f3ce 100644 --- a/README.rst +++ b/README.rst @@ -346,7 +346,7 @@ Energy * |OK_ICON| `HFED `_ -* |FIXME_ICON| `PLAID - The Plug Load Appliance Identification Dataset `_ +* |OK_ICON| `PLAID - The Plug Load Appliance Identification Dataset `_ * |OK_ICON| `REDD `_ @@ -390,6 +390,8 @@ GIS * |OK_ICON| `Factual Global Location Data `_ +* |OK_ICON| `Geo Maps - High Quality GeoJSON maps programmatically generated `_ + * |OK_ICON| `Geo Spatial Data from ASU `_ * |OK_ICON| `Geo Wiki Project - Citizen-driven Environmental Monitoring `_ @@ -583,7 +585,7 @@ Government * |OK_ICON| `Regina SK, Canada `_ -* |FIXME_ICON| `Rio de Janeiro, Brazil `_ +* |OK_ICON| `Rio de Janeiro, Brazil `_ * |OK_ICON| `Romania `_ @@ -684,6 +686,8 @@ Healthcare * |OK_ICON| `PhysioBank Databases - A large and growing archive of physiological data. `_ +* |OK_ICON| `The Cancer Imaging Archive (TCIA) `_ + * |OK_ICON| `The Cancer Genome Atlas project (TCGA) `_ * |OK_ICON| `World Health Organization Global Health Observatory `_ @@ -776,6 +780,8 @@ MachineLearning * |FIXME_ICON| `Yahoo! Ratings and Classification Data `_ +* |OK_ICON| `YouTube-BoundingBoxes `_ + * |OK_ICON| `Youtube 8m `_ * |OK_ICON| `eBay Online Auctions (2012) `_ @@ -822,7 +828,7 @@ NaturalLanguage * |OK_ICON| `Google Web 5gram (1TB, 2006) `_ -* |FIXME_ICON| `Gutenberg eBooks List `_ +* |OK_ICON| `Gutenberg eBooks List `_ * |OK_ICON| `Hansards text chunks of Canadian Parliament `_ @@ -900,6 +906,8 @@ Physics * |OK_ICON| `Crystallography Open Database `_ +* |OK_ICON| `IceCube - South Pole Neutrino Observatory `_ + * |OK_ICON| `NASA Exoplanet Archive `_ * |OK_ICON| `NSSDC (NASA) data of 550 space spacecraft `_ @@ -932,7 +940,7 @@ PublicDomains * |OK_ICON| `Google `_ -* |FIXME_ICON| `Infochimps `_ +* |OK_ICON| `Infochimps `_ * |OK_ICON| `KDNuggets Data Collections `_ @@ -1047,7 +1055,7 @@ SocialSciences * |OK_ICON| `ACLED (Armed Conflict Location & Event Data Project) `_ -* |OK_ICON| `Canadian Legal Information Institute `_ +* |FIXME_ICON| `Canadian Legal Information Institute `_ * |OK_ICON| `Center for Systemic Peace Datasets - Conflict Trends, Polities, State Fragility, etc `_ @@ -1131,6 +1139,8 @@ Software -------- * |OK_ICON| `FLOSSmole data about free, libre, and open source software development `_ + +* |OK_ICON| `Libraries.io Open Source Repository and Dependency Metadata `_ Sports ------ From caa686418b513e95e19938dde85d8591637eb4b7 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Thu, 18 Jan 2018 16:26:59 +0000 Subject: [PATCH 82/99] Update README from APD2: 38dada0ceb4035ec4e5f6a0d8e7c37a5c702f142 --- README.rst | 10 +++++----- 1 file changed, 5 insertions(+), 5 deletions(-) diff --git a/README.rst b/README.rst index e78f3ce..352af52 100644 --- a/README.rst +++ b/README.rst @@ -172,7 +172,7 @@ ComplexNetworks * |OK_ICON| `NIST complex networks data collection `_ -* |OK_ICON| `Network Repository with Interactive Exploratory Analysis Tools `_ +* |FIXME_ICON| `Network Repository with Interactive Exploratory Analysis Tools `_ * |OK_ICON| `Protein-protein interaction network `_ @@ -304,7 +304,7 @@ Economics * |OK_ICON| `Jon Haveman International Trade Data Links `_ -* |OK_ICON| `OpenCorporates Database of Companies in the World `_ +* |FIXME_ICON| `OpenCorporates Database of Companies in the World `_ * |OK_ICON| `Our World in Data `_ @@ -420,7 +420,7 @@ GIS * |OK_ICON| `Reverse Geocoder using OSM data `_ -* |FIXME_ICON| `TIGER/Line - U.S. boundaries and roads `_ +* |OK_ICON| `TIGER/Line - U.S. boundaries and roads `_ * |OK_ICON| `TZ Timezones shapfiles `_ @@ -585,7 +585,7 @@ Government * |OK_ICON| `Regina SK, Canada `_ -* |OK_ICON| `Rio de Janeiro, Brazil `_ +* |FIXME_ICON| `Rio de Janeiro, Brazil `_ * |OK_ICON| `Romania `_ @@ -940,7 +940,7 @@ PublicDomains * |OK_ICON| `Google `_ -* |OK_ICON| `Infochimps `_ +* |FIXME_ICON| `Infochimps `_ * |OK_ICON| `KDNuggets Data Collections `_ From 956f09f1a47ec758a40e2b034758fc0a350316ee Mon Sep 17 00:00:00 2001 From: Travis CI Date: Fri, 19 Jan 2018 09:03:32 +0000 Subject: [PATCH 83/99] Update README from APD2: c591a5cea95bce873dbc1fa021efa74773f36855 --- README.rst | 18 ++++++++++-------- 1 file changed, 10 insertions(+), 8 deletions(-) diff --git a/README.rst b/README.rst index 352af52..a9e5eb3 100644 --- a/README.rst +++ b/README.rst @@ -172,7 +172,7 @@ ComplexNetworks * |OK_ICON| `NIST complex networks data collection `_ -* |FIXME_ICON| `Network Repository with Interactive Exploratory Analysis Tools `_ +* |OK_ICON| `Network Repository with Interactive Exploratory Analysis Tools `_ * |OK_ICON| `Protein-protein interaction network `_ @@ -304,7 +304,7 @@ Economics * |OK_ICON| `Jon Haveman International Trade Data Links `_ -* |FIXME_ICON| `OpenCorporates Database of Companies in the World `_ +* |OK_ICON| `OpenCorporates Database of Companies in the World `_ * |OK_ICON| `Our World in Data `_ @@ -420,7 +420,7 @@ GIS * |OK_ICON| `Reverse Geocoder using OSM data `_ -* |OK_ICON| `TIGER/Line - U.S. boundaries and roads `_ +* |FIXME_ICON| `TIGER/Line - U.S. boundaries and roads `_ * |OK_ICON| `TZ Timezones shapfiles `_ @@ -585,7 +585,7 @@ Government * |OK_ICON| `Regina SK, Canada `_ -* |FIXME_ICON| `Rio de Janeiro, Brazil `_ +* |OK_ICON| `Rio de Janeiro, Brazil `_ * |OK_ICON| `Romania `_ @@ -605,7 +605,7 @@ Government * |OK_ICON| `South Africa Trade Statistics `_ -* |OK_ICON| `South Africa `_ +* |FIXME_ICON| `South Africa `_ * |OK_ICON| `State of Utah, US `_ @@ -617,7 +617,7 @@ Government * |OK_ICON| `Texas Open Data `_ -* |FIXME_ICON| `The World Bank `_ +* |OK_ICON| `The World Bank `_ * |OK_ICON| `Toronto, ON, Canada `_ @@ -940,7 +940,7 @@ PublicDomains * |OK_ICON| `Google `_ -* |FIXME_ICON| `Infochimps `_ +* |OK_ICON| `Infochimps `_ * |OK_ICON| `KDNuggets Data Collections `_ @@ -1055,7 +1055,7 @@ SocialSciences * |OK_ICON| `ACLED (Armed Conflict Location & Event Data Project) `_ -* |FIXME_ICON| `Canadian Legal Information Institute `_ +* |OK_ICON| `Canadian Legal Information Institute `_ * |OK_ICON| `Center for Systemic Peace Datasets - Conflict Trends, Polities, State Fragility, etc `_ @@ -1103,6 +1103,8 @@ SocialSciences * |OK_ICON| `Open Crime and Policing Data in England, Wales and Northern Ireland `_ +* |OK_ICON| `OpenSanctions - A global database of persons and companies of political, criminal, or economic interest. `_ + * |OK_ICON| `Paul Hensel General International Data Page `_ * |FIXME_ICON| `PewResearch Internet Survey Project `_ From 336f1cf06282dbe589fd5d6037ff5a83e69b0488 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Sat, 20 Jan 2018 11:28:42 +0000 Subject: [PATCH 84/99] Update README from APD2: fb0da44af98ee7d3d74b5220d422da069f3a9ade --- README.rst | 8 ++++---- 1 file changed, 4 insertions(+), 4 deletions(-) diff --git a/README.rst b/README.rst index a9e5eb3..04eb9d1 100644 --- a/README.rst +++ b/README.rst @@ -485,7 +485,7 @@ Government * |OK_ICON| `EveryPolitician - Ongoing project collating and sharing data on every politician. `_ -* |OK_ICON| `FedStats `_ +* |FIXME_ICON| `FedStats `_ * |OK_ICON| `Finland `_ @@ -585,7 +585,7 @@ Government * |OK_ICON| `Regina SK, Canada `_ -* |OK_ICON| `Rio de Janeiro, Brazil `_ +* |FIXME_ICON| `Rio de Janeiro, Brazil `_ * |OK_ICON| `Romania `_ @@ -605,7 +605,7 @@ Government * |OK_ICON| `South Africa Trade Statistics `_ -* |FIXME_ICON| `South Africa `_ +* |OK_ICON| `South Africa `_ * |OK_ICON| `State of Utah, US `_ @@ -828,7 +828,7 @@ NaturalLanguage * |OK_ICON| `Google Web 5gram (1TB, 2006) `_ -* |OK_ICON| `Gutenberg eBooks List `_ +* |FIXME_ICON| `Gutenberg eBooks List `_ * |OK_ICON| `Hansards text chunks of Canadian Parliament `_ From bd5c5661efd7b9745cac4927df465af2cace83ce Mon Sep 17 00:00:00 2001 From: Travis CI Date: Sat, 10 Feb 2018 16:00:41 +0000 Subject: [PATCH 85/99] Update README from APD2: cccfb99adf5aef45554bf946376a819811ef8c51 --- README.rst | 22 ++++++++++++---------- 1 file changed, 12 insertions(+), 10 deletions(-) diff --git a/README.rst b/README.rst index 04eb9d1..44b30b9 100644 --- a/README.rst +++ b/README.rst @@ -13,6 +13,8 @@ Please **DO NOT** modify this file directly. We have provided `a new way `_ to contribute to Awesome Public Datasets. The original PR entrance directly on repo is closed forever. +* |OK_ICON| I am well. +* |FIXME_ICON| Please fix me. `This list of a topic-centric public data sources `_ in high quality. They are collected and tidied from blogs, answers, and user responses. @@ -209,7 +211,7 @@ ComputerNetworks * |OK_ICON| `CAIDA Internet Datasets `_ -* |FIXME_ICON| `CRAWDAD Wireless datasets from Dartmouth Univ. `_ +* |OK_ICON| `CRAWDAD Wireless datasets from Dartmouth Univ. `_ * |OK_ICON| `ClueWeb09 - 1B web pages `_ @@ -388,7 +390,7 @@ GIS * |OK_ICON| `Cambridge, MA, US, GIS data on GitHub `_ -* |OK_ICON| `Factual Global Location Data `_ +* |FIXME_ICON| `Factual Global Location Data `_ * |OK_ICON| `Geo Maps - High Quality GeoJSON maps programmatically generated `_ @@ -455,7 +457,7 @@ Government * |OK_ICON| `Belgium `_ -* |OK_ICON| `Brazil `_ +* |FIXME_ICON| `Brazil `_ * |OK_ICON| `Buenos Aires, Argentina `_ @@ -485,7 +487,7 @@ Government * |OK_ICON| `EveryPolitician - Ongoing project collating and sharing data on every politician. `_ -* |FIXME_ICON| `FedStats `_ +* |OK_ICON| `FedStats `_ * |OK_ICON| `Finland `_ @@ -617,9 +619,9 @@ Government * |OK_ICON| `Texas Open Data `_ -* |OK_ICON| `The World Bank `_ +* |FIXME_ICON| `The World Bank `_ -* |OK_ICON| `Toronto, ON, Canada `_ +* |FIXME_ICON| `Toronto, ON, Canada `_ * |OK_ICON| `Tunisia `_ @@ -647,7 +649,7 @@ Government * |OK_ICON| `U.S. Patent and Trademark Office (USPTO) Bulk Data Products `_ -* |OK_ICON| `Uganda Bureau of Statistics `_ +* |FIXME_ICON| `Uganda Bureau of Statistics `_ * |OK_ICON| `United Nations `_ @@ -713,7 +715,7 @@ ImageProcessing * |OK_ICON| `Flickr: 32 Class Brand Logos `_ -* |OK_ICON| `GDXray - X-ray images for X-ray testing and Computer Vision `_ +* |FIXME_ICON| `GDXray - X-ray images for X-ray testing and Computer Vision `_ * |OK_ICON| `ImageNet (in WordNet hierarchy) `_ @@ -877,7 +879,7 @@ Neuroscience * |OK_ICON| `Brainomics `_ -* |OK_ICON| `CodeNeuro Datasets `_ +* |FIXME_ICON| `CodeNeuro Datasets `_ * |OK_ICON| `Collaborative Research in Computational Neuroscience (CRCNS) `_ @@ -940,7 +942,7 @@ PublicDomains * |OK_ICON| `Google `_ -* |OK_ICON| `Infochimps `_ +* |FIXME_ICON| `Infochimps `_ * |OK_ICON| `KDNuggets Data Collections `_ From b9c053f1f812445119c67be64c97aea7b905f76f Mon Sep 17 00:00:00 2001 From: jozefdickins <30645291+jozefdickins@users.noreply.github.com> Date: Wed, 14 Feb 2018 16:34:22 +0000 Subject: [PATCH 86/99] updated humanitarian data exchange url (#353) --- README.rst | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.rst b/README.rst index 44b30b9..e57772a 100644 --- a/README.rst +++ b/README.rst @@ -1081,7 +1081,7 @@ SocialSciences * |OK_ICON| `Global Religious Futures Project `_ -* |FIXME_ICON| `Humanitarian Data Exchange `_ +* |FIXME_ICON| `Humanitarian Data Exchange `_ * |OK_ICON| `INFORM Index for Risk Management `_ From f30e99c95b61d018d9e84eb239934130cf9e208f Mon Sep 17 00:00:00 2001 From: Travis CI Date: Thu, 5 Apr 2018 16:32:17 +0000 Subject: [PATCH 87/99] Update README from APD2: e63a04009271e4b2a58a654268ba516c57d0ca13 --- README.rst | 30 +++++++++++++++--------------- 1 file changed, 15 insertions(+), 15 deletions(-) diff --git a/README.rst b/README.rst index e57772a..dddd799 100644 --- a/README.rst +++ b/README.rst @@ -137,13 +137,13 @@ Climate+Weather * |OK_ICON| `Climate Data from UEA (updated monthly) `_ -* |OK_ICON| `European Climate Assessment & Dataset `_ +* |FIXME_ICON| `European Climate Assessment & Dataset `_ * |OK_ICON| `Global Climate Data Since 1929 `_ * |OK_ICON| `NASA Global Imagery Browse Services `_ -* |FIXME_ICON| `NOAA Bering Sea Climate `_ +* |OK_ICON| `NOAA Bering Sea Climate `_ * |OK_ICON| `NOAA Climate Datasets `_ @@ -166,7 +166,7 @@ ComplexNetworks * |OK_ICON| `CrossRef DOI URLs `_ -* |OK_ICON| `DBLP Citation dataset `_ +* |FIXME_ICON| `DBLP Citation dataset `_ * |OK_ICON| `DIMACS Road Networks Collection `_ @@ -457,7 +457,7 @@ Government * |OK_ICON| `Belgium `_ -* |FIXME_ICON| `Brazil `_ +* |OK_ICON| `Brazil `_ * |OK_ICON| `Buenos Aires, Argentina `_ @@ -465,7 +465,7 @@ Government * |OK_ICON| `Cambridge, MA, US `_ -* |OK_ICON| `Canada `_ +* |OK_ICON| `Canada `_ * |OK_ICON| `Chicago `_ @@ -501,7 +501,7 @@ Government * |FIXME_ICON| `Ghent, Belgium `_ -* |FIXME_ICON| `Glasgow, Scotland, UK `_ +* |OK_ICON| `Glasgow, Scotland, UK `_ * |OK_ICON| `Greece `_ @@ -619,7 +619,7 @@ Government * |OK_ICON| `Texas Open Data `_ -* |FIXME_ICON| `The World Bank `_ +* |OK_ICON| `The World Bank `_ * |FIXME_ICON| `Toronto, ON, Canada `_ @@ -649,7 +649,7 @@ Government * |OK_ICON| `U.S. Patent and Trademark Office (USPTO) Bulk Data Products `_ -* |FIXME_ICON| `Uganda Bureau of Statistics `_ +* |OK_ICON| `Uganda Bureau of Statistics `_ * |OK_ICON| `United Nations `_ @@ -715,7 +715,7 @@ ImageProcessing * |OK_ICON| `Flickr: 32 Class Brand Logos `_ -* |FIXME_ICON| `GDXray - X-ray images for X-ray testing and Computer Vision `_ +* |OK_ICON| `GDXray - X-ray images for X-ray testing and Computer Vision `_ * |OK_ICON| `ImageNet (in WordNet hierarchy) `_ @@ -828,9 +828,9 @@ NaturalLanguage * |OK_ICON| `Google MC-AFP - Generated based on the public available Gigaword dataset using Paragraph Vectors `_ -* |OK_ICON| `Google Web 5gram (1TB, 2006) `_ +* |FIXME_ICON| `Google Web 5gram (1TB, 2006) `_ -* |FIXME_ICON| `Gutenberg eBooks List `_ +* |OK_ICON| `Gutenberg eBooks List `_ * |OK_ICON| `Hansards text chunks of Canadian Parliament `_ @@ -868,7 +868,7 @@ NaturalLanguage * |OK_ICON| `Wikipedia Links data - 40 Million Entities in Context `_ -* |OK_ICON| `WordNet databases and tools `_ +* |FIXME_ICON| `WordNet databases and tools `_ Neuroscience ------------ @@ -946,7 +946,7 @@ PublicDomains * |OK_ICON| `KDNuggets Data Collections `_ -* |OK_ICON| `Microsoft Azure Data Market Free DataSets `_ +* |FIXME_ICON| `Microsoft Azure Data Market Free DataSets `_ * |OK_ICON| `Microsoft Data Science for Research `_ @@ -1026,7 +1026,7 @@ SocialNetworks * |OK_ICON| `Indie Map: social graph and crawl of top IndieWeb sites `_ -* |OK_ICON| `Mobile Social Networks from UMASS `_ +* |FIXME_ICON| `Mobile Social Networks from UMASS `_ * |OK_ICON| `Network Twitter Data `_ @@ -1081,7 +1081,7 @@ SocialSciences * |OK_ICON| `Global Religious Futures Project `_ -* |FIXME_ICON| `Humanitarian Data Exchange `_ +* |FIXME_ICON| `Humanitarian Data Exchange `_ * |OK_ICON| `INFORM Index for Risk Management `_ From b794c955f2411ba08c4dee09f7b7f6367b60c45a Mon Sep 17 00:00:00 2001 From: Travis CI Date: Thu, 5 Apr 2018 16:33:08 +0000 Subject: [PATCH 88/99] Update README from APD2: ab2804abb4be030a49b3140694e6deb74cc18264 --- README.rst | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/README.rst b/README.rst index dddd799..6d7e36a 100644 --- a/README.rst +++ b/README.rst @@ -404,7 +404,7 @@ GIS * |OK_ICON| `Global Administrative Areas Database (GADM) `_ -* |OK_ICON| `Homeland Infrastructure Foundation-Level Data `_ +* |OK_ICON| `Homeland Infrastructure Foundation-Level Data `_ * |OK_ICON| `Landsat 8 on AWS `_ @@ -619,7 +619,7 @@ Government * |OK_ICON| `Texas Open Data `_ -* |OK_ICON| `The World Bank `_ +* |FIXME_ICON| `The World Bank `_ * |FIXME_ICON| `Toronto, ON, Canada `_ From d141d80669a8e47d7b3cad6f592bf26e275a6ae1 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Thu, 5 Apr 2018 16:33:15 +0000 Subject: [PATCH 89/99] Update README from APD2: a02d6b02613c2d7589cd482478312aaa30a2983a --- README.rst | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.rst b/README.rst index 6d7e36a..adce4f0 100644 --- a/README.rst +++ b/README.rst @@ -499,7 +499,7 @@ Government * |OK_ICON| `Germany `_ -* |FIXME_ICON| `Ghent, Belgium `_ +* |OK_ICON| `Ghent, Belgium `_ * |OK_ICON| `Glasgow, Scotland, UK `_ From 2e62a828bb5ae0e692a3c6f9af3afc5d86dab3f2 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Thu, 5 Apr 2018 16:35:10 +0000 Subject: [PATCH 90/99] Update README from APD2: 6e46cc79126bb5f3fd09278af6a5a195f93ae179 --- README.rst | 6 +++++- 1 file changed, 5 insertions(+), 1 deletion(-) diff --git a/README.rst b/README.rst index adce4f0..2a2da19 100644 --- a/README.rst +++ b/README.rst @@ -76,6 +76,8 @@ Biology * |OK_ICON| `Journal of Cell Biology DataViewer `_ +* |OK_ICON| `KEGG - KEGG is a database resource for understanding high-level functions and utilities of the biological system, such as the cell, the organism and the ecosystem, from molecular-level information, especially large-scale molecular datasets generated by genome sequencing and other high-throughput experimental technologies. `_ + * |OK_ICON| `MIT Cancer Genomics Data `_ * |OK_ICON| `NCBI Proteins `_ @@ -521,6 +523,8 @@ Government * |OK_ICON| `Ireland's Open Data Portal `_ +* |OK_ICON| `Italy - Il Portale dati.gov.it è il catalogo nazionale dei metadati relativi ai dati rilasciati in formato aperto dalle pubbliche amministrazioni italiane. Il Portale è promosso dal Governo Italiano e gestito dall’Agenzia per l’Italia digitale con il supporto di FormezPA. `_ + * |OK_ICON| `Japan `_ * |OK_ICON| `Laval, QC, Canada `_ @@ -619,7 +623,7 @@ Government * |OK_ICON| `Texas Open Data `_ -* |FIXME_ICON| `The World Bank `_ +* |OK_ICON| `The World Bank `_ * |FIXME_ICON| `Toronto, ON, Canada `_ From 475b88e63883a9d44df4cb8e8175ee2babc04667 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Thu, 5 Apr 2018 16:35:46 +0000 Subject: [PATCH 91/99] Update README from APD2: f1e675f05b44aa18eee1583c9dbd6c9d691a6b08 --- README.rst | 4 +--- 1 file changed, 1 insertion(+), 3 deletions(-) diff --git a/README.rst b/README.rst index 2a2da19..7f00156 100644 --- a/README.rst +++ b/README.rst @@ -76,8 +76,6 @@ Biology * |OK_ICON| `Journal of Cell Biology DataViewer `_ -* |OK_ICON| `KEGG - KEGG is a database resource for understanding high-level functions and utilities of the biological system, such as the cell, the organism and the ecosystem, from molecular-level information, especially large-scale molecular datasets generated by genome sequencing and other high-throughput experimental technologies. `_ - * |OK_ICON| `MIT Cancer Genomics Data `_ * |OK_ICON| `NCBI Proteins `_ @@ -1061,7 +1059,7 @@ SocialSciences * |OK_ICON| `ACLED (Armed Conflict Location & Event Data Project) `_ -* |OK_ICON| `Canadian Legal Information Institute `_ +* |FIXME_ICON| `Canadian Legal Information Institute `_ * |OK_ICON| `Center for Systemic Peace Datasets - Conflict Trends, Polities, State Fragility, etc `_ From 69070361bd6730201bb4c6cbe8ae0a25109b2a0a Mon Sep 17 00:00:00 2001 From: Travis CI Date: Thu, 5 Apr 2018 17:00:48 +0000 Subject: [PATCH 92/99] Update README from APD2: 8547a5f2ec94f40268d31fbbf22e84253d7d47c9 --- README.rst | 14 +++++++++++--- 1 file changed, 11 insertions(+), 3 deletions(-) diff --git a/README.rst b/README.rst index 7f00156..bfe2ad0 100644 --- a/README.rst +++ b/README.rst @@ -76,6 +76,8 @@ Biology * |OK_ICON| `Journal of Cell Biology DataViewer `_ +* |OK_ICON| `KEGG - KEGG is a database resource for understanding high-level functions and utilities of the biological system, such as the cell, the organism and the ecosystem, from molecular-level information, especially large-scale molecular datasets generated by genome sequencing and other high-throughput experimental technologies. `_ + * |OK_ICON| `MIT Cancer Genomics Data `_ * |OK_ICON| `NCBI Proteins `_ @@ -589,7 +591,7 @@ Government * |OK_ICON| `Regina SK, Canada `_ -* |FIXME_ICON| `Rio de Janeiro, Brazil `_ +* |OK_ICON| `Rio de Janeiro, Brazil `_ * |OK_ICON| `Romania `_ @@ -619,6 +621,8 @@ Government * |OK_ICON| `Taiwan `_ +* |OK_ICON| `Tel-Aviv Open Data `_ + * |OK_ICON| `Texas Open Data `_ * |OK_ICON| `The World Bank `_ @@ -668,6 +672,8 @@ Government Healthcare ---------- +* |OK_ICON| `Composition of Foods Raw, Processed, Prepared USDA National Nutrient Database for Standard Reference - The database consists of several sets of data: food descriptions, nutrients, weights and measures, footnotes, and sources of data. The Nutrient Data file contains mean nutrient values per 100 g of the edible portion of food, along with fields to further describe the mean value. `_ + * |OK_ICON| `EHDP Large Health Data Sets `_ * |OK_ICON| `GDC - GDC supports several cancer genome programs for CCG, TCGA, TARGET etc. `_ @@ -719,7 +725,7 @@ ImageProcessing * |OK_ICON| `GDXray - X-ray images for X-ray testing and Computer Vision `_ -* |OK_ICON| `ImageNet (in WordNet hierarchy) `_ +* |FIXME_ICON| `ImageNet (in WordNet hierarchy) `_ * |OK_ICON| `Indoor Scene Recognition `_ @@ -1059,7 +1065,7 @@ SocialSciences * |OK_ICON| `ACLED (Armed Conflict Location & Event Data Project) `_ -* |FIXME_ICON| `Canadian Legal Information Institute `_ +* |OK_ICON| `Canadian Legal Information Institute `_ * |OK_ICON| `Center for Systemic Peace Datasets - Conflict Trends, Polities, State Fragility, etc `_ @@ -1166,6 +1172,8 @@ Sports * |OK_ICON| `Retrosheet Baseball Statistics `_ * |OK_ICON| `Tennis database of rankings, results, and stats for ATP `_ + +* |OK_ICON| `Tennis database of rankings, results, and stats for WTA `_ TimeSeries ---------- From b4219e45cd4216cdbe371e62b3ba3f391bdc2fce Mon Sep 17 00:00:00 2001 From: Travis CI Date: Sat, 7 Apr 2018 06:44:49 +0000 Subject: [PATCH 93/99] Update README from APD2: 20ea239b2803cb4556e485c3a5a1a1ae3a09f1be --- README.rst | 1129 ++++++++++++++++++++++++++-------------------------- 1 file changed, 565 insertions(+), 564 deletions(-) diff --git a/README.rst b/README.rst index bfe2ad0..0df7843 100644 --- a/README.rst +++ b/README.rst @@ -2,12 +2,14 @@ Awesome Public Datasets ======================= .. image:: https://cdn.rawgit.com/sindresorhus/awesome/d7305f38d29fed78fa85652e3a63e154dd8e8829/media/badge.svg - :alt: Awesome - :target: https://github.com/sindresorhus/awesome +:alt: Awesome +:target: https://github.com/sindresorhus/awesome + .. |OK_ICON| image:: https://raw.githubusercontent.com/awesomedata/apd-core/master/deploy/ok-24.png .. |FIXME_ICON| image:: https://raw.githubusercontent.com/awesomedata/apd-core/master/deploy/fixme-24.png + **NOTICE**: This repo is automatically generated by `apd-core `_. Please **DO NOT** modify this file directly. We have provided `a new way `_ @@ -24,1216 +26,1215 @@ Other amazingly awesome lists can be found in `sindresorhus's awesome `_ +* |OK_ICON| `U.S. Department of Agriculture's Nutrient Database `_ [`fixme `_] -* |OK_ICON| `U.S. Department of Agriculture's PLANTS Database `_ +* |OK_ICON| `U.S. Department of Agriculture's PLANTS Database `_ [`fixme `_] Biology ------- -* |OK_ICON| `1000 Genomes `_ +* |OK_ICON| `1000 Genomes `_ [`fixme `_] -* |OK_ICON| `American Gut (Microbiome Project) `_ +* |OK_ICON| `American Gut (Microbiome Project) `_ [`fixme `_] -* |OK_ICON| `Broad Bioimage Benchmark Collection (BBBC) `_ +* |OK_ICON| `Broad Bioimage Benchmark Collection (BBBC) `_ [`fixme `_] -* |OK_ICON| `Broad Cancer Cell Line Encyclopedia (CCLE) `_ +* |OK_ICON| `Broad Cancer Cell Line Encyclopedia (CCLE) `_ [`fixme `_] -* |OK_ICON| `Cell Image Library `_ +* |OK_ICON| `Cell Image Library `_ [`fixme `_] -* |OK_ICON| `Complete Genomics Public Data `_ +* |OK_ICON| `Complete Genomics Public Data `_ [`fixme `_] -* |OK_ICON| `EBI ArrayExpress `_ +* |OK_ICON| `EBI ArrayExpress `_ [`fixme `_] -* |OK_ICON| `EBI Protein Data Bank in Europe `_ +* |OK_ICON| `EBI Protein Data Bank in Europe `_ [`fixme `_] -* |OK_ICON| `ENCODE project `_ +* |OK_ICON| `ENCODE project `_ [`fixme `_] -* |OK_ICON| `Electron Microscopy Pilot Image Archive (EMPIAR) `_ +* |OK_ICON| `Electron Microscopy Pilot Image Archive (EMPIAR) `_ [`fixme `_] -* |OK_ICON| `Ensembl Genomes `_ +* |OK_ICON| `Ensembl Genomes `_ [`fixme `_] -* |OK_ICON| `Gene Expression Omnibus (GEO) `_ +* |OK_ICON| `Gene Expression Omnibus (GEO) `_ [`fixme `_] -* |OK_ICON| `Gene Ontology (GO) `_ +* |OK_ICON| `Gene Ontology (GO) `_ [`fixme `_] -* |OK_ICON| `Global Biotic Interactions (GloBI) `_ +* |OK_ICON| `Global Biotic Interactions (GloBI) `_ [`fixme `_] -* |OK_ICON| `Harvard Medical School (HMS) LINCS Project `_ +* |OK_ICON| `Harvard Medical School (HMS) LINCS Project `_ [`fixme `_] -* |OK_ICON| `Human Genome Diversity Project `_ +* |OK_ICON| `Human Genome Diversity Project `_ [`fixme `_] -* |OK_ICON| `Human Microbiome Project (HMP) `_ +* |OK_ICON| `Human Microbiome Project (HMP) `_ [`fixme `_] -* |OK_ICON| `ICOS PSP Benchmark `_ +* |OK_ICON| `ICOS PSP Benchmark `_ [`fixme `_] -* |OK_ICON| `International HapMap Project `_ +* |OK_ICON| `International HapMap Project `_ [`fixme `_] -* |OK_ICON| `Journal of Cell Biology DataViewer `_ +* |OK_ICON| `Journal of Cell Biology DataViewer `_ [`fixme `_] -* |OK_ICON| `KEGG - KEGG is a database resource for understanding high-level functions and utilities of the biological system, such as the cell, the organism and the ecosystem, from molecular-level information, especially large-scale molecular datasets generated by genome sequencing and other high-throughput experimental technologies. `_ +* |OK_ICON| `KEGG - KEGG is a database resource for understanding high-level functions [...] `_ [`fixme `_] -* |OK_ICON| `MIT Cancer Genomics Data `_ +* |OK_ICON| `MIT Cancer Genomics Data `_ [`fixme `_] -* |OK_ICON| `NCBI Proteins `_ +* |OK_ICON| `NCBI Proteins `_ [`fixme `_] -* |OK_ICON| `NCBI Taxonomy `_ +* |OK_ICON| `NCBI Taxonomy `_ [`fixme `_] -* |OK_ICON| `NCI Genomic Data Commons `_ +* |OK_ICON| `NCI Genomic Data Commons `_ [`fixme `_] -* |FIXME_ICON| `NIH Microarray data `_ +* |FIXME_ICON| `NIH Microarray data `_ [`fixme `_] -* |OK_ICON| `OpenSNP genotypes data `_ +* |OK_ICON| `OpenSNP genotypes data `_ [`fixme `_] -* |OK_ICON| `Pathguid - Protein-Protein Interactions Catalog `_ +* |OK_ICON| `Pathguid - Protein-Protein Interactions Catalog `_ [`fixme `_] -* |OK_ICON| `Protein Data Bank `_ +* |OK_ICON| `Protein Data Bank `_ [`fixme `_] -* |OK_ICON| `Psychiatric Genomics Consortium `_ +* |OK_ICON| `Psychiatric Genomics Consortium `_ [`fixme `_] -* |OK_ICON| `PubChem Project `_ +* |OK_ICON| `PubChem Project `_ [`fixme `_] -* |OK_ICON| `PubGene (now Coremine Medical) `_ +* |OK_ICON| `PubGene (now Coremine Medical) `_ [`fixme `_] -* |OK_ICON| `Sanger Catalogue of Somatic Mutations in Cancer (COSMIC) `_ +* |OK_ICON| `Sanger Catalogue of Somatic Mutations in Cancer (COSMIC) `_ [`fixme `_] -* |OK_ICON| `Sanger Genomics of Drug Sensitivity in Cancer Project (GDSC) `_ +* |OK_ICON| `Sanger Genomics of Drug Sensitivity in Cancer Project (GDSC) `_ [`fixme `_] -* |OK_ICON| `Sequence Read Archive(SRA) `_ +* |OK_ICON| `Sequence Read Archive(SRA) `_ [`fixme `_] -* |FIXME_ICON| `Stanford Microarray Data `_ +* |FIXME_ICON| `Stanford Microarray Data `_ [`fixme `_] -* |OK_ICON| `Stowers Institute Original Data Repository `_ +* |OK_ICON| `Stowers Institute Original Data Repository `_ [`fixme `_] -* |OK_ICON| `Systems Science of Biological Dynamics (SSBD) Database `_ +* |OK_ICON| `Systems Science of Biological Dynamics (SSBD) Database `_ [`fixme `_] -* |OK_ICON| `The Cancer Genome Atlas (TCGA), available via Broad GDAC `_ +* |OK_ICON| `The Cancer Genome Atlas (TCGA), available via Broad GDAC `_ [`fixme `_] -* |OK_ICON| `The Catalogue of Life `_ +* |OK_ICON| `The Catalogue of Life `_ [`fixme `_] -* |OK_ICON| `The Personal Genome Project `_ +* |OK_ICON| `The Personal Genome Project `_ [`fixme `_] -* |OK_ICON| `UCSC Public Data `_ +* |OK_ICON| `UCSC Public Data `_ [`fixme `_] -* |OK_ICON| `UniGene `_ +* |OK_ICON| `UniGene `_ [`fixme `_] -* |OK_ICON| `Universal Protein Resource (UnitProt) `_ +* |OK_ICON| `Universal Protein Resource (UnitProt) `_ [`fixme `_] Climate+Weather --------------- -* |OK_ICON| `Actuaries Climate Index `_ +* |OK_ICON| `Actuaries Climate Index `_ [`fixme `_] -* |OK_ICON| `Australian Weather `_ +* |OK_ICON| `Australian Weather `_ [`fixme `_] -* |OK_ICON| `Aviation Weather Center - Consistent, timely and accurate weather information for the world airspace system `_ +* |OK_ICON| `Aviation Weather Center - Consistent, timely and accurate weather [...] `_ [`fixme `_] -* |OK_ICON| `Brazilian Weather - Historical data (In Portuguese) `_ +* |OK_ICON| `Brazilian Weather - Historical data (In Portuguese) `_ [`fixme `_] -* |OK_ICON| `Canadian Meteorological Centre `_ +* |OK_ICON| `Canadian Meteorological Centre `_ [`fixme `_] -* |OK_ICON| `Climate Data from UEA (updated monthly) `_ +* |OK_ICON| `Climate Data from UEA (updated monthly) `_ [`fixme `_] -* |FIXME_ICON| `European Climate Assessment & Dataset `_ +* |FIXME_ICON| `European Climate Assessment & Dataset `_ [`fixme `_] -* |OK_ICON| `Global Climate Data Since 1929 `_ +* |OK_ICON| `Global Climate Data Since 1929 `_ [`fixme `_] -* |OK_ICON| `NASA Global Imagery Browse Services `_ +* |OK_ICON| `NASA Global Imagery Browse Services `_ [`fixme `_] -* |OK_ICON| `NOAA Bering Sea Climate `_ +* |OK_ICON| `NOAA Bering Sea Climate `_ [`fixme `_] -* |OK_ICON| `NOAA Climate Datasets `_ +* |OK_ICON| `NOAA Climate Datasets `_ [`fixme `_] -* |OK_ICON| `NOAA Realtime Weather Models `_ +* |OK_ICON| `NOAA Realtime Weather Models `_ [`fixme `_] -* |OK_ICON| `NOAA SURFRAD Meteorology and Radiation Datasets `_ +* |OK_ICON| `NOAA SURFRAD Meteorology and Radiation Datasets `_ [`fixme `_] -* |OK_ICON| `The World Bank Open Data Resources for Climate Change `_ +* |OK_ICON| `The World Bank Open Data Resources for Climate Change `_ [`fixme `_] -* |OK_ICON| `UEA Climatic Research Unit `_ +* |OK_ICON| `UEA Climatic Research Unit `_ [`fixme `_] -* |OK_ICON| `WU Historical Weather Worldwide `_ +* |OK_ICON| `WU Historical Weather Worldwide `_ [`fixme `_] -* |OK_ICON| `WorldClim - Global Climate Data `_ +* |OK_ICON| `WorldClim - Global Climate Data `_ [`fixme `_] ComplexNetworks --------------- -* |OK_ICON| `AMiner Citation Network Dataset `_ +* |OK_ICON| `AMiner Citation Network Dataset `_ [`fixme `_] -* |OK_ICON| `CrossRef DOI URLs `_ +* |OK_ICON| `CrossRef DOI URLs `_ [`fixme `_] -* |FIXME_ICON| `DBLP Citation dataset `_ +* |FIXME_ICON| `DBLP Citation dataset `_ [`fixme `_] -* |OK_ICON| `DIMACS Road Networks Collection `_ +* |OK_ICON| `DIMACS Road Networks Collection `_ [`fixme `_] -* |OK_ICON| `NBER Patent Citations `_ +* |OK_ICON| `NBER Patent Citations `_ [`fixme `_] -* |OK_ICON| `NIST complex networks data collection `_ +* |OK_ICON| `NIST complex networks data collection `_ [`fixme `_] -* |OK_ICON| `Network Repository with Interactive Exploratory Analysis Tools `_ +* |OK_ICON| `Network Repository with Interactive Exploratory Analysis Tools `_ [`fixme `_] -* |OK_ICON| `Protein-protein interaction network `_ +* |OK_ICON| `Protein-protein interaction network `_ [`fixme `_] -* |OK_ICON| `PyPI and Maven Dependency Network `_ +* |OK_ICON| `PyPI and Maven Dependency Network `_ [`fixme `_] -* |OK_ICON| `Scopus Citation Database `_ +* |OK_ICON| `Scopus Citation Database `_ [`fixme `_] -* |OK_ICON| `Small Network Data `_ +* |OK_ICON| `Small Network Data `_ [`fixme `_] -* |OK_ICON| `Stanford GraphBase `_ +* |OK_ICON| `Stanford GraphBase `_ [`fixme `_] -* |OK_ICON| `Stanford Large Network Dataset Collection `_ +* |OK_ICON| `Stanford Large Network Dataset Collection `_ [`fixme `_] -* |OK_ICON| `Stanford Longitudinal Network Data Sources `_ +* |OK_ICON| `Stanford Longitudinal Network Data Sources `_ [`fixme `_] -* |OK_ICON| `The Koblenz Network Collection `_ +* |OK_ICON| `The Koblenz Network Collection `_ [`fixme `_] -* |OK_ICON| `The Laboratory for Web Algorithmics (UNIMI) `_ +* |OK_ICON| `The Laboratory for Web Algorithmics (UNIMI) `_ [`fixme `_] -* |FIXME_ICON| `The Nexus Network Repository `_ +* |FIXME_ICON| `The Nexus Network Repository `_ [`fixme `_] -* |OK_ICON| `UCI Network Data Repository `_ +* |OK_ICON| `UCI Network Data Repository `_ [`fixme `_] -* |OK_ICON| `UFL sparse matrix collection `_ +* |OK_ICON| `UFL sparse matrix collection `_ [`fixme `_] -* |OK_ICON| `WSU Graph Database `_ +* |OK_ICON| `WSU Graph Database `_ [`fixme `_] ComputerNetworks ---------------- -* |OK_ICON| `3.5B Web Pages from CommonCrawl 2012 `_ +* |OK_ICON| `3.5B Web Pages from CommonCrawl 2012 `_ [`fixme `_] -* |OK_ICON| `53.5B Web clicks of 100K users in Indiana Univ. `_ +* |OK_ICON| `53.5B Web clicks of 100K users in Indiana Univ. `_ [`fixme `_] -* |OK_ICON| `CAIDA Internet Datasets `_ +* |OK_ICON| `CAIDA Internet Datasets `_ [`fixme `_] -* |OK_ICON| `CRAWDAD Wireless datasets from Dartmouth Univ. `_ +* |OK_ICON| `CRAWDAD Wireless datasets from Dartmouth Univ. `_ [`fixme `_] -* |OK_ICON| `ClueWeb09 - 1B web pages `_ +* |OK_ICON| `ClueWeb09 - 1B web pages `_ [`fixme `_] -* |OK_ICON| `ClueWeb12 - 733M web pages `_ +* |OK_ICON| `ClueWeb12 - 733M web pages `_ [`fixme `_] -* |OK_ICON| `CommonCrawl Web Data over 7 years `_ +* |OK_ICON| `CommonCrawl Web Data over 7 years `_ [`fixme `_] -* |OK_ICON| `Criteo click-through data `_ +* |OK_ICON| `Criteo click-through data `_ [`fixme `_] -* |OK_ICON| `Internet-Wide Scan Data Repository `_ +* |OK_ICON| `Internet-Wide Scan Data Repository `_ [`fixme `_] -* |OK_ICON| `OONI: Open Observatory of Network Interference - Internet censorship data `_ +* |OK_ICON| `OONI: Open Observatory of Network Interference - Internet censorship data `_ [`fixme `_] -* |OK_ICON| `Open Mobile Data by MobiPerf `_ +* |OK_ICON| `Open Mobile Data by MobiPerf `_ [`fixme `_] -* |OK_ICON| `Rapid7 Sonar Internet Scans `_ +* |OK_ICON| `Rapid7 Sonar Internet Scans `_ [`fixme `_] -* |OK_ICON| `UCSD Network Telescope, IPv4 /8 net `_ +* |OK_ICON| `UCSD Network Telescope, IPv4 /8 net `_ [`fixme `_] DataChallenges -------------- -* |OK_ICON| `Bruteforce Database `_ +* |OK_ICON| `Bruteforce Database `_ [`fixme `_] -* |OK_ICON| `Challenges in Machine Learning `_ +* |OK_ICON| `Challenges in Machine Learning `_ [`fixme `_] -* |OK_ICON| `CrowdANALYTIX dataX `_ +* |OK_ICON| `CrowdANALYTIX dataX `_ [`fixme `_] -* |FIXME_ICON| `D4D Challenge of Orange `_ +* |FIXME_ICON| `D4D Challenge of Orange `_ [`fixme `_] -* |OK_ICON| `DrivenData Competitions for Social Good `_ +* |OK_ICON| `DrivenData Competitions for Social Good `_ [`fixme `_] -* |FIXME_ICON| `ICWSM Data Challenge (since 2009) `_ +* |FIXME_ICON| `ICWSM Data Challenge (since 2009) `_ [`fixme `_] -* |OK_ICON| `KDD Cup by Tencent 2012 `_ +* |OK_ICON| `KDD Cup by Tencent 2012 `_ [`fixme `_] -* |OK_ICON| `Kaggle Competition Data `_ +* |OK_ICON| `Kaggle Competition Data `_ [`fixme `_] -* |OK_ICON| `Localytics Data Visualization Challenge `_ +* |OK_ICON| `Localytics Data Visualization Challenge `_ [`fixme `_] -* |OK_ICON| `Netflix Prize `_ +* |OK_ICON| `Netflix Prize `_ [`fixme `_] -* |OK_ICON| `Space Apps Challenge `_ +* |OK_ICON| `Space Apps Challenge `_ [`fixme `_] -* |OK_ICON| `Telecom Italia Big Data Challenge `_ +* |OK_ICON| `Telecom Italia Big Data Challenge `_ [`fixme `_] -* |OK_ICON| `TravisTorrent Dataset - MSR'2017 Mining Challenge `_ +* |OK_ICON| `TravisTorrent Dataset - MSR'2017 Mining Challenge `_ [`fixme `_] -* |OK_ICON| `TunedIT - Data mining & machine learning data sets, algorithms, challenges `_ +* |OK_ICON| `TunedIT - Data mining & machine learning data sets, algorithms, challenges `_ [`fixme `_] -* |OK_ICON| `Yelp Dataset Challenge `_ +* |OK_ICON| `Yelp Dataset Challenge `_ [`fixme `_] EarthScience ------------ -* |OK_ICON| `AQUASTAT - Global water resources and uses `_ +* |OK_ICON| `AQUASTAT - Global water resources and uses `_ [`fixme `_] -* |OK_ICON| `BODC - marine data of ~22K vars `_ +* |OK_ICON| `BODC - marine data of ~22K vars `_ [`fixme `_] -* |OK_ICON| `EOSDIS - NASA's earth observing system data `_ +* |OK_ICON| `EOSDIS - NASA's earth observing system data `_ [`fixme `_] -* |OK_ICON| `Earth Models `_ +* |OK_ICON| `Earth Models `_ [`fixme `_] -* |OK_ICON| `Integrated Marine Observing System (IMOS) - roughly 30TB of ocean measurements `_ +* |OK_ICON| `Integrated Marine Observing System (IMOS) - roughly 30TB of ocean measurements `_ [`fixme `_] -* |OK_ICON| `Marinexplore - Open Oceanographic Data `_ +* |OK_ICON| `Marinexplore - Open Oceanographic Data `_ [`fixme `_] -* |OK_ICON| `Smithsonian Institution Global Volcano and Eruption Database `_ +* |OK_ICON| `Smithsonian Institution Global Volcano and Eruption Database `_ [`fixme `_] -* |OK_ICON| `USGS Earthquake Archives `_ +* |OK_ICON| `USGS Earthquake Archives `_ [`fixme `_] Economics --------- -* |OK_ICON| `American Economic Association (AEA) `_ +* |OK_ICON| `American Economic Association (AEA) `_ [`fixme `_] -* |OK_ICON| `EconData from UMD `_ +* |OK_ICON| `EconData from UMD `_ [`fixme `_] -* |FIXME_ICON| `Economic Freedom of the World Data `_ +* |FIXME_ICON| `Economic Freedom of the World Data `_ [`fixme `_] -* |OK_ICON| `Historical MacroEconomc Statistics `_ +* |OK_ICON| `Historical MacroEconomc Statistics `_ [`fixme `_] -* |OK_ICON| `INFORUM - Interindustry Forecasting at the University of Maryland `_ +* |OK_ICON| `INFORUM - Interindustry Forecasting at the University of Maryland `_ [`fixme `_] -* |OK_ICON| `International Economics Database `_ +* |OK_ICON| `International Economics Database `_ [`fixme `_] -* |OK_ICON| `International Trade Statistics `_ +* |OK_ICON| `International Trade Statistics `_ [`fixme `_] -* |OK_ICON| `Internet Product Code Database `_ +* |OK_ICON| `Internet Product Code Database `_ [`fixme `_] -* |OK_ICON| `Joint External Debt Data Hub `_ +* |OK_ICON| `Joint External Debt Data Hub `_ [`fixme `_] -* |OK_ICON| `Jon Haveman International Trade Data Links `_ +* |OK_ICON| `Jon Haveman International Trade Data Links `_ [`fixme `_] -* |OK_ICON| `OpenCorporates Database of Companies in the World `_ +* |OK_ICON| `OpenCorporates Database of Companies in the World `_ [`fixme `_] -* |OK_ICON| `Our World in Data `_ +* |OK_ICON| `Our World in Data `_ [`fixme `_] -* |OK_ICON| `SciencesPo World Trade Gravity Datasets `_ +* |OK_ICON| `SciencesPo World Trade Gravity Datasets `_ [`fixme `_] -* |OK_ICON| `The Atlas of Economic Complexity `_ +* |OK_ICON| `The Atlas of Economic Complexity `_ [`fixme `_] -* |OK_ICON| `The Center for International Data `_ +* |OK_ICON| `The Center for International Data `_ [`fixme `_] -* |OK_ICON| `The Observatory of Economic Complexity `_ +* |OK_ICON| `The Observatory of Economic Complexity `_ [`fixme `_] -* |OK_ICON| `UN Commodity Trade Statistics `_ +* |OK_ICON| `UN Commodity Trade Statistics `_ [`fixme `_] -* |OK_ICON| `UN Human Development Reports `_ +* |OK_ICON| `UN Human Development Reports `_ [`fixme `_] Education --------- -* |OK_ICON| `College Scorecard Data `_ +* |OK_ICON| `College Scorecard Data `_ [`fixme `_] -* |OK_ICON| `Student Data from Free Code Camp `_ +* |OK_ICON| `Student Data from Free Code Camp `_ [`fixme `_] Energy ------ -* |OK_ICON| `AMPds `_ +* |OK_ICON| `AMPds `_ [`fixme `_] -* |OK_ICON| `BLUEd `_ +* |OK_ICON| `BLUEd `_ [`fixme `_] -* |OK_ICON| `COMBED `_ +* |OK_ICON| `COMBED `_ [`fixme `_] -* |OK_ICON| `DRED `_ +* |OK_ICON| `DRED `_ [`fixme `_] -* |OK_ICON| `ECO `_ +* |OK_ICON| `ECO `_ [`fixme `_] -* |OK_ICON| `EIA `_ +* |OK_ICON| `EIA `_ [`fixme `_] -* |OK_ICON| `HES - Household Electricity Study, UK `_ +* |OK_ICON| `HES - Household Electricity Study, UK `_ [`fixme `_] -* |OK_ICON| `HFED `_ +* |OK_ICON| `HFED `_ [`fixme `_] -* |OK_ICON| `PLAID - The Plug Load Appliance Identification Dataset `_ +* |FIXME_ICON| `PLAID - The Plug Load Appliance Identification Dataset `_ [`fixme `_] -* |OK_ICON| `REDD `_ +* |OK_ICON| `REDD `_ [`fixme `_] -* |OK_ICON| `Tracebase `_ +* |OK_ICON| `Tracebase `_ [`fixme `_] -* |OK_ICON| `UK-DALE - UK Domestic Appliance-Level Electricity `_ +* |OK_ICON| `UK-DALE - UK Domestic Appliance-Level Electricity `_ [`fixme `_] -* |OK_ICON| `WHITED `_ +* |OK_ICON| `WHITED `_ [`fixme `_] -* |OK_ICON| `iAWE `_ +* |OK_ICON| `iAWE `_ [`fixme `_] Finance ------- -* |FIXME_ICON| `CBOE Futures Exchange `_ +* |FIXME_ICON| `CBOE Futures Exchange `_ [`fixme `_] -* |OK_ICON| `Google Finance `_ +* |OK_ICON| `Google Finance `_ [`fixme `_] -* |OK_ICON| `Google Trends `_ +* |OK_ICON| `Google Trends `_ [`fixme `_] -* |OK_ICON| `NASDAQ `_ +* |OK_ICON| `NASDAQ `_ [`fixme `_] -* |OK_ICON| `NYSE Market Data `_ +* |OK_ICON| `NYSE Market Data `_ [`fixme `_] -* |OK_ICON| `OANDA `_ +* |OK_ICON| `OANDA `_ [`fixme `_] -* |OK_ICON| `OSU Financial data `_ +* |OK_ICON| `OSU Financial data `_ [`fixme `_] -* |OK_ICON| `Quandl `_ +* |OK_ICON| `Quandl `_ [`fixme `_] -* |OK_ICON| `St Louis Federal `_ +* |OK_ICON| `St Louis Federal `_ [`fixme `_] -* |OK_ICON| `Yahoo Finance `_ +* |OK_ICON| `Yahoo Finance `_ [`fixme `_] GIS --- -* |OK_ICON| `ArcGIS Open Data portal `_ +* |OK_ICON| `ArcGIS Open Data portal `_ [`fixme `_] -* |OK_ICON| `Cambridge, MA, US, GIS data on GitHub `_ +* |OK_ICON| `Cambridge, MA, US, GIS data on GitHub `_ [`fixme `_] -* |FIXME_ICON| `Factual Global Location Data `_ +* |FIXME_ICON| `Factual Global Location Data `_ [`fixme `_] -* |OK_ICON| `Geo Maps - High Quality GeoJSON maps programmatically generated `_ +* |OK_ICON| `Geo Maps - High Quality GeoJSON maps programmatically generated `_ [`fixme `_] -* |OK_ICON| `Geo Spatial Data from ASU `_ +* |OK_ICON| `Geo Spatial Data from ASU `_ [`fixme `_] -* |OK_ICON| `Geo Wiki Project - Citizen-driven Environmental Monitoring `_ +* |OK_ICON| `Geo Wiki Project - Citizen-driven Environmental Monitoring `_ [`fixme `_] -* |OK_ICON| `GeoFabrik - OSM data extracted to a variety of formats and areas `_ +* |OK_ICON| `GeoFabrik - OSM data extracted to a variety of formats and areas `_ [`fixme `_] -* |OK_ICON| `GeoNames Worldwide `_ +* |OK_ICON| `GeoNames Worldwide `_ [`fixme `_] -* |OK_ICON| `Global Administrative Areas Database (GADM) `_ +* |FIXME_ICON| `Global Administrative Areas Database (GADM) `_ [`fixme `_] -* |OK_ICON| `Homeland Infrastructure Foundation-Level Data `_ +* |OK_ICON| `Homeland Infrastructure Foundation-Level Data `_ [`fixme `_] -* |OK_ICON| `Landsat 8 on AWS `_ +* |OK_ICON| `Landsat 8 on AWS `_ [`fixme `_] -* |OK_ICON| `List of all countries in all languages `_ +* |OK_ICON| `List of all countries in all languages `_ [`fixme `_] -* |OK_ICON| `National Weather Service GIS Data Portal `_ +* |OK_ICON| `National Weather Service GIS Data Portal `_ [`fixme `_] -* |OK_ICON| `Natural Earth - vectors and rasters of the world `_ +* |OK_ICON| `Natural Earth - vectors and rasters of the world `_ [`fixme `_] -* |OK_ICON| `OpenAddresses `_ +* |OK_ICON| `OpenAddresses `_ [`fixme `_] -* |OK_ICON| `OpenStreetMap (OSM) `_ +* |OK_ICON| `OpenStreetMap (OSM) `_ [`fixme `_] -* |OK_ICON| `Pleiades - Gazetteer and graph of ancient places `_ +* |OK_ICON| `Pleiades - Gazetteer and graph of ancient places `_ [`fixme `_] -* |OK_ICON| `Reverse Geocoder using OSM data `_ +* |OK_ICON| `Reverse Geocoder using OSM data `_ [`fixme `_] -* |FIXME_ICON| `TIGER/Line - U.S. boundaries and roads `_ +* |FIXME_ICON| `TIGER/Line - U.S. boundaries and roads `_ [`fixme `_] -* |OK_ICON| `TZ Timezones shapfiles `_ +* |OK_ICON| `TZ Timezones shapfiles `_ [`fixme `_] -* |OK_ICON| `TwoFishes - Foursquare's coarse geocoder `_ +* |OK_ICON| `TwoFishes - Foursquare's coarse geocoder `_ [`fixme `_] -* |OK_ICON| `UN Environmental Data `_ +* |OK_ICON| `UN Environmental Data `_ [`fixme `_] -* |FIXME_ICON| `World boundaries from the U.S. Department of State `_ +* |FIXME_ICON| `World boundaries from the U.S. Department of State `_ [`fixme `_] -* |OK_ICON| `World countries in multiple formats `_ +* |OK_ICON| `World countries in multiple formats `_ [`fixme `_] Government ---------- -* |OK_ICON| `Alberta, Province of Canada `_ +* |OK_ICON| `Alberta, Province of Canada `_ [`fixme `_] -* |OK_ICON| `Antwerp, Belgium `_ +* |OK_ICON| `Antwerp, Belgium `_ [`fixme `_] -* |OK_ICON| `Argentina (non official) `_ +* |OK_ICON| `Argentina (non official) `_ [`fixme `_] -* |FIXME_ICON| `Argentina `_ +* |OK_ICON| `Datos Argentina - Portal de datos abiertos de la República Argentina. [...] `_ [`fixme `_] -* |OK_ICON| `Austin, TX, US `_ +* |OK_ICON| `Austin, TX, US `_ [`fixme `_] -* |OK_ICON| `Australia (abs.gov.au) `_ +* |OK_ICON| `Australia (abs.gov.au) `_ [`fixme `_] -* |OK_ICON| `Australia (data.gov.au) `_ +* |OK_ICON| `Australia (data.gov.au) `_ [`fixme `_] -* |OK_ICON| `Austria (data.gv.at) `_ +* |OK_ICON| `Austria (data.gv.at) `_ [`fixme `_] -* |OK_ICON| `Baton Rouge, LA, US `_ +* |OK_ICON| `Baton Rouge, LA, US `_ [`fixme `_] -* |OK_ICON| `Belgium `_ +* |OK_ICON| `Belgium `_ [`fixme `_] -* |OK_ICON| `Brazil `_ +* |OK_ICON| `Brazil `_ [`fixme `_] -* |OK_ICON| `Buenos Aires, Argentina `_ +* |OK_ICON| `Buenos Aires, Argentina `_ [`fixme `_] -* |FIXME_ICON| `Calgary, AB, Canada `_ +* |FIXME_ICON| `Calgary, AB, Canada `_ [`fixme `_] -* |OK_ICON| `Cambridge, MA, US `_ +* |OK_ICON| `Cambridge, MA, US `_ [`fixme `_] -* |OK_ICON| `Canada `_ +* |OK_ICON| `Canada `_ [`fixme `_] -* |OK_ICON| `Chicago `_ +* |OK_ICON| `Chicago `_ [`fixme `_] -* |OK_ICON| `Chile `_ +* |OK_ICON| `Chile `_ [`fixme `_] -* |OK_ICON| `Dallas Open Data `_ +* |OK_ICON| `Dallas Open Data `_ [`fixme `_] -* |OK_ICON| `DataBC - data from the Province of British Columbia `_ +* |OK_ICON| `DataBC - data from the Province of British Columbia `_ [`fixme `_] -* |OK_ICON| `Denver Open Data `_ +* |OK_ICON| `Denver Open Data `_ [`fixme `_] -* |OK_ICON| `Durham, NC Open Data `_ +* |OK_ICON| `Durham, NC Open Data `_ [`fixme `_] -* |OK_ICON| `Edmonton, AB, Canada `_ +* |OK_ICON| `Edmonton, AB, Canada `_ [`fixme `_] -* |OK_ICON| `England LGInform `_ +* |OK_ICON| `England LGInform `_ [`fixme `_] -* |OK_ICON| `EuroStat `_ +* |OK_ICON| `EuroStat `_ [`fixme `_] -* |OK_ICON| `EveryPolitician - Ongoing project collating and sharing data on every politician. `_ +* |OK_ICON| `EveryPolitician - Ongoing project collating and sharing data on every [...] `_ [`fixme `_] -* |OK_ICON| `FedStats `_ +* |OK_ICON| `FedStats `_ [`fixme `_] -* |OK_ICON| `Finland `_ +* |OK_ICON| `Finland `_ [`fixme `_] -* |OK_ICON| `France `_ +* |OK_ICON| `France `_ [`fixme `_] -* |OK_ICON| `Fredericton, NB, Canada `_ +* |OK_ICON| `Fredericton, NB, Canada `_ [`fixme `_] -* |OK_ICON| `Gatineau, QC, Canada `_ +* |OK_ICON| `Gatineau, QC, Canada `_ [`fixme `_] -* |OK_ICON| `Germany `_ +* |OK_ICON| `Germany `_ [`fixme `_] -* |OK_ICON| `Ghent, Belgium `_ +* |OK_ICON| `Ghent, Belgium `_ [`fixme `_] -* |OK_ICON| `Glasgow, Scotland, UK `_ +* |OK_ICON| `Glasgow, Scotland, UK `_ [`fixme `_] -* |OK_ICON| `Greece `_ +* |OK_ICON| `Greece `_ [`fixme `_] -* |OK_ICON| `Guardian world governments `_ +* |OK_ICON| `Guardian world governments `_ [`fixme `_] -* |FIXME_ICON| `Halifax, NS, Canada `_ +* |FIXME_ICON| `Halifax, NS, Canada `_ [`fixme `_] -* |OK_ICON| `Helsinki Region, Finland `_ +* |OK_ICON| `Helsinki Region, Finland `_ [`fixme `_] -* |OK_ICON| `Hong Kong, China `_ +* |OK_ICON| `Hong Kong, China `_ [`fixme `_] -* |FIXME_ICON| `Houston Open Data `_ +* |FIXME_ICON| `Houston Open Data `_ [`fixme `_] -* |OK_ICON| `Indian Government Data `_ +* |OK_ICON| `Indian Government Data `_ [`fixme `_] -* |OK_ICON| `Indonesian Data Portal `_ +* |OK_ICON| `Indonesian Data Portal `_ [`fixme `_] -* |OK_ICON| `Ireland's Open Data Portal `_ +* |OK_ICON| `Ireland's Open Data Portal `_ [`fixme `_] -* |OK_ICON| `Italy - Il Portale dati.gov.it è il catalogo nazionale dei metadati relativi ai dati rilasciati in formato aperto dalle pubbliche amministrazioni italiane. Il Portale è promosso dal Governo Italiano e gestito dall’Agenzia per l’Italia digitale con il supporto di FormezPA. `_ +* |OK_ICON| `Italy - Il Portale dati.gov.it è il catalogo nazionale dei metadati [...] `_ [`fixme `_] -* |OK_ICON| `Japan `_ +* |OK_ICON| `Japan `_ [`fixme `_] -* |OK_ICON| `Laval, QC, Canada `_ +* |OK_ICON| `Laval, QC, Canada `_ [`fixme `_] -* |OK_ICON| `Lexington, KY `_ +* |OK_ICON| `Lexington, KY `_ [`fixme `_] -* |OK_ICON| `London Datastore, UK `_ +* |OK_ICON| `London Datastore, UK `_ [`fixme `_] -* |OK_ICON| `London, ON, Canada `_ +* |OK_ICON| `London, ON, Canada `_ [`fixme `_] -* |OK_ICON| `Los Angeles Open Data `_ +* |OK_ICON| `Los Angeles Open Data `_ [`fixme `_] -* |OK_ICON| `MassGIS, Massachusetts, U.S. `_ +* |OK_ICON| `MassGIS, Massachusetts, U.S. `_ [`fixme `_] -* |OK_ICON| `Metropolitain Transportation Commission (MTC), California, US `_ +* |OK_ICON| `Metropolitain Transportation Commission (MTC), California, US `_ [`fixme `_] -* |OK_ICON| `Mexico `_ +* |OK_ICON| `Mexico `_ [`fixme `_] -* |OK_ICON| `Missisauga, ON, Canada `_ +* |OK_ICON| `Missisauga, ON, Canada `_ [`fixme `_] -* |OK_ICON| `Moldova `_ +* |OK_ICON| `Moldova `_ [`fixme `_] -* |OK_ICON| `Moncton, NB, Canada `_ +* |OK_ICON| `Moncton, NB, Canada `_ [`fixme `_] -* |OK_ICON| `Montreal, QC, Canada `_ +* |OK_ICON| `Montreal, QC, Canada `_ [`fixme `_] -* |OK_ICON| `Mountain View, California, US (GIS) `_ +* |OK_ICON| `Mountain View, California, US (GIS) `_ [`fixme `_] -* |FIXME_ICON| `NYC Open Data `_ +* |FIXME_ICON| `NYC Open Data `_ [`fixme `_] -* |OK_ICON| `NYC betanyc `_ +* |OK_ICON| `NYC betanyc `_ [`fixme `_] -* |OK_ICON| `Netherlands `_ +* |OK_ICON| `Netherlands `_ [`fixme `_] -* |OK_ICON| `New Zealand `_ +* |OK_ICON| `New Zealand `_ [`fixme `_] -* |OK_ICON| `OECD `_ +* |OK_ICON| `OECD `_ [`fixme `_] -* |OK_ICON| `Oakland, California, US `_ +* |OK_ICON| `Oakland, California, US `_ [`fixme `_] -* |OK_ICON| `Oklahoma `_ +* |OK_ICON| `Oklahoma `_ [`fixme `_] -* |OK_ICON| `Open Data for Africa `_ +* |OK_ICON| `Open Data for Africa `_ [`fixme `_] -* |OK_ICON| `Open Government Data (OGD) Platform India `_ +* |OK_ICON| `Open Government Data (OGD) Platform India `_ [`fixme `_] -* |OK_ICON| `OpenDataSoft's list of 1,600 open data `_ +* |OK_ICON| `OpenDataSoft's list of 1,600 open data `_ [`fixme `_] -* |OK_ICON| `Oregon `_ +* |OK_ICON| `Oregon `_ [`fixme `_] -* |OK_ICON| `Ottawa, ON, Canada `_ +* |OK_ICON| `Ottawa, ON, Canada `_ [`fixme `_] -* |OK_ICON| `Palo Alto, California, US `_ +* |OK_ICON| `Palo Alto, California, US `_ [`fixme `_] -* |OK_ICON| `Portland, Oregon `_ +* |OK_ICON| `Portland, Oregon `_ [`fixme `_] -* |OK_ICON| `Portugal - Pordata organization `_ +* |OK_ICON| `Portugal - Pordata organization `_ [`fixme `_] -* |OK_ICON| `Puerto Rico Government `_ +* |OK_ICON| `Puerto Rico Government `_ [`fixme `_] -* |OK_ICON| `Quebec City, QC, Canada `_ +* |OK_ICON| `Quebec City, QC, Canada `_ [`fixme `_] -* |OK_ICON| `Quebec Province of Canada `_ +* |OK_ICON| `Quebec Province of Canada `_ [`fixme `_] -* |OK_ICON| `Regina SK, Canada `_ +* |OK_ICON| `Regina SK, Canada `_ [`fixme `_] -* |OK_ICON| `Rio de Janeiro, Brazil `_ +* |FIXME_ICON| `Rio de Janeiro, Brazil `_ [`fixme `_] -* |OK_ICON| `Romania `_ +* |OK_ICON| `Romania `_ [`fixme `_] -* |OK_ICON| `Russia `_ +* |OK_ICON| `Russia `_ [`fixme `_] -* |OK_ICON| `San Francisco Data sets `_ +* |OK_ICON| `San Francisco Data sets `_ [`fixme `_] -* |OK_ICON| `San Jose, California, US `_ +* |OK_ICON| `San Jose, California, US `_ [`fixme `_] -* |OK_ICON| `San Mateo County, California, US `_ +* |OK_ICON| `San Mateo County, California, US `_ [`fixme `_] -* |OK_ICON| `Saskatchewan, Province of Canada `_ +* |OK_ICON| `Saskatchewan, Province of Canada `_ [`fixme `_] -* |OK_ICON| `Seattle `_ +* |OK_ICON| `Seattle `_ [`fixme `_] -* |OK_ICON| `Singapore Government Data `_ +* |OK_ICON| `Singapore Government Data `_ [`fixme `_] -* |OK_ICON| `South Africa Trade Statistics `_ +* |OK_ICON| `South Africa Trade Statistics `_ [`fixme `_] -* |OK_ICON| `South Africa `_ +* |OK_ICON| `South Africa `_ [`fixme `_] -* |OK_ICON| `State of Utah, US `_ +* |OK_ICON| `State of Utah, US `_ [`fixme `_] -* |OK_ICON| `Switzerland `_ +* |OK_ICON| `Switzerland `_ [`fixme `_] -* |OK_ICON| `Taiwan g0v `_ +* |OK_ICON| `Taiwan g0v `_ [`fixme `_] -* |OK_ICON| `Taiwan `_ +* |OK_ICON| `Taiwan `_ [`fixme `_] -* |OK_ICON| `Tel-Aviv Open Data `_ +* |OK_ICON| `Tel-Aviv Open Data `_ [`fixme `_] -* |OK_ICON| `Texas Open Data `_ +* |OK_ICON| `Texas Open Data `_ [`fixme `_] -* |OK_ICON| `The World Bank `_ +* |OK_ICON| `The World Bank `_ [`fixme `_] -* |FIXME_ICON| `Toronto, ON, Canada `_ +* |FIXME_ICON| `Toronto, ON, Canada `_ [`fixme `_] -* |OK_ICON| `Tunisia `_ +* |OK_ICON| `Tunisia `_ [`fixme `_] -* |OK_ICON| `U.K. Government Data `_ +* |OK_ICON| `U.K. Government Data `_ [`fixme `_] -* |OK_ICON| `U.S. American Community Survey `_ +* |OK_ICON| `U.S. American Community Survey `_ [`fixme `_] -* |OK_ICON| `U.S. CDC Public Health datasets `_ +* |OK_ICON| `U.S. CDC Public Health datasets `_ [`fixme `_] -* |OK_ICON| `U.S. Census Bureau `_ +* |OK_ICON| `U.S. Census Bureau `_ [`fixme `_] -* |OK_ICON| `U.S. Department of Housing and Urban Development (HUD) `_ +* |OK_ICON| `U.S. Department of Housing and Urban Development (HUD) `_ [`fixme `_] -* |OK_ICON| `U.S. Federal Government Agencies `_ +* |OK_ICON| `U.S. Federal Government Agencies `_ [`fixme `_] -* |OK_ICON| `U.S. Federal Government Data Catalog `_ +* |OK_ICON| `U.S. Federal Government Data Catalog `_ [`fixme `_] -* |OK_ICON| `U.S. Food and Drug Administration (FDA) `_ +* |OK_ICON| `U.S. Food and Drug Administration (FDA) `_ [`fixme `_] -* |OK_ICON| `U.S. National Center for Education Statistics (NCES) `_ +* |OK_ICON| `U.S. National Center for Education Statistics (NCES) `_ [`fixme `_] -* |OK_ICON| `U.S. Open Government `_ +* |OK_ICON| `U.S. Open Government `_ [`fixme `_] -* |FIXME_ICON| `UK 2011 Census Open Atlas Project `_ +* |FIXME_ICON| `UK 2011 Census Open Atlas Project `_ [`fixme `_] -* |OK_ICON| `U.S. Patent and Trademark Office (USPTO) Bulk Data Products `_ +* |OK_ICON| `U.S. Patent and Trademark Office (USPTO) Bulk Data Products `_ [`fixme `_] -* |OK_ICON| `Uganda Bureau of Statistics `_ +* |OK_ICON| `Uganda Bureau of Statistics `_ [`fixme `_] -* |OK_ICON| `United Nations `_ +* |OK_ICON| `United Nations `_ [`fixme `_] -* |OK_ICON| `Uruguay `_ +* |OK_ICON| `Uruguay `_ [`fixme `_] -* |OK_ICON| `Valley Transportation Authority (VTA), California, US `_ +* |OK_ICON| `Valley Transportation Authority (VTA), California, US `_ [`fixme `_] -* |OK_ICON| `Vancouver, BC Open Data Catalog `_ +* |OK_ICON| `Vancouver, BC Open Data Catalog `_ [`fixme `_] -* |FIXME_ICON| `Victoria, BC, Canada `_ +* |FIXME_ICON| `Victoria, BC, Canada `_ [`fixme `_] -* |OK_ICON| `Vienna, Austria `_ +* |OK_ICON| `Vienna, Austria `_ [`fixme `_] Healthcare ---------- -* |OK_ICON| `Composition of Foods Raw, Processed, Prepared USDA National Nutrient Database for Standard Reference - The database consists of several sets of data: food descriptions, nutrients, weights and measures, footnotes, and sources of data. The Nutrient Data file contains mean nutrient values per 100 g of the edible portion of food, along with fields to further describe the mean value. `_ +* |OK_ICON| `Composition of Foods Raw, Processed, Prepared USDA National Nutrient Database for Standard [...] `_ [`fixme `_] -* |OK_ICON| `EHDP Large Health Data Sets `_ +* |OK_ICON| `EHDP Large Health Data Sets `_ [`fixme `_] -* |OK_ICON| `GDC - GDC supports several cancer genome programs for CCG, TCGA, TARGET etc. `_ +* |OK_ICON| `GDC - GDC supports several cancer genome programs for CCG, TCGA, TARGET etc. `_ [`fixme `_] -* |OK_ICON| `Gapminder World demographic databases `_ +* |OK_ICON| `Gapminder World demographic databases `_ [`fixme `_] -* |OK_ICON| `MeSH, the vocabulary thesaurus used for indexing articles for PubMed `_ +* |OK_ICON| `MeSH, the vocabulary thesaurus used for indexing articles for PubMed `_ [`fixme `_] -* |OK_ICON| `Medicare Coverage Database (MCD), U.S. `_ +* |OK_ICON| `Medicare Coverage Database (MCD), U.S. `_ [`fixme `_] -* |OK_ICON| `Medicare Data Engine of medicare.gov Data `_ +* |OK_ICON| `Medicare Data Engine of medicare.gov Data `_ [`fixme `_] -* |OK_ICON| `Medicare Data File `_ +* |OK_ICON| `Medicare Data File `_ [`fixme `_] -* |FIXME_ICON| `Number of Ebola Cases and Deaths in Affected Countries (2014) `_ +* |FIXME_ICON| `Number of Ebola Cases and Deaths in Affected Countries (2014) `_ [`fixme `_] -* |OK_ICON| `Open-ODS (structure of the UK NHS) `_ +* |OK_ICON| `Open-ODS (structure of the UK NHS) `_ [`fixme `_] -* |OK_ICON| `OpenPaymentsData, Healthcare financial relationship data `_ +* |OK_ICON| `OpenPaymentsData, Healthcare financial relationship data `_ [`fixme `_] -* |OK_ICON| `PhysioBank Databases - A large and growing archive of physiological data. `_ +* |OK_ICON| `PhysioBank Databases - A large and growing archive of physiological data. `_ [`fixme `_] -* |OK_ICON| `The Cancer Imaging Archive (TCIA) `_ +* |OK_ICON| `The Cancer Imaging Archive (TCIA) `_ [`fixme `_] -* |OK_ICON| `The Cancer Genome Atlas project (TCGA) `_ +* |OK_ICON| `The Cancer Genome Atlas project (TCGA) `_ [`fixme `_] -* |OK_ICON| `World Health Organization Global Health Observatory `_ +* |OK_ICON| `World Health Organization Global Health Observatory `_ [`fixme `_] ImageProcessing --------------- -* |OK_ICON| `10k US Adult Faces Database `_ +* |OK_ICON| `10k US Adult Faces Database `_ [`fixme `_] -* |FIXME_ICON| `2GB of Photos of Cats `_ +* |FIXME_ICON| `2GB of Photos of Cats `_ [`fixme `_] -* |OK_ICON| `Adience Unfiltered faces for gender and age classification `_ +* |OK_ICON| `Adience Unfiltered faces for gender and age classification `_ [`fixme `_] -* |OK_ICON| `Affective Image Classification `_ +* |OK_ICON| `Affective Image Classification `_ [`fixme `_] -* |OK_ICON| `Animals with attributes `_ +* |OK_ICON| `Animals with attributes `_ [`fixme `_] -* |OK_ICON| `Caltech Pedestrian Detection Benchmark `_ +* |OK_ICON| `Caltech Pedestrian Detection Benchmark `_ [`fixme `_] -* |OK_ICON| `Chars74K dataset - Character Recognition in Natural Images (both English and Kannada are available) `_ +* |OK_ICON| `Chars74K dataset - Character Recognition in Natural Images (both English [...] `_ [`fixme `_] -* |OK_ICON| `Face Recognition Benchmark `_ +* |OK_ICON| `Face Recognition Benchmark `_ [`fixme `_] -* |OK_ICON| `Flickr: 32 Class Brand Logos `_ +* |OK_ICON| `Flickr: 32 Class Brand Logos `_ [`fixme `_] -* |OK_ICON| `GDXray - X-ray images for X-ray testing and Computer Vision `_ +* |OK_ICON| `GDXray - X-ray images for X-ray testing and Computer Vision `_ [`fixme `_] -* |FIXME_ICON| `ImageNet (in WordNet hierarchy) `_ +* |OK_ICON| `ImageNet (in WordNet hierarchy) `_ [`fixme `_] -* |OK_ICON| `Indoor Scene Recognition `_ +* |OK_ICON| `Indoor Scene Recognition `_ [`fixme `_] -* |OK_ICON| `International Affective Picture System, UFL `_ +* |OK_ICON| `International Affective Picture System, UFL `_ [`fixme `_] -* |OK_ICON| `MNIST database of handwritten digits, near 1 million examples `_ +* |OK_ICON| `MNIST database of handwritten digits, near 1 million examples `_ [`fixme `_] -* |OK_ICON| `Massive Visual Memory Stimuli, MIT `_ +* |OK_ICON| `Massive Visual Memory Stimuli, MIT `_ [`fixme `_] -* |OK_ICON| `SUN database, MIT `_ +* |OK_ICON| `SUN database, MIT `_ [`fixme `_] -* |FIXME_ICON| `Several Shape-from-Silhouette Datasets `_ +* |FIXME_ICON| `Several Shape-from-Silhouette Datasets `_ [`fixme `_] -* |OK_ICON| `Stanford Dogs Dataset `_ +* |OK_ICON| `Stanford Dogs Dataset `_ [`fixme `_] -* |OK_ICON| `The Action Similarity Labeling (ASLAN) Challenge `_ +* |OK_ICON| `The Action Similarity Labeling (ASLAN) Challenge `_ [`fixme `_] -* |OK_ICON| `The Oxford-IIIT Pet Dataset `_ +* |OK_ICON| `The Oxford-IIIT Pet Dataset `_ [`fixme `_] -* |OK_ICON| `Violent-Flows - Crowd Violence / Non-violence Database and benchmark `_ +* |OK_ICON| `Violent-Flows - Crowd Violence / Non-violence Database and benchmark `_ [`fixme `_] -* |OK_ICON| `Visual genome `_ +* |OK_ICON| `Visual genome `_ [`fixme `_] -* |OK_ICON| `YouTube Faces Database `_ +* |OK_ICON| `YouTube Faces Database `_ [`fixme `_] MachineLearning --------------- -* |OK_ICON| `Context-aware data sets from five domains `_ +* |OK_ICON| `Context-aware data sets from five domains `_ [`fixme `_] -* |OK_ICON| `Delve Datasets for classification and regression `_ +* |OK_ICON| `Delve Datasets for classification and regression `_ [`fixme `_] -* |OK_ICON| `Discogs Monthly Data `_ +* |OK_ICON| `Discogs Monthly Data `_ [`fixme `_] -* |OK_ICON| `Free Music Archive `_ +* |OK_ICON| `Free Music Archive `_ [`fixme `_] -* |OK_ICON| `IMDb Database `_ +* |OK_ICON| `IMDb Database `_ [`fixme `_] -* |OK_ICON| `Keel Repository for classification, regression and time series `_ +* |OK_ICON| `Keel Repository for classification, regression and time series `_ [`fixme `_] -* |OK_ICON| `Labeled Faces in the Wild (LFW) `_ +* |OK_ICON| `Labeled Faces in the Wild (LFW) `_ [`fixme `_] -* |OK_ICON| `Lending Club Loan Data `_ +* |OK_ICON| `Lending Club Loan Data `_ [`fixme `_] -* |OK_ICON| `Machine Learning Data Set Repository `_ +* |OK_ICON| `Machine Learning Data Set Repository `_ [`fixme `_] -* |OK_ICON| `Million Song Dataset `_ +* |OK_ICON| `Million Song Dataset `_ [`fixme `_] -* |OK_ICON| `More Song Datasets `_ +* |OK_ICON| `More Song Datasets `_ [`fixme `_] -* |OK_ICON| `MovieLens Data Sets `_ +* |OK_ICON| `MovieLens Data Sets `_ [`fixme `_] -* |OK_ICON| `New Yorker caption contest ratings `_ +* |OK_ICON| `New Yorker caption contest ratings `_ [`fixme `_] -* |OK_ICON| `RDataMining - "R and Data Mining" ebook data `_ +* |OK_ICON| `RDataMining - "R and Data Mining" ebook data `_ [`fixme `_] -* |OK_ICON| `Registered Meteorites on Earth `_ +* |OK_ICON| `Registered Meteorites on Earth `_ [`fixme `_] -* |FIXME_ICON| `Restaurants Health Score Data in San Francisco `_ +* |FIXME_ICON| `Restaurants Health Score Data in San Francisco `_ [`fixme `_] -* |OK_ICON| `UCI Machine Learning Repository `_ +* |OK_ICON| `UCI Machine Learning Repository `_ [`fixme `_] -* |FIXME_ICON| `Yahoo! Ratings and Classification Data `_ +* |FIXME_ICON| `Yahoo! Ratings and Classification Data `_ [`fixme `_] -* |OK_ICON| `YouTube-BoundingBoxes `_ +* |OK_ICON| `YouTube-BoundingBoxes `_ [`fixme `_] -* |OK_ICON| `Youtube 8m `_ +* |OK_ICON| `Youtube 8m `_ [`fixme `_] -* |OK_ICON| `eBay Online Auctions (2012) `_ +* |OK_ICON| `eBay Online Auctions (2012) `_ [`fixme `_] Museums ------- -* |OK_ICON| `Canada Science and Technology Museums Corporation's Open Data `_ +* |OK_ICON| `Canada Science and Technology Museums Corporation's Open Data `_ [`fixme `_] -* |OK_ICON| `Cooper-Hewitt's Collection Database `_ +* |OK_ICON| `Cooper-Hewitt's Collection Database `_ [`fixme `_] -* |OK_ICON| `Minneapolis Institute of Arts metadata `_ +* |OK_ICON| `Minneapolis Institute of Arts metadata `_ [`fixme `_] -* |OK_ICON| `Natural History Museum (London) Data Portal `_ +* |OK_ICON| `Natural History Museum (London) Data Portal `_ [`fixme `_] -* |OK_ICON| `Rijksmuseum Historical Art Collection `_ +* |OK_ICON| `Rijksmuseum Historical Art Collection `_ [`fixme `_] -* |OK_ICON| `Tate Collection metadata `_ +* |OK_ICON| `Tate Collection metadata `_ [`fixme `_] -* |OK_ICON| `The Getty vocabularies `_ +* |OK_ICON| `The Getty vocabularies `_ [`fixme `_] NaturalLanguage --------------- -* |OK_ICON| `Automatic Keyphrase Extraction `_ +* |OK_ICON| `Automatic Keyphrase Extraction `_ [`fixme `_] -* |OK_ICON| `Blogger Corpus `_ +* |OK_ICON| `Blogger Corpus `_ [`fixme `_] -* |OK_ICON| `CLiPS Stylometry Investigation Corpus `_ +* |OK_ICON| `CLiPS Stylometry Investigation Corpus `_ [`fixme `_] -* |OK_ICON| `ClueWeb09 FACC `_ +* |OK_ICON| `ClueWeb09 FACC `_ [`fixme `_] -* |OK_ICON| `ClueWeb12 FACC `_ +* |OK_ICON| `ClueWeb12 FACC `_ [`fixme `_] -* |OK_ICON| `DBpedia - 4.58M things with 583M facts `_ +* |OK_ICON| `DBpedia - 4.58M things with 583M facts `_ [`fixme `_] -* |OK_ICON| `Flickr Personal Taxonomies `_ +* |OK_ICON| `Flickr Personal Taxonomies `_ [`fixme `_] -* |OK_ICON| `Freebase of people, places, and things `_ +* |OK_ICON| `Freebase of people, places, and things `_ [`fixme `_] -* |OK_ICON| `Google Books Ngrams (2.2TB) `_ +* |OK_ICON| `Google Books Ngrams (2.2TB) `_ [`fixme `_] -* |OK_ICON| `Google MC-AFP - Generated based on the public available Gigaword dataset using Paragraph Vectors `_ +* |OK_ICON| `Google MC-AFP - Generated based on the public available Gigaword dataset [...] `_ [`fixme `_] -* |FIXME_ICON| `Google Web 5gram (1TB, 2006) `_ +* |OK_ICON| `Google Web 5gram (1TB, 2006) `_ [`fixme `_] -* |OK_ICON| `Gutenberg eBooks List `_ +* |OK_ICON| `Gutenberg eBooks List `_ [`fixme `_] -* |OK_ICON| `Hansards text chunks of Canadian Parliament `_ +* |OK_ICON| `Hansards text chunks of Canadian Parliament `_ [`fixme `_] -* |OK_ICON| `Microsoft MAchine Reading COmprehension Dataset (or MS MARCO) `_ +* |OK_ICON| `Microsoft MAchine Reading COmprehension Dataset (or MS MARCO) `_ [`fixme `_] -* |OK_ICON| `Machine Comprehension Test (MCTest) of text from Microsoft Research `_ +* |OK_ICON| `Machine Comprehension Test (MCTest) of text from Microsoft Research `_ [`fixme `_] -* |OK_ICON| `Machine Translation of European languages `_ +* |OK_ICON| `Machine Translation of European languages `_ [`fixme `_] -* |FIXME_ICON| `Making Sense of Microposts 2013 - Concept Extraction `_ +* |FIXME_ICON| `Making Sense of Microposts 2013 - Concept Extraction `_ [`fixme `_] -* |OK_ICON| `Making Sense of Microposts 2016 - Named Entity rEcognition and Linking `_ +* |OK_ICON| `Making Sense of Microposts 2016 - Named Entity rEcognition and Linking `_ [`fixme `_] -* |OK_ICON| `Multi-Domain Sentiment Dataset (version 2.0) `_ +* |OK_ICON| `Multi-Domain Sentiment Dataset (version 2.0) `_ [`fixme `_] -* |OK_ICON| `Open Multilingual Wordnet `_ +* |OK_ICON| `Open Multilingual Wordnet `_ [`fixme `_] -* |OK_ICON| `POS/NER/Chunk annotated data `_ +* |OK_ICON| `POS/NER/Chunk annotated data `_ [`fixme `_] -* |OK_ICON| `Personae Corpus `_ +* |OK_ICON| `Personae Corpus `_ [`fixme `_] -* |OK_ICON| `SMS Spam Collection in English `_ +* |OK_ICON| `SMS Spam Collection in English `_ [`fixme `_] -* |OK_ICON| `SaudiNewsNet Collection of Saudi Newspaper Articles (Arabic, 30K articles) `_ +* |OK_ICON| `SaudiNewsNet Collection of Saudi Newspaper Articles (Arabic, 30K articles) `_ [`fixme `_] -* |OK_ICON| `Stanford Question Answering Dataset (SQuAD) `_ +* |OK_ICON| `Stanford Question Answering Dataset (SQuAD) `_ [`fixme `_] -* |OK_ICON| `USENET postings corpus of 2005~2011 `_ +* |OK_ICON| `USENET postings corpus of 2005~2011 `_ [`fixme `_] -* |OK_ICON| `Universal Dependencies `_ +* |OK_ICON| `Universal Dependencies `_ [`fixme `_] -* |OK_ICON| `Webhose - News/Blogs in multiple languages `_ +* |OK_ICON| `Webhose - News/Blogs in multiple languages `_ [`fixme `_] -* |OK_ICON| `Wikidata - Wikipedia databases `_ +* |OK_ICON| `Wikidata - Wikipedia databases `_ [`fixme `_] -* |OK_ICON| `Wikipedia Links data - 40 Million Entities in Context `_ +* |OK_ICON| `Wikipedia Links data - 40 Million Entities in Context `_ [`fixme `_] -* |FIXME_ICON| `WordNet databases and tools `_ +* |FIXME_ICON| `WordNet databases and tools `_ [`fixme `_] Neuroscience ------------ -* |OK_ICON| `Allen Institute Datasets `_ +* |OK_ICON| `Allen Institute Datasets `_ [`fixme `_] -* |OK_ICON| `Brain Catalogue `_ +* |OK_ICON| `Brain Catalogue `_ [`fixme `_] -* |OK_ICON| `Brainomics `_ +* |OK_ICON| `Brainomics `_ [`fixme `_] -* |FIXME_ICON| `CodeNeuro Datasets `_ +* |FIXME_ICON| `CodeNeuro Datasets `_ [`fixme `_] -* |OK_ICON| `Collaborative Research in Computational Neuroscience (CRCNS) `_ +* |OK_ICON| `Collaborative Research in Computational Neuroscience (CRCNS) `_ [`fixme `_] -* |OK_ICON| `FCP-INDI `_ +* |OK_ICON| `FCP-INDI `_ [`fixme `_] -* |OK_ICON| `Human Connectome Project `_ +* |OK_ICON| `Human Connectome Project `_ [`fixme `_] -* |OK_ICON| `NDAR `_ +* |OK_ICON| `NDAR `_ [`fixme `_] -* |OK_ICON| `NIMH Data Archive `_ +* |OK_ICON| `NIMH Data Archive `_ [`fixme `_] -* |OK_ICON| `NeuroData `_ +* |OK_ICON| `NeuroData `_ [`fixme `_] -* |OK_ICON| `Neuroelectro `_ +* |OK_ICON| `Neuroelectro `_ [`fixme `_] -* |OK_ICON| `OASIS `_ +* |OK_ICON| `OASIS `_ [`fixme `_] -* |OK_ICON| `OpenfMRI `_ +* |OK_ICON| `OpenfMRI `_ [`fixme `_] -* |OK_ICON| `Study Forrest `_ +* |OK_ICON| `Study Forrest `_ [`fixme `_] Physics ------- -* |OK_ICON| `CERN Open Data Portal `_ +* |OK_ICON| `CERN Open Data Portal `_ [`fixme `_] -* |OK_ICON| `Crystallography Open Database `_ +* |OK_ICON| `Crystallography Open Database `_ [`fixme `_] -* |OK_ICON| `IceCube - South Pole Neutrino Observatory `_ +* |OK_ICON| `IceCube - South Pole Neutrino Observatory `_ [`fixme `_] -* |OK_ICON| `NASA Exoplanet Archive `_ +* |OK_ICON| `NASA Exoplanet Archive `_ [`fixme `_] -* |OK_ICON| `NSSDC (NASA) data of 550 space spacecraft `_ +* |OK_ICON| `NSSDC (NASA) data of 550 space spacecraft `_ [`fixme `_] -* |OK_ICON| `Sloan Digital Sky Survey (SDSS) - Mapping the Universe `_ +* |OK_ICON| `Sloan Digital Sky Survey (SDSS) - Mapping the Universe `_ [`fixme `_] Psychology+Cognition -------------------- -* |FIXME_ICON| `OSU Cognitive Modeling Repository Datasets `_ +* |FIXME_ICON| `OSU Cognitive Modeling Repository Datasets `_ [`fixme `_] PublicDomains ------------- -* |OK_ICON| `Amazon `_ +* |OK_ICON| `Amazon `_ [`fixme `_] -* |OK_ICON| `Archive.org Datasets `_ +* |OK_ICON| `Archive.org Datasets `_ [`fixme `_] -* |OK_ICON| `Archive-it from Internet Archive `_ +* |OK_ICON| `Archive-it from Internet Archive `_ [`fixme `_] -* |OK_ICON| `CMU JASA data archive `_ +* |OK_ICON| `CMU JASA data archive `_ [`fixme `_] -* |OK_ICON| `CMU StatLab collections `_ +* |OK_ICON| `CMU StatLab collections `_ [`fixme `_] -* |OK_ICON| `Data.World `_ +* |OK_ICON| `Data.World `_ [`fixme `_] -* |OK_ICON| `Data360 `_ +* |OK_ICON| `Data360 `_ [`fixme `_] -* |OK_ICON| `Enigma Public `_ +* |OK_ICON| `Enigma Public `_ [`fixme `_] -* |OK_ICON| `Google `_ +* |OK_ICON| `Google `_ [`fixme `_] -* |FIXME_ICON| `Infochimps `_ +* |OK_ICON| `Infochimps `_ [`fixme `_] -* |OK_ICON| `KDNuggets Data Collections `_ +* |OK_ICON| `KDNuggets Data Collections `_ [`fixme `_] -* |FIXME_ICON| `Microsoft Azure Data Market Free DataSets `_ +* |FIXME_ICON| `Microsoft Azure Data Market Free DataSets `_ [`fixme `_] -* |OK_ICON| `Microsoft Data Science for Research `_ +* |OK_ICON| `Microsoft Data Science for Research `_ [`fixme `_] -* |FIXME_ICON| `Numbray `_ +* |FIXME_ICON| `Numbray `_ [`fixme `_] -* |OK_ICON| `Open Library Data Dumps `_ +* |OK_ICON| `Open Library Data Dumps `_ [`fixme `_] -* |OK_ICON| `Reddit Datasets `_ +* |OK_ICON| `Reddit Datasets `_ [`fixme `_] -* |OK_ICON| `RevolutionAnalytics Collection `_ +* |OK_ICON| `RevolutionAnalytics Collection `_ [`fixme `_] -* |OK_ICON| `Sample R data sets `_ +* |OK_ICON| `Sample R data sets `_ [`fixme `_] -* |OK_ICON| `StatSci.org `_ +* |OK_ICON| `StatSci.org `_ [`fixme `_] -* |FIXME_ICON| `Stats4Stem R data sets `_ +* |FIXME_ICON| `Stats4Stem R data sets `_ [`fixme `_] -* |OK_ICON| `The Washington Post List `_ +* |OK_ICON| `The Washington Post List `_ [`fixme `_] -* |OK_ICON| `UCLA SOCR data collection `_ +* |OK_ICON| `UCLA SOCR data collection `_ [`fixme `_] -* |OK_ICON| `UFO Reports `_ +* |OK_ICON| `UFO Reports `_ [`fixme `_] -* |OK_ICON| `Wikileaks 911 pager intercepts `_ +* |OK_ICON| `Wikileaks 911 pager intercepts `_ [`fixme `_] -* |FIXME_ICON| `Yahoo Webscope `_ +* |FIXME_ICON| `Yahoo Webscope `_ [`fixme `_] SearchEngines ------------- -* |OK_ICON| `Academic Torrents of data sharing from UMB `_ +* |OK_ICON| `Academic Torrents of data sharing from UMB `_ [`fixme `_] -* |OK_ICON| `DataMarket (Qlik) `_ +* |OK_ICON| `DataMarket (Qlik) `_ [`fixme `_] -* |OK_ICON| `Datahub.io `_ +* |OK_ICON| `Datahub.io `_ [`fixme `_] -* |OK_ICON| `Harvard Dataverse Network of scientific data `_ +* |OK_ICON| `Harvard Dataverse Network of scientific data `_ [`fixme `_] -* |OK_ICON| `ICPSR (UMICH) `_ +* |OK_ICON| `ICPSR (UMICH) `_ [`fixme `_] -* |OK_ICON| `Institute of Education Sciences `_ +* |OK_ICON| `Institute of Education Sciences `_ [`fixme `_] -* |FIXME_ICON| `National Technical Reports Library `_ +* |FIXME_ICON| `National Technical Reports Library `_ [`fixme `_] -* |OK_ICON| `Open Data Certificates (beta) `_ +* |OK_ICON| `Open Data Certificates (beta) `_ [`fixme `_] -* |OK_ICON| `OpenDataNetwork - A search engine of all Socrata powered data portals `_ +* |OK_ICON| `OpenDataNetwork - A search engine of all Socrata powered data portals `_ [`fixme `_] -* |OK_ICON| `Statista.com - statistics and Studies `_ +* |OK_ICON| `Statista.com - statistics and Studies `_ [`fixme `_] -* |OK_ICON| `Zenodo - An open dependable home for the long-tail of science `_ +* |OK_ICON| `Zenodo - An open dependable home for the long-tail of science `_ [`fixme `_] SocialNetworks -------------- -* |OK_ICON| `72 hours #gamergate Twitter Scrape `_ +* |OK_ICON| `72 hours #gamergate Twitter Scrape `_ [`fixme `_] -* |OK_ICON| `Ancestry.com Forum Dataset over 10 years `_ +* |OK_ICON| `Ancestry.com Forum Dataset over 10 years `_ [`fixme `_] -* |OK_ICON| `CMU Enron Email of 150 users `_ +* |OK_ICON| `CMU Enron Email of 150 users `_ [`fixme `_] -* |OK_ICON| `Cheng-Caverlee-Lee September 2009 - January 2010 Twitter Scrape `_ +* |OK_ICON| `Cheng-Caverlee-Lee September 2009 - January 2010 Twitter Scrape `_ [`fixme `_] -* |OK_ICON| `EDRM Enron EMail of 151 users, hosted on S3 `_ +* |OK_ICON| `EDRM Enron EMail of 151 users, hosted on S3 `_ [`fixme `_] -* |OK_ICON| `Facebook Data Scrape (2005) `_ +* |OK_ICON| `Facebook Data Scrape (2005) `_ [`fixme `_] -* |OK_ICON| `Facebook Social Networks from LAW (since 2007) `_ +* |OK_ICON| `Facebook Social Networks from LAW (since 2007) `_ [`fixme `_] -* |OK_ICON| `Foursquare from UMN/Sarwat (2013) `_ +* |OK_ICON| `Foursquare from UMN/Sarwat (2013) `_ [`fixme `_] -* |OK_ICON| `GitHub Collaboration Archive `_ +* |OK_ICON| `GitHub Collaboration Archive `_ [`fixme `_] -* |OK_ICON| `Google Scholar citation relations `_ +* |OK_ICON| `Google Scholar citation relations `_ [`fixme `_] -* |OK_ICON| `High-Resolution Contact Networks from Wearable Sensors `_ +* |OK_ICON| `High-Resolution Contact Networks from Wearable Sensors `_ [`fixme `_] -* |OK_ICON| `Indie Map: social graph and crawl of top IndieWeb sites `_ +* |OK_ICON| `Indie Map: social graph and crawl of top IndieWeb sites `_ [`fixme `_] -* |FIXME_ICON| `Mobile Social Networks from UMASS `_ +* |FIXME_ICON| `Mobile Social Networks from UMASS `_ [`fixme `_] -* |OK_ICON| `Network Twitter Data `_ +* |OK_ICON| `Network Twitter Data `_ [`fixme `_] -* |OK_ICON| `Reddit Comments `_ +* |OK_ICON| `Reddit Comments `_ [`fixme `_] -* |OK_ICON| `Skytrax' Air Travel Reviews Dataset `_ +* |OK_ICON| `Skytrax' Air Travel Reviews Dataset `_ [`fixme `_] -* |OK_ICON| `Social Twitter Data `_ +* |OK_ICON| `Social Twitter Data `_ [`fixme `_] -* |OK_ICON| `SourceForge.net Research Data `_ +* |OK_ICON| `SourceForge.net Research Data `_ [`fixme `_] -* |OK_ICON| `Twitter Data for Online Reputation Management `_ +* |OK_ICON| `Twitter Data for Online Reputation Management `_ [`fixme `_] -* |OK_ICON| `Twitter Data for Sentiment Analysis `_ +* |OK_ICON| `Twitter Data for Sentiment Analysis `_ [`fixme `_] -* |OK_ICON| `Twitter Graph of entire Twitter site `_ +* |OK_ICON| `Twitter Graph of entire Twitter site `_ [`fixme `_] -* |FIXME_ICON| `Twitter Scrape Calufa May 2011 `_ +* |FIXME_ICON| `Twitter Scrape Calufa May 2011 `_ [`fixme `_] -* |OK_ICON| `UNIMI/LAW Social Network Datasets `_ +* |OK_ICON| `UNIMI/LAW Social Network Datasets `_ [`fixme `_] -* |FIXME_ICON| `Yahoo! Graph and Social Data `_ +* |FIXME_ICON| `Yahoo! Graph and Social Data `_ [`fixme `_] -* |OK_ICON| `Youtube Video Social Graph in 2007,2008 `_ +* |OK_ICON| `Youtube Video Social Graph in 2007,2008 `_ [`fixme `_] SocialSciences -------------- -* |OK_ICON| `ACLED (Armed Conflict Location & Event Data Project) `_ +* |OK_ICON| `ACLED (Armed Conflict Location & Event Data Project) `_ [`fixme `_] -* |OK_ICON| `Canadian Legal Information Institute `_ +* |OK_ICON| `Canadian Legal Information Institute `_ [`fixme `_] -* |OK_ICON| `Center for Systemic Peace Datasets - Conflict Trends, Polities, State Fragility, etc `_ +* |OK_ICON| `Center for Systemic Peace Datasets - Conflict Trends, Polities, State Fragility, etc `_ [`fixme `_] -* |OK_ICON| `Correlates of War Project `_ +* |OK_ICON| `Correlates of War Project `_ [`fixme `_] -* |OK_ICON| `Cryptome Conspiracy Theory Items `_ +* |FIXME_ICON| `Cryptome Conspiracy Theory Items `_ [`fixme `_] -* |FIXME_ICON| `Datacards `_ +* |FIXME_ICON| `Datacards `_ [`fixme `_] -* |OK_ICON| `European Social Survey `_ +* |OK_ICON| `European Social Survey `_ [`fixme `_] -* |OK_ICON| `FBI Hate Crime 2013 - aggregated data `_ +* |OK_ICON| `FBI Hate Crime 2013 - aggregated data `_ [`fixme `_] -* |FIXME_ICON| `Fragile States Index `_ +* |FIXME_ICON| `Fragile States Index `_ [`fixme `_] -* |OK_ICON| `GDELT Global Events Database `_ +* |OK_ICON| `GDELT Global Events Database `_ [`fixme `_] -* |OK_ICON| `General Social Survey (GSS) since 1972 `_ +* |OK_ICON| `General Social Survey (GSS) since 1972 `_ [`fixme `_] -* |OK_ICON| `German Social Survey `_ +* |OK_ICON| `German Social Survey `_ [`fixme `_] -* |OK_ICON| `Global Religious Futures Project `_ +* |OK_ICON| `Global Religious Futures Project `_ [`fixme `_] -* |FIXME_ICON| `Humanitarian Data Exchange `_ +* |FIXME_ICON| `Humanitarian Data Exchange `_ [`fixme `_] -* |OK_ICON| `INFORM Index for Risk Management `_ +* |OK_ICON| `INFORM Index for Risk Management `_ [`fixme `_] -* |OK_ICON| `Institute for Demographic Studies `_ +* |OK_ICON| `Institute for Demographic Studies `_ [`fixme `_] -* |OK_ICON| `International Networks Archive `_ +* |OK_ICON| `International Networks Archive `_ [`fixme `_] -* |OK_ICON| `International Social Survey Program ISSP `_ +* |OK_ICON| `International Social Survey Program ISSP `_ [`fixme `_] -* |OK_ICON| `International Studies Compendium Project `_ +* |OK_ICON| `International Studies Compendium Project `_ [`fixme `_] -* |OK_ICON| `James McGuire Cross National Data `_ +* |OK_ICON| `James McGuire Cross National Data `_ [`fixme `_] -* |OK_ICON| `MIT Reality Mining Dataset `_ +* |OK_ICON| `MIT Reality Mining Dataset `_ [`fixme `_] -* |OK_ICON| `MacroData Guide by Norsk samfunnsvitenskapelig datatjeneste `_ +* |OK_ICON| `MacroData Guide by Norsk samfunnsvitenskapelig datatjeneste `_ [`fixme `_] -* |OK_ICON| `Minnesota Population Center `_ +* |OK_ICON| `Minnesota Population Center `_ [`fixme `_] -* |OK_ICON| `Notre Dame Global Adaptation Index (NG-DAIN) `_ +* |OK_ICON| `Notre Dame Global Adaptation Index (NG-DAIN) `_ [`fixme `_] -* |OK_ICON| `Open Crime and Policing Data in England, Wales and Northern Ireland `_ +* |OK_ICON| `Open Crime and Policing Data in England, Wales and Northern Ireland `_ [`fixme `_] -* |OK_ICON| `OpenSanctions - A global database of persons and companies of political, criminal, or economic interest. `_ +* |OK_ICON| `OpenSanctions - A global database of persons and companies of political, [...] `_ [`fixme `_] -* |OK_ICON| `Paul Hensel General International Data Page `_ +* |OK_ICON| `Paul Hensel General International Data Page `_ [`fixme `_] -* |FIXME_ICON| `PewResearch Internet Survey Project `_ +* |FIXME_ICON| `PewResearch Internet Survey Project `_ [`fixme `_] -* |OK_ICON| `PewResearch Society Data Collection `_ +* |OK_ICON| `PewResearch Society Data Collection `_ [`fixme `_] -* |OK_ICON| `Political Polarity Data `_ +* |OK_ICON| `Political Polarity Data `_ [`fixme `_] -* |OK_ICON| `StackExchange Data Explorer `_ +* |OK_ICON| `StackExchange Data Explorer `_ [`fixme `_] -* |OK_ICON| `Terrorism Research and Analysis Consortium `_ +* |OK_ICON| `Terrorism Research and Analysis Consortium `_ [`fixme `_] -* |OK_ICON| `Texas Inmates Executed Since 1984 `_ +* |OK_ICON| `Texas Inmates Executed Since 1984 `_ [`fixme `_] -* |OK_ICON| `Titanic Survival Data Set `_ +* |OK_ICON| `Titanic Survival Data Set `_ [`fixme `_] -* |OK_ICON| `UCB's Archive of Social Science Data (D-Lab) `_ +* |OK_ICON| `UCB's Archive of Social Science Data (D-Lab) `_ [`fixme `_] -* |FIXME_ICON| `UCLA Social Sciences Data Archive `_ +* |FIXME_ICON| `UCLA Social Sciences Data Archive `_ [`fixme `_] -* |OK_ICON| `UN Civil Society Database `_ +* |OK_ICON| `UN Civil Society Database `_ [`fixme `_] -* |OK_ICON| `UPJOHN for Labor Employment Research `_ +* |OK_ICON| `UPJOHN for Labor Employment Research `_ [`fixme `_] -* |OK_ICON| `Universities Worldwide `_ +* |OK_ICON| `Universities Worldwide `_ [`fixme `_] -* |OK_ICON| `Uppsala Conflict Data Program `_ +* |OK_ICON| `Uppsala Conflict Data Program `_ [`fixme `_] -* |OK_ICON| `World Bank Open Data `_ +* |OK_ICON| `World Bank Open Data `_ [`fixme `_] -* |OK_ICON| `WorldPop project - Worldwide human population distributions `_ +* |OK_ICON| `WorldPop project - Worldwide human population distributions `_ [`fixme `_] Software -------- -* |OK_ICON| `FLOSSmole data about free, libre, and open source software development `_ +* |OK_ICON| `FLOSSmole data about free, libre, and open source software development `_ [`fixme `_] -* |OK_ICON| `Libraries.io Open Source Repository and Dependency Metadata `_ +* |OK_ICON| `Libraries.io Open Source Repository and Dependency Metadata `_ [`fixme `_] Sports ------ -* |OK_ICON| `Betfair Historical Exchange Data `_ +* |OK_ICON| `Betfair Historical Exchange Data `_ [`fixme `_] -* |OK_ICON| `Cricsheet Matches (cricket) `_ +* |OK_ICON| `Cricsheet Matches (cricket) `_ [`fixme `_] -* |OK_ICON| `Ergast Formula 1, from 1950 up to date (API) `_ +* |OK_ICON| `Ergast Formula 1, from 1950 up to date (API) `_ [`fixme `_] -* |OK_ICON| `Football/Soccer resources (data and APIs) `_ +* |OK_ICON| `Football/Soccer resources (data and APIs) `_ [`fixme `_] -* |OK_ICON| `Lahman's Baseball Database `_ +* |OK_ICON| `Lahman's Baseball Database `_ [`fixme `_] -* |OK_ICON| `Pinhooker: Thoroughbred Bloodstock Sale Data `_ +* |OK_ICON| `Pinhooker: Thoroughbred Bloodstock Sale Data `_ [`fixme `_] -* |OK_ICON| `Retrosheet Baseball Statistics `_ +* |OK_ICON| `Retrosheet Baseball Statistics `_ [`fixme `_] -* |OK_ICON| `Tennis database of rankings, results, and stats for ATP `_ +* |OK_ICON| `Tennis database of rankings, results, and stats for ATP `_ [`fixme `_] -* |OK_ICON| `Tennis database of rankings, results, and stats for WTA `_ +* |OK_ICON| `Tennis database of rankings, results, and stats for WTA `_ [`fixme `_] TimeSeries ---------- -* |OK_ICON| `Databanks International Cross National Time Series Data Archive `_ +* |OK_ICON| `Databanks International Cross National Time Series Data Archive `_ [`fixme `_] -* |OK_ICON| `Hard Drive Failure Rates `_ +* |OK_ICON| `Hard Drive Failure Rates `_ [`fixme `_] -* |OK_ICON| `Heart Rate Time Series from MIT `_ +* |OK_ICON| `Heart Rate Time Series from MIT `_ [`fixme `_] -* |OK_ICON| `Time Series Data Library (TSDL) from MU `_ +* |OK_ICON| `Time Series Data Library (TSDL) from MU `_ [`fixme `_] -* |OK_ICON| `UC Riverside Time Series Dataset `_ +* |OK_ICON| `UC Riverside Time Series Dataset `_ [`fixme `_] Transportation -------------- -* |OK_ICON| `Airlines OD Data 1987-2008 `_ +* |OK_ICON| `Airlines OD Data 1987-2008 `_ [`fixme `_] -* |OK_ICON| `Bay Area Bike Share Data `_ +* |OK_ICON| `Bay Area Bike Share Data `_ [`fixme `_] -* |OK_ICON| `Bike Share Systems (BSS) collection `_ +* |OK_ICON| `Bike Share Systems (BSS) collection `_ [`fixme `_] -* |OK_ICON| `GeoLife GPS Trajectory from Microsoft Research `_ +* |OK_ICON| `GeoLife GPS Trajectory from Microsoft Research `_ [`fixme `_] -* |OK_ICON| `German train system by Deutsche Bahn `_ +* |OK_ICON| `German train system by Deutsche Bahn `_ [`fixme `_] -* |OK_ICON| `Hubway Million Rides in MA `_ +* |OK_ICON| `Hubway Million Rides in MA `_ [`fixme `_] -* |OK_ICON| `Montreal BIXI Bike Share `_ +* |OK_ICON| `Montreal BIXI Bike Share `_ [`fixme `_] -* |OK_ICON| `NYC Taxi Trip Data 2009- `_ +* |OK_ICON| `NYC Taxi Trip Data 2009- `_ [`fixme `_] -* |OK_ICON| `NYC Taxi Trip Data 2013 (FOIA/FOILed) `_ +* |OK_ICON| `NYC Taxi Trip Data 2013 (FOIA/FOILed) `_ [`fixme `_] -* |OK_ICON| `NYC Uber trip data April 2014 to September 2014 `_ +* |OK_ICON| `NYC Uber trip data April 2014 to September 2014 `_ [`fixme `_] -* |OK_ICON| `Open Traffic collection `_ +* |OK_ICON| `Open Traffic collection `_ [`fixme `_] -* |OK_ICON| `OpenFlights - airport, airline and route data `_ +* |OK_ICON| `OpenFlights - airport, airline and route data `_ [`fixme `_] -* |FIXME_ICON| `Philadelphia Bike Share Stations (JSON) `_ +* |FIXME_ICON| `Philadelphia Bike Share Stations (JSON) `_ [`fixme `_] -* |OK_ICON| `Plane Crash Database, since 1920 `_ +* |OK_ICON| `Plane Crash Database, since 1920 `_ [`fixme `_] -* |OK_ICON| `RITA Airline On-Time Performance data `_ +* |OK_ICON| `RITA Airline On-Time Performance data `_ [`fixme `_] -* |OK_ICON| `RITA/BTS transport data collection (TranStat) `_ +* |OK_ICON| `RITA/BTS transport data collection (TranStat) `_ [`fixme `_] -* |FIXME_ICON| `Toronto Bike Share Stations (XML file) `_ +* |FIXME_ICON| `Toronto Bike Share Stations (XML file) `_ [`fixme `_] -* |OK_ICON| `Transport for London (TFL) `_ +* |OK_ICON| `Transport for London (TFL) `_ [`fixme `_] -* |OK_ICON| `Travel Tracker Survey (TTS) for Chicago `_ +* |OK_ICON| `Travel Tracker Survey (TTS) for Chicago `_ [`fixme `_] -* |OK_ICON| `U.S. Bureau of Transportation Statistics (BTS) `_ +* |OK_ICON| `U.S. Bureau of Transportation Statistics (BTS) `_ [`fixme `_] -* |OK_ICON| `U.S. Domestic Flights 1990 to 2009 `_ +* |OK_ICON| `U.S. Domestic Flights 1990 to 2009 `_ [`fixme `_] -* |OK_ICON| `U.S. Freight Analysis Framework since 2007 `_ +* |OK_ICON| `U.S. Freight Analysis Framework since 2007 `_ [`fixme `_] Complementary Collections From 41fdb1a3811c127cb3a434c20afbc10513e2536e Mon Sep 17 00:00:00 2001 From: Travis CI Date: Sat, 7 Apr 2018 15:56:40 +0000 Subject: [PATCH 94/99] Update README from APD2: cccdf5f5191e3e18ce966ea97de7cf1a954859bb --- README.rst | 100 ++++++++++++++++++++++++++--------------------------- 1 file changed, 50 insertions(+), 50 deletions(-) diff --git a/README.rst b/README.rst index 0df7843..8962a68 100644 --- a/README.rst +++ b/README.rst @@ -2,8 +2,8 @@ Awesome Public Datasets ======================= .. image:: https://cdn.rawgit.com/sindresorhus/awesome/d7305f38d29fed78fa85652e3a63e154dd8e8829/media/badge.svg -:alt: Awesome -:target: https://github.com/sindresorhus/awesome + :alt: Awesome + :target: https://github.com/sindresorhus/awesome .. |OK_ICON| image:: https://raw.githubusercontent.com/awesomedata/apd-core/master/deploy/ok-24.png @@ -87,7 +87,7 @@ Biology * |OK_ICON| `NCI Genomic Data Commons `_ [`fixme `_] -* |FIXME_ICON| `NIH Microarray data `_ [`fixme `_] +* |FIXME_ICON| `NIH Microarray data `_ * |OK_ICON| `OpenSNP genotypes data `_ [`fixme `_] @@ -107,7 +107,7 @@ Biology * |OK_ICON| `Sequence Read Archive(SRA) `_ [`fixme `_] -* |FIXME_ICON| `Stanford Microarray Data `_ [`fixme `_] +* |FIXME_ICON| `Stanford Microarray Data `_ * |OK_ICON| `Stowers Institute Original Data Repository `_ [`fixme `_] @@ -140,7 +140,7 @@ Climate+Weather * |OK_ICON| `Climate Data from UEA (updated monthly) `_ [`fixme `_] -* |FIXME_ICON| `European Climate Assessment & Dataset `_ [`fixme `_] +* |FIXME_ICON| `European Climate Assessment & Dataset `_ * |OK_ICON| `Global Climate Data Since 1929 `_ [`fixme `_] @@ -169,7 +169,7 @@ ComplexNetworks * |OK_ICON| `CrossRef DOI URLs `_ [`fixme `_] -* |FIXME_ICON| `DBLP Citation dataset `_ [`fixme `_] +* |FIXME_ICON| `DBLP Citation dataset `_ * |OK_ICON| `DIMACS Road Networks Collection `_ [`fixme `_] @@ -197,7 +197,7 @@ ComplexNetworks * |OK_ICON| `The Laboratory for Web Algorithmics (UNIMI) `_ [`fixme `_] -* |FIXME_ICON| `The Nexus Network Repository `_ [`fixme `_] +* |FIXME_ICON| `The Nexus Network Repository `_ * |OK_ICON| `UCI Network Data Repository `_ [`fixme `_] @@ -243,11 +243,11 @@ DataChallenges * |OK_ICON| `CrowdANALYTIX dataX `_ [`fixme `_] -* |FIXME_ICON| `D4D Challenge of Orange `_ [`fixme `_] +* |FIXME_ICON| `D4D Challenge of Orange `_ * |OK_ICON| `DrivenData Competitions for Social Good `_ [`fixme `_] -* |FIXME_ICON| `ICWSM Data Challenge (since 2009) `_ [`fixme `_] +* |FIXME_ICON| `ICWSM Data Challenge (since 2009) `_ * |OK_ICON| `KDD Cup by Tencent 2012 `_ [`fixme `_] @@ -293,7 +293,7 @@ Economics * |OK_ICON| `EconData from UMD `_ [`fixme `_] -* |FIXME_ICON| `Economic Freedom of the World Data `_ [`fixme `_] +* |FIXME_ICON| `Economic Freedom of the World Data `_ * |OK_ICON| `Historical MacroEconomc Statistics `_ [`fixme `_] @@ -351,7 +351,7 @@ Energy * |OK_ICON| `HFED `_ [`fixme `_] -* |FIXME_ICON| `PLAID - The Plug Load Appliance Identification Dataset `_ [`fixme `_] +* |FIXME_ICON| `PLAID - The Plug Load Appliance Identification Dataset `_ * |OK_ICON| `REDD `_ [`fixme `_] @@ -366,7 +366,7 @@ Energy Finance ------- -* |FIXME_ICON| `CBOE Futures Exchange `_ [`fixme `_] +* |FIXME_ICON| `CBOE Futures Exchange `_ * |OK_ICON| `Google Finance `_ [`fixme `_] @@ -393,7 +393,7 @@ GIS * |OK_ICON| `Cambridge, MA, US, GIS data on GitHub `_ [`fixme `_] -* |FIXME_ICON| `Factual Global Location Data `_ [`fixme `_] +* |FIXME_ICON| `Factual Global Location Data `_ * |OK_ICON| `Geo Maps - High Quality GeoJSON maps programmatically generated `_ [`fixme `_] @@ -405,7 +405,7 @@ GIS * |OK_ICON| `GeoNames Worldwide `_ [`fixme `_] -* |FIXME_ICON| `Global Administrative Areas Database (GADM) `_ [`fixme `_] +* |FIXME_ICON| `Global Administrative Areas Database (GADM) `_ * |OK_ICON| `Homeland Infrastructure Foundation-Level Data `_ [`fixme `_] @@ -425,7 +425,7 @@ GIS * |OK_ICON| `Reverse Geocoder using OSM data `_ [`fixme `_] -* |FIXME_ICON| `TIGER/Line - U.S. boundaries and roads `_ [`fixme `_] +* |FIXME_ICON| `TIGER/Line - U.S. boundaries and roads `_ * |OK_ICON| `TZ Timezones shapfiles `_ [`fixme `_] @@ -433,7 +433,7 @@ GIS * |OK_ICON| `UN Environmental Data `_ [`fixme `_] -* |FIXME_ICON| `World boundaries from the U.S. Department of State `_ [`fixme `_] +* |FIXME_ICON| `World boundaries from the U.S. Department of State `_ * |OK_ICON| `World countries in multiple formats `_ [`fixme `_] @@ -464,7 +464,7 @@ Government * |OK_ICON| `Buenos Aires, Argentina `_ [`fixme `_] -* |FIXME_ICON| `Calgary, AB, Canada `_ [`fixme `_] +* |FIXME_ICON| `Calgary, AB, Canada `_ * |OK_ICON| `Cambridge, MA, US `_ [`fixme `_] @@ -510,13 +510,13 @@ Government * |OK_ICON| `Guardian world governments `_ [`fixme `_] -* |FIXME_ICON| `Halifax, NS, Canada `_ [`fixme `_] +* |FIXME_ICON| `Halifax, NS, Canada `_ * |OK_ICON| `Helsinki Region, Finland `_ [`fixme `_] * |OK_ICON| `Hong Kong, China `_ [`fixme `_] -* |FIXME_ICON| `Houston Open Data `_ [`fixme `_] +* |FIXME_ICON| `Houston Open Data `_ * |OK_ICON| `Indian Government Data `_ [`fixme `_] @@ -554,7 +554,7 @@ Government * |OK_ICON| `Mountain View, California, US (GIS) `_ [`fixme `_] -* |FIXME_ICON| `NYC Open Data `_ [`fixme `_] +* |FIXME_ICON| `NYC Open Data `_ * |OK_ICON| `NYC betanyc `_ [`fixme `_] @@ -592,7 +592,7 @@ Government * |OK_ICON| `Regina SK, Canada `_ [`fixme `_] -* |FIXME_ICON| `Rio de Janeiro, Brazil `_ [`fixme `_] +* |FIXME_ICON| `Rio de Janeiro, Brazil `_ * |OK_ICON| `Romania `_ [`fixme `_] @@ -628,7 +628,7 @@ Government * |OK_ICON| `The World Bank `_ [`fixme `_] -* |FIXME_ICON| `Toronto, ON, Canada `_ [`fixme `_] +* |FIXME_ICON| `Toronto, ON, Canada `_ * |OK_ICON| `Tunisia `_ [`fixme `_] @@ -652,7 +652,7 @@ Government * |OK_ICON| `U.S. Open Government `_ [`fixme `_] -* |FIXME_ICON| `UK 2011 Census Open Atlas Project `_ [`fixme `_] +* |FIXME_ICON| `UK 2011 Census Open Atlas Project `_ * |OK_ICON| `U.S. Patent and Trademark Office (USPTO) Bulk Data Products `_ [`fixme `_] @@ -666,7 +666,7 @@ Government * |OK_ICON| `Vancouver, BC Open Data Catalog `_ [`fixme `_] -* |FIXME_ICON| `Victoria, BC, Canada `_ [`fixme `_] +* |FIXME_ICON| `Victoria, BC, Canada `_ * |OK_ICON| `Vienna, Austria `_ [`fixme `_] @@ -689,7 +689,7 @@ Healthcare * |OK_ICON| `Medicare Data File `_ [`fixme `_] -* |FIXME_ICON| `Number of Ebola Cases and Deaths in Affected Countries (2014) `_ [`fixme `_] +* |FIXME_ICON| `Number of Ebola Cases and Deaths in Affected Countries (2014) `_ * |OK_ICON| `Open-ODS (structure of the UK NHS) `_ [`fixme `_] @@ -708,7 +708,7 @@ ImageProcessing * |OK_ICON| `10k US Adult Faces Database `_ [`fixme `_] -* |FIXME_ICON| `2GB of Photos of Cats `_ [`fixme `_] +* |FIXME_ICON| `2GB of Photos of Cats `_ * |OK_ICON| `Adience Unfiltered faces for gender and age classification `_ [`fixme `_] @@ -738,7 +738,7 @@ ImageProcessing * |OK_ICON| `SUN database, MIT `_ [`fixme `_] -* |FIXME_ICON| `Several Shape-from-Silhouette Datasets `_ [`fixme `_] +* |FIXME_ICON| `Several Shape-from-Silhouette Datasets `_ * |OK_ICON| `Stanford Dogs Dataset `_ [`fixme `_] @@ -785,11 +785,11 @@ MachineLearning * |OK_ICON| `Registered Meteorites on Earth `_ [`fixme `_] -* |FIXME_ICON| `Restaurants Health Score Data in San Francisco `_ [`fixme `_] +* |FIXME_ICON| `Restaurants Health Score Data in San Francisco `_ * |OK_ICON| `UCI Machine Learning Repository `_ [`fixme `_] -* |FIXME_ICON| `Yahoo! Ratings and Classification Data `_ [`fixme `_] +* |FIXME_ICON| `Yahoo! Ratings and Classification Data `_ * |OK_ICON| `YouTube-BoundingBoxes `_ [`fixme `_] @@ -849,7 +849,7 @@ NaturalLanguage * |OK_ICON| `Machine Translation of European languages `_ [`fixme `_] -* |FIXME_ICON| `Making Sense of Microposts 2013 - Concept Extraction `_ [`fixme `_] +* |FIXME_ICON| `Making Sense of Microposts 2013 - Concept Extraction `_ * |OK_ICON| `Making Sense of Microposts 2016 - Named Entity rEcognition and Linking `_ [`fixme `_] @@ -877,7 +877,7 @@ NaturalLanguage * |OK_ICON| `Wikipedia Links data - 40 Million Entities in Context `_ [`fixme `_] -* |FIXME_ICON| `WordNet databases and tools `_ [`fixme `_] +* |FIXME_ICON| `WordNet databases and tools `_ Neuroscience ------------ @@ -888,7 +888,7 @@ Neuroscience * |OK_ICON| `Brainomics `_ [`fixme `_] -* |FIXME_ICON| `CodeNeuro Datasets `_ [`fixme `_] +* |FIXME_ICON| `CodeNeuro Datasets `_ * |OK_ICON| `Collaborative Research in Computational Neuroscience (CRCNS) `_ [`fixme `_] @@ -928,7 +928,7 @@ Physics Psychology+Cognition -------------------- -* |FIXME_ICON| `OSU Cognitive Modeling Repository Datasets `_ [`fixme `_] +* |FIXME_ICON| `OSU Cognitive Modeling Repository Datasets `_ PublicDomains ------------- @@ -951,15 +951,15 @@ PublicDomains * |OK_ICON| `Google `_ [`fixme `_] -* |OK_ICON| `Infochimps `_ [`fixme `_] +* |FIXME_ICON| `Infochimps `_ * |OK_ICON| `KDNuggets Data Collections `_ [`fixme `_] -* |FIXME_ICON| `Microsoft Azure Data Market Free DataSets `_ [`fixme `_] +* |FIXME_ICON| `Microsoft Azure Data Market Free DataSets `_ * |OK_ICON| `Microsoft Data Science for Research `_ [`fixme `_] -* |FIXME_ICON| `Numbray `_ [`fixme `_] +* |FIXME_ICON| `Numbray `_ * |OK_ICON| `Open Library Data Dumps `_ [`fixme `_] @@ -971,7 +971,7 @@ PublicDomains * |OK_ICON| `StatSci.org `_ [`fixme `_] -* |FIXME_ICON| `Stats4Stem R data sets `_ [`fixme `_] +* |FIXME_ICON| `Stats4Stem R data sets `_ * |OK_ICON| `The Washington Post List `_ [`fixme `_] @@ -981,7 +981,7 @@ PublicDomains * |OK_ICON| `Wikileaks 911 pager intercepts `_ [`fixme `_] -* |FIXME_ICON| `Yahoo Webscope `_ [`fixme `_] +* |FIXME_ICON| `Yahoo Webscope `_ SearchEngines ------------- @@ -998,7 +998,7 @@ SearchEngines * |OK_ICON| `Institute of Education Sciences `_ [`fixme `_] -* |FIXME_ICON| `National Technical Reports Library `_ [`fixme `_] +* |FIXME_ICON| `National Technical Reports Library `_ * |OK_ICON| `Open Data Certificates (beta) `_ [`fixme `_] @@ -1035,7 +1035,7 @@ SocialNetworks * |OK_ICON| `Indie Map: social graph and crawl of top IndieWeb sites `_ [`fixme `_] -* |FIXME_ICON| `Mobile Social Networks from UMASS `_ [`fixme `_] +* |FIXME_ICON| `Mobile Social Networks from UMASS `_ * |OK_ICON| `Network Twitter Data `_ [`fixme `_] @@ -1053,11 +1053,11 @@ SocialNetworks * |OK_ICON| `Twitter Graph of entire Twitter site `_ [`fixme `_] -* |FIXME_ICON| `Twitter Scrape Calufa May 2011 `_ [`fixme `_] +* |FIXME_ICON| `Twitter Scrape Calufa May 2011 `_ * |OK_ICON| `UNIMI/LAW Social Network Datasets `_ [`fixme `_] -* |FIXME_ICON| `Yahoo! Graph and Social Data `_ [`fixme `_] +* |FIXME_ICON| `Yahoo! Graph and Social Data `_ * |OK_ICON| `Youtube Video Social Graph in 2007,2008 `_ [`fixme `_] @@ -1072,15 +1072,15 @@ SocialSciences * |OK_ICON| `Correlates of War Project `_ [`fixme `_] -* |FIXME_ICON| `Cryptome Conspiracy Theory Items `_ [`fixme `_] +* |OK_ICON| `Cryptome Conspiracy Theory Items `_ [`fixme `_] -* |FIXME_ICON| `Datacards `_ [`fixme `_] +* |FIXME_ICON| `Datacards `_ * |OK_ICON| `European Social Survey `_ [`fixme `_] * |OK_ICON| `FBI Hate Crime 2013 - aggregated data `_ [`fixme `_] -* |FIXME_ICON| `Fragile States Index `_ [`fixme `_] +* |FIXME_ICON| `Fragile States Index `_ * |OK_ICON| `GDELT Global Events Database `_ [`fixme `_] @@ -1090,7 +1090,7 @@ SocialSciences * |OK_ICON| `Global Religious Futures Project `_ [`fixme `_] -* |FIXME_ICON| `Humanitarian Data Exchange `_ [`fixme `_] +* |FIXME_ICON| `Humanitarian Data Exchange `_ * |OK_ICON| `INFORM Index for Risk Management `_ [`fixme `_] @@ -1118,7 +1118,7 @@ SocialSciences * |OK_ICON| `Paul Hensel General International Data Page `_ [`fixme `_] -* |FIXME_ICON| `PewResearch Internet Survey Project `_ [`fixme `_] +* |FIXME_ICON| `PewResearch Internet Survey Project `_ * |OK_ICON| `PewResearch Society Data Collection `_ [`fixme `_] @@ -1134,7 +1134,7 @@ SocialSciences * |OK_ICON| `UCB's Archive of Social Science Data (D-Lab) `_ [`fixme `_] -* |FIXME_ICON| `UCLA Social Sciences Data Archive `_ [`fixme `_] +* |FIXME_ICON| `UCLA Social Sciences Data Archive `_ * |OK_ICON| `UN Civil Society Database `_ [`fixme `_] @@ -1216,7 +1216,7 @@ Transportation * |OK_ICON| `OpenFlights - airport, airline and route data `_ [`fixme `_] -* |FIXME_ICON| `Philadelphia Bike Share Stations (JSON) `_ [`fixme `_] +* |FIXME_ICON| `Philadelphia Bike Share Stations (JSON) `_ * |OK_ICON| `Plane Crash Database, since 1920 `_ [`fixme `_] @@ -1224,7 +1224,7 @@ Transportation * |OK_ICON| `RITA/BTS transport data collection (TranStat) `_ [`fixme `_] -* |FIXME_ICON| `Toronto Bike Share Stations (XML file) `_ [`fixme `_] +* |FIXME_ICON| `Toronto Bike Share Stations (XML file) `_ * |OK_ICON| `Transport for London (TFL) `_ [`fixme `_] From 3aa7e82edef97211af84e3accc9c02655d6a0531 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Tue, 10 Apr 2018 17:06:33 +0000 Subject: [PATCH 95/99] Update README from APD2: e5206d532a80a0319d5ab7752d84f410c4eadf14 --- README.rst | 1122 ++++++++++++++++++++++++++-------------------------- 1 file changed, 561 insertions(+), 561 deletions(-) diff --git a/README.rst b/README.rst index 8962a68..500aff0 100644 --- a/README.rst +++ b/README.rst @@ -30,1211 +30,1211 @@ Other amazingly awesome lists can be found in `sindresorhus's awesome `_ [`fixme `_] +* |OK_ICON| `U.S. Department of Agriculture's Nutrient Database `_ -* |OK_ICON| `U.S. Department of Agriculture's PLANTS Database `_ [`fixme `_] +* |OK_ICON| `U.S. Department of Agriculture's PLANTS Database `_ Biology ------- -* |OK_ICON| `1000 Genomes `_ [`fixme `_] +* |OK_ICON| `1000 Genomes `_ -* |OK_ICON| `American Gut (Microbiome Project) `_ [`fixme `_] +* |OK_ICON| `American Gut (Microbiome Project) `_ -* |OK_ICON| `Broad Bioimage Benchmark Collection (BBBC) `_ [`fixme `_] +* |OK_ICON| `Broad Bioimage Benchmark Collection (BBBC) `_ -* |OK_ICON| `Broad Cancer Cell Line Encyclopedia (CCLE) `_ [`fixme `_] +* |OK_ICON| `Broad Cancer Cell Line Encyclopedia (CCLE) `_ -* |OK_ICON| `Cell Image Library `_ [`fixme `_] +* |OK_ICON| `Cell Image Library `_ -* |OK_ICON| `Complete Genomics Public Data `_ [`fixme `_] +* |OK_ICON| `Complete Genomics Public Data `_ -* |OK_ICON| `EBI ArrayExpress `_ [`fixme `_] +* |OK_ICON| `EBI ArrayExpress `_ -* |OK_ICON| `EBI Protein Data Bank in Europe `_ [`fixme `_] +* |OK_ICON| `EBI Protein Data Bank in Europe `_ -* |OK_ICON| `ENCODE project `_ [`fixme `_] +* |OK_ICON| `ENCODE project `_ -* |OK_ICON| `Electron Microscopy Pilot Image Archive (EMPIAR) `_ [`fixme `_] +* |OK_ICON| `Electron Microscopy Pilot Image Archive (EMPIAR) `_ -* |OK_ICON| `Ensembl Genomes `_ [`fixme `_] +* |OK_ICON| `Ensembl Genomes `_ -* |OK_ICON| `Gene Expression Omnibus (GEO) `_ [`fixme `_] +* |OK_ICON| `Gene Expression Omnibus (GEO) `_ -* |OK_ICON| `Gene Ontology (GO) `_ [`fixme `_] +* |OK_ICON| `Gene Ontology (GO) `_ -* |OK_ICON| `Global Biotic Interactions (GloBI) `_ [`fixme `_] +* |OK_ICON| `Global Biotic Interactions (GloBI) `_ -* |OK_ICON| `Harvard Medical School (HMS) LINCS Project `_ [`fixme `_] +* |OK_ICON| `Harvard Medical School (HMS) LINCS Project `_ -* |OK_ICON| `Human Genome Diversity Project `_ [`fixme `_] +* |OK_ICON| `Human Genome Diversity Project `_ -* |OK_ICON| `Human Microbiome Project (HMP) `_ [`fixme `_] +* |OK_ICON| `Human Microbiome Project (HMP) `_ -* |OK_ICON| `ICOS PSP Benchmark `_ [`fixme `_] +* |OK_ICON| `ICOS PSP Benchmark `_ -* |OK_ICON| `International HapMap Project `_ [`fixme `_] +* |OK_ICON| `International HapMap Project `_ -* |OK_ICON| `Journal of Cell Biology DataViewer `_ [`fixme `_] +* |OK_ICON| `Journal of Cell Biology DataViewer `_ -* |OK_ICON| `KEGG - KEGG is a database resource for understanding high-level functions [...] `_ [`fixme `_] +* |OK_ICON| `KEGG - KEGG is a database resource for understanding high-level functions [...] `_ -* |OK_ICON| `MIT Cancer Genomics Data `_ [`fixme `_] +* |OK_ICON| `MIT Cancer Genomics Data `_ -* |OK_ICON| `NCBI Proteins `_ [`fixme `_] +* |OK_ICON| `NCBI Proteins `_ -* |OK_ICON| `NCBI Taxonomy `_ [`fixme `_] +* |OK_ICON| `NCBI Taxonomy `_ -* |OK_ICON| `NCI Genomic Data Commons `_ [`fixme `_] +* |OK_ICON| `NCI Genomic Data Commons `_ -* |FIXME_ICON| `NIH Microarray data `_ +* |FIXME_ICON| `NIH Microarray data `_ [`fixme `_] -* |OK_ICON| `OpenSNP genotypes data `_ [`fixme `_] +* |OK_ICON| `OpenSNP genotypes data `_ -* |OK_ICON| `Pathguid - Protein-Protein Interactions Catalog `_ [`fixme `_] +* |OK_ICON| `Pathguid - Protein-Protein Interactions Catalog `_ -* |OK_ICON| `Protein Data Bank `_ [`fixme `_] +* |OK_ICON| `Protein Data Bank `_ -* |OK_ICON| `Psychiatric Genomics Consortium `_ [`fixme `_] +* |OK_ICON| `Psychiatric Genomics Consortium `_ -* |OK_ICON| `PubChem Project `_ [`fixme `_] +* |OK_ICON| `PubChem Project `_ -* |OK_ICON| `PubGene (now Coremine Medical) `_ [`fixme `_] +* |OK_ICON| `PubGene (now Coremine Medical) `_ -* |OK_ICON| `Sanger Catalogue of Somatic Mutations in Cancer (COSMIC) `_ [`fixme `_] +* |OK_ICON| `Sanger Catalogue of Somatic Mutations in Cancer (COSMIC) `_ -* |OK_ICON| `Sanger Genomics of Drug Sensitivity in Cancer Project (GDSC) `_ [`fixme `_] +* |OK_ICON| `Sanger Genomics of Drug Sensitivity in Cancer Project (GDSC) `_ -* |OK_ICON| `Sequence Read Archive(SRA) `_ [`fixme `_] +* |OK_ICON| `Sequence Read Archive(SRA) `_ -* |FIXME_ICON| `Stanford Microarray Data `_ +* |FIXME_ICON| `Stanford Microarray Data `_ [`fixme `_] -* |OK_ICON| `Stowers Institute Original Data Repository `_ [`fixme `_] +* |OK_ICON| `Stowers Institute Original Data Repository `_ -* |OK_ICON| `Systems Science of Biological Dynamics (SSBD) Database `_ [`fixme `_] +* |OK_ICON| `Systems Science of Biological Dynamics (SSBD) Database `_ -* |OK_ICON| `The Cancer Genome Atlas (TCGA), available via Broad GDAC `_ [`fixme `_] +* |OK_ICON| `The Cancer Genome Atlas (TCGA), available via Broad GDAC `_ -* |OK_ICON| `The Catalogue of Life `_ [`fixme `_] +* |OK_ICON| `The Catalogue of Life `_ -* |OK_ICON| `The Personal Genome Project `_ [`fixme `_] +* |OK_ICON| `The Personal Genome Project `_ -* |OK_ICON| `UCSC Public Data `_ [`fixme `_] +* |OK_ICON| `UCSC Public Data `_ -* |OK_ICON| `UniGene `_ [`fixme `_] +* |OK_ICON| `UniGene `_ -* |OK_ICON| `Universal Protein Resource (UnitProt) `_ [`fixme `_] +* |OK_ICON| `Universal Protein Resource (UnitProt) `_ Climate+Weather --------------- -* |OK_ICON| `Actuaries Climate Index `_ [`fixme `_] +* |OK_ICON| `Actuaries Climate Index `_ -* |OK_ICON| `Australian Weather `_ [`fixme `_] +* |OK_ICON| `Australian Weather `_ -* |OK_ICON| `Aviation Weather Center - Consistent, timely and accurate weather [...] `_ [`fixme `_] +* |OK_ICON| `Aviation Weather Center - Consistent, timely and accurate weather [...] `_ -* |OK_ICON| `Brazilian Weather - Historical data (In Portuguese) `_ [`fixme `_] +* |OK_ICON| `Brazilian Weather - Historical data (In Portuguese) `_ -* |OK_ICON| `Canadian Meteorological Centre `_ [`fixme `_] +* |OK_ICON| `Canadian Meteorological Centre `_ -* |OK_ICON| `Climate Data from UEA (updated monthly) `_ [`fixme `_] +* |OK_ICON| `Climate Data from UEA (updated monthly) `_ -* |FIXME_ICON| `European Climate Assessment & Dataset `_ +* |FIXME_ICON| `European Climate Assessment & Dataset `_ [`fixme `_] -* |OK_ICON| `Global Climate Data Since 1929 `_ [`fixme `_] +* |OK_ICON| `Global Climate Data Since 1929 `_ -* |OK_ICON| `NASA Global Imagery Browse Services `_ [`fixme `_] +* |OK_ICON| `NASA Global Imagery Browse Services `_ -* |OK_ICON| `NOAA Bering Sea Climate `_ [`fixme `_] +* |OK_ICON| `NOAA Bering Sea Climate `_ -* |OK_ICON| `NOAA Climate Datasets `_ [`fixme `_] +* |OK_ICON| `NOAA Climate Datasets `_ -* |OK_ICON| `NOAA Realtime Weather Models `_ [`fixme `_] +* |OK_ICON| `NOAA Realtime Weather Models `_ -* |OK_ICON| `NOAA SURFRAD Meteorology and Radiation Datasets `_ [`fixme `_] +* |OK_ICON| `NOAA SURFRAD Meteorology and Radiation Datasets `_ -* |OK_ICON| `The World Bank Open Data Resources for Climate Change `_ [`fixme `_] +* |OK_ICON| `The World Bank Open Data Resources for Climate Change `_ -* |OK_ICON| `UEA Climatic Research Unit `_ [`fixme `_] +* |OK_ICON| `UEA Climatic Research Unit `_ -* |OK_ICON| `WU Historical Weather Worldwide `_ [`fixme `_] +* |OK_ICON| `WU Historical Weather Worldwide `_ -* |OK_ICON| `WorldClim - Global Climate Data `_ [`fixme `_] +* |OK_ICON| `WorldClim - Global Climate Data `_ ComplexNetworks --------------- -* |OK_ICON| `AMiner Citation Network Dataset `_ [`fixme `_] +* |OK_ICON| `AMiner Citation Network Dataset `_ -* |OK_ICON| `CrossRef DOI URLs `_ [`fixme `_] +* |OK_ICON| `CrossRef DOI URLs `_ -* |FIXME_ICON| `DBLP Citation dataset `_ +* |FIXME_ICON| `DBLP Citation dataset `_ [`fixme `_] -* |OK_ICON| `DIMACS Road Networks Collection `_ [`fixme `_] +* |OK_ICON| `DIMACS Road Networks Collection `_ -* |OK_ICON| `NBER Patent Citations `_ [`fixme `_] +* |OK_ICON| `NBER Patent Citations `_ -* |OK_ICON| `NIST complex networks data collection `_ [`fixme `_] +* |OK_ICON| `NIST complex networks data collection `_ -* |OK_ICON| `Network Repository with Interactive Exploratory Analysis Tools `_ [`fixme `_] +* |OK_ICON| `Network Repository with Interactive Exploratory Analysis Tools `_ -* |OK_ICON| `Protein-protein interaction network `_ [`fixme `_] +* |OK_ICON| `Protein-protein interaction network `_ -* |OK_ICON| `PyPI and Maven Dependency Network `_ [`fixme `_] +* |OK_ICON| `PyPI and Maven Dependency Network `_ -* |OK_ICON| `Scopus Citation Database `_ [`fixme `_] +* |OK_ICON| `Scopus Citation Database `_ -* |OK_ICON| `Small Network Data `_ [`fixme `_] +* |OK_ICON| `Small Network Data `_ -* |OK_ICON| `Stanford GraphBase `_ [`fixme `_] +* |OK_ICON| `Stanford GraphBase `_ -* |OK_ICON| `Stanford Large Network Dataset Collection `_ [`fixme `_] +* |OK_ICON| `Stanford Large Network Dataset Collection `_ -* |OK_ICON| `Stanford Longitudinal Network Data Sources `_ [`fixme `_] +* |OK_ICON| `Stanford Longitudinal Network Data Sources `_ -* |OK_ICON| `The Koblenz Network Collection `_ [`fixme `_] +* |OK_ICON| `The Koblenz Network Collection `_ -* |OK_ICON| `The Laboratory for Web Algorithmics (UNIMI) `_ [`fixme `_] +* |OK_ICON| `The Laboratory for Web Algorithmics (UNIMI) `_ -* |FIXME_ICON| `The Nexus Network Repository `_ +* |FIXME_ICON| `The Nexus Network Repository `_ [`fixme `_] -* |OK_ICON| `UCI Network Data Repository `_ [`fixme `_] +* |OK_ICON| `UCI Network Data Repository `_ -* |OK_ICON| `UFL sparse matrix collection `_ [`fixme `_] +* |OK_ICON| `UFL sparse matrix collection `_ -* |OK_ICON| `WSU Graph Database `_ [`fixme `_] +* |OK_ICON| `WSU Graph Database `_ ComputerNetworks ---------------- -* |OK_ICON| `3.5B Web Pages from CommonCrawl 2012 `_ [`fixme `_] +* |OK_ICON| `3.5B Web Pages from CommonCrawl 2012 `_ -* |OK_ICON| `53.5B Web clicks of 100K users in Indiana Univ. `_ [`fixme `_] +* |OK_ICON| `53.5B Web clicks of 100K users in Indiana Univ. `_ -* |OK_ICON| `CAIDA Internet Datasets `_ [`fixme `_] +* |OK_ICON| `CAIDA Internet Datasets `_ -* |OK_ICON| `CRAWDAD Wireless datasets from Dartmouth Univ. `_ [`fixme `_] +* |OK_ICON| `CRAWDAD Wireless datasets from Dartmouth Univ. `_ -* |OK_ICON| `ClueWeb09 - 1B web pages `_ [`fixme `_] +* |OK_ICON| `ClueWeb09 - 1B web pages `_ -* |OK_ICON| `ClueWeb12 - 733M web pages `_ [`fixme `_] +* |OK_ICON| `ClueWeb12 - 733M web pages `_ -* |OK_ICON| `CommonCrawl Web Data over 7 years `_ [`fixme `_] +* |OK_ICON| `CommonCrawl Web Data over 7 years `_ -* |OK_ICON| `Criteo click-through data `_ [`fixme `_] +* |OK_ICON| `Criteo click-through data `_ -* |OK_ICON| `Internet-Wide Scan Data Repository `_ [`fixme `_] +* |OK_ICON| `Internet-Wide Scan Data Repository `_ -* |OK_ICON| `OONI: Open Observatory of Network Interference - Internet censorship data `_ [`fixme `_] +* |OK_ICON| `OONI: Open Observatory of Network Interference - Internet censorship data `_ -* |OK_ICON| `Open Mobile Data by MobiPerf `_ [`fixme `_] +* |OK_ICON| `Open Mobile Data by MobiPerf `_ -* |OK_ICON| `Rapid7 Sonar Internet Scans `_ [`fixme `_] +* |OK_ICON| `Rapid7 Sonar Internet Scans `_ -* |OK_ICON| `UCSD Network Telescope, IPv4 /8 net `_ [`fixme `_] +* |OK_ICON| `UCSD Network Telescope, IPv4 /8 net `_ DataChallenges -------------- -* |OK_ICON| `Bruteforce Database `_ [`fixme `_] +* |OK_ICON| `Bruteforce Database `_ -* |OK_ICON| `Challenges in Machine Learning `_ [`fixme `_] +* |OK_ICON| `Challenges in Machine Learning `_ -* |OK_ICON| `CrowdANALYTIX dataX `_ [`fixme `_] +* |OK_ICON| `CrowdANALYTIX dataX `_ -* |FIXME_ICON| `D4D Challenge of Orange `_ +* |FIXME_ICON| `D4D Challenge of Orange `_ [`fixme `_] -* |OK_ICON| `DrivenData Competitions for Social Good `_ [`fixme `_] +* |OK_ICON| `DrivenData Competitions for Social Good `_ -* |FIXME_ICON| `ICWSM Data Challenge (since 2009) `_ +* |FIXME_ICON| `ICWSM Data Challenge (since 2009) `_ [`fixme `_] -* |OK_ICON| `KDD Cup by Tencent 2012 `_ [`fixme `_] +* |OK_ICON| `KDD Cup by Tencent 2012 `_ -* |OK_ICON| `Kaggle Competition Data `_ [`fixme `_] +* |OK_ICON| `Kaggle Competition Data `_ -* |OK_ICON| `Localytics Data Visualization Challenge `_ [`fixme `_] +* |OK_ICON| `Localytics Data Visualization Challenge `_ -* |OK_ICON| `Netflix Prize `_ [`fixme `_] +* |OK_ICON| `Netflix Prize `_ -* |OK_ICON| `Space Apps Challenge `_ [`fixme `_] +* |OK_ICON| `Space Apps Challenge `_ -* |OK_ICON| `Telecom Italia Big Data Challenge `_ [`fixme `_] +* |OK_ICON| `Telecom Italia Big Data Challenge `_ -* |OK_ICON| `TravisTorrent Dataset - MSR'2017 Mining Challenge `_ [`fixme `_] +* |OK_ICON| `TravisTorrent Dataset - MSR'2017 Mining Challenge `_ -* |OK_ICON| `TunedIT - Data mining & machine learning data sets, algorithms, challenges `_ [`fixme `_] +* |OK_ICON| `TunedIT - Data mining & machine learning data sets, algorithms, challenges `_ -* |OK_ICON| `Yelp Dataset Challenge `_ [`fixme `_] +* |OK_ICON| `Yelp Dataset Challenge `_ EarthScience ------------ -* |OK_ICON| `AQUASTAT - Global water resources and uses `_ [`fixme `_] +* |OK_ICON| `AQUASTAT - Global water resources and uses `_ -* |OK_ICON| `BODC - marine data of ~22K vars `_ [`fixme `_] +* |OK_ICON| `BODC - marine data of ~22K vars `_ -* |OK_ICON| `EOSDIS - NASA's earth observing system data `_ [`fixme `_] +* |OK_ICON| `EOSDIS - NASA's earth observing system data `_ -* |OK_ICON| `Earth Models `_ [`fixme `_] +* |OK_ICON| `Earth Models `_ -* |OK_ICON| `Integrated Marine Observing System (IMOS) - roughly 30TB of ocean measurements `_ [`fixme `_] +* |OK_ICON| `Integrated Marine Observing System (IMOS) - roughly 30TB of ocean measurements `_ -* |OK_ICON| `Marinexplore - Open Oceanographic Data `_ [`fixme `_] +* |OK_ICON| `Marinexplore - Open Oceanographic Data `_ -* |OK_ICON| `Smithsonian Institution Global Volcano and Eruption Database `_ [`fixme `_] +* |OK_ICON| `Smithsonian Institution Global Volcano and Eruption Database `_ -* |OK_ICON| `USGS Earthquake Archives `_ [`fixme `_] +* |OK_ICON| `USGS Earthquake Archives `_ Economics --------- -* |OK_ICON| `American Economic Association (AEA) `_ [`fixme `_] +* |OK_ICON| `American Economic Association (AEA) `_ -* |OK_ICON| `EconData from UMD `_ [`fixme `_] +* |OK_ICON| `EconData from UMD `_ -* |FIXME_ICON| `Economic Freedom of the World Data `_ +* |FIXME_ICON| `Economic Freedom of the World Data `_ [`fixme `_] -* |OK_ICON| `Historical MacroEconomc Statistics `_ [`fixme `_] +* |OK_ICON| `Historical MacroEconomc Statistics `_ -* |OK_ICON| `INFORUM - Interindustry Forecasting at the University of Maryland `_ [`fixme `_] +* |OK_ICON| `INFORUM - Interindustry Forecasting at the University of Maryland `_ -* |OK_ICON| `International Economics Database `_ [`fixme `_] +* |OK_ICON| `International Economics Database `_ -* |OK_ICON| `International Trade Statistics `_ [`fixme `_] +* |OK_ICON| `International Trade Statistics `_ -* |OK_ICON| `Internet Product Code Database `_ [`fixme `_] +* |OK_ICON| `Internet Product Code Database `_ -* |OK_ICON| `Joint External Debt Data Hub `_ [`fixme `_] +* |OK_ICON| `Joint External Debt Data Hub `_ -* |OK_ICON| `Jon Haveman International Trade Data Links `_ [`fixme `_] +* |OK_ICON| `Jon Haveman International Trade Data Links `_ -* |OK_ICON| `OpenCorporates Database of Companies in the World `_ [`fixme `_] +* |OK_ICON| `OpenCorporates Database of Companies in the World `_ -* |OK_ICON| `Our World in Data `_ [`fixme `_] +* |OK_ICON| `Our World in Data `_ -* |OK_ICON| `SciencesPo World Trade Gravity Datasets `_ [`fixme `_] +* |OK_ICON| `SciencesPo World Trade Gravity Datasets `_ -* |OK_ICON| `The Atlas of Economic Complexity `_ [`fixme `_] +* |OK_ICON| `The Atlas of Economic Complexity `_ -* |OK_ICON| `The Center for International Data `_ [`fixme `_] +* |OK_ICON| `The Center for International Data `_ -* |OK_ICON| `The Observatory of Economic Complexity `_ [`fixme `_] +* |OK_ICON| `The Observatory of Economic Complexity `_ -* |OK_ICON| `UN Commodity Trade Statistics `_ [`fixme `_] +* |OK_ICON| `UN Commodity Trade Statistics `_ -* |OK_ICON| `UN Human Development Reports `_ [`fixme `_] +* |OK_ICON| `UN Human Development Reports `_ Education --------- -* |OK_ICON| `College Scorecard Data `_ [`fixme `_] +* |OK_ICON| `College Scorecard Data `_ -* |OK_ICON| `Student Data from Free Code Camp `_ [`fixme `_] +* |OK_ICON| `Student Data from Free Code Camp `_ Energy ------ -* |OK_ICON| `AMPds `_ [`fixme `_] +* |OK_ICON| `AMPds `_ -* |OK_ICON| `BLUEd `_ [`fixme `_] +* |OK_ICON| `BLUEd `_ -* |OK_ICON| `COMBED `_ [`fixme `_] +* |OK_ICON| `COMBED `_ -* |OK_ICON| `DRED `_ [`fixme `_] +* |OK_ICON| `DRED `_ -* |OK_ICON| `ECO `_ [`fixme `_] +* |OK_ICON| `ECO `_ -* |OK_ICON| `EIA `_ [`fixme `_] +* |OK_ICON| `EIA `_ -* |OK_ICON| `HES - Household Electricity Study, UK `_ [`fixme `_] +* |OK_ICON| `HES - Household Electricity Study, UK `_ -* |OK_ICON| `HFED `_ [`fixme `_] +* |OK_ICON| `HFED `_ -* |FIXME_ICON| `PLAID - The Plug Load Appliance Identification Dataset `_ +* |FIXME_ICON| `PLAID - The Plug Load Appliance Identification Dataset `_ [`fixme `_] -* |OK_ICON| `REDD `_ [`fixme `_] +* |OK_ICON| `REDD `_ -* |OK_ICON| `Tracebase `_ [`fixme `_] +* |OK_ICON| `Tracebase `_ -* |OK_ICON| `UK-DALE - UK Domestic Appliance-Level Electricity `_ [`fixme `_] +* |OK_ICON| `UK-DALE - UK Domestic Appliance-Level Electricity `_ -* |OK_ICON| `WHITED `_ [`fixme `_] +* |OK_ICON| `WHITED `_ -* |OK_ICON| `iAWE `_ [`fixme `_] +* |OK_ICON| `iAWE `_ Finance ------- -* |FIXME_ICON| `CBOE Futures Exchange `_ +* |FIXME_ICON| `CBOE Futures Exchange `_ [`fixme `_] -* |OK_ICON| `Google Finance `_ [`fixme `_] +* |OK_ICON| `Google Finance `_ -* |OK_ICON| `Google Trends `_ [`fixme `_] +* |OK_ICON| `Google Trends `_ -* |OK_ICON| `NASDAQ `_ [`fixme `_] +* |OK_ICON| `NASDAQ `_ -* |OK_ICON| `NYSE Market Data `_ [`fixme `_] +* |OK_ICON| `NYSE Market Data `_ -* |OK_ICON| `OANDA `_ [`fixme `_] +* |OK_ICON| `OANDA `_ -* |OK_ICON| `OSU Financial data `_ [`fixme `_] +* |OK_ICON| `OSU Financial data `_ -* |OK_ICON| `Quandl `_ [`fixme `_] +* |OK_ICON| `Quandl `_ -* |OK_ICON| `St Louis Federal `_ [`fixme `_] +* |OK_ICON| `St Louis Federal `_ -* |OK_ICON| `Yahoo Finance `_ [`fixme `_] +* |OK_ICON| `Yahoo Finance `_ GIS --- -* |OK_ICON| `ArcGIS Open Data portal `_ [`fixme `_] +* |OK_ICON| `ArcGIS Open Data portal `_ -* |OK_ICON| `Cambridge, MA, US, GIS data on GitHub `_ [`fixme `_] +* |OK_ICON| `Cambridge, MA, US, GIS data on GitHub `_ -* |FIXME_ICON| `Factual Global Location Data `_ +* |FIXME_ICON| `Factual Global Location Data `_ [`fixme `_] -* |OK_ICON| `Geo Maps - High Quality GeoJSON maps programmatically generated `_ [`fixme `_] +* |OK_ICON| `Geo Maps - High Quality GeoJSON maps programmatically generated `_ -* |OK_ICON| `Geo Spatial Data from ASU `_ [`fixme `_] +* |OK_ICON| `Geo Spatial Data from ASU `_ -* |OK_ICON| `Geo Wiki Project - Citizen-driven Environmental Monitoring `_ [`fixme `_] +* |OK_ICON| `Geo Wiki Project - Citizen-driven Environmental Monitoring `_ -* |OK_ICON| `GeoFabrik - OSM data extracted to a variety of formats and areas `_ [`fixme `_] +* |OK_ICON| `GeoFabrik - OSM data extracted to a variety of formats and areas `_ -* |OK_ICON| `GeoNames Worldwide `_ [`fixme `_] +* |OK_ICON| `GeoNames Worldwide `_ -* |FIXME_ICON| `Global Administrative Areas Database (GADM) `_ +* |FIXME_ICON| `Global Administrative Areas Database (GADM) `_ [`fixme `_] -* |OK_ICON| `Homeland Infrastructure Foundation-Level Data `_ [`fixme `_] +* |OK_ICON| `Homeland Infrastructure Foundation-Level Data `_ -* |OK_ICON| `Landsat 8 on AWS `_ [`fixme `_] +* |OK_ICON| `Landsat 8 on AWS `_ -* |OK_ICON| `List of all countries in all languages `_ [`fixme `_] +* |OK_ICON| `List of all countries in all languages `_ -* |OK_ICON| `National Weather Service GIS Data Portal `_ [`fixme `_] +* |OK_ICON| `National Weather Service GIS Data Portal `_ -* |OK_ICON| `Natural Earth - vectors and rasters of the world `_ [`fixme `_] +* |OK_ICON| `Natural Earth - vectors and rasters of the world `_ -* |OK_ICON| `OpenAddresses `_ [`fixme `_] +* |OK_ICON| `OpenAddresses `_ -* |OK_ICON| `OpenStreetMap (OSM) `_ [`fixme `_] +* |OK_ICON| `OpenStreetMap (OSM) `_ -* |OK_ICON| `Pleiades - Gazetteer and graph of ancient places `_ [`fixme `_] +* |OK_ICON| `Pleiades - Gazetteer and graph of ancient places `_ -* |OK_ICON| `Reverse Geocoder using OSM data `_ [`fixme `_] +* |OK_ICON| `Reverse Geocoder using OSM data `_ -* |FIXME_ICON| `TIGER/Line - U.S. boundaries and roads `_ +* |FIXME_ICON| `TIGER/Line - U.S. boundaries and roads `_ [`fixme `_] -* |OK_ICON| `TZ Timezones shapfiles `_ [`fixme `_] +* |OK_ICON| `TZ Timezones shapfiles `_ -* |OK_ICON| `TwoFishes - Foursquare's coarse geocoder `_ [`fixme `_] +* |OK_ICON| `TwoFishes - Foursquare's coarse geocoder `_ -* |OK_ICON| `UN Environmental Data `_ [`fixme `_] +* |OK_ICON| `UN Environmental Data `_ -* |FIXME_ICON| `World boundaries from the U.S. Department of State `_ +* |FIXME_ICON| `World boundaries from the U.S. Department of State `_ [`fixme `_] -* |OK_ICON| `World countries in multiple formats `_ [`fixme `_] +* |OK_ICON| `World countries in multiple formats `_ Government ---------- -* |OK_ICON| `Alberta, Province of Canada `_ [`fixme `_] +* |OK_ICON| `Alberta, Province of Canada `_ -* |OK_ICON| `Antwerp, Belgium `_ [`fixme `_] +* |OK_ICON| `Antwerp, Belgium `_ -* |OK_ICON| `Argentina (non official) `_ [`fixme `_] +* |OK_ICON| `Argentina (non official) `_ -* |OK_ICON| `Datos Argentina - Portal de datos abiertos de la República Argentina. [...] `_ [`fixme `_] +* |OK_ICON| `Datos Argentina - Portal de datos abiertos de la República Argentina. [...] `_ -* |OK_ICON| `Austin, TX, US `_ [`fixme `_] +* |OK_ICON| `Austin, TX, US `_ -* |OK_ICON| `Australia (abs.gov.au) `_ [`fixme `_] +* |OK_ICON| `Australia (abs.gov.au) `_ -* |OK_ICON| `Australia (data.gov.au) `_ [`fixme `_] +* |OK_ICON| `Australia (data.gov.au) `_ -* |OK_ICON| `Austria (data.gv.at) `_ [`fixme `_] +* |OK_ICON| `Austria (data.gv.at) `_ -* |OK_ICON| `Baton Rouge, LA, US `_ [`fixme `_] +* |OK_ICON| `Baton Rouge, LA, US `_ -* |OK_ICON| `Belgium `_ [`fixme `_] +* |OK_ICON| `Belgium `_ -* |OK_ICON| `Brazil `_ [`fixme `_] +* |OK_ICON| `Brazil `_ -* |OK_ICON| `Buenos Aires, Argentina `_ [`fixme `_] +* |OK_ICON| `Buenos Aires, Argentina `_ -* |FIXME_ICON| `Calgary, AB, Canada `_ +* |FIXME_ICON| `Calgary, AB, Canada `_ [`fixme `_] -* |OK_ICON| `Cambridge, MA, US `_ [`fixme `_] +* |OK_ICON| `Cambridge, MA, US `_ -* |OK_ICON| `Canada `_ [`fixme `_] +* |OK_ICON| `Canada `_ -* |OK_ICON| `Chicago `_ [`fixme `_] +* |OK_ICON| `Chicago `_ -* |OK_ICON| `Chile `_ [`fixme `_] +* |OK_ICON| `Chile `_ -* |OK_ICON| `Dallas Open Data `_ [`fixme `_] +* |OK_ICON| `Dallas Open Data `_ -* |OK_ICON| `DataBC - data from the Province of British Columbia `_ [`fixme `_] +* |OK_ICON| `DataBC - data from the Province of British Columbia `_ -* |OK_ICON| `Denver Open Data `_ [`fixme `_] +* |OK_ICON| `Denver Open Data `_ -* |OK_ICON| `Durham, NC Open Data `_ [`fixme `_] +* |OK_ICON| `Durham, NC Open Data `_ -* |OK_ICON| `Edmonton, AB, Canada `_ [`fixme `_] +* |OK_ICON| `Edmonton, AB, Canada `_ -* |OK_ICON| `England LGInform `_ [`fixme `_] +* |OK_ICON| `England LGInform `_ -* |OK_ICON| `EuroStat `_ [`fixme `_] +* |OK_ICON| `EuroStat `_ -* |OK_ICON| `EveryPolitician - Ongoing project collating and sharing data on every [...] `_ [`fixme `_] +* |OK_ICON| `EveryPolitician - Ongoing project collating and sharing data on every [...] `_ -* |OK_ICON| `FedStats `_ [`fixme `_] +* |OK_ICON| `FedStats `_ -* |OK_ICON| `Finland `_ [`fixme `_] +* |OK_ICON| `Finland `_ -* |OK_ICON| `France `_ [`fixme `_] +* |OK_ICON| `France `_ -* |OK_ICON| `Fredericton, NB, Canada `_ [`fixme `_] +* |OK_ICON| `Fredericton, NB, Canada `_ -* |OK_ICON| `Gatineau, QC, Canada `_ [`fixme `_] +* |OK_ICON| `Gatineau, QC, Canada `_ -* |OK_ICON| `Germany `_ [`fixme `_] +* |OK_ICON| `Germany `_ -* |OK_ICON| `Ghent, Belgium `_ [`fixme `_] +* |OK_ICON| `Ghent, Belgium `_ -* |OK_ICON| `Glasgow, Scotland, UK `_ [`fixme `_] +* |OK_ICON| `Glasgow, Scotland, UK `_ -* |OK_ICON| `Greece `_ [`fixme `_] +* |OK_ICON| `Greece `_ -* |OK_ICON| `Guardian world governments `_ [`fixme `_] +* |OK_ICON| `Guardian world governments `_ -* |FIXME_ICON| `Halifax, NS, Canada `_ +* |FIXME_ICON| `Halifax, NS, Canada `_ [`fixme `_] -* |OK_ICON| `Helsinki Region, Finland `_ [`fixme `_] +* |OK_ICON| `Helsinki Region, Finland `_ -* |OK_ICON| `Hong Kong, China `_ [`fixme `_] +* |OK_ICON| `Hong Kong, China `_ -* |FIXME_ICON| `Houston Open Data `_ +* |FIXME_ICON| `Houston Open Data `_ [`fixme `_] -* |OK_ICON| `Indian Government Data `_ [`fixme `_] +* |OK_ICON| `Indian Government Data `_ -* |OK_ICON| `Indonesian Data Portal `_ [`fixme `_] +* |OK_ICON| `Indonesian Data Portal `_ -* |OK_ICON| `Ireland's Open Data Portal `_ [`fixme `_] +* |OK_ICON| `Ireland's Open Data Portal `_ -* |OK_ICON| `Italy - Il Portale dati.gov.it è il catalogo nazionale dei metadati [...] `_ [`fixme `_] +* |OK_ICON| `Italy - Il Portale dati.gov.it è il catalogo nazionale dei metadati [...] `_ -* |OK_ICON| `Japan `_ [`fixme `_] +* |OK_ICON| `Japan `_ -* |OK_ICON| `Laval, QC, Canada `_ [`fixme `_] +* |OK_ICON| `Laval, QC, Canada `_ -* |OK_ICON| `Lexington, KY `_ [`fixme `_] +* |OK_ICON| `Lexington, KY `_ -* |OK_ICON| `London Datastore, UK `_ [`fixme `_] +* |OK_ICON| `London Datastore, UK `_ -* |OK_ICON| `London, ON, Canada `_ [`fixme `_] +* |OK_ICON| `London, ON, Canada `_ -* |OK_ICON| `Los Angeles Open Data `_ [`fixme `_] +* |OK_ICON| `Los Angeles Open Data `_ -* |OK_ICON| `MassGIS, Massachusetts, U.S. `_ [`fixme `_] +* |OK_ICON| `MassGIS, Massachusetts, U.S. `_ -* |OK_ICON| `Metropolitain Transportation Commission (MTC), California, US `_ [`fixme `_] +* |OK_ICON| `Metropolitain Transportation Commission (MTC), California, US `_ -* |OK_ICON| `Mexico `_ [`fixme `_] +* |OK_ICON| `Mexico `_ -* |OK_ICON| `Missisauga, ON, Canada `_ [`fixme `_] +* |OK_ICON| `Missisauga, ON, Canada `_ -* |OK_ICON| `Moldova `_ [`fixme `_] +* |OK_ICON| `Moldova `_ -* |OK_ICON| `Moncton, NB, Canada `_ [`fixme `_] +* |OK_ICON| `Moncton, NB, Canada `_ -* |OK_ICON| `Montreal, QC, Canada `_ [`fixme `_] +* |OK_ICON| `Montreal, QC, Canada `_ -* |OK_ICON| `Mountain View, California, US (GIS) `_ [`fixme `_] +* |OK_ICON| `Mountain View, California, US (GIS) `_ -* |FIXME_ICON| `NYC Open Data `_ +* |FIXME_ICON| `NYC Open Data `_ [`fixme `_] -* |OK_ICON| `NYC betanyc `_ [`fixme `_] +* |OK_ICON| `NYC betanyc `_ -* |OK_ICON| `Netherlands `_ [`fixme `_] +* |OK_ICON| `Netherlands `_ -* |OK_ICON| `New Zealand `_ [`fixme `_] +* |OK_ICON| `New Zealand `_ -* |OK_ICON| `OECD `_ [`fixme `_] +* |OK_ICON| `OECD `_ -* |OK_ICON| `Oakland, California, US `_ [`fixme `_] +* |OK_ICON| `Oakland, California, US `_ -* |OK_ICON| `Oklahoma `_ [`fixme `_] +* |OK_ICON| `Oklahoma `_ -* |OK_ICON| `Open Data for Africa `_ [`fixme `_] +* |OK_ICON| `Open Data for Africa `_ -* |OK_ICON| `Open Government Data (OGD) Platform India `_ [`fixme `_] +* |OK_ICON| `Open Government Data (OGD) Platform India `_ -* |OK_ICON| `OpenDataSoft's list of 1,600 open data `_ [`fixme `_] +* |OK_ICON| `OpenDataSoft's list of 1,600 open data `_ -* |OK_ICON| `Oregon `_ [`fixme `_] +* |OK_ICON| `Oregon `_ -* |OK_ICON| `Ottawa, ON, Canada `_ [`fixme `_] +* |OK_ICON| `Ottawa, ON, Canada `_ -* |OK_ICON| `Palo Alto, California, US `_ [`fixme `_] +* |OK_ICON| `Palo Alto, California, US `_ -* |OK_ICON| `Portland, Oregon `_ [`fixme `_] +* |OK_ICON| `Portland, Oregon `_ -* |OK_ICON| `Portugal - Pordata organization `_ [`fixme `_] +* |OK_ICON| `Portugal - Pordata organization `_ -* |OK_ICON| `Puerto Rico Government `_ [`fixme `_] +* |OK_ICON| `Puerto Rico Government `_ -* |OK_ICON| `Quebec City, QC, Canada `_ [`fixme `_] +* |OK_ICON| `Quebec City, QC, Canada `_ -* |OK_ICON| `Quebec Province of Canada `_ [`fixme `_] +* |OK_ICON| `Quebec Province of Canada `_ -* |OK_ICON| `Regina SK, Canada `_ [`fixme `_] +* |OK_ICON| `Regina SK, Canada `_ -* |FIXME_ICON| `Rio de Janeiro, Brazil `_ +* |FIXME_ICON| `Rio de Janeiro, Brazil `_ [`fixme `_] -* |OK_ICON| `Romania `_ [`fixme `_] +* |OK_ICON| `Romania `_ -* |OK_ICON| `Russia `_ [`fixme `_] +* |OK_ICON| `Russia `_ -* |OK_ICON| `San Francisco Data sets `_ [`fixme `_] +* |OK_ICON| `San Francisco Data sets `_ -* |OK_ICON| `San Jose, California, US `_ [`fixme `_] +* |OK_ICON| `San Jose, California, US `_ -* |OK_ICON| `San Mateo County, California, US `_ [`fixme `_] +* |OK_ICON| `San Mateo County, California, US `_ -* |OK_ICON| `Saskatchewan, Province of Canada `_ [`fixme `_] +* |OK_ICON| `Saskatchewan, Province of Canada `_ -* |OK_ICON| `Seattle `_ [`fixme `_] +* |OK_ICON| `Seattle `_ -* |OK_ICON| `Singapore Government Data `_ [`fixme `_] +* |OK_ICON| `Singapore Government Data `_ -* |OK_ICON| `South Africa Trade Statistics `_ [`fixme `_] +* |OK_ICON| `South Africa Trade Statistics `_ -* |OK_ICON| `South Africa `_ [`fixme `_] +* |OK_ICON| `South Africa `_ -* |OK_ICON| `State of Utah, US `_ [`fixme `_] +* |OK_ICON| `State of Utah, US `_ -* |OK_ICON| `Switzerland `_ [`fixme `_] +* |OK_ICON| `Switzerland `_ -* |OK_ICON| `Taiwan g0v `_ [`fixme `_] +* |OK_ICON| `Taiwan g0v `_ -* |OK_ICON| `Taiwan `_ [`fixme `_] +* |OK_ICON| `Taiwan `_ -* |OK_ICON| `Tel-Aviv Open Data `_ [`fixme `_] +* |OK_ICON| `Tel-Aviv Open Data `_ -* |OK_ICON| `Texas Open Data `_ [`fixme `_] +* |OK_ICON| `Texas Open Data `_ -* |OK_ICON| `The World Bank `_ [`fixme `_] +* |FIXME_ICON| `The World Bank `_ [`fixme `_] -* |FIXME_ICON| `Toronto, ON, Canada `_ +* |FIXME_ICON| `Toronto, ON, Canada `_ [`fixme `_] -* |OK_ICON| `Tunisia `_ [`fixme `_] +* |OK_ICON| `Tunisia `_ -* |OK_ICON| `U.K. Government Data `_ [`fixme `_] +* |OK_ICON| `U.K. Government Data `_ -* |OK_ICON| `U.S. American Community Survey `_ [`fixme `_] +* |OK_ICON| `U.S. American Community Survey `_ -* |OK_ICON| `U.S. CDC Public Health datasets `_ [`fixme `_] +* |OK_ICON| `U.S. CDC Public Health datasets `_ -* |OK_ICON| `U.S. Census Bureau `_ [`fixme `_] +* |OK_ICON| `U.S. Census Bureau `_ -* |OK_ICON| `U.S. Department of Housing and Urban Development (HUD) `_ [`fixme `_] +* |OK_ICON| `U.S. Department of Housing and Urban Development (HUD) `_ -* |OK_ICON| `U.S. Federal Government Agencies `_ [`fixme `_] +* |OK_ICON| `U.S. Federal Government Agencies `_ -* |OK_ICON| `U.S. Federal Government Data Catalog `_ [`fixme `_] +* |OK_ICON| `U.S. Federal Government Data Catalog `_ -* |OK_ICON| `U.S. Food and Drug Administration (FDA) `_ [`fixme `_] +* |OK_ICON| `U.S. Food and Drug Administration (FDA) `_ -* |OK_ICON| `U.S. National Center for Education Statistics (NCES) `_ [`fixme `_] +* |OK_ICON| `U.S. National Center for Education Statistics (NCES) `_ -* |OK_ICON| `U.S. Open Government `_ [`fixme `_] +* |OK_ICON| `U.S. Open Government `_ -* |FIXME_ICON| `UK 2011 Census Open Atlas Project `_ +* |FIXME_ICON| `UK 2011 Census Open Atlas Project `_ [`fixme `_] -* |OK_ICON| `U.S. Patent and Trademark Office (USPTO) Bulk Data Products `_ [`fixme `_] +* |OK_ICON| `U.S. Patent and Trademark Office (USPTO) Bulk Data Products `_ -* |OK_ICON| `Uganda Bureau of Statistics `_ [`fixme `_] +* |OK_ICON| `Uganda Bureau of Statistics `_ -* |OK_ICON| `United Nations `_ [`fixme `_] +* |OK_ICON| `United Nations `_ -* |OK_ICON| `Uruguay `_ [`fixme `_] +* |OK_ICON| `Uruguay `_ -* |OK_ICON| `Valley Transportation Authority (VTA), California, US `_ [`fixme `_] +* |OK_ICON| `Valley Transportation Authority (VTA), California, US `_ -* |OK_ICON| `Vancouver, BC Open Data Catalog `_ [`fixme `_] +* |OK_ICON| `Vancouver, BC Open Data Catalog `_ -* |FIXME_ICON| `Victoria, BC, Canada `_ +* |FIXME_ICON| `Victoria, BC, Canada `_ [`fixme `_] -* |OK_ICON| `Vienna, Austria `_ [`fixme `_] +* |OK_ICON| `Vienna, Austria `_ Healthcare ---------- -* |OK_ICON| `Composition of Foods Raw, Processed, Prepared USDA National Nutrient Database for Standard [...] `_ [`fixme `_] +* |OK_ICON| `Composition of Foods Raw, Processed, Prepared USDA National Nutrient Database for Standard [...] `_ -* |OK_ICON| `EHDP Large Health Data Sets `_ [`fixme `_] +* |OK_ICON| `EHDP Large Health Data Sets `_ -* |OK_ICON| `GDC - GDC supports several cancer genome programs for CCG, TCGA, TARGET etc. `_ [`fixme `_] +* |OK_ICON| `GDC - GDC supports several cancer genome programs for CCG, TCGA, TARGET etc. `_ -* |OK_ICON| `Gapminder World demographic databases `_ [`fixme `_] +* |OK_ICON| `Gapminder World demographic databases `_ -* |OK_ICON| `MeSH, the vocabulary thesaurus used for indexing articles for PubMed `_ [`fixme `_] +* |OK_ICON| `MeSH, the vocabulary thesaurus used for indexing articles for PubMed `_ -* |OK_ICON| `Medicare Coverage Database (MCD), U.S. `_ [`fixme `_] +* |OK_ICON| `Medicare Coverage Database (MCD), U.S. `_ -* |OK_ICON| `Medicare Data Engine of medicare.gov Data `_ [`fixme `_] +* |OK_ICON| `Medicare Data Engine of medicare.gov Data `_ -* |OK_ICON| `Medicare Data File `_ [`fixme `_] +* |OK_ICON| `Medicare Data File `_ -* |FIXME_ICON| `Number of Ebola Cases and Deaths in Affected Countries (2014) `_ +* |FIXME_ICON| `Number of Ebola Cases and Deaths in Affected Countries (2014) `_ [`fixme `_] -* |OK_ICON| `Open-ODS (structure of the UK NHS) `_ [`fixme `_] +* |OK_ICON| `Open-ODS (structure of the UK NHS) `_ -* |OK_ICON| `OpenPaymentsData, Healthcare financial relationship data `_ [`fixme `_] +* |OK_ICON| `OpenPaymentsData, Healthcare financial relationship data `_ -* |OK_ICON| `PhysioBank Databases - A large and growing archive of physiological data. `_ [`fixme `_] +* |OK_ICON| `PhysioBank Databases - A large and growing archive of physiological data. `_ -* |OK_ICON| `The Cancer Imaging Archive (TCIA) `_ [`fixme `_] +* |OK_ICON| `The Cancer Imaging Archive (TCIA) `_ -* |OK_ICON| `The Cancer Genome Atlas project (TCGA) `_ [`fixme `_] +* |OK_ICON| `The Cancer Genome Atlas project (TCGA) `_ -* |OK_ICON| `World Health Organization Global Health Observatory `_ [`fixme `_] +* |OK_ICON| `World Health Organization Global Health Observatory `_ ImageProcessing --------------- -* |OK_ICON| `10k US Adult Faces Database `_ [`fixme `_] +* |OK_ICON| `10k US Adult Faces Database `_ -* |FIXME_ICON| `2GB of Photos of Cats `_ +* |FIXME_ICON| `2GB of Photos of Cats `_ [`fixme `_] -* |OK_ICON| `Adience Unfiltered faces for gender and age classification `_ [`fixme `_] +* |OK_ICON| `Adience Unfiltered faces for gender and age classification `_ -* |OK_ICON| `Affective Image Classification `_ [`fixme `_] +* |OK_ICON| `Affective Image Classification `_ -* |OK_ICON| `Animals with attributes `_ [`fixme `_] +* |OK_ICON| `Animals with attributes `_ -* |OK_ICON| `Caltech Pedestrian Detection Benchmark `_ [`fixme `_] +* |OK_ICON| `Caltech Pedestrian Detection Benchmark `_ -* |OK_ICON| `Chars74K dataset - Character Recognition in Natural Images (both English [...] `_ [`fixme `_] +* |OK_ICON| `Chars74K dataset - Character Recognition in Natural Images (both English [...] `_ -* |OK_ICON| `Face Recognition Benchmark `_ [`fixme `_] +* |OK_ICON| `Face Recognition Benchmark `_ -* |OK_ICON| `Flickr: 32 Class Brand Logos `_ [`fixme `_] +* |OK_ICON| `Flickr: 32 Class Brand Logos `_ -* |OK_ICON| `GDXray - X-ray images for X-ray testing and Computer Vision `_ [`fixme `_] +* |OK_ICON| `GDXray - X-ray images for X-ray testing and Computer Vision `_ -* |OK_ICON| `ImageNet (in WordNet hierarchy) `_ [`fixme `_] +* |FIXME_ICON| `ImageNet (in WordNet hierarchy) `_ [`fixme `_] -* |OK_ICON| `Indoor Scene Recognition `_ [`fixme `_] +* |OK_ICON| `Indoor Scene Recognition `_ -* |OK_ICON| `International Affective Picture System, UFL `_ [`fixme `_] +* |OK_ICON| `International Affective Picture System, UFL `_ -* |OK_ICON| `MNIST database of handwritten digits, near 1 million examples `_ [`fixme `_] +* |OK_ICON| `MNIST database of handwritten digits, near 1 million examples `_ -* |OK_ICON| `Massive Visual Memory Stimuli, MIT `_ [`fixme `_] +* |OK_ICON| `Massive Visual Memory Stimuli, MIT `_ -* |OK_ICON| `SUN database, MIT `_ [`fixme `_] +* |OK_ICON| `SUN database, MIT `_ -* |FIXME_ICON| `Several Shape-from-Silhouette Datasets `_ +* |FIXME_ICON| `Several Shape-from-Silhouette Datasets `_ [`fixme `_] -* |OK_ICON| `Stanford Dogs Dataset `_ [`fixme `_] +* |OK_ICON| `Stanford Dogs Dataset `_ -* |OK_ICON| `The Action Similarity Labeling (ASLAN) Challenge `_ [`fixme `_] +* |OK_ICON| `The Action Similarity Labeling (ASLAN) Challenge `_ -* |OK_ICON| `The Oxford-IIIT Pet Dataset `_ [`fixme `_] +* |OK_ICON| `The Oxford-IIIT Pet Dataset `_ -* |OK_ICON| `Violent-Flows - Crowd Violence / Non-violence Database and benchmark `_ [`fixme `_] +* |OK_ICON| `Violent-Flows - Crowd Violence / Non-violence Database and benchmark `_ -* |OK_ICON| `Visual genome `_ [`fixme `_] +* |OK_ICON| `Visual genome `_ -* |OK_ICON| `YouTube Faces Database `_ [`fixme `_] +* |OK_ICON| `YouTube Faces Database `_ MachineLearning --------------- -* |OK_ICON| `Context-aware data sets from five domains `_ [`fixme `_] +* |OK_ICON| `Context-aware data sets from five domains `_ -* |OK_ICON| `Delve Datasets for classification and regression `_ [`fixme `_] +* |OK_ICON| `Delve Datasets for classification and regression `_ -* |OK_ICON| `Discogs Monthly Data `_ [`fixme `_] +* |OK_ICON| `Discogs Monthly Data `_ -* |OK_ICON| `Free Music Archive `_ [`fixme `_] +* |OK_ICON| `Free Music Archive `_ -* |OK_ICON| `IMDb Database `_ [`fixme `_] +* |OK_ICON| `IMDb Database `_ -* |OK_ICON| `Keel Repository for classification, regression and time series `_ [`fixme `_] +* |OK_ICON| `Keel Repository for classification, regression and time series `_ -* |OK_ICON| `Labeled Faces in the Wild (LFW) `_ [`fixme `_] +* |OK_ICON| `Labeled Faces in the Wild (LFW) `_ -* |OK_ICON| `Lending Club Loan Data `_ [`fixme `_] +* |FIXME_ICON| `Lending Club Loan Data `_ [`fixme `_] -* |OK_ICON| `Machine Learning Data Set Repository `_ [`fixme `_] +* |OK_ICON| `Machine Learning Data Set Repository `_ -* |OK_ICON| `Million Song Dataset `_ [`fixme `_] +* |OK_ICON| `Million Song Dataset `_ -* |OK_ICON| `More Song Datasets `_ [`fixme `_] +* |OK_ICON| `More Song Datasets `_ -* |OK_ICON| `MovieLens Data Sets `_ [`fixme `_] +* |OK_ICON| `MovieLens Data Sets `_ -* |OK_ICON| `New Yorker caption contest ratings `_ [`fixme `_] +* |OK_ICON| `New Yorker caption contest ratings `_ -* |OK_ICON| `RDataMining - "R and Data Mining" ebook data `_ [`fixme `_] +* |OK_ICON| `RDataMining - "R and Data Mining" ebook data `_ -* |OK_ICON| `Registered Meteorites on Earth `_ [`fixme `_] +* |OK_ICON| `Registered Meteorites on Earth `_ -* |FIXME_ICON| `Restaurants Health Score Data in San Francisco `_ +* |FIXME_ICON| `Restaurants Health Score Data in San Francisco `_ [`fixme `_] -* |OK_ICON| `UCI Machine Learning Repository `_ [`fixme `_] +* |OK_ICON| `UCI Machine Learning Repository `_ -* |FIXME_ICON| `Yahoo! Ratings and Classification Data `_ +* |FIXME_ICON| `Yahoo! Ratings and Classification Data `_ [`fixme `_] -* |OK_ICON| `YouTube-BoundingBoxes `_ [`fixme `_] +* |OK_ICON| `YouTube-BoundingBoxes `_ -* |OK_ICON| `Youtube 8m `_ [`fixme `_] +* |OK_ICON| `Youtube 8m `_ -* |OK_ICON| `eBay Online Auctions (2012) `_ [`fixme `_] +* |OK_ICON| `eBay Online Auctions (2012) `_ Museums ------- -* |OK_ICON| `Canada Science and Technology Museums Corporation's Open Data `_ [`fixme `_] +* |OK_ICON| `Canada Science and Technology Museums Corporation's Open Data `_ -* |OK_ICON| `Cooper-Hewitt's Collection Database `_ [`fixme `_] +* |OK_ICON| `Cooper-Hewitt's Collection Database `_ -* |OK_ICON| `Minneapolis Institute of Arts metadata `_ [`fixme `_] +* |OK_ICON| `Minneapolis Institute of Arts metadata `_ -* |OK_ICON| `Natural History Museum (London) Data Portal `_ [`fixme `_] +* |OK_ICON| `Natural History Museum (London) Data Portal `_ -* |OK_ICON| `Rijksmuseum Historical Art Collection `_ [`fixme `_] +* |OK_ICON| `Rijksmuseum Historical Art Collection `_ -* |OK_ICON| `Tate Collection metadata `_ [`fixme `_] +* |OK_ICON| `Tate Collection metadata `_ -* |OK_ICON| `The Getty vocabularies `_ [`fixme `_] +* |OK_ICON| `The Getty vocabularies `_ NaturalLanguage --------------- -* |OK_ICON| `Automatic Keyphrase Extraction `_ [`fixme `_] +* |OK_ICON| `Automatic Keyphrase Extraction `_ -* |OK_ICON| `Blogger Corpus `_ [`fixme `_] +* |OK_ICON| `Blogger Corpus `_ -* |OK_ICON| `CLiPS Stylometry Investigation Corpus `_ [`fixme `_] +* |OK_ICON| `CLiPS Stylometry Investigation Corpus `_ -* |OK_ICON| `ClueWeb09 FACC `_ [`fixme `_] +* |OK_ICON| `ClueWeb09 FACC `_ -* |OK_ICON| `ClueWeb12 FACC `_ [`fixme `_] +* |OK_ICON| `ClueWeb12 FACC `_ -* |OK_ICON| `DBpedia - 4.58M things with 583M facts `_ [`fixme `_] +* |OK_ICON| `DBpedia - 4.58M things with 583M facts `_ -* |OK_ICON| `Flickr Personal Taxonomies `_ [`fixme `_] +* |OK_ICON| `Flickr Personal Taxonomies `_ -* |OK_ICON| `Freebase of people, places, and things `_ [`fixme `_] +* |OK_ICON| `Freebase of people, places, and things `_ -* |OK_ICON| `Google Books Ngrams (2.2TB) `_ [`fixme `_] +* |OK_ICON| `Google Books Ngrams (2.2TB) `_ -* |OK_ICON| `Google MC-AFP - Generated based on the public available Gigaword dataset [...] `_ [`fixme `_] +* |OK_ICON| `Google MC-AFP - Generated based on the public available Gigaword dataset [...] `_ -* |OK_ICON| `Google Web 5gram (1TB, 2006) `_ [`fixme `_] +* |OK_ICON| `Google Web 5gram (1TB, 2006) `_ -* |OK_ICON| `Gutenberg eBooks List `_ [`fixme `_] +* |OK_ICON| `Gutenberg eBooks List `_ -* |OK_ICON| `Hansards text chunks of Canadian Parliament `_ [`fixme `_] +* |OK_ICON| `Hansards text chunks of Canadian Parliament `_ -* |OK_ICON| `Microsoft MAchine Reading COmprehension Dataset (or MS MARCO) `_ [`fixme `_] +* |OK_ICON| `Microsoft MAchine Reading COmprehension Dataset (or MS MARCO) `_ -* |OK_ICON| `Machine Comprehension Test (MCTest) of text from Microsoft Research `_ [`fixme `_] +* |OK_ICON| `Machine Comprehension Test (MCTest) of text from Microsoft Research `_ -* |OK_ICON| `Machine Translation of European languages `_ [`fixme `_] +* |OK_ICON| `Machine Translation of European languages `_ -* |FIXME_ICON| `Making Sense of Microposts 2013 - Concept Extraction `_ +* |FIXME_ICON| `Making Sense of Microposts 2013 - Concept Extraction `_ [`fixme `_] -* |OK_ICON| `Making Sense of Microposts 2016 - Named Entity rEcognition and Linking `_ [`fixme `_] +* |OK_ICON| `Making Sense of Microposts 2016 - Named Entity rEcognition and Linking `_ -* |OK_ICON| `Multi-Domain Sentiment Dataset (version 2.0) `_ [`fixme `_] +* |OK_ICON| `Multi-Domain Sentiment Dataset (version 2.0) `_ -* |OK_ICON| `Open Multilingual Wordnet `_ [`fixme `_] +* |OK_ICON| `Open Multilingual Wordnet `_ -* |OK_ICON| `POS/NER/Chunk annotated data `_ [`fixme `_] +* |OK_ICON| `POS/NER/Chunk annotated data `_ -* |OK_ICON| `Personae Corpus `_ [`fixme `_] +* |OK_ICON| `Personae Corpus `_ -* |OK_ICON| `SMS Spam Collection in English `_ [`fixme `_] +* |OK_ICON| `SMS Spam Collection in English `_ -* |OK_ICON| `SaudiNewsNet Collection of Saudi Newspaper Articles (Arabic, 30K articles) `_ [`fixme `_] +* |OK_ICON| `SaudiNewsNet Collection of Saudi Newspaper Articles (Arabic, 30K articles) `_ -* |OK_ICON| `Stanford Question Answering Dataset (SQuAD) `_ [`fixme `_] +* |OK_ICON| `Stanford Question Answering Dataset (SQuAD) `_ -* |OK_ICON| `USENET postings corpus of 2005~2011 `_ [`fixme `_] +* |OK_ICON| `USENET postings corpus of 2005~2011 `_ -* |OK_ICON| `Universal Dependencies `_ [`fixme `_] +* |OK_ICON| `Universal Dependencies `_ -* |OK_ICON| `Webhose - News/Blogs in multiple languages `_ [`fixme `_] +* |OK_ICON| `Webhose - News/Blogs in multiple languages `_ -* |OK_ICON| `Wikidata - Wikipedia databases `_ [`fixme `_] +* |OK_ICON| `Wikidata - Wikipedia databases `_ -* |OK_ICON| `Wikipedia Links data - 40 Million Entities in Context `_ [`fixme `_] +* |OK_ICON| `Wikipedia Links data - 40 Million Entities in Context `_ -* |FIXME_ICON| `WordNet databases and tools `_ +* |FIXME_ICON| `WordNet databases and tools `_ [`fixme `_] Neuroscience ------------ -* |OK_ICON| `Allen Institute Datasets `_ [`fixme `_] +* |OK_ICON| `Allen Institute Datasets `_ -* |OK_ICON| `Brain Catalogue `_ [`fixme `_] +* |OK_ICON| `Brain Catalogue `_ -* |OK_ICON| `Brainomics `_ [`fixme `_] +* |OK_ICON| `Brainomics `_ -* |FIXME_ICON| `CodeNeuro Datasets `_ +* |FIXME_ICON| `CodeNeuro Datasets `_ [`fixme `_] -* |OK_ICON| `Collaborative Research in Computational Neuroscience (CRCNS) `_ [`fixme `_] +* |OK_ICON| `Collaborative Research in Computational Neuroscience (CRCNS) `_ -* |OK_ICON| `FCP-INDI `_ [`fixme `_] +* |OK_ICON| `FCP-INDI `_ -* |OK_ICON| `Human Connectome Project `_ [`fixme `_] +* |OK_ICON| `Human Connectome Project `_ -* |OK_ICON| `NDAR `_ [`fixme `_] +* |OK_ICON| `NDAR `_ -* |OK_ICON| `NIMH Data Archive `_ [`fixme `_] +* |OK_ICON| `NIMH Data Archive `_ -* |OK_ICON| `NeuroData `_ [`fixme `_] +* |OK_ICON| `NeuroData `_ -* |OK_ICON| `Neuroelectro `_ [`fixme `_] +* |OK_ICON| `Neuroelectro `_ -* |OK_ICON| `OASIS `_ [`fixme `_] +* |OK_ICON| `OASIS `_ -* |OK_ICON| `OpenfMRI `_ [`fixme `_] +* |OK_ICON| `OpenfMRI `_ -* |OK_ICON| `Study Forrest `_ [`fixme `_] +* |OK_ICON| `Study Forrest `_ Physics ------- -* |OK_ICON| `CERN Open Data Portal `_ [`fixme `_] +* |OK_ICON| `CERN Open Data Portal `_ -* |OK_ICON| `Crystallography Open Database `_ [`fixme `_] +* |OK_ICON| `Crystallography Open Database `_ -* |OK_ICON| `IceCube - South Pole Neutrino Observatory `_ [`fixme `_] +* |OK_ICON| `IceCube - South Pole Neutrino Observatory `_ -* |OK_ICON| `NASA Exoplanet Archive `_ [`fixme `_] +* |OK_ICON| `NASA Exoplanet Archive `_ -* |OK_ICON| `NSSDC (NASA) data of 550 space spacecraft `_ [`fixme `_] +* |OK_ICON| `NSSDC (NASA) data of 550 space spacecraft `_ -* |OK_ICON| `Sloan Digital Sky Survey (SDSS) - Mapping the Universe `_ [`fixme `_] +* |OK_ICON| `Sloan Digital Sky Survey (SDSS) - Mapping the Universe `_ Psychology+Cognition -------------------- -* |FIXME_ICON| `OSU Cognitive Modeling Repository Datasets `_ +* |FIXME_ICON| `OSU Cognitive Modeling Repository Datasets `_ [`fixme `_] PublicDomains ------------- -* |OK_ICON| `Amazon `_ [`fixme `_] +* |OK_ICON| `Amazon `_ -* |OK_ICON| `Archive.org Datasets `_ [`fixme `_] +* |OK_ICON| `Archive.org Datasets `_ -* |OK_ICON| `Archive-it from Internet Archive `_ [`fixme `_] +* |OK_ICON| `Archive-it from Internet Archive `_ -* |OK_ICON| `CMU JASA data archive `_ [`fixme `_] +* |OK_ICON| `CMU JASA data archive `_ -* |OK_ICON| `CMU StatLab collections `_ [`fixme `_] +* |OK_ICON| `CMU StatLab collections `_ -* |OK_ICON| `Data.World `_ [`fixme `_] +* |OK_ICON| `Data.World `_ -* |OK_ICON| `Data360 `_ [`fixme `_] +* |OK_ICON| `Data360 `_ -* |OK_ICON| `Enigma Public `_ [`fixme `_] +* |OK_ICON| `Enigma Public `_ -* |OK_ICON| `Google `_ [`fixme `_] +* |OK_ICON| `Google `_ -* |FIXME_ICON| `Infochimps `_ +* |FIXME_ICON| `Infochimps `_ [`fixme `_] -* |OK_ICON| `KDNuggets Data Collections `_ [`fixme `_] +* |OK_ICON| `KDNuggets Data Collections `_ -* |FIXME_ICON| `Microsoft Azure Data Market Free DataSets `_ +* |FIXME_ICON| `Microsoft Azure Data Market Free DataSets `_ [`fixme `_] -* |OK_ICON| `Microsoft Data Science for Research `_ [`fixme `_] +* |OK_ICON| `Microsoft Data Science for Research `_ -* |FIXME_ICON| `Numbray `_ +* |FIXME_ICON| `Numbray `_ [`fixme `_] -* |OK_ICON| `Open Library Data Dumps `_ [`fixme `_] +* |OK_ICON| `Open Library Data Dumps `_ -* |OK_ICON| `Reddit Datasets `_ [`fixme `_] +* |OK_ICON| `Reddit Datasets `_ -* |OK_ICON| `RevolutionAnalytics Collection `_ [`fixme `_] +* |OK_ICON| `RevolutionAnalytics Collection `_ -* |OK_ICON| `Sample R data sets `_ [`fixme `_] +* |OK_ICON| `Sample R data sets `_ -* |OK_ICON| `StatSci.org `_ [`fixme `_] +* |OK_ICON| `StatSci.org `_ -* |FIXME_ICON| `Stats4Stem R data sets `_ +* |FIXME_ICON| `Stats4Stem R data sets `_ [`fixme `_] -* |OK_ICON| `The Washington Post List `_ [`fixme `_] +* |OK_ICON| `The Washington Post List `_ -* |OK_ICON| `UCLA SOCR data collection `_ [`fixme `_] +* |OK_ICON| `UCLA SOCR data collection `_ -* |OK_ICON| `UFO Reports `_ [`fixme `_] +* |OK_ICON| `UFO Reports `_ -* |OK_ICON| `Wikileaks 911 pager intercepts `_ [`fixme `_] +* |OK_ICON| `Wikileaks 911 pager intercepts `_ -* |FIXME_ICON| `Yahoo Webscope `_ +* |FIXME_ICON| `Yahoo Webscope `_ [`fixme `_] SearchEngines ------------- -* |OK_ICON| `Academic Torrents of data sharing from UMB `_ [`fixme `_] +* |OK_ICON| `Academic Torrents of data sharing from UMB `_ -* |OK_ICON| `DataMarket (Qlik) `_ [`fixme `_] +* |OK_ICON| `DataMarket (Qlik) `_ -* |OK_ICON| `Datahub.io `_ [`fixme `_] +* |OK_ICON| `Datahub.io `_ -* |OK_ICON| `Harvard Dataverse Network of scientific data `_ [`fixme `_] +* |OK_ICON| `Harvard Dataverse Network of scientific data `_ -* |OK_ICON| `ICPSR (UMICH) `_ [`fixme `_] +* |OK_ICON| `ICPSR (UMICH) `_ -* |OK_ICON| `Institute of Education Sciences `_ [`fixme `_] +* |OK_ICON| `Institute of Education Sciences `_ -* |FIXME_ICON| `National Technical Reports Library `_ +* |FIXME_ICON| `National Technical Reports Library `_ [`fixme `_] -* |OK_ICON| `Open Data Certificates (beta) `_ [`fixme `_] +* |OK_ICON| `Open Data Certificates (beta) `_ -* |OK_ICON| `OpenDataNetwork - A search engine of all Socrata powered data portals `_ [`fixme `_] +* |OK_ICON| `OpenDataNetwork - A search engine of all Socrata powered data portals `_ -* |OK_ICON| `Statista.com - statistics and Studies `_ [`fixme `_] +* |OK_ICON| `Statista.com - statistics and Studies `_ -* |OK_ICON| `Zenodo - An open dependable home for the long-tail of science `_ [`fixme `_] +* |OK_ICON| `Zenodo - An open dependable home for the long-tail of science `_ SocialNetworks -------------- -* |OK_ICON| `72 hours #gamergate Twitter Scrape `_ [`fixme `_] +* |OK_ICON| `72 hours #gamergate Twitter Scrape `_ -* |OK_ICON| `Ancestry.com Forum Dataset over 10 years `_ [`fixme `_] +* |OK_ICON| `Ancestry.com Forum Dataset over 10 years `_ -* |OK_ICON| `CMU Enron Email of 150 users `_ [`fixme `_] +* |OK_ICON| `CMU Enron Email of 150 users `_ -* |OK_ICON| `Cheng-Caverlee-Lee September 2009 - January 2010 Twitter Scrape `_ [`fixme `_] +* |OK_ICON| `Cheng-Caverlee-Lee September 2009 - January 2010 Twitter Scrape `_ -* |OK_ICON| `EDRM Enron EMail of 151 users, hosted on S3 `_ [`fixme `_] +* |OK_ICON| `EDRM Enron EMail of 151 users, hosted on S3 `_ -* |OK_ICON| `Facebook Data Scrape (2005) `_ [`fixme `_] +* |OK_ICON| `Facebook Data Scrape (2005) `_ -* |OK_ICON| `Facebook Social Networks from LAW (since 2007) `_ [`fixme `_] +* |OK_ICON| `Facebook Social Networks from LAW (since 2007) `_ -* |OK_ICON| `Foursquare from UMN/Sarwat (2013) `_ [`fixme `_] +* |OK_ICON| `Foursquare from UMN/Sarwat (2013) `_ -* |OK_ICON| `GitHub Collaboration Archive `_ [`fixme `_] +* |OK_ICON| `GitHub Collaboration Archive `_ -* |OK_ICON| `Google Scholar citation relations `_ [`fixme `_] +* |OK_ICON| `Google Scholar citation relations `_ -* |OK_ICON| `High-Resolution Contact Networks from Wearable Sensors `_ [`fixme `_] +* |OK_ICON| `High-Resolution Contact Networks from Wearable Sensors `_ -* |OK_ICON| `Indie Map: social graph and crawl of top IndieWeb sites `_ [`fixme `_] +* |OK_ICON| `Indie Map: social graph and crawl of top IndieWeb sites `_ -* |FIXME_ICON| `Mobile Social Networks from UMASS `_ +* |FIXME_ICON| `Mobile Social Networks from UMASS `_ [`fixme `_] -* |OK_ICON| `Network Twitter Data `_ [`fixme `_] +* |OK_ICON| `Network Twitter Data `_ -* |OK_ICON| `Reddit Comments `_ [`fixme `_] +* |OK_ICON| `Reddit Comments `_ -* |OK_ICON| `Skytrax' Air Travel Reviews Dataset `_ [`fixme `_] +* |OK_ICON| `Skytrax' Air Travel Reviews Dataset `_ -* |OK_ICON| `Social Twitter Data `_ [`fixme `_] +* |OK_ICON| `Social Twitter Data `_ -* |OK_ICON| `SourceForge.net Research Data `_ [`fixme `_] +* |OK_ICON| `SourceForge.net Research Data `_ -* |OK_ICON| `Twitter Data for Online Reputation Management `_ [`fixme `_] +* |OK_ICON| `Twitter Data for Online Reputation Management `_ -* |OK_ICON| `Twitter Data for Sentiment Analysis `_ [`fixme `_] +* |OK_ICON| `Twitter Data for Sentiment Analysis `_ -* |OK_ICON| `Twitter Graph of entire Twitter site `_ [`fixme `_] +* |OK_ICON| `Twitter Graph of entire Twitter site `_ -* |FIXME_ICON| `Twitter Scrape Calufa May 2011 `_ +* |FIXME_ICON| `Twitter Scrape Calufa May 2011 `_ [`fixme `_] -* |OK_ICON| `UNIMI/LAW Social Network Datasets `_ [`fixme `_] +* |OK_ICON| `UNIMI/LAW Social Network Datasets `_ -* |FIXME_ICON| `Yahoo! Graph and Social Data `_ +* |FIXME_ICON| `Yahoo! Graph and Social Data `_ [`fixme `_] -* |OK_ICON| `Youtube Video Social Graph in 2007,2008 `_ [`fixme `_] +* |OK_ICON| `Youtube Video Social Graph in 2007,2008 `_ SocialSciences -------------- -* |OK_ICON| `ACLED (Armed Conflict Location & Event Data Project) `_ [`fixme `_] +* |OK_ICON| `ACLED (Armed Conflict Location & Event Data Project) `_ -* |OK_ICON| `Canadian Legal Information Institute `_ [`fixme `_] +* |OK_ICON| `Canadian Legal Information Institute `_ -* |OK_ICON| `Center for Systemic Peace Datasets - Conflict Trends, Polities, State Fragility, etc `_ [`fixme `_] +* |OK_ICON| `Center for Systemic Peace Datasets - Conflict Trends, Polities, State Fragility, etc `_ -* |OK_ICON| `Correlates of War Project `_ [`fixme `_] +* |OK_ICON| `Correlates of War Project `_ -* |OK_ICON| `Cryptome Conspiracy Theory Items `_ [`fixme `_] +* |OK_ICON| `Cryptome Conspiracy Theory Items `_ -* |FIXME_ICON| `Datacards `_ +* |FIXME_ICON| `Datacards `_ [`fixme `_] -* |OK_ICON| `European Social Survey `_ [`fixme `_] +* |OK_ICON| `European Social Survey `_ -* |OK_ICON| `FBI Hate Crime 2013 - aggregated data `_ [`fixme `_] +* |OK_ICON| `FBI Hate Crime 2013 - aggregated data `_ -* |FIXME_ICON| `Fragile States Index `_ +* |FIXME_ICON| `Fragile States Index `_ [`fixme `_] -* |OK_ICON| `GDELT Global Events Database `_ [`fixme `_] +* |OK_ICON| `GDELT Global Events Database `_ -* |OK_ICON| `General Social Survey (GSS) since 1972 `_ [`fixme `_] +* |OK_ICON| `General Social Survey (GSS) since 1972 `_ -* |OK_ICON| `German Social Survey `_ [`fixme `_] +* |OK_ICON| `German Social Survey `_ -* |OK_ICON| `Global Religious Futures Project `_ [`fixme `_] +* |OK_ICON| `Global Religious Futures Project `_ -* |FIXME_ICON| `Humanitarian Data Exchange `_ +* |FIXME_ICON| `Humanitarian Data Exchange `_ [`fixme `_] -* |OK_ICON| `INFORM Index for Risk Management `_ [`fixme `_] +* |OK_ICON| `INFORM Index for Risk Management `_ -* |OK_ICON| `Institute for Demographic Studies `_ [`fixme `_] +* |OK_ICON| `Institute for Demographic Studies `_ -* |OK_ICON| `International Networks Archive `_ [`fixme `_] +* |OK_ICON| `International Networks Archive `_ -* |OK_ICON| `International Social Survey Program ISSP `_ [`fixme `_] +* |OK_ICON| `International Social Survey Program ISSP `_ -* |OK_ICON| `International Studies Compendium Project `_ [`fixme `_] +* |OK_ICON| `International Studies Compendium Project `_ -* |OK_ICON| `James McGuire Cross National Data `_ [`fixme `_] +* |OK_ICON| `James McGuire Cross National Data `_ -* |OK_ICON| `MIT Reality Mining Dataset `_ [`fixme `_] +* |OK_ICON| `MIT Reality Mining Dataset `_ -* |OK_ICON| `MacroData Guide by Norsk samfunnsvitenskapelig datatjeneste `_ [`fixme `_] +* |OK_ICON| `MacroData Guide by Norsk samfunnsvitenskapelig datatjeneste `_ -* |OK_ICON| `Minnesota Population Center `_ [`fixme `_] +* |OK_ICON| `Minnesota Population Center `_ -* |OK_ICON| `Notre Dame Global Adaptation Index (NG-DAIN) `_ [`fixme `_] +* |OK_ICON| `Notre Dame Global Adaptation Index (NG-DAIN) `_ -* |OK_ICON| `Open Crime and Policing Data in England, Wales and Northern Ireland `_ [`fixme `_] +* |OK_ICON| `Open Crime and Policing Data in England, Wales and Northern Ireland `_ -* |OK_ICON| `OpenSanctions - A global database of persons and companies of political, [...] `_ [`fixme `_] +* |OK_ICON| `OpenSanctions - A global database of persons and companies of political, [...] `_ -* |OK_ICON| `Paul Hensel General International Data Page `_ [`fixme `_] +* |OK_ICON| `Paul Hensel General International Data Page `_ -* |FIXME_ICON| `PewResearch Internet Survey Project `_ +* |FIXME_ICON| `PewResearch Internet Survey Project `_ [`fixme `_] -* |OK_ICON| `PewResearch Society Data Collection `_ [`fixme `_] +* |OK_ICON| `PewResearch Society Data Collection `_ -* |OK_ICON| `Political Polarity Data `_ [`fixme `_] +* |OK_ICON| `Political Polarity Data `_ -* |OK_ICON| `StackExchange Data Explorer `_ [`fixme `_] +* |OK_ICON| `StackExchange Data Explorer `_ -* |OK_ICON| `Terrorism Research and Analysis Consortium `_ [`fixme `_] +* |OK_ICON| `Terrorism Research and Analysis Consortium `_ -* |OK_ICON| `Texas Inmates Executed Since 1984 `_ [`fixme `_] +* |OK_ICON| `Texas Inmates Executed Since 1984 `_ -* |OK_ICON| `Titanic Survival Data Set `_ [`fixme `_] +* |OK_ICON| `Titanic Survival Data Set `_ -* |OK_ICON| `UCB's Archive of Social Science Data (D-Lab) `_ [`fixme `_] +* |OK_ICON| `UCB's Archive of Social Science Data (D-Lab) `_ -* |FIXME_ICON| `UCLA Social Sciences Data Archive `_ +* |FIXME_ICON| `UCLA Social Sciences Data Archive `_ [`fixme `_] -* |OK_ICON| `UN Civil Society Database `_ [`fixme `_] +* |OK_ICON| `UN Civil Society Database `_ -* |OK_ICON| `UPJOHN for Labor Employment Research `_ [`fixme `_] +* |OK_ICON| `UPJOHN for Labor Employment Research `_ -* |OK_ICON| `Universities Worldwide `_ [`fixme `_] +* |OK_ICON| `Universities Worldwide `_ -* |OK_ICON| `Uppsala Conflict Data Program `_ [`fixme `_] +* |OK_ICON| `Uppsala Conflict Data Program `_ -* |OK_ICON| `World Bank Open Data `_ [`fixme `_] +* |OK_ICON| `World Bank Open Data `_ -* |OK_ICON| `WorldPop project - Worldwide human population distributions `_ [`fixme `_] +* |OK_ICON| `WorldPop project - Worldwide human population distributions `_ Software -------- -* |OK_ICON| `FLOSSmole data about free, libre, and open source software development `_ [`fixme `_] +* |OK_ICON| `FLOSSmole data about free, libre, and open source software development `_ -* |OK_ICON| `Libraries.io Open Source Repository and Dependency Metadata `_ [`fixme `_] +* |OK_ICON| `Libraries.io Open Source Repository and Dependency Metadata `_ Sports ------ -* |OK_ICON| `Betfair Historical Exchange Data `_ [`fixme `_] +* |OK_ICON| `Betfair Historical Exchange Data `_ -* |OK_ICON| `Cricsheet Matches (cricket) `_ [`fixme `_] +* |OK_ICON| `Cricsheet Matches (cricket) `_ -* |OK_ICON| `Ergast Formula 1, from 1950 up to date (API) `_ [`fixme `_] +* |OK_ICON| `Ergast Formula 1, from 1950 up to date (API) `_ -* |OK_ICON| `Football/Soccer resources (data and APIs) `_ [`fixme `_] +* |OK_ICON| `Football/Soccer resources (data and APIs) `_ -* |OK_ICON| `Lahman's Baseball Database `_ [`fixme `_] +* |OK_ICON| `Lahman's Baseball Database `_ -* |OK_ICON| `Pinhooker: Thoroughbred Bloodstock Sale Data `_ [`fixme `_] +* |OK_ICON| `Pinhooker: Thoroughbred Bloodstock Sale Data `_ -* |OK_ICON| `Retrosheet Baseball Statistics `_ [`fixme `_] +* |OK_ICON| `Retrosheet Baseball Statistics `_ -* |OK_ICON| `Tennis database of rankings, results, and stats for ATP `_ [`fixme `_] +* |OK_ICON| `Tennis database of rankings, results, and stats for ATP `_ -* |OK_ICON| `Tennis database of rankings, results, and stats for WTA `_ [`fixme `_] +* |OK_ICON| `Tennis database of rankings, results, and stats for WTA `_ TimeSeries ---------- -* |OK_ICON| `Databanks International Cross National Time Series Data Archive `_ [`fixme `_] +* |OK_ICON| `Databanks International Cross National Time Series Data Archive `_ -* |OK_ICON| `Hard Drive Failure Rates `_ [`fixme `_] +* |OK_ICON| `Hard Drive Failure Rates `_ -* |OK_ICON| `Heart Rate Time Series from MIT `_ [`fixme `_] +* |OK_ICON| `Heart Rate Time Series from MIT `_ -* |OK_ICON| `Time Series Data Library (TSDL) from MU `_ [`fixme `_] +* |OK_ICON| `Time Series Data Library (TSDL) from MU `_ -* |OK_ICON| `UC Riverside Time Series Dataset `_ [`fixme `_] +* |OK_ICON| `UC Riverside Time Series Dataset `_ Transportation -------------- -* |OK_ICON| `Airlines OD Data 1987-2008 `_ [`fixme `_] +* |OK_ICON| `Airlines OD Data 1987-2008 `_ -* |OK_ICON| `Bay Area Bike Share Data `_ [`fixme `_] +* |OK_ICON| `Bay Area Bike Share Data `_ -* |OK_ICON| `Bike Share Systems (BSS) collection `_ [`fixme `_] +* |OK_ICON| `Bike Share Systems (BSS) collection `_ -* |OK_ICON| `GeoLife GPS Trajectory from Microsoft Research `_ [`fixme `_] +* |OK_ICON| `GeoLife GPS Trajectory from Microsoft Research `_ -* |OK_ICON| `German train system by Deutsche Bahn `_ [`fixme `_] +* |OK_ICON| `German train system by Deutsche Bahn `_ -* |OK_ICON| `Hubway Million Rides in MA `_ [`fixme `_] +* |OK_ICON| `Hubway Million Rides in MA `_ -* |OK_ICON| `Montreal BIXI Bike Share `_ [`fixme `_] +* |OK_ICON| `Montreal BIXI Bike Share `_ -* |OK_ICON| `NYC Taxi Trip Data 2009- `_ [`fixme `_] +* |OK_ICON| `NYC Taxi Trip Data 2009- `_ -* |OK_ICON| `NYC Taxi Trip Data 2013 (FOIA/FOILed) `_ [`fixme `_] +* |OK_ICON| `NYC Taxi Trip Data 2013 (FOIA/FOILed) `_ -* |OK_ICON| `NYC Uber trip data April 2014 to September 2014 `_ [`fixme `_] +* |OK_ICON| `NYC Uber trip data April 2014 to September 2014 `_ -* |OK_ICON| `Open Traffic collection `_ [`fixme `_] +* |OK_ICON| `Open Traffic collection `_ -* |OK_ICON| `OpenFlights - airport, airline and route data `_ [`fixme `_] +* |OK_ICON| `OpenFlights - airport, airline and route data `_ -* |FIXME_ICON| `Philadelphia Bike Share Stations (JSON) `_ +* |FIXME_ICON| `Philadelphia Bike Share Stations (JSON) `_ [`fixme `_] -* |OK_ICON| `Plane Crash Database, since 1920 `_ [`fixme `_] +* |OK_ICON| `Plane Crash Database, since 1920 `_ -* |OK_ICON| `RITA Airline On-Time Performance data `_ [`fixme `_] +* |OK_ICON| `RITA Airline On-Time Performance data `_ -* |OK_ICON| `RITA/BTS transport data collection (TranStat) `_ [`fixme `_] +* |OK_ICON| `RITA/BTS transport data collection (TranStat) `_ -* |FIXME_ICON| `Toronto Bike Share Stations (XML file) `_ +* |FIXME_ICON| `Toronto Bike Share Stations (XML file) `_ [`fixme `_] -* |OK_ICON| `Transport for London (TFL) `_ [`fixme `_] +* |OK_ICON| `Transport for London (TFL) `_ -* |OK_ICON| `Travel Tracker Survey (TTS) for Chicago `_ [`fixme `_] +* |OK_ICON| `Travel Tracker Survey (TTS) for Chicago `_ -* |OK_ICON| `U.S. Bureau of Transportation Statistics (BTS) `_ [`fixme `_] +* |OK_ICON| `U.S. Bureau of Transportation Statistics (BTS) `_ -* |OK_ICON| `U.S. Domestic Flights 1990 to 2009 `_ [`fixme `_] +* |OK_ICON| `U.S. Domestic Flights 1990 to 2009 `_ -* |OK_ICON| `U.S. Freight Analysis Framework since 2007 `_ [`fixme `_] +* |OK_ICON| `U.S. Freight Analysis Framework since 2007 `_ Complementary Collections From eeb636d2a9a8403acbd16e7020ab7d269e507883 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Thu, 12 Apr 2018 17:54:06 +0000 Subject: [PATCH 96/99] Update README from APD2: 6c901fba4a66a3d47d8647c016567ca09a6a5ab9 --- README.rst | 16 +++++++++------- 1 file changed, 9 insertions(+), 7 deletions(-) diff --git a/README.rst b/README.rst index 500aff0..37e44fa 100644 --- a/README.rst +++ b/README.rst @@ -201,7 +201,7 @@ ComplexNetworks * |OK_ICON| `UCI Network Data Repository `_ -* |OK_ICON| `UFL sparse matrix collection `_ +* |FIXME_ICON| `UFL sparse matrix collection `_ [`fixme `_] * |OK_ICON| `WSU Graph Database `_ @@ -351,7 +351,7 @@ Energy * |OK_ICON| `HFED `_ -* |FIXME_ICON| `PLAID - The Plug Load Appliance Identification Dataset `_ [`fixme `_] +* |OK_ICON| `PLAID - The Plug Load Appliance Identification Dataset `_ * |OK_ICON| `REDD `_ @@ -538,6 +538,8 @@ Government * |OK_ICON| `Los Angeles Open Data `_ +* |OK_ICON| `Luxembourg - Luxembourgish Open Data Portal `_ + * |OK_ICON| `MassGIS, Massachusetts, U.S. `_ * |OK_ICON| `Metropolitain Transportation Commission (MTC), California, US `_ @@ -626,13 +628,13 @@ Government * |OK_ICON| `Texas Open Data `_ -* |FIXME_ICON| `The World Bank `_ [`fixme `_] +* |OK_ICON| `The World Bank `_ * |FIXME_ICON| `Toronto, ON, Canada `_ [`fixme `_] * |OK_ICON| `Tunisia `_ -* |OK_ICON| `U.K. Government Data `_ +* |FIXME_ICON| `U.K. Government Data `_ [`fixme `_] * |OK_ICON| `U.S. American Community Survey `_ @@ -748,7 +750,7 @@ ImageProcessing * |OK_ICON| `Violent-Flows - Crowd Violence / Non-violence Database and benchmark `_ -* |OK_ICON| `Visual genome `_ +* |FIXME_ICON| `Visual genome `_ [`fixme `_] * |OK_ICON| `YouTube Faces Database `_ @@ -769,7 +771,7 @@ MachineLearning * |OK_ICON| `Labeled Faces in the Wild (LFW) `_ -* |FIXME_ICON| `Lending Club Loan Data `_ [`fixme `_] +* |OK_ICON| `Lending Club Loan Data `_ * |OK_ICON| `Machine Learning Data Set Repository `_ @@ -963,7 +965,7 @@ PublicDomains * |OK_ICON| `Open Library Data Dumps `_ -* |OK_ICON| `Reddit Datasets `_ +* |FIXME_ICON| `Reddit Datasets `_ [`fixme `_] * |OK_ICON| `RevolutionAnalytics Collection `_ From d8bc59e390f34caab9ef66681119c07b28fd6836 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Thu, 19 Apr 2018 16:24:59 +0000 Subject: [PATCH 97/99] Update README from APD2: 08859db8a925116d622bba0f5cc221c09d2f5aac --- README.rst | 20 +++++++++++++------- 1 file changed, 13 insertions(+), 7 deletions(-) diff --git a/README.rst b/README.rst index 37e44fa..75420dd 100644 --- a/README.rst +++ b/README.rst @@ -201,7 +201,7 @@ ComplexNetworks * |OK_ICON| `UCI Network Data Repository `_ -* |FIXME_ICON| `UFL sparse matrix collection `_ [`fixme `_] +* |OK_ICON| `UFL sparse matrix collection `_ * |OK_ICON| `WSU Graph Database `_ @@ -347,6 +347,8 @@ Energy * |OK_ICON| `EIA `_ +* |OK_ICON| `Global Power Plant Database - The Global Power Plant Database is a [...] `_ + * |OK_ICON| `HES - Household Electricity Study, UK `_ * |OK_ICON| `HFED `_ @@ -582,6 +584,8 @@ Government * |OK_ICON| `Palo Alto, California, US `_ +* |OK_ICON| `OpenDataPhilly - OpenDataPhilly is a catalog of open data in the [...] `_ + * |OK_ICON| `Portland, Oregon `_ * |OK_ICON| `Portugal - Pordata organization `_ @@ -590,16 +594,18 @@ Government * |OK_ICON| `Quebec City, QC, Canada `_ -* |OK_ICON| `Quebec Province of Canada `_ +* |FIXME_ICON| `Quebec Province of Canada `_ [`fixme `_] * |OK_ICON| `Regina SK, Canada `_ -* |FIXME_ICON| `Rio de Janeiro, Brazil `_ [`fixme `_] +* |OK_ICON| `Rio de Janeiro, Brazil `_ * |OK_ICON| `Romania `_ * |OK_ICON| `Russia `_ +* |OK_ICON| `San Antonio, TX - Community Information Now - CI:Now is a nonprofit [...] `_ + * |OK_ICON| `San Francisco Data sets `_ * |OK_ICON| `San Jose, California, US `_ @@ -634,7 +640,7 @@ Government * |OK_ICON| `Tunisia `_ -* |FIXME_ICON| `U.K. Government Data `_ [`fixme `_] +* |OK_ICON| `U.K. Government Data `_ * |OK_ICON| `U.S. American Community Survey `_ @@ -750,7 +756,7 @@ ImageProcessing * |OK_ICON| `Violent-Flows - Crowd Violence / Non-violence Database and benchmark `_ -* |FIXME_ICON| `Visual genome `_ [`fixme `_] +* |OK_ICON| `Visual genome `_ * |OK_ICON| `YouTube Faces Database `_ @@ -965,7 +971,7 @@ PublicDomains * |OK_ICON| `Open Library Data Dumps `_ -* |FIXME_ICON| `Reddit Datasets `_ [`fixme `_] +* |OK_ICON| `Reddit Datasets `_ * |OK_ICON| `RevolutionAnalytics Collection `_ @@ -1168,7 +1174,7 @@ Sports * |OK_ICON| `Football/Soccer resources (data and APIs) `_ -* |OK_ICON| `Lahman's Baseball Database `_ +* |FIXME_ICON| `Lahman's Baseball Database `_ [`fixme `_] * |OK_ICON| `Pinhooker: Thoroughbred Bloodstock Sale Data `_ From dc7a35d34dcafdab6513891c497ba3297351dbc6 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Thu, 19 Apr 2018 16:25:08 +0000 Subject: [PATCH 98/99] Update README from APD2: dcaa222d448688c69f44c4a58df2c6acf96a245d --- README.rst | 4 +--- 1 file changed, 1 insertion(+), 3 deletions(-) diff --git a/README.rst b/README.rst index 75420dd..60100be 100644 --- a/README.rst +++ b/README.rst @@ -347,8 +347,6 @@ Energy * |OK_ICON| `EIA `_ -* |OK_ICON| `Global Power Plant Database - The Global Power Plant Database is a [...] `_ - * |OK_ICON| `HES - Household Electricity Study, UK `_ * |OK_ICON| `HFED `_ @@ -1074,7 +1072,7 @@ SocialSciences * |OK_ICON| `ACLED (Armed Conflict Location & Event Data Project) `_ -* |OK_ICON| `Canadian Legal Information Institute `_ +* |FIXME_ICON| `Canadian Legal Information Institute `_ [`fixme `_] * |OK_ICON| `Center for Systemic Peace Datasets - Conflict Trends, Polities, State Fragility, etc `_ From 7dbbb7477b0115f3101a097b459dda092c160200 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Thu, 19 Apr 2018 16:28:35 +0000 Subject: [PATCH 99/99] Update README from APD2: 554e46ebfed0eb6915b2822e3a2aa58a6b338f7a --- README.rst | 6 +++++- 1 file changed, 5 insertions(+), 1 deletion(-) diff --git a/README.rst b/README.rst index 60100be..fa55f14 100644 --- a/README.rst +++ b/README.rst @@ -347,6 +347,8 @@ Energy * |OK_ICON| `EIA `_ +* |OK_ICON| `Global Power Plant Database - The Global Power Plant Database is a [...] `_ + * |OK_ICON| `HES - Household Electricity Study, UK `_ * |OK_ICON| `HFED `_ @@ -632,7 +634,7 @@ Government * |OK_ICON| `Texas Open Data `_ -* |OK_ICON| `The World Bank `_ +* |FIXME_ICON| `The World Bank `_ [`fixme `_] * |FIXME_ICON| `Toronto, ON, Canada `_ [`fixme `_] @@ -1096,6 +1098,8 @@ SocialSciences * |OK_ICON| `Global Religious Futures Project `_ +* |OK_ICON| `Gun Violence Data - A comprehensive, accessible database that contains [...] `_ + * |FIXME_ICON| `Humanitarian Data Exchange `_ [`fixme `_] * |OK_ICON| `INFORM Index for Risk Management `_