Commit Graph

429 Commits

Author SHA1 Message Date
Xiaming Chen
dd3a17cb02 Merge pull request #260 from MaxwellRebo/master
Add keyphrase extraction datasets
2016-12-18 23:25:20 +08:00
Xiaming Chen
0d0117a88a Update new image sets and three NLP sets
Images: Chars74K dataset and MNIST, NLP: Google MC-AFP, MS-MACRO,  and MDST
2016-12-18 16:08:36 +08:00
Maxwell Rebo
4dc886ac00 Update README.rst 2016-12-11 15:17:54 +04:00
Arturo Filastò
6b7120dad2 Add OONI data
Add a link to data provided by the Open Observatory of Network Interference on internet censorship
2016-12-08 18:44:01 +00:00
Diomidis Spinellis
80ecc66409 Add Microsoft's Data Science for Research 2016-11-27 10:47:59 +02:00
Samuel Taylor
57d9c7bff7 Remove dead link to GetGlue 2016-11-12 09:41:05 -06:00
Xiaming Chen
0954d9aa6b Add Kaggle link to Titanic data 2016-11-11 09:48:18 +08:00
Sammy X Chen
2530bbf133 Update README.rst 2016-08-15 14:04:32 +08:00
Sammy X Chen
9d1f4fb10d Add AQUASTAT and category Earth Science
Earch Science maintains data from geoscience and earth related fields, like environment, water etc.
2016-08-15 13:59:28 +08:00
Sammy X Chen
980b0564eb Merge pull request #226 from JackKelly/master
Update README.rst
2016-08-15 11:22:03 +08:00
Xiaming Chen
e2e48c39a0 #230 2016-08-15 11:18:24 +08:00
Xiaming Chen
25abbff1b6 #231 merge to add category Neuroscience 2016-08-15 11:10:46 +08:00
Sammy X Chen
059a2b974d Merge pull request #233 from arademaker/master
wordnet and the corpora from UD project
2016-08-15 11:02:29 +08:00
Sammy X Chen
90825c7654 Merge pull request #235 from handmadeby/patch-1
Updated TFL to current API link.
2016-08-15 10:58:47 +08:00
Sammy X Chen
a54b091c21 Merge pull request #236 from hckiang/master
Added Uppsala Conflict Data Program
2016-08-15 10:56:35 +08:00
Sammy X Chen
8d59ba8dfe Merge pull request #237 from stsievert/master
New Yorker caption contest ratings
2016-08-15 10:53:50 +08:00
Sammy X Chen
7270eb0b1f Merge pull request #239 from jeremie-seguin/master
Fix broken link: Netflix prize
2016-08-15 10:49:34 +08:00
Sammy X Chen
86fe0cf6dc add AWC 2016-08-11 10:51:08 +08:00
Sammy X Chen
71d9c2466d add International Economics Database 2016-08-11 10:45:55 +08:00
jeremie
9bb6ab1e89 Fix broken link: Netflix prize 2016-08-10 11:04:50 +02:00
Scott Sievert
2bf5f661f4 adds caption contest dataset 2016-07-22 10:52:48 -05:00
Haochi Kiang
21ffee83e3 Added Uppsala Conflict Data Program
"The Uppsala Conflict Data Program (UCDP) offers a number of datasets on organised violence and peacemaking, all of which can be downloaded for free through the links below."
2016-07-20 10:39:51 +08:00
handmadeby
af605c3869 Updated TFL to current API link.
The Transport for London API link was pointing to a legacy page - I updated to the current valid page.
2016-07-07 14:33:06 +01:00
Alexandre Rademaker
a3bde36abb wordnet and the corpora from UD project 2016-07-05 05:34:44 -03:00
John Pellman
7e00e1a52b Neuroscience data added; new section for neuroscience 2016-07-04 11:05:14 -04:00
John Pellman
2f40e980d2 Added Brain Catalogue. 2016-06-23 05:24:21 -04:00
Jack Kelly
4400bf5a80 Update README.rst
Adding more Energy datasets.  And fixing capitalisation for UK-DALE and PLAID
2016-06-08 13:19:18 +01:00
Pierre Fenoll
b59f3bbb65 Add NYSE 2016-04-26 20:54:35 +02:00
Xiaming
8a09814e77 Add EMPIAR to bio. cat #215 2016-04-15 14:02:08 +08:00
David Dao
0f85053046 Adding Broad Bioimage Benchmark Collection (BBBC)
The Broad Bioimage Benchmark Collection (BBBC) is a large curated collection of published data sets in bio imaging. It includes all the images, metadata and ground truths. The BBBC resource is described in the following publication: Ljosa V, Sokolnicki KL, Carpenter AE (2012). Annotated high-throughput microscopy image sets for validation. Nature Methods 9(7):637 / doi. PMID: 22743765 PMCID: PMC3627348. Available at http://dx.doi.org/10.1038/nmeth.2083
2016-03-18 09:36:16 -04:00
Xiaming Chen
a355d0ef93 Clean TOC 2016-02-26 11:14:07 +08:00
Xiaming Chen
5c55314427 Add OpenDataSoft's portal list #208;
Move collected government to separated file to make the list short and clean.
2016-02-26 11:06:00 +08:00
Xiaming
ddc77bdf69 Add AMiner Citation Network Dataset 2016-02-25 19:28:36 +08:00
Xiaming
954600c51b Merge pull request #207 from alexurquhart/patch-1
Added HIFLD GIS data
2016-02-25 19:25:57 +08:00
Alex Urquhart
08e3bda416 Added HIFLD GIS data
Homeland Infrastructure Foundation-Level Data - https://hifld-dhs-gii.opendata.arcgis.com/
2016-02-25 05:48:48 -05:00
Ron
abd28a9836 added network repository to complex networks 2016-02-24 15:21:28 -08:00
lukeleslie
dea18ce158 Add Road Networks source to Complex Networks. 2016-02-19 17:32:46 -06:00
Xiaming
be1883f181 Merge pull request #202 from megansquire/patch-2
Added FLOSSmole
2016-02-16 01:30:02 +08:00
Megan Squire
feb840727c Update README.rst
Added FLOSSmole 60,000 data sets about free, libre, and open source software development practices with corrected link
2016-02-14 12:58:38 -05:00
Prayag Verma
b259eb2a3f Fix typos
`Interations` → `Interactions`
`Longitudnal` → `Longitudinal`
2016-02-14 23:07:37 +05:30
anatoly techtonik
c9a3a0affc Add Crystallography Open Database 2016-02-14 07:12:18 +03:00
Xiaming Chen
38ecc63b95 Change GeoSpace/GIS to GIS/Environment;
Add IMOS data;
2016-02-14 01:25:23 +08:00
Xiaming Chen
fb909aa46f Move ArchiveIt! to PublicDomains; 2016-02-14 01:18:12 +08:00
Xiaming Chen
9dd7a97da3 Merge #189 2016-02-14 01:09:49 +08:00
Xiaming
a9bdb878e9 Merge pull request #198 from ccjeng/master
Datasets from Taiwan added
2016-02-14 01:05:47 +08:00
Xiaming
7b44371da8 Merge pull request #197 from suvjunmd/moldova
Added Moldova government data site
2016-02-14 01:05:33 +08:00
Xiaming
fa521f12e5 Merge pull request #196 from rmporsch/master
Please add data from the Psychiatric Genomics Consortium
2016-02-14 01:03:09 +08:00
Xiaming
6739631cec Merge pull request #195 from panisson/master
Add High-Resolution Contact Networks from Wearable Sensors
2016-02-14 01:02:32 +08:00
Xiaming
beb2fb568a Merge pull request #194 from shaih82/master
Update README.rst
2016-02-14 01:00:45 +08:00
Xiaming
7250a28974 Merge pull request #193 from mrvaldes/patch-1
added Chile Open Data to README.rst
2016-02-14 00:58:54 +08:00
Xiaming
78459c25b8 Merge pull request #191 from duyetdev/master
Add Bruteforce Database
2016-02-14 00:58:20 +08:00
Xiaming
8815526dd9 Merge pull request #190 from pdeardorff-r7/master
Add Rapid7 Sonar internet scans
2016-02-14 00:56:43 +08:00
Xiaming
757d2c3497 Merge pull request #188 from damiano/patch-1
Adding 'Twitter Data for Online Reputation Management'
2016-02-14 00:54:26 +08:00
andycheng
9a18e153b2 Datasets from Taiwan added 2016-02-13 18:18:13 +08:00
Dmitri Suvorov
2467b46057 Added Moldova government data site 2016-02-13 00:25:00 +02:00
Robert Porsch
28765b8cbc Added data available from the Psychiatric Genomics Consortium 2016-02-11 16:34:14 +08:00
André Panisson
71d2854ec5 Add High-Resolution Contact Networks from Wearable Sensors 2016-02-10 16:45:43 +01:00
shai harel
18f0b961bf Update README.rst
added Adience ASLAN and violent flow DATASETES
2016-02-10 17:39:44 +02:00
M. Valdes
734dc4a407 add Chile Open Data to README.rst 2016-02-10 03:09:40 -03:00
Van-Duyet Le
a8d357192b Add Bruteforce Database 2016-02-10 12:09:46 +07:00
pdeardorff-r7
39abe36670 Add Rapid7 Sonar internet scans 2016-02-09 21:04:27 -08:00
Damiano Spina
299dd2c952 Adding 'Twitter Data for Online Reputation Management'
Added the RepLab 2013 dataset into the 'Social Networks' category
2016-02-09 23:33:36 +11:00
HashirZahir
46e601cfa3 Added Basketball Player Database and Statistics 2016-02-09 12:19:23 +08:00
Xiaming
f09a3d9b83 Merge pull request #185 from kenguish/master
Add Hong Kong (China) government data site
2016-02-09 11:04:15 +08:00
Xiaming
18ad7c6a2b Merge pull request #184 from dspinellis/master
Add Greece's government data site
2016-02-09 11:03:40 +08:00
kenguish
0a0bf5b1e0 Add Hong Kong (China) government data site 2016-02-08 03:47:32 +08:00
Diomidis Spinellis
31b6c3c087 Add Greece's government data site 2016-02-07 12:36:29 +02:00
Brant Strand
a00a61fe4e Adding UniProt proteins 2016-02-05 14:27:41 -08:00
Brant Strand
74fb770e3a Adding NCBI protein and taxonomy databases 2016-02-05 14:25:29 -08:00
Xiaming Chen
a467d56ac5 Clean format and thanks for every contribution in last days 2016-02-04 22:20:49 +08:00
Xiaming Chen
5323085486 Merge #167 2016-02-04 22:14:31 +08:00
Xiaming Chen
d5030b0f5b Merge #175 2016-02-04 22:12:01 +08:00
Xiaming Chen
845e78f006 Merge #178 2016-02-04 22:10:29 +08:00
Xiaming Chen
de00186b96 Merge #179 2016-02-04 22:09:31 +08:00
Xiaming Chen
c0fbb8cc0e Merge #180 2016-02-04 22:06:44 +08:00
Xiaming
b97f7cceb6 Merge pull request #177 from bmaeser/patch-1
added the Vienna (Austria) 'Open Government Data' catalogue
2016-02-04 21:57:32 +08:00
Xiaming
ded94a22ef Merge pull request #176 from danfowler/patch-1
Update README.rst
2016-02-04 21:57:19 +08:00
Xiaming
edcc197a46 Merge pull request #174 from openlexington/add_lexington_ky
add lexinton's open data collection
2016-02-04 21:56:25 +08:00
Xiaming
1c0619e025 Merge pull request #173 from alexurquhart/master
Added various GIS & social science links
2016-02-04 21:56:05 +08:00
Xiaming
790655fab9 Merge pull request #172 from verhoevenben/master
Update README.rst
2016-02-04 21:55:28 +08:00
Xiaming
d66a3ce3a0 Merge pull request #171 from seaninryan/irelandOpenData
Ireland's open data
2016-02-04 21:55:07 +08:00
Xiaming
46daedde78 Merge pull request #170 from QuincyLarson/patch-1
Add Free Code Camp's 150,000 record open data set
2016-02-04 21:54:48 +08:00
Xiaming
d45138520f Merge pull request #165 from tomazin/master
Added Portuguese stats database
2016-02-04 21:53:03 +08:00
Xiaming
4dde25596c Merge pull request #164 from danbartlett/open-atlas-latest-version
Update link to latest version of Census Open Atlas
2016-02-04 21:51:10 +08:00
Xiaming
aa1294ba10 Merge pull request #162 from karussell/patch-1
Added open traffic data collection
2016-02-04 21:49:52 +08:00
Xiaming
16aab574c8 Merge pull request #160 from woemler/master
Added some cancer genomics resources.
2016-02-04 21:49:15 +08:00
Xiaming
7f3c2e9b34 Merge pull request #155 from suyashss/master
Added more genomics datasets HGDP/HapMap/CGI
2016-02-04 21:48:44 +08:00
Xiaming
f195f38b17 Merge pull request #154 from m4thfr34k/master
Update README.rst
2016-02-04 21:48:16 +08:00
Bernhard Mäser
4726d58dcb added the Vienna (Austria) 'Open Government Data' catalogue 2016-02-03 16:37:29 +01:00
Daniel Fowler
ccb6eb82c6 Update README.rst
Add data packaged "core" datasets
2016-02-03 16:40:50 +03:00
Chase Southard
c6b678ad6a add link to lexinton's open data collection 2016-02-02 14:13:34 -05:00
Alex Urquhart
a9b5b6095e Update README.rst 2016-02-02 12:44:33 -05:00
Alex Urquhart
a1534d5cf5 Update README.rst 2016-02-02 12:39:02 -05:00
Alex Urquhart
80c484fa7e Update README.rst 2016-02-02 12:25:44 -05:00
Alex Urquhart
717a5e4900 Update README.rst 2016-02-02 12:21:51 -05:00
Ben Verhoeven
59a5dc490b Update README.rst
added Personae and CSI corpus to Natural Language
2016-02-02 13:25:17 +01:00
Sean Ryan
baee4a3fdd Ireland's open data 2016-02-02 09:17:02 +00:00
Quincy Larson
ce186bb56d Add Free Code Camp's 150,000 record open data set
For more information on the dataset: https://medium.freecodecamp.com/free-code-camp-christmas-special-giving-the-gift-of-data-6ecbf0313d62#.4y2k11ta2
2016-02-01 14:42:02 -08:00
Tome
9792bace9e Added Portuguese stats database 2016-01-31 15:24:54 +00:00
Tome
64f0325f38 Added Portuguese stats atabase 2016-01-31 15:09:49 +00:00
Dan Bartlett
41db200856 Update README.rst
Link to latest version of Census Open Atlas
2016-01-31 15:07:13 +00:00
Peter
4f9f1181ef added open traffic collection 2016-01-31 14:39:53 +01:00
Will Oemler
1418f271f8 Added some cancer genomics resources. 2016-01-31 08:10:01 -05:00
Suyash Shringarpure
29fccee399 Added more genomics datasets HGDP/HapMap/CGI
Added datasets from the Human Genome Diversity Project, HapMap Project and Complete Genomics.
2016-01-30 22:54:38 -08:00
Daniel
788b7af22e Update README.rst
Added Open Payments Data
2016-01-30 22:49:11 -05:00
Jordan Matelsky
8df05809de Update README.rst 2016-01-30 22:11:43 -05:00
Helen Flynn
cd8064eafe Add OME powered data repositories 2016-01-21 16:38:29 +00:00
Xiaming Chen
52183c015f Add WorldPop project 2016-01-18 15:33:16 +08:00
Phill
7916027e4d Fix Broken Links
Travis build failed on a number of broken links. I've rectified some of the links, but the following I cannot:

  3. http://cvcl.mit.edu/MM/stimuli.html Connection refused - connect(2) for "cvcl.mit.edu" port 80
  4. 403 http://www.gutenberg.org/wiki/Gutenberg:Offline_Catalogs
  5. http://data.ohouston.org Net::ReadTimeout
  6. http://data.rio.rj.gov.br/ Connection timed out - connect(2) for "data.rio.rj.gov.br" port 80
2016-01-17 12:22:58 +00:00
Phill
535be187b1 Fix Pinhooker URL 2016-01-17 10:33:21 +00:00
Phill
60a7a434aa Added Pinhooker to Sport 2016-01-17 10:32:07 +00:00
Xiaming
d7e13d83b3 Merge pull request #146 from chaitan94/patch-1
Added the dataset 'Labeled Faces in the Wild'
2016-01-11 10:56:35 +08:00
Krishna Chaitanya
bf251cea26 Added the dataset 'Labeled Faces in the Wild' 2016-01-10 11:16:22 +05:30
Wes Turner
c7828639c8 DOC: README.rst: .. contents:: 2016-01-08 07:10:53 -06:00
raybuhr
81cd6895ca add http:// prefix to a few links
Some of the links returned 404 error messages due to the rst used. Rst assumes a link without a prefix is contained in the local directory, though none of the links in this file are. 

For example, the line 

* `The Atlas of Economic Complexity <atlas.cid.harvard.edu>`_

would proceed to the url https://github.com/caesar0301/awesome-public-datasets/blob/master/atlas.cid.harvard.edu, resulting in a 404 error. 

My change prepends http:// to the link so that line now routes to the correct address.
New line: 

* `The Atlas of Economic Complexity <http://atlas.cid.harvard.edu>`_
2016-01-05 00:11:23 -06:00
Xiaming
a9c241aa87 Remove dup GSS 2016-01-05 00:06:18 +08:00
Xiaming Chen
d2f8cb8549 Clean list format 2016-01-02 20:23:00 +08:00
usuallycwdillon
c990c1085e Added several links from my personal bookmarks 2015-12-31 14:56:58 -05:00
usuallycwdillon
69ed965a14 update with current; include changes
Merge remote-tracking branch 'upstream/master'
2015-12-31 10:05:56 -05:00
CW Dillon
549c99ca14 Adding a few data sources from my data bookmarks 2015-12-31 08:24:05 -05:00
François Pelletier
4c94713af0 Update README.rst 2015-12-30 23:55:15 -05:00
François Pelletier
f9cdb924cd New data sources from Canada
Added Canada and other miscellaneous open data sources
2015-12-30 23:52:03 -05:00
CW Dillon
9d895a6473 Adding a few data sources from my data bookmarks 2015-12-30 20:45:52 -05:00
Herman Slatman
fbf46c30e2 OpenCorporates database of companies 2015-12-31 00:44:45 +01:00
Xiaming Chen
795252c7f7 1. Add society data from Pew Research Center;
2. Merge social networks into social science;
2015-12-30 17:18:44 +08:00
Camilo Nova
c178c90b66 Fix typo 2015-12-29 13:58:26 -05:00
Xiaming
2691595ffc Merge pull request #139 from tcarnus/patch-1
Adding european climate assessment dataset
2015-12-24 10:42:03 +08:00
Tim Carnus
309c82668d Adding european climate assessment dataset 2015-12-24 00:22:56 +00:00
Marcus Emmanuel Barnes
19647877e1 Update README.rst
Government of British Columbia (Canada) data portal, which includes access to over 1,500 data sets licensed under the Open Government 
License – British Columbia.
2015-12-23 13:59:55 -08:00
Ignacio Peluffo
e6dc40ad85 Datasets from Argentina added
Datasets from Argentina added to the Government list.
I added two open data resources for Argentina and one for Buenos Aires
2015-12-23 11:04:00 -03:00
Xiaming Chen
0b0766acc9 Update 2G cat link 2015-12-23 16:25:56 +08:00
Xiaming Chen
b048ab1afe add SSL certificate error to white list 2015-12-23 16:15:22 +08:00
Xiaming Chen
628f0ed694 Recheck travis list 2015-12-23 16:04:01 +08:00
Kai Wolf
09ebe4a9a7 Add some shape-from-silhoutte datasets 2015-12-22 14:25:14 +01:00
Xiaming Chen
4a0d02d9a2 Fix NIH FTP link 2015-12-22 13:00:21 +08:00
Xiaming Chen
bc53888819 Add awesome_bot to check links;
Repair several dead links reported by awesome_bot.
#130
2015-12-22 12:52:22 +08:00
Xiaming
716778896a Add MCTest #133 2015-12-22 10:50:15 +08:00
ReadmeCritic
1c336735d5 Update README URLs based on HTTP redirects 2015-12-21 08:36:13 -08:00
Xiaming
b7edc5cb38 Add German train data. #131 2015-12-21 18:50:36 +08:00
Xiaming
f6b2578c27 Merge pull request #129 from MarcusBarnes/master
Add the Canada Science and Technology Museums Corporation open data page under the Museums section.
2015-12-18 10:18:48 +08:00
Marcus Emmanuel Barnes
c51feea86c Update README.rst
Add the Canada Science and Technology Museums Corporation open data page under the Museums section.
2015-12-17 14:44:45 -08:00
Derwin McGeary
b2d27fdf2e Add Ensemble Genomes 2015-12-16 12:31:28 +00:00
Xiaming
334650ec54 Update README.rst 2015-12-16 20:03:54 +08:00
Erich Schubert
4a13b653df Add fast reverse-geocoder using OSM data 2015-12-16 10:20:48 +01:00
Xiaming
078e4e353d FQ 2011, 2012 data link dead 2015-12-08 16:05:03 +08:00
Xiaming
64d40f70bf Fix link of JPJOHN employment research data 2015-12-08 15:25:12 +08:00
Xiaming
13c09f81d8 Remove dup UCI net data 2015-12-08 14:47:50 +08:00
Xiaming Chen
bf5e282f43 Add TCGA #126;
Clear format.
2015-12-08 13:23:43 +08:00
Xiaming
c716fe4040 Merge pull request #125 from derwinmcgeary/patch-1
Add worldclim.org
2015-12-08 13:07:50 +08:00
Derwin McGeary
1788e514c2 Add worldclim.org
This site has good global climate data. Licence: "This dataset is freely available for academic and other non-commercial use. Redistribution, or commercial use is not allowed without prior permission."
2015-12-08 00:33:13 +00:00