Commit Graph

334 Commits

Author SHA1 Message Date
Jos Polfliet
58a8710638 Add Ubuntu Dialogue Corpus
The Ubuntu Dialog Corpus (UDC) is one of the largest public dialog datasets available. It’s based on chat logs from the Ubuntu channels on a public IRC network.
2017-01-10 14:02:13 +01:00
Daniel Darabos
3ba773df2d Fix typo. 2017-01-05 17:07:31 +01:00
Xiaming Chen
7f779b7cad Merge pull request #259 from hellais/patch-1
Add OONI data
2016-12-19 10:04:54 +08:00
Victor Laerte Oliveira
d5a61529bc Adding TravisTorrent MSR2017 Mining Challenge.
TravisTorrent, a GHTorrent partner project, provides free and easy-to-use Travis CI build analyses to the masses through its open database.
2016-12-18 20:57:22 -03:00
Xiaming Chen
4b285a07fc Merge pull request #256 from ssaamm/master
Remove dead link to GetGlue
2016-12-18 23:39:42 +08:00
Xiaming Chen
d2c2f914a7 Merge pull request #258 from dspinellis/master
Add Microsoft's Data Science for Research
2016-12-18 23:32:44 +08:00
Xiaming Chen
dd3a17cb02 Merge pull request #260 from MaxwellRebo/master
Add keyphrase extraction datasets
2016-12-18 23:25:20 +08:00
Xiaming Chen
0d0117a88a Update new image sets and three NLP sets
Images: Chars74K dataset and MNIST, NLP: Google MC-AFP, MS-MACRO,  and MDST
2016-12-18 16:08:36 +08:00
Maxwell Rebo
4dc886ac00 Update README.rst 2016-12-11 15:17:54 +04:00
Arturo Filastò
6b7120dad2 Add OONI data
Add a link to data provided by the Open Observatory of Network Interference on internet censorship
2016-12-08 18:44:01 +00:00
Diomidis Spinellis
80ecc66409 Add Microsoft's Data Science for Research 2016-11-27 10:47:59 +02:00
Samuel Taylor
57d9c7bff7 Remove dead link to GetGlue 2016-11-12 09:41:05 -06:00
Xiaming Chen
0954d9aa6b Add Kaggle link to Titanic data 2016-11-11 09:48:18 +08:00
Sammy X Chen
2530bbf133 Update README.rst 2016-08-15 14:04:32 +08:00
Sammy X Chen
9d1f4fb10d Add AQUASTAT and category Earth Science
Earch Science maintains data from geoscience and earth related fields, like environment, water etc.
2016-08-15 13:59:28 +08:00
Sammy X Chen
980b0564eb Merge pull request #226 from JackKelly/master
Update README.rst
2016-08-15 11:22:03 +08:00
Xiaming Chen
e2e48c39a0 #230 2016-08-15 11:18:24 +08:00
Xiaming Chen
25abbff1b6 #231 merge to add category Neuroscience 2016-08-15 11:10:46 +08:00
Sammy X Chen
059a2b974d Merge pull request #233 from arademaker/master
wordnet and the corpora from UD project
2016-08-15 11:02:29 +08:00
Sammy X Chen
90825c7654 Merge pull request #235 from handmadeby/patch-1
Updated TFL to current API link.
2016-08-15 10:58:47 +08:00
Sammy X Chen
a54b091c21 Merge pull request #236 from hckiang/master
Added Uppsala Conflict Data Program
2016-08-15 10:56:35 +08:00
Sammy X Chen
8d59ba8dfe Merge pull request #237 from stsievert/master
New Yorker caption contest ratings
2016-08-15 10:53:50 +08:00
Sammy X Chen
7270eb0b1f Merge pull request #239 from jeremie-seguin/master
Fix broken link: Netflix prize
2016-08-15 10:49:34 +08:00
Sammy X Chen
86fe0cf6dc add AWC 2016-08-11 10:51:08 +08:00
Sammy X Chen
71d9c2466d add International Economics Database 2016-08-11 10:45:55 +08:00
jeremie
9bb6ab1e89 Fix broken link: Netflix prize 2016-08-10 11:04:50 +02:00
Scott Sievert
2bf5f661f4 adds caption contest dataset 2016-07-22 10:52:48 -05:00
Haochi Kiang
21ffee83e3 Added Uppsala Conflict Data Program
"The Uppsala Conflict Data Program (UCDP) offers a number of datasets on organised violence and peacemaking, all of which can be downloaded for free through the links below."
2016-07-20 10:39:51 +08:00
handmadeby
af605c3869 Updated TFL to current API link.
The Transport for London API link was pointing to a legacy page - I updated to the current valid page.
2016-07-07 14:33:06 +01:00
Alexandre Rademaker
a3bde36abb wordnet and the corpora from UD project 2016-07-05 05:34:44 -03:00
John Pellman
7e00e1a52b Neuroscience data added; new section for neuroscience 2016-07-04 11:05:14 -04:00
John Pellman
2f40e980d2 Added Brain Catalogue. 2016-06-23 05:24:21 -04:00
Jack Kelly
4400bf5a80 Update README.rst
Adding more Energy datasets.  And fixing capitalisation for UK-DALE and PLAID
2016-06-08 13:19:18 +01:00
Pierre Fenoll
b59f3bbb65 Add NYSE 2016-04-26 20:54:35 +02:00
Xiaming
8a09814e77 Add EMPIAR to bio. cat #215 2016-04-15 14:02:08 +08:00
David Dao
0f85053046 Adding Broad Bioimage Benchmark Collection (BBBC)
The Broad Bioimage Benchmark Collection (BBBC) is a large curated collection of published data sets in bio imaging. It includes all the images, metadata and ground truths. The BBBC resource is described in the following publication: Ljosa V, Sokolnicki KL, Carpenter AE (2012). Annotated high-throughput microscopy image sets for validation. Nature Methods 9(7):637 / doi. PMID: 22743765 PMCID: PMC3627348. Available at http://dx.doi.org/10.1038/nmeth.2083
2016-03-18 09:36:16 -04:00
Xiaming Chen
a355d0ef93 Clean TOC 2016-02-26 11:14:07 +08:00
Xiaming Chen
5c55314427 Add OpenDataSoft's portal list #208;
Move collected government to separated file to make the list short and clean.
2016-02-26 11:06:00 +08:00
Xiaming
ddc77bdf69 Add AMiner Citation Network Dataset 2016-02-25 19:28:36 +08:00
Xiaming
954600c51b Merge pull request #207 from alexurquhart/patch-1
Added HIFLD GIS data
2016-02-25 19:25:57 +08:00
Alex Urquhart
08e3bda416 Added HIFLD GIS data
Homeland Infrastructure Foundation-Level Data - https://hifld-dhs-gii.opendata.arcgis.com/
2016-02-25 05:48:48 -05:00
Ron
abd28a9836 added network repository to complex networks 2016-02-24 15:21:28 -08:00
lukeleslie
dea18ce158 Add Road Networks source to Complex Networks. 2016-02-19 17:32:46 -06:00
Xiaming
be1883f181 Merge pull request #202 from megansquire/patch-2
Added FLOSSmole
2016-02-16 01:30:02 +08:00
Megan Squire
feb840727c Update README.rst
Added FLOSSmole 60,000 data sets about free, libre, and open source software development practices with corrected link
2016-02-14 12:58:38 -05:00
Prayag Verma
b259eb2a3f Fix typos
`Interations` → `Interactions`
`Longitudnal` → `Longitudinal`
2016-02-14 23:07:37 +05:30
anatoly techtonik
c9a3a0affc Add Crystallography Open Database 2016-02-14 07:12:18 +03:00
Xiaming Chen
38ecc63b95 Change GeoSpace/GIS to GIS/Environment;
Add IMOS data;
2016-02-14 01:25:23 +08:00
Xiaming Chen
fb909aa46f Move ArchiveIt! to PublicDomains; 2016-02-14 01:18:12 +08:00
Xiaming Chen
9dd7a97da3 Merge #189 2016-02-14 01:09:49 +08:00