Commit Graph

459 Commits

Author SHA1 Message Date
Jos Polfliet
58a8710638 Add Ubuntu Dialogue Corpus
The Ubuntu Dialog Corpus (UDC) is one of the largest public dialog datasets available. It’s based on chat logs from the Ubuntu channels on a public IRC network.
2017-01-10 14:02:13 +01:00
Xiaming Chen
f90daee3c1 Merge pull request #267 from darabos/patch-1
Fix typo in "Common Craw"
2017-01-06 10:55:36 +08:00
Daniel Darabos
3ba773df2d Fix typo. 2017-01-05 17:07:31 +01:00
Xiaming Chen
d7e4e7f957 Merge pull request #265 from bghazy/patch-2
Added Tunisia government data site
2016-12-30 13:17:52 +08:00
ghazy ben ahmed
606189b55c Added Tunisia government data site 2016-12-28 20:56:27 +01:00
Xiaming Chen
7f779b7cad Merge pull request #259 from hellais/patch-1
Add OONI data
2016-12-19 10:04:54 +08:00
Xiaming Chen
2507ee7d21 Merge pull request #263 from victorlaerte/master
Adding TravisTorrent MSR2017 Mining Challenge.
2016-12-19 10:00:40 +08:00
Victor Laerte Oliveira
d5a61529bc Adding TravisTorrent MSR2017 Mining Challenge.
TravisTorrent, a GHTorrent partner project, provides free and easy-to-use Travis CI build analyses to the masses through its open database.
2016-12-18 20:57:22 -03:00
Xiaming Chen
4b285a07fc Merge pull request #256 from ssaamm/master
Remove dead link to GetGlue
2016-12-18 23:39:42 +08:00
Xiaming Chen
d2c2f914a7 Merge pull request #258 from dspinellis/master
Add Microsoft's Data Science for Research
2016-12-18 23:32:44 +08:00
Xiaming Chen
dd3a17cb02 Merge pull request #260 from MaxwellRebo/master
Add keyphrase extraction datasets
2016-12-18 23:25:20 +08:00
Xiaming Chen
0d0117a88a Update new image sets and three NLP sets
Images: Chars74K dataset and MNIST, NLP: Google MC-AFP, MS-MACRO,  and MDST
2016-12-18 16:08:36 +08:00
Maxwell Rebo
4dc886ac00 Update README.rst 2016-12-11 15:17:54 +04:00
Arturo Filastò
6b7120dad2 Add OONI data
Add a link to data provided by the Open Observatory of Network Interference on internet censorship
2016-12-08 18:44:01 +00:00
Diomidis Spinellis
80ecc66409 Add Microsoft's Data Science for Research 2016-11-27 10:47:59 +02:00
Samuel Taylor
57d9c7bff7 Remove dead link to GetGlue 2016-11-12 09:41:05 -06:00
Xiaming Chen
0954d9aa6b Add Kaggle link to Titanic data 2016-11-11 09:48:18 +08:00
Sammy X Chen
2530bbf133 Update README.rst 2016-08-15 14:04:32 +08:00
Sammy X Chen
9d1f4fb10d Add AQUASTAT and category Earth Science
Earch Science maintains data from geoscience and earth related fields, like environment, water etc.
2016-08-15 13:59:28 +08:00
Xiaming Chen
87df786d26 Disable fake reports of links 2016-08-15 11:26:55 +08:00
Sammy X Chen
980b0564eb Merge pull request #226 from JackKelly/master
Update README.rst
2016-08-15 11:22:03 +08:00
Xiaming Chen
e2e48c39a0 #230 2016-08-15 11:18:24 +08:00
Xiaming Chen
7154fc4095 Merge branch 'jpellman-master' 2016-08-15 11:11:04 +08:00
Xiaming Chen
25abbff1b6 #231 merge to add category Neuroscience 2016-08-15 11:10:46 +08:00
Sammy X Chen
059a2b974d Merge pull request #233 from arademaker/master
wordnet and the corpora from UD project
2016-08-15 11:02:29 +08:00
Sammy X Chen
90825c7654 Merge pull request #235 from handmadeby/patch-1
Updated TFL to current API link.
2016-08-15 10:58:47 +08:00
Sammy X Chen
a54b091c21 Merge pull request #236 from hckiang/master
Added Uppsala Conflict Data Program
2016-08-15 10:56:35 +08:00
Sammy X Chen
8d59ba8dfe Merge pull request #237 from stsievert/master
New Yorker caption contest ratings
2016-08-15 10:53:50 +08:00
Sammy X Chen
7270eb0b1f Merge pull request #239 from jeremie-seguin/master
Fix broken link: Netflix prize
2016-08-15 10:49:34 +08:00
Sammy X Chen
86fe0cf6dc add AWC 2016-08-11 10:51:08 +08:00
Sammy X Chen
71d9c2466d add International Economics Database 2016-08-11 10:45:55 +08:00
jeremie
9bb6ab1e89 Fix broken link: Netflix prize 2016-08-10 11:04:50 +02:00
Scott Sievert
2bf5f661f4 adds caption contest dataset 2016-07-22 10:52:48 -05:00
Haochi Kiang
21ffee83e3 Added Uppsala Conflict Data Program
"The Uppsala Conflict Data Program (UCDP) offers a number of datasets on organised violence and peacemaking, all of which can be downloaded for free through the links below."
2016-07-20 10:39:51 +08:00
handmadeby
af605c3869 Updated TFL to current API link.
The Transport for London API link was pointing to a legacy page - I updated to the current valid page.
2016-07-07 14:33:06 +01:00
Alexandre Rademaker
a3bde36abb wordnet and the corpora from UD project 2016-07-05 05:34:44 -03:00
John Pellman
7e00e1a52b Neuroscience data added; new section for neuroscience 2016-07-04 11:05:14 -04:00
John Pellman
2f40e980d2 Added Brain Catalogue. 2016-06-23 05:24:21 -04:00
Jack Kelly
4400bf5a80 Update README.rst
Adding more Energy datasets.  And fixing capitalisation for UK-DALE and PLAID
2016-06-08 13:19:18 +01:00
Xiaming
e725a66ba6 Merge pull request #219 from fenollp/patch-1
Add NYSE
2016-04-28 00:02:39 +08:00
Pierre Fenoll
b59f3bbb65 Add NYSE 2016-04-26 20:54:35 +02:00
Xiaming
8a09814e77 Add EMPIAR to bio. cat #215 2016-04-15 14:02:08 +08:00
Xiaming
7634e43010 Merge pull request #214 from daviddao/patch-1
Adding Broad Bioimage Benchmark Collection (BBBC)
2016-04-15 13:53:18 +08:00
David Dao
0f85053046 Adding Broad Bioimage Benchmark Collection (BBBC)
The Broad Bioimage Benchmark Collection (BBBC) is a large curated collection of published data sets in bio imaging. It includes all the images, metadata and ground truths. The BBBC resource is described in the following publication: Ljosa V, Sokolnicki KL, Carpenter AE (2012). Annotated high-throughput microscopy image sets for validation. Nature Methods 9(7):637 / doi. PMID: 22743765 PMCID: PMC3627348. Available at http://dx.doi.org/10.1038/nmeth.2083
2016-03-18 09:36:16 -04:00
Xiaming Chen
a355d0ef93 Clean TOC 2016-02-26 11:14:07 +08:00
Xiaming Chen
5c55314427 Add OpenDataSoft's portal list #208;
Move collected government to separated file to make the list short and clean.
2016-02-26 11:06:00 +08:00
Xiaming
bc0ba9e7b8 Merge pull request #209 from ReadmeCritic/patch-1
Fix Travis Build
2016-02-26 10:55:09 +08:00
ReadmeCritic
febb09ef8b [travis] white lis arcgis,bixi 2016-02-25 15:41:05 -08:00
ReadmeCritic
f85d519589 [travis] allow timeout 2016-02-25 15:40:02 -08:00
Xiaming
ddc77bdf69 Add AMiner Citation Network Dataset 2016-02-25 19:28:36 +08:00