Jos Polfliet
58a8710638
Add Ubuntu Dialogue Corpus
...
The Ubuntu Dialog Corpus (UDC) is one of the largest public dialog datasets available. It’s based on chat logs from the Ubuntu channels on a public IRC network.
2017-01-10 14:02:13 +01:00
Xiaming Chen
f90daee3c1
Merge pull request #267 from darabos/patch-1
...
Fix typo in "Common Craw"
2017-01-06 10:55:36 +08:00
Daniel Darabos
3ba773df2d
Fix typo.
2017-01-05 17:07:31 +01:00
Xiaming Chen
d7e4e7f957
Merge pull request #265 from bghazy/patch-2
...
Added Tunisia government data site
2016-12-30 13:17:52 +08:00
ghazy ben ahmed
606189b55c
Added Tunisia government data site
2016-12-28 20:56:27 +01:00
Xiaming Chen
7f779b7cad
Merge pull request #259 from hellais/patch-1
...
Add OONI data
2016-12-19 10:04:54 +08:00
Xiaming Chen
2507ee7d21
Merge pull request #263 from victorlaerte/master
...
Adding TravisTorrent MSR2017 Mining Challenge.
2016-12-19 10:00:40 +08:00
Victor Laerte Oliveira
d5a61529bc
Adding TravisTorrent MSR2017 Mining Challenge.
...
TravisTorrent, a GHTorrent partner project, provides free and easy-to-use Travis CI build analyses to the masses through its open database.
2016-12-18 20:57:22 -03:00
Xiaming Chen
4b285a07fc
Merge pull request #256 from ssaamm/master
...
Remove dead link to GetGlue
2016-12-18 23:39:42 +08:00
Xiaming Chen
d2c2f914a7
Merge pull request #258 from dspinellis/master
...
Add Microsoft's Data Science for Research
2016-12-18 23:32:44 +08:00
Xiaming Chen
dd3a17cb02
Merge pull request #260 from MaxwellRebo/master
...
Add keyphrase extraction datasets
2016-12-18 23:25:20 +08:00
Xiaming Chen
0d0117a88a
Update new image sets and three NLP sets
...
Images: Chars74K dataset and MNIST, NLP: Google MC-AFP, MS-MACRO, and MDST
2016-12-18 16:08:36 +08:00
Maxwell Rebo
4dc886ac00
Update README.rst
2016-12-11 15:17:54 +04:00
Arturo Filastò
6b7120dad2
Add OONI data
...
Add a link to data provided by the Open Observatory of Network Interference on internet censorship
2016-12-08 18:44:01 +00:00
Diomidis Spinellis
80ecc66409
Add Microsoft's Data Science for Research
2016-11-27 10:47:59 +02:00
Samuel Taylor
57d9c7bff7
Remove dead link to GetGlue
2016-11-12 09:41:05 -06:00
Xiaming Chen
0954d9aa6b
Add Kaggle link to Titanic data
2016-11-11 09:48:18 +08:00
Sammy X Chen
2530bbf133
Update README.rst
2016-08-15 14:04:32 +08:00
Sammy X Chen
9d1f4fb10d
Add AQUASTAT and category Earth Science
...
Earch Science maintains data from geoscience and earth related fields, like environment, water etc.
2016-08-15 13:59:28 +08:00
Xiaming Chen
87df786d26
Disable fake reports of links
2016-08-15 11:26:55 +08:00
Sammy X Chen
980b0564eb
Merge pull request #226 from JackKelly/master
...
Update README.rst
2016-08-15 11:22:03 +08:00
Xiaming Chen
e2e48c39a0
#230
2016-08-15 11:18:24 +08:00
Xiaming Chen
7154fc4095
Merge branch 'jpellman-master'
2016-08-15 11:11:04 +08:00
Xiaming Chen
25abbff1b6
#231 merge to add category Neuroscience
2016-08-15 11:10:46 +08:00
Sammy X Chen
059a2b974d
Merge pull request #233 from arademaker/master
...
wordnet and the corpora from UD project
2016-08-15 11:02:29 +08:00
Sammy X Chen
90825c7654
Merge pull request #235 from handmadeby/patch-1
...
Updated TFL to current API link.
2016-08-15 10:58:47 +08:00
Sammy X Chen
a54b091c21
Merge pull request #236 from hckiang/master
...
Added Uppsala Conflict Data Program
2016-08-15 10:56:35 +08:00
Sammy X Chen
8d59ba8dfe
Merge pull request #237 from stsievert/master
...
New Yorker caption contest ratings
2016-08-15 10:53:50 +08:00
Sammy X Chen
7270eb0b1f
Merge pull request #239 from jeremie-seguin/master
...
Fix broken link: Netflix prize
2016-08-15 10:49:34 +08:00
Sammy X Chen
86fe0cf6dc
add AWC
2016-08-11 10:51:08 +08:00
Sammy X Chen
71d9c2466d
add International Economics Database
2016-08-11 10:45:55 +08:00
jeremie
9bb6ab1e89
Fix broken link: Netflix prize
2016-08-10 11:04:50 +02:00
Scott Sievert
2bf5f661f4
adds caption contest dataset
2016-07-22 10:52:48 -05:00
Haochi Kiang
21ffee83e3
Added Uppsala Conflict Data Program
...
"The Uppsala Conflict Data Program (UCDP) offers a number of datasets on organised violence and peacemaking, all of which can be downloaded for free through the links below."
2016-07-20 10:39:51 +08:00
handmadeby
af605c3869
Updated TFL to current API link.
...
The Transport for London API link was pointing to a legacy page - I updated to the current valid page.
2016-07-07 14:33:06 +01:00
Alexandre Rademaker
a3bde36abb
wordnet and the corpora from UD project
2016-07-05 05:34:44 -03:00
John Pellman
7e00e1a52b
Neuroscience data added; new section for neuroscience
2016-07-04 11:05:14 -04:00
John Pellman
2f40e980d2
Added Brain Catalogue.
2016-06-23 05:24:21 -04:00
Jack Kelly
4400bf5a80
Update README.rst
...
Adding more Energy datasets. And fixing capitalisation for UK-DALE and PLAID
2016-06-08 13:19:18 +01:00
Xiaming
e725a66ba6
Merge pull request #219 from fenollp/patch-1
...
Add NYSE
2016-04-28 00:02:39 +08:00
Pierre Fenoll
b59f3bbb65
Add NYSE
2016-04-26 20:54:35 +02:00
Xiaming
8a09814e77
Add EMPIAR to bio. cat #215
2016-04-15 14:02:08 +08:00
Xiaming
7634e43010
Merge pull request #214 from daviddao/patch-1
...
Adding Broad Bioimage Benchmark Collection (BBBC)
2016-04-15 13:53:18 +08:00
David Dao
0f85053046
Adding Broad Bioimage Benchmark Collection (BBBC)
...
The Broad Bioimage Benchmark Collection (BBBC) is a large curated collection of published data sets in bio imaging. It includes all the images, metadata and ground truths. The BBBC resource is described in the following publication: Ljosa V, Sokolnicki KL, Carpenter AE (2012). Annotated high-throughput microscopy image sets for validation. Nature Methods 9(7):637 / doi. PMID: 22743765 PMCID: PMC3627348. Available at http://dx.doi.org/10.1038/nmeth.2083
2016-03-18 09:36:16 -04:00
Xiaming Chen
a355d0ef93
Clean TOC
2016-02-26 11:14:07 +08:00
Xiaming Chen
5c55314427
Add OpenDataSoft's portal list #208 ;
...
Move collected government to separated file to make the list short and clean.
2016-02-26 11:06:00 +08:00
Xiaming
bc0ba9e7b8
Merge pull request #209 from ReadmeCritic/patch-1
...
Fix Travis Build
2016-02-26 10:55:09 +08:00
ReadmeCritic
febb09ef8b
[travis] white lis arcgis,bixi
2016-02-25 15:41:05 -08:00
ReadmeCritic
f85d519589
[travis] allow timeout
2016-02-25 15:40:02 -08:00
Xiaming
ddc77bdf69
Add AMiner Citation Network Dataset
2016-02-25 19:28:36 +08:00