From 58a871063820c0dbc6d74fe41050d94c7728bb6e Mon Sep 17 00:00:00 2001 From: Jos Polfliet Date: Tue, 10 Jan 2017 14:02:13 +0100 Subject: [PATCH] Add Ubuntu Dialogue Corpus MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit The Ubuntu Dialog Corpus (UDC) is one of the largest public dialog datasets available. It’s based on chat logs from the Ubuntu channels on a public IRC network. --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index 6a03677..fa5efdd 100755 --- a/README.rst +++ b/README.rst @@ -358,6 +358,7 @@ Natural Language * `Personae Corpus `_ * `SaudiNewsNet Collection of Saudi Newspaper Articles (Arabic, 30K articles) `_ * `SMS Spam Collection in English `_ +* `Ubuntu Dialogue Corpus `_ * `USENET postings corpus of 2005~2011 `_ * `Wikidata - Wikipedia databases `_ * `Wikipedia Links data - 40 Million Entities in Context `_