20新闻组免费

jsaifc 17 2021-09-02 文本语料

资源介绍

20新闻组 (http://ds.jsai.org.cn/) 文本语料 第1张

The original form of this dataset is at this page http://qwone.com/~jason/20Newsgroups/ The 20 Newsgroups data set is a collection of approximately 19K newsgroup documents. This version is bydate version that has 18846 documents. In this version documents are sorted by date into training(60%) and test(40%) sets. Changes on dataset: - All files are converted to txt format.

END
下一篇

发表评论