CN-Celeb

资源介绍

CN-Celeb (http://ds.jsai.org.cn/) 语音识别第1张

This is a large-scale speaker recognition dataset collected 'in the wild'. The dataset consists of two subsets, CN-Celeb1 and CN-Celeb2. All the audio files are coded as single channel and sampled at 16kHz with 16-bit precision. For CN-Celeb1, it contains more than 130,000 utterances from 1,000 Chinese celebrities, and covers 11 different genres in real world. For CN-Celeb2, it contains more than 520,000 utterances from 2,000 Chinese celebrities, and covers 11 different genres in real world. The data collection process was organized by the Center for Speech and Language Technologies, Tsinghua University. It was also funded by the National Natural Science Foundation of China No. 61633013, and the Postdoctoral Science Foundation of China No. 2018M640133. You can cite the data using the following BibTeX entry:

@misc{fan2019cnceleb,
  title={CN-CELEB: a challenging Chinese speaker recognition dataset},
  author={Yue Fan and Jiawen Kang and Lantian Li and Kaicheng Li and Haolin Chen and Sitong Cheng and Pengyuan Zhang and Ziya Zhou and Yunqi Cai and Dong Wang},
  year={2019},
  eprint={1911.01799},
  archivePrefix={arXiv},
  primaryClass={eess.AS}
 }

PEOPLE

Dong Wang, Yue Fan, Hao Cui, Jiawen Kang, Lantian Li, Kaicheng Li, Haolin Chen, Sitong Cheng, Pengyuan Zhang, Ziya Zhou, Yunqi Cai

ConTACTOR

Dong Wang: wangdong99@mails.tsinghua.edu.cn
Lantian Li: lilt@cslt.org
Yue Fan: fanyue@cslt.org
Jiawen Kang: kangjw@cslt.org
Zhiyuan Tang: tangzy@cslt.org

Address: ROOM 1-303, BLDG FIT, CSLT, Tsinghua University

Homepage: http://cslt.org or http://cslt.riit.tsinghua.edu.cn

END

上一篇 VOice ICar fEDerico II Database

下一篇 openslr

发表评论取消回复

请先登录账户再评论哦

CN-Celeb免费

资源介绍

PEOPLE

ConTACTOR

发表评论取消回复

最新文章

热门文章

THUYG-20 维吾尔语语音数据

VGG-Sound

LibriTTS语料库

ESC环境噪音分类数据集

标签云

猜你喜欢

CN-Celeb免费

资源介绍

PEOPLE

ConTACTOR

发表评论 取消回复

最新文章

热门文章

THUYG-20 维吾尔语语音数据

VGG-Sound

LibriTTS语料库

ESC环境噪音分类数据集

CN-Celeb

标签云

猜你喜欢

THUYG-20 维吾尔语语音数据

VGG-Sound

LibriTTS语料库

ESC环境噪音分类数据集

呼吸声音数据集，用于检测呼吸系统疾病

Google Audioset 音频数据集

AISHELL-1 开源中文语音数据库

固有唤醒词数据库 HI-MIA

叠置密集去噪-分割合成标注

THCHS30 中文语音数据集

发表评论取消回复