VGG-Sound免费

jsaifc 119 2021-08-24 语音识别

资源介绍

VGG-Sound (http://ds.jsai.org.cn/) 语音识别 第1张

VGG-Sound是一个视听对应数据集,由从上传到YouTube的视频中提取的音频短片组成.

VGG-Sound (http://ds.jsai.org.cn/) 语音识别 第2张

Citation

@InProceedings{Chen20,
  author       = "Honglie Chen and Weidi Xie and Andrea Vedaldi and Andrew Zisserman",
  title        = "VGGSound: A Large-scale Audio-Visual Dataset",
  booktitle    = "International Conference on Acoustics, Speech, and Signal Processing (ICASSP)",
  year         = "2020",
}

License

The VGG-Sound dataset is available to download for commercial/research purposes under a Creative Commons Attribution 4.0 International License. The copyright remains with the original owners of the video.

END

发表评论