Context: The Eurovision Song Contest, which originated in 1956, is present on YouTube through uploads of songs performed in the Contest. Any user can freely comment on these songs. This dataset is made of up a collection of comments made on four YouTube videos of Eurovision entries by Belgium. The comments are in a number of languages. Content: The YouTube online forums associated with the Eurovision Song Contest have a large number of users from varied linguistic backgrounds who, because of their interests in song performance, are particularly attentive to language-related issues, such as the accent of the performers and the choice of language of the songs. Commentaries are made by forum participants from disparate locations on a variety of topics, one of the most prominent being language, including language features and perceptions of language use. Acknowledgements: This dataset was collected by Dejan Ivkovi? for the purpose of linguistic research. If you made use of this data, please cite the following article: [Ivkovi?, D. (2013). The Eurovision Song Contest on YouTube: A corpus-based analysis of language attitudes. Language@Internet, 10, article 1. (urn:nbn:de:0009-7-35977)][1] Inspiration: * This dataset contains multiple languages. Can you identify and the language of each comment? * Can you automatically find positive and negative comments about different country’s songs? * Are some commenters more positive or more negative than others? [1]: http://www.languageatinternet.org/articles/2013/Ivkovic/?searchterm=eurovision