Dataset has been extracted from social media for an amount of 43,313 tokens. The classification task consists in categorizing the text at the token level into three classes: arabizi, foreign and emotag., lang: Tunisian, iterations: 4,790, file_type: TSV, tasks: Classification, Part-of-Speech (POS)