A Corpus for Multilingual Document Classification in Eight Languages

Holger Schwenk, Xian Li


Anthology ID:
L18-1560
Volume:
Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)
Month:
May
Year:
2018
Address:
Miyazaki, Japan
Editors:
Nicoletta Calzolari, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Koiti Hasida, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Hélène Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis, Takenobu Tokunaga
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
Language:
URL:
https://aclanthology.org/L18-1560
DOI:
Bibkey:
Cite (ACL):
Holger Schwenk and Xian Li. 2018. A Corpus for Multilingual Document Classification in Eight Languages. In Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), Miyazaki, Japan. European Language Resources Association (ELRA).
Cite (Informal):
A Corpus for Multilingual Document Classification in Eight Languages (Schwenk & Li, LREC 2018)
Copy Citation:
PDF:
https://aclanthology.org/L18-1560.pdf
Code
 facebookresearch/MLDoc +  additional community code
Data
MLDoc