Age Group Classification with Speech and Metadata Multimodality Fusion

Denys Katerenchuk


Abstract
Children comprise a significant proportion of TV viewers and it is worthwhile to customize the experience for them. However, identifying who is a child in the audience can be a challenging task. We present initial studies of a novel method which combines utterances with user metadata. In particular, we develop an ensemble of different machine learning techniques on different subsets of data to improve child detection. Our initial results show an 9.2% absolute improvement over the baseline, leading to a state-of-the-art performance.
Anthology ID:
E17-2030
Volume:
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 2, Short Papers
Month:
April
Year:
2017
Address:
Valencia, Spain
Editors:
Mirella Lapata, Phil Blunsom, Alexander Koller
Venue:
EACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
188–193
Language:
URL:
https://aclanthology.org/E17-2030
DOI:
Bibkey:
Cite (ACL):
Denys Katerenchuk. 2017. Age Group Classification with Speech and Metadata Multimodality Fusion. In Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 2, Short Papers, pages 188–193, Valencia, Spain. Association for Computational Linguistics.
Cite (Informal):
Age Group Classification with Speech and Metadata Multimodality Fusion (Katerenchuk, EACL 2017)
Copy Citation:
PDF:
https://aclanthology.org/E17-2030.pdf