Estimation of Speaking Style in Speech Corpora Focusing on speech transcriptions

Raymond Shen, Hideaki Kikuchi


Abstract
Recent developments in computer technology have allowed the construction and widespread application of large-scale speech corpora. To foster ease of data retrieval for people interested in utilising these speech corpora, we attempt to characterise speaking style across some of them. In this paper, we first introduce the 3 scales of speaking style proposed by Eskenazi in 1993. We then use morphological features extracted from speech transcriptions that have proven effective in style discrimination and author identification in the field of natural language processing to construct an estimation model of speaking style. More specifically, we randomly choose transcriptions from various speech corpora as text stimuli with which to conduct a rating experiment on speaking style perception; then, using the features extracted from those stimuli and the rating results, we construct an estimation model of speaking style by a multi-regression analysis. After the cross validation (leave-1-out), the results show that among the 3 scales of speaking style, the ratings of 2 scales can be estimated with high accuracies, which prove the effectiveness of our method in the estimation of speaking style.
Anthology ID:
L14-1493
Volume:
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)
Month:
May
Year:
2014
Address:
Reykjavik, Iceland
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Hrafn Loftsson, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
2747–2752
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2014/pdf/616_Paper.pdf
DOI:
Bibkey:
Cite (ACL):
Raymond Shen and Hideaki Kikuchi. 2014. Estimation of Speaking Style in Speech Corpora Focusing on speech transcriptions. In Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), pages 2747–2752, Reykjavik, Iceland. European Language Resources Association (ELRA).
Cite (Informal):
Estimation of Speaking Style in Speech Corpora Focusing on speech transcriptions (Shen & Kikuchi, LREC 2014)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2014/pdf/616_Paper.pdf