Spoken Turkish in television news and debates: Some acoustic and morphological aspects relevant to respeaking
Tarih
Yazarlar
Dergi Başlığı
Dergi ISSN
Cilt Başlığı
Yayıncı
Erişim Hakkı
Özet
This paper presents a qualitative analysis of spoken Turkish based on approximately 77 minutes of transcribed content from television news and debates with special focus on acoustic and morphological aspects relevant to respeaking, where an edited form of the original verbal content is dictated to a speaker-dependent automatic speech recognition (ASR) engine, whose output is further edited for broadcast-quality subtitles. The data suggest that respeaking can solve only some of the potential problems unscripted speech presents for ASR. On the acoustic level, a respeaker can overcome segmental and supra-segmental variation as well as degraded acoustic conditions, and can partially resolve overlapping speech. On the morphological level, disfluency and deviant morphology can be handled. When paraphrasing the text and dictating punctuation marks under time pres-sure, a respeaker can hardly control her own pronunciation; deletion, reduced morphemes, etc., might lead to misrecognition. As standard orthography, capitalization, and punctuation are required in subtitles, named entities, figures, morphophonemics and use of apos-trophe will require satisfactory solutions during ASR and/or post-editing. © 2021, Otto Harrassowitz GmbH. Co.KG. All rights reserved.












