Multilingual Text-to-Speech Synthesis: The Bell Labs Approach is the first monograph-length description of the Bell Labs work on multilingual text-to-speech synthesis. Every important aspect of the system is described, including text analysis, segmental timing, intonation and synthesis. There is also a discussion of evaluation methodologies, as well as a chapter outlining some future areas of research. While the book focuses on the Bell Labs approach to the various problems of converting from text into speech, other approaches are discussed and compared. Thus, this book serves both the function of providing a single reference to an important strand of research in multilingual synthesis, while at the same time providing a source of information on current trends in the field.
Ulum Ö (2020). A critical deconstruction of computer-based test application in Turkish State University, Education and Information Technologies , 25 :6 , (4883-4896), Online publication date: 1-Nov-2020 .
Zhang H, Sproat R, Ng A, Stahlberg F, Peng X, Gorman K and Roark B (2019). Neural models of text normalization for speech applications, Computational Linguistics , 45 :2 , (293-337), Online publication date: 1-Jun-2019 .
Vainio M Phonetics and Machine Learning: Hierarchical Modelling of Prosody in Statistical Speech Synthesis Statistical Language and Speech Processing, (37-54)
Moreno-Daniel A, Wilpon J and Juang B (2012). Index-based incremental language model for scalable directory assistance, Speech Communication , 54 :3 , (351-367), Online publication date: 1-Mar-2012 .
Rojc M, Rotovnik T, Brus M, Jan D and Kačič Z Embodied conversational agents in Wizard-of-Oz and multimodal interaction applications Proceedings of the 2007 COST action 2102 international conference on Verbal and nonverbal communication behaviours, (294-309)
Šef T Automatic accentuation of words for Slovenian TTS system Proceedings of the 5th WSEAS international conference on Signal processing, (155-160)
Escudero-Mancebo D and Cardeñoso-Payo V Mining intonation corpora using knowledge driven sequential clustering Proceedings of the 2nd international joint conference, and Proceedings of the 10th Ibero-American Conference on AI 18th Brazilian conference on Advances in Artificial Intelligence, (360-369)
Escudero-Mancebo D and Cardeñoso-Payo V Visualization of prosodic knowledge using corpus driven MEMOInt intonation modelling Proceedings of the 9th international conference on Text, Speech and Dialogue, (645-652)
Müller K Improving syllabification models with phonotactic knowledge Proceedings of the Eighth Meeting of the ACL Special Interest Group on Computational Phonology and Morphology, (11-20)
Ivanecky J, Fischer J, Mast M, Kunzmann S, Ross T and Fischer V Multi-lingual and multi-modal speech processing and applications Proceedings of the 27th DAGM conference on Pattern Recognition, (149-159)
Sef T, Skrjanc M and Gams M Automatic Lexical Stress Assignment of Unknown Words for Highly Inflected Slovenian Language Proceedings of the 5th International Conference on Text, Speech and Dialogue, (165-172)
Jansche M Re-engineering letter-to-sound rules Proceedings of the second meeting of the North American Chapter of the Association for Computational Linguistics on Language technologies, (1-7)
Müller K Automatic detection of syllable boundaries combining the advantages of treebank and bracketed corpora training Proceedings of the 39th Annual Meeting on Association for Computational Linguistics, (410-417)
Toole J A hybrid approach to the identification and expansion of abbreviations Content-Based Multimedia Information Access - Volume 1, (725-736)
Kiraz G (2000). Multitiered nonlinear morphology using multitape finite automata, Computational Linguistics , 26 :1 , (77-105), Online publication date: 1-Mar-2000 .
Nakatani C and Chu-Carroll J Using dialogue representations for concept-to-speech generation Proceedings of the ANLP-NAACL 2000 Workshop on Conversational Systems, (48-53)
Chu-Carroll J MIMIC Proceedings of the sixth conference on Applied natural language processing, (97-104)
Nakatani C and Chu-Carroll J Using dialogue representations for concept-to-speech generation Proceedings of the 2000 ANLP/NAACL Workshop on Conversational systems - Volume 3, (48-53)
Pan S and Hirschberg J Modeling local context for pitch accent prediction Proceedings of the 38th Annual Meeting on Association for Computational Linguistics, (233-240)
Müller K, Möbius B and Prescher D Inducing probabilistic syllable classes using multivariate clustering Proceedings of the 38th Annual Meeting on Association for Computational Linguistics, (225-232)
Chu-Carroll J and Carpenter B (1999). Vector-based natural language call routing, Computational Linguistics , 25 :3 , (361-388), Online publication date: 1-Sep-1999 .
Chu-Carroll J and Carpenter B Dialogue management in vector-based call routing Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics - Volume 1, (256-262)
Nakatani C Constituent-based accent prediction Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics - Volume 2, (939-945)