Systems and Means of Informatics
2022, Volume 32, Issue 1, pp 55-62
- Yu. I. Butenko
- Yu. V. Stroganov
- A. V. Kvasnikov
- N. V. Slavnov
The article describes the phonetic-acoustic base of Russian trigrams for analysis and synthesis of Russian speech. The classification of the Russian trigrams is given as well as trigrams easy and difficult for pronunciation are highlighted. It is noted that the trigrams in the composition of the word fully or partially coincide with the morphemes of the Russian language. The variants of marking of speech records in the system of marking sounding speech are illustrated. Variability in pronunciation of Russian trigrams by different speakers is analyzed and illustrated by means of oscillograms. It is shown that the speech markup system allows taking into account personal characteristics of the speaker, affecting the quality of pronunciation. The influence of phoneme location in the word on the quality of its recognition is studied. It is suggested to use frequency of use and the position of the phoneme in the word as weights when using trigrams in speech recognition and synthesis tasks.
Systems and Means of Informatics
Volume 32, Issue 1, pp 55-62
Institute of Informatics Problems, Russian Academy of Sciences
Key words
phonetic-acoustic base; trigram; speaker; annotation; oscillogram; pronunciation; variability
Yu. I. Butenko  , Yu. V. Stroganov  , A. V. Kvasnikov  , and N. V. Slavnov
 N. E. Bauman Moscow State Technical University, 5-1, 2nd Baumanskaya Str., Moscow 105005, Russian Federation