Systems and Means of Informatics
2022, Volume 32, Issue 1, pp 55-62
PHONETIC-ACOUSTIC DATABASE OF RUSSIAN TRIGRAMS
- Yu. I. Butenko
- Yu. V. Stroganov
- A. V. Kvasnikov
- N. V. Slavnov
Abstract
The article describes the phonetic-acoustic base of Russian trigrams for analysis and synthesis of Russian speech. The classification of the Russian trigrams is given as well as trigrams easy and difficult for pronunciation are highlighted. It is noted that the trigrams in the composition of the word fully or partially coincide with the morphemes of the Russian language. The variants of marking of speech records in the system of marking sounding speech are illustrated. Variability in pronunciation of Russian trigrams by different speakers is analyzed and illustrated by means of oscillograms. It is shown that the speech markup system allows taking into account personal characteristics of the speaker, affecting the quality of pronunciation. The influence of phoneme location in the word on the quality of its recognition is studied. It is suggested to use frequency of use and the position of the phoneme in the word as weights when using trigrams in speech recognition and synthesis tasks.
[+] References (10)
- Zaharov, V. P. 2015. Korpusa russkogo yazyka [Russian language corpora]. Trudy Instituta russkogo yazyka im. V. V. Vinogradova [V. V. Vinogradov Russian Language Institute Proceedings] 6:20-65.
- DARPA TIMIT acoustic phonetic continuous speech corpus. Available at: https:// catalog.ldc.upenn.edu/LDC93S1 (accessed February 9, 2022).
- Bogdanov, D. S., O. F. Krivnova, A. Ya. Podrabinovich, and V. V. Farsobina. 1998. Baza rechevykh fragmentov russkogo yazyka ISABASE [Base of Russian language speech fragments ISABASE]. Intellektual'nye tekhnologii vvoda i obrabotki informatsii [Intelligent information input and processing technologies]. Moscow: Editorial URSS. 74-85.
- Krivnova, O.F. 2013. Russkiy rechevoy korpus RuSpeech [Russian speech corpa RuSpeech]. 7th Conference (International) "Phonetics Today" Proceedings. Moscow: V. V. Vinogradov Russian Language Institute RAS Publs. 54-56.
- Izrailova, E. S. 2017. O sozdanii fonetiko-akusticheskoy bazy v ramkakh sinteza che- chenskoy rechi [On the creation of phonetic-acoustic database within synthesis of the Chechen speech]. Vestnik VGU. Ser. Sistemnyy analiz i informatsionnye tekhnologii [Proceedincs of Voronezh State University. Ser. System analysis and information technology] 2:111-115.
- Frumkina, R. M., F.P. Vasilevich, and E.N. Gerganov. 1971. Sub"ektivnye otsenki chastot elementov teksta kak prognoziruyushchiy faktor [Subjective estimates of text element frequencies as a predictive factor]. Veroyatnostnoe prognozirovanie v rechi [Probabilistic prediction in speech]. Moscow: Nauka. 70-93.
- Frumkina, R. M., and A. P. Vasilevich. 1971. Proiznositel'naya trudnost' bukvosochetaniy i ee svyaz's porogami zritel'nogo raspoznavaniya [Relative difficulty of letter combinations and its connection with visual recognition thresholds]. Veroyatnostnoe prognozirovanie v rechi [Probabilistic prediction in speech]. Moscow: Nauka. 94-106.
- Eng, T.L., and J.B. Hellige. 1994. Hemispheric asymmetry for processing unpronounceable and pronounceable letter trigrams. Brain Lang. 46(4):517-535.
- Butenko, Iu. I., Yu. V. Stroganov, V. I. Shevchenko, N. V. Slavnov, and A. V. Kvasnikov. 2020. Sistema razmetki zvuchashchey rechi dlya sravnitel'nogo analiza proiznosheniya v razlichnykh dialektakh [Speech annotation system for the comparative analysis of pronunciation in different dialects]. Vestnik VGU. Ser. Sistemnyy analiz
i informatsionnye tekhnologii [Proceedincs of Voronezh State University. Ser. System analysis and information technology] 4:168-176. doi: 10.17308/sait.2020.1/2631.
- Butenko, Yu. I., and A. A. Konopleva. 2020. Metodologiya ispol'zovaniya ney- rosetevykh tekhnologiy pri raspoznavanii trigram [Methodology for neuron-network technologies in recognition of trigrams]. Neyrokomp'yutery: razrabotka, primenenie [Neurocomputers] 1:5-17. doi: 10.18127/j 19998554-202001-01.
[+] About this article
Title
PHONETIC-ACOUSTIC DATABASE OF RUSSIAN TRIGRAMS
Journal
Systems and Means of Informatics
Volume 32, Issue 1, pp 55-62
Cover Date
2022-05-10
DOI
10.14357/08696527220105
Print ISSN
0869-6527
Publisher
Institute of Informatics Problems, Russian Academy of Sciences
Additional Links
Key words
phonetic-acoustic base; trigram; speaker; annotation; oscillogram; pronunciation; variability
Authors
Yu. I. Butenko , Yu. V. Stroganov , A. V. Kvasnikov , and N. V. Slavnov
Author Affiliations
N. E. Bauman Moscow State Technical University, 5-1, 2nd Baumanskaya Str., Moscow 105005, Russian Federation
|