Informatics and Applications
2017, Volume 11, Issue 3, pp 123-131
STATISTICAL DATA AS INFORMATION SOURCE FOR LINGUISTIC ANALYSIS OF RUSSIAN CONNECTORS
Abstract
The aim of this paper is to describe statistical data gathered from the supracorpora database (SCDB) of connectors for further analysis of their formal and functional properties. Until now, these properties have usually been described applying semantic analysis, while corpus data, if used at all, have not been subject to statistical processing. It is automatically generated and verifiable information, collected from texts corpora that can be one of the most reliable tools in the analysis of linguistic units, including connectors. The paper shows what statistics one may obtain from the SCDB and how to use it in the linguistic analysis in case of tol’ko, a polyfunctional linguistic unit that can be a part of multicomponent and two-place connectors.
[+] References (16)
- In’kova-Manzotti, O. Yu. 2001. Konnektory protivopostavleniya vo frantsuzskom i russkom yazykakh. Sopostavitel’noe issledovanie [Connectors of opposition in French and Russian. A comparative study]. Moscow: Informelektro. 434 p.
- Chzhon, Kh. Kh. 2003. Prisoedinitel’nye skrepy v sovremennom russkom yazyke: sintaksis i semantika [Conjunctive ties in modern Russian: Syntax and semantics]. Moscow: Lomonosov Moscow State University. PhD Thesis. 190 p.
- Zav’yalov, V. N. 2009. Morfologicheskie i sintaksicheskie aspekty opisaniya struktury soyuzov v sovremennom russkom yazyke [Morphological and syntactical aspects of the conjunctions’ structure description in modern Russian]. Vladivostok: DGU. D.Sc. Thesis. 393 p.
- Natsional’nyy korpus russkogo yazyka [Russian National Corpus]. Available at: http://www.ruscorpora.ru (accessed April 23, 2017).
- Russkaya korpusnaya grammatika [Russian Corpus Grammar]. Available at: http://rusgram.ru/ (accessed April28, 2017).
- Apresyan, V. Yu., andO. E. Pekelis. 2011. Soyuz [Conjunction]. Available at: http://rusgram.ru/ (accessed April28, 2017).
- Zaliznyak Anna A., I. M. Zatsman, O.Yu. In’kova, and M. G. Kruzhkov. 2015. Nadkorpusnye bazy dannykh kak lingvisticheskiy resurs [Subcorpora databases as linguistic resource]. Corpus Linguistics: 7th Conference (International) Proceedings. St. Petersburg: St. Petersburg State University. 211-218.
- Zatsman, I.M., O.Yu. In’kova, M.G. Kruzhkov, and N. A. Popkova. 2016. Predstavlenie krossyazykovykh znaniy o konnektorakh v nadkorpusnykh bazakh dannykh [Representation of cross-lingual knowledge about connectors in supracorpora databases] Informatika i ee Primeneniya — Inform. Appl. 10(1):106—118.
- In’kova, O.Yu., and M. G. Kruzhkov. 2016. Nadkorpusnye russko-frantsuzskie basy dannykh glagol’nykh form i konnektorov [Supracorpora databases of Russian and French verbal forms and connectors]. Lingue slave a confronto [Slavic languages in comparison]. Eds. O. In’kova and A. Trovesi. Bergamo: Bergamo University Press. 365-392.
- Zaliznyak, Anna A., I. M. Zatsman, and O.Yu. In’kova. 2017. Nadkorpusnaya baza dannykh konnektorov: postroenie sistemy terminov [Supracorpora database of connectors: Developing a terminology]. Informatika i ee Primeneniya — Inform. Appl. 11(1):100—108.
- Dobrovol’skiy, D. O., A. A. Kretov, and S. A. Sharov. 2005. Korpus parallel’nykh tekstov: Arkhitektura i vozmozhnosti ispol’zovaniya [Corpus of parallel texts: Architecture and applications]. Natsional’nyy korpus russkogo yazyka: 2003—2005 [Russian National Corpus: 2003-2005]. Moscow: Indrik. 263-296.
- In’kova, O.Yu., and N.A. Popkova. 2016. Struktura dvukhmestnykh konnektorov russkogo yazyka v svete korpusnykh dannykh [The structure of two-part correlative connectors as an object of corpus analysis]. Computational Linguistics and Intellectual Technologies: Conference (International) “Dialogue”Proceedings. Moscow: RGGU. 15(22):200-213.
- In’kova, O.Yu. 2016. Kprobleme opisaniya mnogokomponentnykh konnektorov russkogo yazyka: ne tol’ko... no i [Towards the problem of the description of multiword connectives of Russian language: Ne tol’ko... no i (not only... but also)]. Voprosy yazykoznaniya [Topics in the Study of Language] 2:37-60.
- Kobozeva, I. M. 2016. Kognitivno-semanticheskiy podkhod k opisaniyu sredstv svyazi predlozheniy (na primere konnektorov so znacheniem neposredstvennogo sledovaniya) [Cognitive-semantical approach to the description of ways to connect sentences (case of connectors of immediate consecution)]. Tr. Instituta russkogo yazyka im. V. V. Vinogradova [V. V. Vinogradov Russian Language Institute of the Russian Academy of Sciences Collections] 11:118-131.
- Popkova, N.A., O.Yu. In’kova, I. M. Zatsman, and M. G. Kruzhkov. 2015. Metodika postroeniya monoekvivalentsiy v nadkorpusnoy baze dannykh konnektorov [Methodology of constructing monoequivalences in the supracorpora database of connectors]. Tr. 2-y nauchn. konf. “Zadachi sovremennoy informatiki” [2nd Scientific Conference “Modern Informatics’ Problems” Proceedings]. Moscow: FRC CSC RAS. 143-153.
- Uryson, E. V. 2012. Soyuzy, konnektory i teoriya valentnostey [Conjunctions, connectors, and the valence theory]. Computational Linguistics and Intellectual Technologies: Conference (International) “Dialogue” Proceedings. Moscow: RGGU. 11(1):627-637.
[+] About this article
Title
STATISTICAL DATA AS INFORMATION SOURCE FOR LINGUISTIC ANALYSIS OF RUSSIAN CONNECTORS
Journal
Informatics and Applications
2017, Volume 11, Issue 3, pp 123-131
Cover Date
2017-09-30
DOI
10.14357/19922264170314
Print ISSN
1992-2264
Publisher
Institute of Informatics Problems, Russian Academy of Sciences
Additional Links
Key words
annotation of connectors; corpus linguistics; supracorpora databases; parallel texts; statistical
Authors
O. Inkova and N. Popkova
Author Affiliations
Institute of Informatics Problems, Federal Research Center “Computer Science and Control” of the Russian Academy of Sciences, 44-2 Vavilov Str., Moscow 119333, Russian Federation
|