Systems and Means of Informatics
2023, Volume 33, Issue 1, pp 24-34
INTEGRATION CAPACITIES OF SUPRACORPORA DATABASES
- A. A. Durnovo
- O. Yu. Inkova
- V. A. Nuriev
Abstract
The paper centers on integration capacities of supracorpora databases developed at the Federal Research Center "Computer Science and Control" of the Russian Academy of Sciences. It is shown how three databases - the Supracorpora database of hierarchal logical-semantic relations (SDBH LSR), the Database of parallel texts (DBT), and the Supracorpora database of connectives (SDBC) - are integrated between themselves. The information system of hierarchal logical-semantic relations uses a specially designed database (SDBH LSR) in which annotations of logical-semantic relations are presented as trees,
i. e., directed connected acyclic graphs where nodes contain data and edges depict the subordination between nodes. Along with the SDBH LSR, ISHLSR uses data from the DBT and SDBC. Such integration makes it possible to combine the methodological strengths of the informatics, contrastive and corpus linguistics, and theory and practice of translation without losing sight of the factors that may adversely affect the validity and reliability of the final data.
[+] References (16)
- Czulo, O., and S. Hansen-Schirra, eds. 2017. Crossroads between contrastive linguistics, translation studies and machine translation: TC3 II (Translation and Multilingual Natural Language Processing 4). Berlin: Language Science Press. 208 p.
- Bystrova-McIntyre, T. 2007. Looking at the overlooked: A corpora study of punctuation use in Russian and English. Transl. Interpret. Stu. 2(1): 137-162.
- Gast, V. 2015. On the use of translation corpora in contrastive linguistics: A case study of impersonalization in English and German. Languages Contrast 15(1):4-33.
- Fantinuoli, C., and F. Zanettin, eds. 2015. New directions in corpus-based translation studies. Berlin: Language Science Press. 168 p.
- Mikhailov, M., and R. Cooper. 2016. Corpus linguistics for translation and contrastive studies: A guide for research. Abingdon, New York: Routledge. 258 p.
- Doval, I., and M.T. Sanchez Nieto, eds. 2019. Parallel corpora for contrastive and translation studies: New resources and applications. Amsterdam, Philadelphia: John Benjamins. 310 p.
- Granger, S., and M.-A. Lefer, eds. 2020. Translating and comparing languages: Corpus-based Insights. Louvain-la-Neuve, Belgique: Presses universitaires de Louvain. 298 p.
- Lavid-Lopez, J., C. Maiz-Arevalo, and J.R. Zamorano-Mansilla, eds. 2021. Corpora in translation and contrastive research in the digital age: Recent advances and explorations. Amsterdam, Philadelphia: John Benjamins. 351 p.
- Granger, S., and M.-A. Lefer, eds. 2022. Extending the scope of corpus-based translation studies. London: Bloomsbury Academic. 288 p.
- Inkova, O. Yu., ed. 2018. Semantika konnektorov: Kontrastivnoe issledovanie [Semantics of connectives: Contrastive study]. Moscow: TORUS PRESS. 396 p.
- Inkova, O. Yu., ed. 2019. Struktura konnektorov i metody ee opisaniya [Structure of connectives and methods for its description]. Moscow: TORUS PRESS. 316 p.
- Durnovo, A. A., O. Yu. Inkova, and N. A. Popkova. 2022. Arkhitekturabazy dannykh ierarkhii logiko-semanticheskikh otnosheniy [Database of hierarchies of logical-semantic relations: Architecture]. Sistemy i Sredstva Informatiki - Systems and Means of Informatics 32(1):114-125. doi: 10.14357/08696527220111.
- Inkova, O.Yu., and N. Popkova. 2017. Statistical data as information source for linguistic analysis of Russian connectors. Informatika i ee Primeneniya - Inform. Appl. 11 (3): 123-131.
- Natsional'nyy korpus russkogo yazyka [Russian national corpus]. Available at: http://www.ruscorpora.ru (accessed March 17, 2022).
- Inkova, O. Yu. 2019. Struktura konnektorov: lingvisticheskie metody opisaniya [Structure of connectives: Linguistic methods of description]. Struktura konnektorov i metody ee opisaniya [Structure of connectives and methods of its description]. Ed. O. Yu. Inkova. Moscow: TORUSS PRESS. 5-47.
- Durnovo, A. A., O. Yu. Inkova, and N. A. Popkova. 2022. Interfeys pol'zovatelya Nadkorpusnoy bazy dannykh ierarkhiy logiko-semanticheskikh otnosheniy [Database of hierarchies of logical-semantic relations: User interface]. Sistemy i Sredstva Informatiki - Systems and Means of Informatics 32(3): 103-115. doi: 10.14357/08696527220310
[+] About this article
Title
INTEGRATION CAPACITIES OF SUPRACORPORA DATABASES
Journal
Systems and Means of Informatics
Volume 33, Issue 1, pp 24-34
Cover Date
2023-05-11
DOI
10.14357/08696527230103
Print ISSN
0869-6527
Publisher
Institute of Informatics Problems, Russian Academy of Sciences
Additional Links
Key words
supracorpora database; integration of databases; multilingual corpus; parallel corpus; corpus-based information resources; translation studies; contrastive linguistics; machine translation
Authors
A. A. Durnovo , O. Yu. Inkova , , and V. A. Nuriev
Author Affiliations
Federal Research Center "Computer Science and Control", Russian Academy of Sciences, 44-2 Vavilov Str., Moscow 119333, Russian Federation
University of Geneva, 22 Bd des Philosophes, CH-1205 Geneva 4, Switzerland
|