Informatics and Applications
2020, Volume 14, Issue 4, pp 108-116
EVOLUTION OF CLASSIFICATIONS IN SUPRACORPORA DATABASES
- A. A. Goncharov
- I. M. Zatsman
- M. G. Kruzhkov
Abstract
The paper examines the task of recording changes to descriptions of meanings of German modal verbs in the process of annotating parallel German-Russian texts within a supracorpora database. This task was used as a case study to analyze the specifics of using dynamic classification systems (DCS) in information systems. The distinctive feature of a DCS is that semantic content of its concepts may change in the process of annotation which often entails the need to reclassify previously annotated data according to the changes made. This paper aims to answer the following questions: (i) What factors may have an impact on the need to edit and/or reclassify the annotations created prior to the concept changes? and (ii) What kind of operations may be used to represent the changes to concepts in the DCS? The paper describes seven types of possible changes and enumerates the corresponding operations applied to the DCS concepts in the process of annotation. The operations are grouped in three categories depending on how they affect the need to reclassify the previously created annotations.
[+] References (20)
- Zakharov, V. P., and S. Yu. Bogdanova. 2011. Korpusnaya lingvistika [Corpus linguistics]. Irkutsk: IGLU. 161 p.
- McEnery, T., and A. Hardie. 2012. Corpus linguistics: Method, theory and practice. Cambridge: Cambridge Uni-versity Press. 310 p.
- Kopotev, M. 2016. Vvedenie v korpusnuyu lingvistiku [Introduction to corpus linguistics]. Praha: Animedia Com-pany. 196 p.
- Kubler, S., and H. Zinsmeister. 2015. Corpus linguistics and linguistically annotated corpora. London/New York: Bloomsbury 320 p.
- Kruzhkov, M. G. 2015. Informatsionnye resursy kontrastivnykh lingvisticheskikh issledovaniy: elektronnye korpusa tekstov [Information resources for contrastive studies: Electronic text corpora]. Sistemy i Sredstva Infor-matiki - Systems and Means of Informatics 25(2):140-159.
- Goncharov, A. A., O.Yu. Inkova, and M. G. Kruzhkov 2019. Metodologiya annotirovaniya v nadkorpusnykh bazakh dannykh [Annotation methodology of supracor- pora databases]. Sistemy i Sredstva Informatiki - Systems and Means of Informatics 29(2):148-160.
- Ide, N., and J. Pustejovsky, eds. 2017. Handbook of linguistic annotation. Dordrecht: Springer Science + Business Media. 1568 p.
- Zaliznyak, A. A., I. M. Zatsman, and O. Yu. Inkova. 2017. Nadkorpusnaya baza dannykh konnektorov: postroenie sistemy terminov [Supracorpora database on connectives: Term system development]. Informatika i ee Primeneniya - Inform. Appl. 11(1):100-108.
- Zatsman, I.M., O.Yu. Inkova, M.G. Kruzhkov, and N. A. Popkova. 2016. Predstavlenie krossyazykovykh znaniy o konnektorakh v nadkorpusnykh bazakh dan- nykh [Representation of cross-lingual knowledge about connectors in suprocorpora databases]. Informatika i ee Primeneniya - Inform. Appl. 10(1):106-118.
- Zatsman, I.M., O.Yu. Inkova, and V.A. Nuriev. 2017. The construction of classification schemes: Methods and technologies of expert formation. Automatic Documentation Mathematical Linguistics 51(1): 27-41.
- Inkova, O., and N. Popkova. 2017. Statistical data as information source for linguistic analysis of Russian connectors. Informatika i ee Primeneniya - Inform. Appl. 11(3):123-131.
- Zatsman, I., M. Kruzhkov, and E. Loshchilova. 2019. Metody i sredstva informatiki dlya opisaniya struktury neodnoslovnykh konnektorov [Methods and means of informatics for multiword connectives structure descrip-tion]. Struktura konnektorov i metody ee opisaniya [Con-nectives structure and methods of its description]. Ed.
O. Yu. Inkova. Moscow: TORUS PRESS. 205-230.
- Dobrovol'skiy, D. O., ed. 2020 (in press). Nemetsko- russkiy slovar': aktual'naya leksika [German-Russian dictionary: Actual vocabulary]. Moscow: Leksrus.
- Goncharov, A. A., I. M. Zatsman, and M.G. Kruzhkov 2019. Temporal'nye dannye v leksikograficheskikh bazakh znaniy [Temporal data in lexicographic databases]. Infor-matika i ee Primeneniya - Inform. Appl. 13(4):90-96.
- Dobrovol'skiy, D. O., and Anna A. Zalizniak. 2018. Nemetskie konstruktsii s modal'nymi glagolami i ikh russkie sootvetstviya: proekt nadkorpusnoy bazy dan- nykh [German constructions with modal verbs and their Russian correlates: A supracorpora database project]. Komp'yuternaya lingvistika i intellektual'nye tekhnologii:po mat-lam Mezhdunar. konf. "Dialog" [Computer Linguistic and Intellectual Technologies: Conference (International) "Dialog" Proceedings]. Moscow. 17(24):172-184.
- Bentz, D.M., and T. P. Cavender. 1953. Reclassification and recataloging. Libr. Trends 2(2):249-263.
- Kumbhar, R. 2012. Library classification trends in the 21st century. Oxford: Chandos Publishing. 186 p.
- Zatsman, I. M., V. V. Kosarik, and O. A. Kurchavova. 2008. Zadachi predstavleniya lichnostnykh i kollektivnykh kontseptov v tsifrovoy srede [Representation of individual and collective concepts in digital medium]. Informatika
i ee Primeneniya - Inform. Appl. 2(3):54-69.
- Gnoli, C. 2008. Ten long-term research questions in knowledge organization. Knowl. Organ. 35(2/3):137-149.
- Zatsman, I. M. 2020. Problemno-orientirovannaya verifikatsiya polnoty temporal'nykh ontologiy i zapolnenie ponyatiynykh lakun [Problem-oriented verifying the com-pleteness of temporal ontologies and filling conceptual lacunas]. Informatika i ee Primeneniya - Inform. Appl. 14(3):119-128.
[+] About this article
Title
EVOLUTION OF CLASSIFICATIONS IN SUPRACORPORA DATABASES
Journal
Informatics and Applications
2020, Volume 14, Issue 4, pp 108-116
Cover Date
2020-12-30
DOI
10.14357/19922264200415
Print ISSN
1992-2264
Publisher
Institute of Informatics Problems, Russian Academy of Sciences
Additional Links
Key words
dynamic classification; faceted classification; reclassification; supracorpora databases
Authors
A. A. Goncharov , I. M. Zatsman , and M. G. Kruzhkov
Author Affiliations
Institute of Informatics Problems, Federal Research Center "Computer Science and Control" of the Russian Academy of Sciences, 44-2 Vavilov Str., Moscow 119333, Russian Federation
|