Systems and Means of Informatics
2022, Volume 32, Issue 4, pp 32-44
METHODS FOR RETRIEVAL OF IMPLICIT LOGICAL-SEMANTIC RELATIONS FROM PARALLEL TEXTS
Abstract
The paper focuses on the problem of targeted retrieval of fragments with implicit logical-semantic relations (LSRs) from parallel texts. In order to answer the question whether such retrieval is possible, the author considers the potential of parallel text as a source of new linguistic knowledge and reviews the works that report results of implicit LSRs studies. The aim of the survey is to identify the methods used to create arrays of text fragments with implicit LSRs. It is revealed that parallel texts offer extensive opportunities for retrieval of such fragments, owing to the fact that an explicit LSR in language A may signal an implicit LSR in language B (that is called a bilingual explicit/implicit mismatch). The following methods have been successfully applied for detection of such mismatches: (i) discourse parsing; (ii) reverse machine translation; and (iii) bilingual exclusion search. Therefore, it can be argued that parallel texts - unlike the monolingual ones - can solve the task of targeted retrieval of fragments with implicit LSRs.
[+] References (23)
- Goncharov, A. A. 2022. Metody poiska implitsitnykh logiko-semanticheskikh otnosheniy v monoyazychnykh tekstakh [Methods for retrieval of implicit logical-semantic relations from monolingual texts]. Sistemy i Sredstva Informatiki - Systems and Means of Informatics 32(3):92-102.
- Dyvik, H. 1998. A translational basis for semantics. Corpora and cross-linguistic research: Theory, method and case studies. Eds. S. Johansson and S. Okselfjell. Amsterdam/Atlanta: Rodopi. 51-86.
- Ivin, A. A. 2004. Filosofiya: entsiklopedicheskiy slovar' [Philosophy: Encyclopedic dictionary]. Moscow: Gardariki. 1072 p.
- Salkie, R. 1999. How can we unpack a translation corpus? Contrastive linguistics and translation studies: Empirical approaches. Louvain-la-Neuve. 36-37. Available at: https://cdn.uclouvain.be/public/Exports%20reddot/cecl/documents/contrastive_ symposium_papers_1999.pdf (accessed October 4, 2022).
- Melamed, I.D. 2001. Empirical methods for exploiting parallel texts. Cambridge/ London: The MIT Press. 209 p.
- Noel, D. 2003. Translations as evidence for semantics: An illustration. Linguistics 41(4):757-785.
- Cartoni, B., S. Zufferey, and Th. Meyer. 2013. Annotating the meaning of discourse connectives by looking at their translation: The translation spotting technique. Dialogue Discourse 4(2):65-86.
- Zufferey, S., and L. Degand. 2017. Annotating the meaning of discourse connectives in multilingual corpora. Corpus Linguist. Ling. 13(2):399-422.
- Dupont, M., and S. Zufferey. 2017. Methodological issues in the use of directional parallel corpora. A case study of English and French concessive connectives. Int. J. Corpus Linguis. 22(2):270-297.
- Inkova, O. Yu., ed. 2018. Semantika konnektorov: kontrastivnoe issledovanie [Semantics of connectives: A contrastive study]. Moscow: TORUS PRESS. 368 p.
- Meyer, Th., A. Popescu-Belis, S. Zufferey, and B. Cartoni. 2011. Multilingual annotation and disambiguation of discourse connectives for machine translation. 12th Annual Meeting of the Special Interest Group on Discourse and Dialogue Proceedings. Portland: ACL. 194-203.
- Meyer, Th., N. Hajlaoui, and A. Popescu-Belis. 2015. Disambiguating discourse connectives for statistical machine translation. IEEE-ACM T. Audio Spe. 23(7): 11841197.
- Yung, F., K. Duh, and Y. Matsumoto. 2015. Crosslingual annotation and analysis of implicit discourse connectives for machine translation. 2nd Workshop on Discourse in Machine Translation Proceedings. Lisbon: ACL. 142-152.
- Li, H., Ph. Langlais, and Y. Jin. 2017. Translating implicit discourse connectives based on crosslingual annotation and alignment. 3rd Workshop on Discourse in Machine Translation Proceedings. Copenhagen: ACL. 93-98.
- Li, J.J., M. Carpuat, and A. Nenkova. 2014. Cross-lingual discourse relation analysis: A corpus study and a semi-supervised classification system. 25th Conference (International) on Computational Linguistics Proceedings. Dublin, Ireland: Dublin City University and ACL. 577-587.
- Wu, Ch., X. Shi, Y. Cheng, Y. Huang, and J. Su. 2016. Bilingually-constrained synthetic data for implicit discourse relation recognition. Conference on Empirical Methods in Natural Language Processing Proceedings. Austin, TX: ACL. 2306-2312.
- Shi, W., F. Yung, R. Rubino, and V. Demberg. 2017. Using explicit discourse connectives in translation for implicit discourse relation classification. 8th Joint Conference (International) on Natural Language Processing Proceedings. Taipei: Asian Federation of Natural Language Processing. 1:484-495.
- Shi, W., F. Yung, and V. Demberg. 2019. Acquiring annotated data with cross-lingual explicitation for implicit discourse relation classification. Workshop on Discourse Relation Parsing and Treebanking Proceedings. Minneapolis, MN: ACL. 12-21.
- Shi, W. 2020. Addressing the data bottleneck in implicit discourse relation classification. Saarbriicken: Saarlandische Universitats- und Landesbibliothek. PhD Thesis. 162 p.
- Goncharov, A. A., and O. Yu. Inkova. 2019. Metodika poiska implitsitnykh logiko- semanticheskikh otnosheniy v tekste [Methods for identification of implicit logical- semantic relations in texts]. Informatika i ee Primeneniya | Inform. Appl. 13(3):97- 104.
- Goncharov, A. A., and O.Yu. Inkova. 2020. Implitsitnye logiko-semanticheskie otnosheniya i metod ikh poiska v parallel'nykh tekstakh [Implicit logical-semantic relations and a method of their identification in parallel texts]. Computational Linguistics and Intellectual Technologies: Papers from the Annual Conference (International) "Dialogue." Moscow: RSHI. 19(26):310-320.
- Kruzhkov, M. G. 2021. Kontseptsiya postroeniya nadkorpusnykh baz dannykh [Conceptual framework for supracorpora databases]. Sistemy i Sredstva Informatiki - Systems and Means of Informatics 31(3): 101-112.
- Goncharov, A. A. 2020. Vzaimodeystvie logiko-semanticheskogo otnosheniya prichiny s drugimi otnosheniyami: opyt opisaniya na osnove kontrastivnogo analiza pokazateley [Interaction of the causal logical-semantic relation with other relations: An attempt of description based on the contrastive analysis of markers]. Conference (International) "Causal Constructions in the World's Languages (Synchrony, Diachrony, Typology)" Proceedings. St. Petersburg: ILI RAS. 84-87.
[+] About this article
Title
METHODS FOR RETRIEVAL OF IMPLICIT LOGICAL-SEMANTIC RELATIONS FROM PARALLEL TEXTS
Journal
Systems and Means of Informatics
Volume 32, Issue 4, pp 32-44
Cover Date
2022-30-11
DOI
10.14357/08696527220404
Print ISSN
0869-6527
Publisher
Institute of Informatics Problems, Russian Academy of Sciences
Additional Links
Key words
implicitness; implicit logical-semantic relations; targeted retrieval; natural language processing; knowledge extraction from texts; parallel texts
Authors
A. A. Goncharov
Author Affiliations
Federal Research Center "Computer Science and Control", Russian Academy of Sciences, 44-2 Vavilov Str., Moscow 119333, Russian Federation
|