Informatics and Applications

2018, Volume 12, Issue 4, pp 96-105

USING SUPRACORPORA DATABASES FOR QUANTITATIVE ANALYSIS OF MACHINE TRANSLATIONS

  • N. V. Buntman
  • A. A. Goncharov
  • I. M. Zatsman
  • V. A. Nuriev

Abstract

The paper discusses an information technology that supports expertise of machine translations. The technology has been developed to meet the following conditions: (i) there are connectives in all translated contexts; (ii) the connectives may be both one-word (khotya 'although,' a 'and') and multiword (da esche 'and beside this,' no zato 'but instead'); and (iii) between words making up a given connective, there may be a space (esli (space) tak 'if (space) then'). With this technology, expertise of machine translations develops through three main stages: (i) linguistic annotation of machine translations in a supracorpora database; (ii) quantitative processing of annotations; and (iii) linguistic analysis of annotations and quantitative data. The paper describes technological aspects of the first two stages. The examples given are only those with multiword connectives. Source sentences chosen for machine translation have been collected from literary texts.

[+] References (22)

[+] About this article