Systems and Means of Informatics
2016, Volume 26, Issue 3, pp 148-161
DISTRIBUTED AUTOMATED TECHNOLOGY OF HISTORICAL TEXTS ANALYSIS
- I. M. Adamovich
- O. I. Volkov
Abstract
The article focuses on the further development of the technology of such part of biographic investigation automation as texts surfing which is searching useful information, the character of which cannot be foreseen and, therefore, the appropriate web search query cannot be formulated. The faults of such technology based on the T-parser automatic facts extraction system are described and analyzed. The ways of its elimination with the aid of researchers' joint work support tools are proposed. The possibility of using Semantic Web decisions for this purpose was analyzed. The domain knowledge representation form based on semantic network is suggested. The advantage of such form over the hierarchical ontology which is used in Semantic Web is demonstrated. The main terms and principles of the new distributed technology of texts surfing in the biographic investigation with the aid of T-parser using some Semantic Web ideas are described. The implementation of the technology is described. The ways of its development are planned.
[+] References (15)
- Adamovich, I.M., and O. I. Volkov. 2015. Sistema izvlecheniya biograficheskikh faktov iz tekstov istoricheskoy napravlennosti [The system of facts extraction from historical texts]. Sistemy i Sredstva Informatiki - Systems and Means of Informatics 2(25):235-250.
- Adamovich, I. M., and O. I. Volkov. 2016. Ierarkhicheskaya forma predstavleniya biograficheskogo fakta [Hierarchial format of biohraphical fact]. Sistemy i Sredstva Informatiki - Systems and Means of Informatics 2(26): 147-161.
- Markova, N. A. 2015. Tekhnologiya podderzhki konkretno-istoricheskikh issledovaniy na osnove modeli faktopodobnykh vyskazyvaniy [Support technology for specific his-torical studies on the base of fact-like propositions model]. Programmnaya Inzheneriya [Software Engineering] 5:43-48.
- Adamovich, I.M., and O. I. Volkov. 2014. Sredstva podderzhki internet-poiska pri provedenii biograficheskikh issledovaniy [The technology of Internet search as a part of biographic investigation]. Sistemy i Sredstva Informatiki - Systems and Means of Informatics 2(24): 178-192.
- Khoroshevsky, V. F. 2008. Prostranstva znaniy vseti Internet i Semantic Web (Chast' 1) [Knowledge spaces in Internet and Semantic Web (Part 1)]. Iskusstvennyy Intellekt i Prinyatie Resheniy [Artificial Intelligence and Decision Making] 1:80-97.
- Rasin, V.V., and A.F. Tuzovsky. 2013. Predstavlenie znaniy o vremeni s uchetom neopredelennosti v ontologiyakh Semantic Web [Representation of temporal knowledge in Semantic Web ontologies considering knowledge uncertainty]. Dokl. TUSURa [TUSUR Proceedings] 2(28):157-162.
- Anatoliev, A. G. 2013. Razvitie veb-tekhnologiy: Osnovnye tendentsii i perspek- tivy. UMK po distsipline 'Veb-programmirovanie' dlya studentov 3-go kursa kafedry ASOIU OmGTU [Development of web technology: The main trends and prospects. Teaching materials on the subject 'Web-programming' for students of the 3rd year of the ASIPM Department of OmSTU]. Uchebno-metodicheskie materialy dlya studentov kafedry ASOIU [The educational materials for the students of ASIPM]. Available at: http://www.4stud.info/web-programming/lecture9.html (accessed January 01, 2016).
- Solovev, V. D., B. V. Dobrov, V. V. Ivanov, andN. V. Lukashevich. 2006. Ontologii i tezaurusy [Ontologies and thesaurus]. Kazan, Moscow: Kazan State University, M. V. Lomonosov Moscow State University. 156 p.
- MacDonald, M. 2011. HTML5. The missing manual. O'Reilly Media. 448 p.
- Andon, P. I., I.Y. Grishanova, and V.A. Reznichenko. 2008. Semantic Web kak novaya model' informatsionnogo prostranstva Internet [Semantic Web as a new model of Internet information space]. Problemy Programmirovaniya [Problems in Programming] Special edition, 2-4:417-430.
- Andreev, A. M., D. V. Berezkin, V. S. Ryimar, andK. V. Simakov. 2006. Ispol'zovanie tekhnologii Semantic Web v sisteme poiska nesootvetstviy v tekstakh dokumentov [Using Semantic Web technology for the task of inconsistency detection in natural lan-guage texts]. Elektronnye biblioteki: Perspektivnye metody i tekhnologii, elektronnye kollektsii: Tr. 8-y Vseross. nauchn. konf. RCDL'2006 [Digital Libraries: Advanced Methods and Technologies, Digital Collections: 8th All-Russian Scientific Conference RCDL'2006 Proceedings]. Yaroslavl: P. G. Demidov Yaroslavl State University. 263-269.
- Markova, N. A. 2012. Elektronnayakollektsiyabiograficheskikhfaktov [Digital collec-tion of biographic facts]. Elektronnye biblioteki: Perspektivnye metody i tekhnologii, elektronnye kollektsii: Tr. 14-y Vseross. nauchnoy konf. RCDL'2012 [Digital Libraries: Advanced Methods and Technologies, Digital Collections: 14th All-Russian Scientific Conference RCDL'2012 Proceedings]. Pereslavl-Zalessky. 287-293.
- Adamovich, I.M., O. I. Volkov, and N. A. Markova. 2012. Metod klassifikatsii informatsii na osnove ierarkhicheskikh tegov i ego realizatsiya na primere semeynogo arkhivnogo fonda [Method of information classification based on hierarchical tags and its implementation on the example of a family archive]. Sistemy i Sredstva Informatiki - Systems and Means of Informatics 2 (22): 134-144.
- Krasnoshchekov, E. E. 2005. Preimushchestva nechetkogo poiska relevantnoy infor-matsii [The advantages of fuzzy search for relevant information]. Scientific and Practical Conference (International) "Computer Technologies in Science, Manufacture, Social and Economic Processes" Proceedings. Novocherkassk. 44-46.
- Fitzgerald, M. 2012. Introducing regular expressions. O'Reilly Media. 154 p.
[+] About this article
Title
DISTRIBUTED AUTOMATED TECHNOLOGY OF HISTORICAL TEXTS ANALYSIS
Journal
Systems and Means of Informatics
Volume 26, Issue 3, pp 148-161
Cover Date
2016-08-30
DOI
10.14357/08696527160311
Print ISSN
0869-6527
Publisher
Institute of Informatics Problems, Russian Academy of Sciences
Additional Links
Key words
biographic investigation; distributed technology of texts surfing; semantic network; automatic facts extraction; Semantic Web
Authors
I. M. Adamovich and O. I. Volkov
Author Affiliations
Institute of Informatics Problems, Federal Research Center "Computer Science
and Control", Russian Academy of Sciences, 44-2 Vavilov Str., Moscow 119333, Russian Federation
|