Systems and Means of Informatics
2023, Volume 33, Issue 1, pp 135-145
- I. M. Adamovich
- O. I. Volkov
The article continues the series of works devoted to the technology of concrete historical research supporting. The technology is based on the principles of co-creation and crowdsourcing and is designed for a wide range of users which are not professional historians and biographers. The article is devoted to the further development of the technology by integrating into it a mechanism that automatically identifies potentially promising areas of research. The proposed approach is to automatically fill in information gaps in a set of facts describing the object of research on the basis of incomplete induction. The analysis of the base for inductive generalization is carried out and the ways of its formation are shown. The possibility of using the data imputation procedure usually used in data analysis and machine learning tasks for this purpose is substantiated. The methods of data imputation are analyzed in the connection with the features of technology and the specifics of concrete historical research. The analysis showed the expediency of the mechanism for automatic hypothesis formation constructing through such method of data imputation as the method of classification trees based on the CHAID (Chi Squared Automatic Interaction Detection) algorithm.
[+] References (16)
- Gribach, S.V. 2010. Issledovanie semeynykh krizisov posredstvom psikholingvisticheskogo eksperimenta [The study of family crises through a psycholinguistic experiment]. Sborniki konferentsiy NITs Sotsiosfera [Conference Proceedings NIC Sociosfera] 6:45-54.
- Adamovich, I.M., and O.I. Volkov. 2016. Tekhnologiya raspredelennogo avtomatizirovannogo analiza istoricheskikh tekstov [The distributed automated technology of historical texts analysis]. Sistemy i Sredstva Informatiki - Systems and Means of Informatics 26(3):148-161. doi: 10.14357/08696527160311.
- Adamovich, I. M., and O.I. Volkov. 2019. Edinaya tekhnologiya podderzhki konkretno-istoricheskikh issledovaniy [Unified technology of concrete historical research support]. Sistemy i Sredstva Informatiki - Systems and Means of Informatics 29(1): 194-205. doi: 10.14357/08696527190116.
- Adamovich, I. M., and O. I. Volkov. 2019. Printsipy organizatsii dannykh dlya tekhnologii podderzhki konkretno-istoricheskikh issledovaniy [The principles of data organization for the technology of concrete historical research support]. Sistemy i Sredstva Informatiki - Systems and Means of Informatics 29(2):161-171. doi: 10.14357/08696527190214.
- Adamovich, I. M., and O. I. Volkov. 2016. Ierarkhicheskaya forma predstavleniya biograficheskogo fakta [Hierarchial format of a biographical fact]. Sistemy
i Sredstva Informatiki - Systems and Means of Informatics 26(2): 108-122. doi: 10.14357/08696527160207.
- Adamovich, I.M., and O. I. Volkov. 2015. Sistema izvlecheniya biograficheskikh faktov iz tekstov istoricheskoy napravlennosti [The system of facts extraction from historical texts]. Sistemy i Sredstva Informatiki - Systems and Means of Informatics 25(3):235-250. doi: 10.14357/08696527150315.
- Adamovich, I. M., and O.I. Volkov. 2020. Avtomatizirovannyy poisk protivorechiy v konkretno-istoricheskoy informatsii [Automated search for contradictions in concrete- historical information]. Sistemy i Sredstva Informatiki - Systems and Means of Informatics 30(3):145-153. doi: 10.14357/08696527200313.
- Adamovich, I. M., and O. I. Volkov. 2014. Sredstva podderzhki internet-poiska pri provedenii biograficheskikh issledovaniy [The technology of internet search as a part of biographic investigation]. Sistemy i Sredstva Informatiki - Systems and Means of Informatics 24(2):178-192. doi: 10.14357/08696527140212.
- Bocharov, A.V. 2007. Algoritmy ispol'zovaniya osnovnykh nauchnykh metodov v konkretno-istoricheskom issledovanii [Algorithms of basic scientific methods using in concrete historical research]. Tomsk: Publishing House of the Tomsk State University. 140 p.
- Beskhlebnyy, E. I. 2021. Logika dlya yuristov [Logicfor lawyers]. Moscow: Yustitsiya. 248 p.
- Fomina, E.E. 2019. Obzor metodov i programmnogo obespecheniya dlya vosstanovleniya propushchennykh znacheniy v massivakh sotsiologicheskikh dannykh [Review of software and methods for recovering missing values in sociological data sets]. Humanities Bulletin BMSTU 4(78). 12 p. doi: 10.18698/2306-8477-2019-4-611.
- Abramenkova, I.V., and V.V. Kruglov. 2005. Metody vosstanovleniya propuskov v massivakh dannykh [Methods for restoring data gaps in data arrays]. Programmnye produkty i sistemy [Software & Systems] 2:18-22.
- Zagoruyko, N. G. 1999. Prikladnye metody analiza dannykh i znaniy [Applied methods of data and knowledge analysis]. Novosibirsk: Publishing House of Institute of Mathematics. 270 p.
- Ippolitov, G. M., and Yu. S. Repinetskaya. 2019. Istoricheskoe issledovanie: logika, strategiya, printsipy, metody [Historical research: Logic, strategy, principles, and methods]. Samara: SGSPU. 214 p.
- Fomina, E.E. 2021. Sravnitel'nyy analiz metodov imputatsii kategorial'nykh peremennykh v massivakh s rezul'tatami sotsiologicheskikh oprosov [Comparative analysis of the methods of categorial variables imputation in arrays for sociological surveys]. Vestnik PNIPU. Sotsial'no-ekonomicheskie nauki [PNRPU Sociology and Economics Bulletin] 1:83-96. doi: 10.15593/2224-9354/2021.1.7.
- Zhuchkova, S. V., and A.N. Rotmistrov. 2018. Vozmozhnost' raboty s propushchennymi dannymi pri ispol'zovanii CHAID: Rezul'taty statisticheskogo eksperimenta [Handling missing data with CHAID: Results of a statistical experiment]. Sotsiologiya: metodologiya, metody, matematicheskoe modelirovanie [Sociology: Methodology, Methods, Mathematical Modeling] 46:85-122.
[+] About this article
Systems and Means of Informatics
Volume 33, Issue 1, pp 135-145
Cover Date
Print ISSN
Institute of Informatics Problems, Russian Academy of Sciences
Additional Links
Key words
concrete historical investigation; distributed technology; formation of hypotheses; information gap; data imputation
I. M. Adamovich  and O. I. Volkov
Author Affiliations
 Federal Research Center "Computer Science and Control", Russian Academy of Sciences, 44-2 Vavilov Str., Moscow 119333, Russian Federation