Systems and Means of Informatics
2022, Volume 32, Issue 3, pp 136-146
AN APPROACH TO SEARCHING FOR ANOMALIES IN CONCRETE HISTORICAL DATA
- I. M. Adamovich
- O. I. Volkov
Abstract
The article continues the series of works devoted to the technology of concrete historical research supporting. The technology is based on the principles of co-creation and crowdsourcing and is designed for a wide range of users which are not professional historians and biographers. The article is devoted to the further development of the technology by integrating the mechanism of automated search for anomalies in concrete-historical information.
The analysis of the existing approach to the search for contradictions in historical and biographical facts has been carried out and its limitations and shortcomings have been revealed. The expediency of the transition from the search for obvious contradictions to the search for anomalies is justified within the development of this approach. The causes of anomalies have been analyzed, their classification has been carried out, and the features of anomalies in concrete-historical data have been determined. The analysis of the known methods for point anomalies detecting has been carried out and the impossibility of using the methods based on supervised learning as well as metric methods in the technology has been justified. The most promising method based on clustering has been found and further steps for its implementation have been determined. The necessary changes to the technology object model are described and justified.
[+] References (12)
- Gribach, S. V. 2010. Issledovanie semeynykh krizisov posredstvom psikholingvisticheskogo eksperimenta [The study of family crises through a psycholinguistic experiment]. Sborniki konferentsiy NITs Sotsiosfera [Conference NIC Sociosfera Proceedings] 6:4554.
- Adamovich, I. M., and O.I. Volkov. 2016. Tekhnologiya raspredelennogo avtomatizirovannogo analiza istoricheskikh tekstov [The distributed automated technology of historical texts analysis]. Sistemy i Sredstva Informatiki - Systems and Means of Informatics 26(3):148-161. doi: 10.14357/08696527160311.
- Adamovich, I. M., and O. I. Volkov. 2019. Edinaya tekhnologiya podderzhki konkretno-istoricheskikh issledovaniy [Unified technology of concrete historical research support]. Sistemy i Sredstva Informatiki - Systems and Means of Informatics 29(1): 194-205. doi: 10.14357/08696527190116.
- Adamovich, I. M., and O. I. Volkov. 2019. Printsipy organizatsii dannykh dlya tekhnologii podderzhki konkretno-istoricheskikh issledovaniy [The principles of data organization for the technology of concrete historical research support]. Sistemy
i Sredstva Informatiki - Systems and Means of Informatics 29(2): 161-171. doi: 10.14357/08696527190214.
- Adamovich, I. M., and O. I. Volkov. 2016. Ierarkhicheskaya forma predstavleniya biograficheskogo fakta [Hierarchial format of a biographical fact]. Sistemy i Sredstva Informatiki - Systems and Means of Informatics 26(2): 108-122. doi: 10.14357/ 08696527160207.
- Adamovich, I. M., and O. I. Volkov. 2020. Avtomatizirovannyy poisk protivorechiy v konkretno-istoricheskoy informatsii [Automated search for contradictions in concrete- historical information]. Sistemy i Sredstva Informatiki - Systems and Means of Informatics 30(3):145-153. doi: 10.14357/08696527200313.
- Adamovich, I. M., and O.I. Volkov. 2015. Sistema izvlecheniya biograficheskikh faktov iz tekstov istoricheskoy napravlennosti [The system of facts extraction from historical texts]. Sistemy i Sredstva Informatiki - Systems and Means of Informatics 25(3):235-250. doi: 10.14357/08696527150315.
- Kislyakov, A.N., and S.V. Polyakov. 2020. Iyerarkhicheskie metody klasterizatsii v zadache poiska anomal'nykh nablyudeniy na osnove grupp s narushennoy simmetriey [Hierarchical clustering methods in a task to find abnormal observations based on groups with broken symmetry]. Upravlencheskoe konsul'tirovanie [Administrative Consulting] 5:116-127.
- Adamovich, I. M., and O.I. Volkov. 2021. Ustoychivost' tekhnologii podderzhki konkretno-istoricheskikh issledovaniy k popytkam iskazheniya istorii [The resistance of technology of concrete historical investigation support to attempts of history distortion]. Sistemy i Sredstva Informatiki - Systems and Means of Informatics 31 (2): 152-162. doi: 10.14357/08696527210214.
- Chandola, V., A. Banerjee, and V. Kumar. 2009. Anomaly detection: A survey. ACM Comput. Surv. 41 (3): 1-58.
- Shkodyrev, V., K. Yagafarov, V. Bashtovenko, and E. Ilyina. 2017. Obzor metodov obnaruzheniya anomaliy v potokakh dannykh [The overview of anomaly detection methods in data streams]. CEUR Workshop Procee. 1864:8. 7 p.
- Kokoreva, Ya. V., and A. A. Makarov. 2015. Poetapnyy protsess klasternogo analiza dannykh na osnove algoritma klasterizatsii k-means [A phased process of cluster data analysis based on the k-means clustering algorithm]. Molodoy uchenyy [Young Scientist] 13(93): 126-128.
[+] About this article
Title
AN APPROACH TO SEARCHING FOR ANOMALIES IN CONCRETE HISTORICAL DATA
Journal
Systems and Means of Informatics
Volume 32, Issue 3, pp 136-146
Cover Date
2022-06-11
DOI
10.14357/08696527220313
Print ISSN
0869-6527
Publisher
Institute of Informatics Problems, Russian Academy of Sciences
Additional Links
Key words
concrete historical investigation; distributed technology; anomaly; historical-biographical fact; automated procedure
Authors
I. M. Adamovich and O. I. Volkov
Author Affiliations
Federal Research Center "Computer Science and Control", Russian Academy of Sciences, 44-2 Vavilov Str., Moscow 119333, Russian Federation
|