Systems and Means of Informatics
2021, Volume 31, Issue 4, pp 157-167
THE USE OF WEB-CRAWLERS IN TECHNOLOGY OF CONCRETE HISTORICAL INVESTIGATION SUPPORT
- I. M. Adamovich
- O. I. Volkov
Abstract
The article is devoted to the further development of the distributed technology of concrete historical investigation support based on the principles of crowdsourcing and focused on a wide range of users which are nonprofessional historians and biographers. Development is carried out through the automation of one of the main types of Internet searches (indirect Internet search) used in biographical research. The article analyzes the possible approaches to the automation of Internet search taking into account the specifics of concrete historical investigation. The use of web-crawlers is substantiated and the requirements for them arising from the distinctive of this technology are formulated. The possibility of using ready-made solutions is estimated. The necessary changes in the object model of the technology and the modifications of its algorithms related to indirect Internet search are described. As an additional measure to reduce the difficulty of indirect Internet search, the new mechanism for automating of the interaction of the technology users which execute their investigations in similar directions is proposed and described in detail.
[+] References (11)
- Pomnikova, A. Yu. 2019. Semeynaya istoriya v diskursivnom prostranstve [Family stories in different types of discourse]. Vestnik of Minin University 7(1):9. 22 p.
- Ikonnikova, S.N. 2012. Biografika kak chast' istoricheskoy kul'turologii [Biografical studies as part of the historical cultural studies]. Vestnik SPbGUKI [Bull. Saint Petersburg State University of Culture and Art] 2(11): 6- 10.
- Adamovich, I.M., and O. I. Volkov. 2014. Sredstva podderzhki internet-poiska pri provedenii biograficheskikh issledovaniy [The technology of Internet search as a part of biographic investigation]. Sistemy i Sredstva Informatiki - Systems and Means of Informatics 24(2): 178-192.
- Adamovich, I.M., and O. I. Volkov. 2015. Sistema izvlecheniya biograficheskikh faktov iz tekstov istoricheskoy napravlennosti [The system of facts extraction from historical texts]. Sistemy i Sredstva Informatiki - Systems and Means of Informatics 25(3):235-250.
- Adamovich, I. M., and O.I. Volkov. 2016. Tekhnologiya raspredelennogo avtomatizirovannogo analiza istoricheskikh tekstov [The distributed automated technology of historical texts analysis]. Sistemy i Sredstva Informatiki - Systems and Means of Informatics 26(3): 148-161.
- Adamovich, I. M., and O.I. Volkov. 2019. Edinaya tekhnologiya podderzhki konkretno-istoricheskikh issledovaniy [Unified technology of concrete historical research support]. Sistemy i Sredstva Informatiki - Systems and Means of Informatics 29(1): 194-205.
- Minakov, V. F. 2016. Znaniya v tsifrovom obshchestve [Knowledge in digital society]. Nauka-rastudent.ru 11:21. 10 p.
- Kiryushin, K. 2013. Monitoring sotsial'nykh media - vozmozhnosti i real'nost' [Social media monitoring - opportunities and reality]. Cossa. Available at: https://www.cossa.ru/trends/43220 (accessed September 12, 2021).
- Andreeva, K. A., andR. S. Shaydurov. 2014. Razrabotkapersonal'noy dokumental'noy informatsionno-poiskovoy sistemy dlya seti Internet [Development of personalized documentary information retrieval system for the Internet]. Reshetnevskie chteniya [Reshetnev Readings] 2:223-224.
- Gudkov, K. V., and M. V. Tonkushin. 2018. Analiz avtomatizirovannykh sistem sbora informatsii v seti Internet [Analysis of automated systems for collecting information on the Internet]. Sovremennye informatsionnye tekhnologii [Contemporary Information Technologies] 28:27-31.
- Pudikova, E. M. 2016. Obzor veb-kraulerov dlya resheniya zadachi sbora dannykh
o predstavitel'skikh saytakh zadannoy predmetnoy oblasti [A review of web crawlers for solving the problem of collecting data on representative sites of a given subject area]. Sistemnyy analiz [System Analysis] 20:1-16.
[+] About this article
Title
THE USE OF WEB-CRAWLERS IN TECHNOLOGY OF CONCRETE HISTORICAL INVESTIGATION SUPPORT
Journal
Systems and Means of Informatics
Volume 31, Issue 4, pp 157-167
Cover Date
2021-12-10
DOI
10.14357/08696527210413
Print ISSN
0869-6527
Publisher
Institute of Informatics Problems, Russian Academy of Sciences
Additional Links
Key words
concrete historical investigation; distributed technology; web- crawler; data model; Internet search
Authors
I. M. Adamovich and O. I. Volkov
Author Affiliations
Federal Research Center "Computer Science and Control", Russian Academy of Sciences, 44-2 Vavilov Str., Moscow 119333, Russian Federation
|