Informatics and Applications
2021, Volume 15, Issue 2, pp 36-43
INTELLIGENT ANALYSIS OF BIG DATA EXTENDIBLE COLLECTIONS UNDER THE LIMITS OF PROCESS-REALTIME
- A. A. Grusho
- M. I. Zabezhailo
- D. V. Smirnov
- E. E. Timonina
Abstract
The problem how to extract relevant to the fixed goal data from regularly extended by new information collections of Big Data not braking given limits for data analysis and decision making (being in agreement with so- called process-real time restrictions) is discussed. The proposed approach is based on implementation of modern artificial intelligence techniques including knowledge representation and reasoning formalization for so-called Intelligent Data Analysis (IDA) computer systems. Some critical barriers preventing efficient application of this type IDA (e. g., computational complexity of some related to IDA combinatorial problems, including provable getting some of them in well-known classes of computationally hard problems, some characteristic features of knowledge representation and search iteration enumeration control, optimization of accuracy, and completeness of search results) are analyzed. A formalized description for the designed IDA set of procedures is presented. The discussed approach is illustrated by examples of its implementation in a corporate computer system of malicious insider activities identification and counteraction operating in a large Russian commercial bank.
[+] References (16)
- The flexible AI-powered Search & Discovery platform. Available at: https://www.algolia.com (accessed May 12, 2021).
- IBM Watson Discovery. Available at: https://www.ibm. com/cloud/watson-discovery (accessed May 12, 2021).
- Power your website with the world's best search. Available at: https://www.yext.com (accessed May 12, 2021).
- A powerful search experience for your website - without the learning curve. Available at: https://swiftype.com (accessed May 12, 2021).
- SearchUnify wins two silver Stevies - one in collaboration with Bluebeam - in 2021 Stevie Awards for Sales & Customer Service. Available at: https://www. searchunify.com (accessed May 12, 2021).
- ELASTIC: Search more, spend less. Available at: https: // www.elastic.co (accessed May 12, 2021).
- Solr is the popular, blazing-fast, open source enterprise search platform built on Apache LuceneTM. Available at: https://lucene.apache.org/solr/ (accessed May 12, 2021).
- Introduction to Search with SPHINX. Available at: http://sphinxsearch.com (accessed May 12, 2021).
- Korporativnyy poisk "Sputnik" [Corporate Search "Sputnik"]. Available at: https://www.sputnik.ru/searchbox (accessed May 12, 2021).
- Arkhitektura platformy 1S-Predpriyatie: global'nyy poisk [1C-Enterprise Platform Architecture: Global search]. Available at: https://v8.1c.ru/platforma/globalnyy- poisk/ (accessed May 12, 2021).
- Cohn, P.M. 1965. Universal algebra. New York, NY Harper and Row. 333 p.
- Zabezhailo, M. I. 2015. O nekotorykh otsenkakh slozhnosti vichisleniy v DSM-rassuzhdeniyakh [To the computational complexity of hypotheses generation in JSM-method]. Iskusstvennyy intellect iprinyatie resheniy [Artificial Intelligence and Decision Making]. Part I. 1:3-17; Part II. 2:3-17.
- Grusho, A. A., M.I. Zabezhailo, A. A. Zatsarinny, and E. E. Timonina. 2018. O nekotorykh vozmozhnostyakh upravleniya resursami pri organizatsii proaktivnogo protivodeystviya komp'yuternym atakam [On some pos-sibilities of resource management for organizing active counteraction to computer attacks]. Informatika i ee Primeneniya - Inform. Appl. 12(1):62-70.
- Simon, J. 1977. On the difference between one and many Automata, languages and programming. Eds. A. Salomaa and M. Steinby. Lecture notes in computer science ser. Berlin-Heidelberg: Springer. 52:480-491.
- Valiant, L. G. 1979. The complexity of enumeration and reliability problems. SIAM J. Comput. 8:410-421.
- Valiant, L. G. 1979. The complexity of computing the permanent. Theor. Comput. Sci. 8:189-201.
[+] About this article
Title
INTELLIGENT ANALYSIS OF BIG DATA EXTENDIBLE COLLECTIONS UNDER THE LIMITS OF PROCESS-REALTIME
Journal
Informatics and Applications
2021, Volume 15, Issue 2, pp 36-43
Cover Date
2021-06-30
DOI
10.14357/19922264210206
Print ISSN
1992-2264
Publisher
Institute of Informatics Problems, Russian Academy of Sciences
Additional Links
Key words
Big Data; process-real time; intelligent data analysis; information security; insider malicious activities
Authors
A. A. Grusho , M. I. Zabezhailo , D. V. Smirnov , and E. E. Timonina
Author Affiliations
Institute of Informatics Problems, Federal Research Center "Computer Science and Control" of the Russian Academy of Sciences, 44-2 Vavilov Str., Moscow 119333, Russian Federation
A. A. Dorodnicyn Computing Center, Federal Research Center "Computer Science and Control" of the Russian Academy of Sciences, 40 Vavilov Str., Moscow 119333, Russian Federation
Sberbank of Russia, 19 Vavilov Str., Moscow 117999, Russian Federation
|