Informatics and Applications
2023, Volume 17, Issue 3, pp 2-7
ON THE FORMATION OF SETS OF PRECEDENTS BASED ON TABLES OF HETEROGENEOUS FEATURE DESCRIPTIONS BY METHODS OF TOPOLOGICAL THEORY OF DATA ANALYSIS
Abstract
Factorization of the contributions of various variables in the analysis of heterogeneous feature descriptions is an urgent task of complex data mining. The paper proposes the development of the lattice formalism of the
topological theory of data analysis, within which new methods for generating parametric estimates and metrics on lattices formed over the topologies of sets of objects are obtained. The formalism was tested on the problem of forming sets of precedents for conducting chemomicrobiome analysis. Whereas the generation of a set of initial information based on regression coefficients and the difference in the values of the learning material corresponded to an extremely low generalizing ability of custom algorithms (correlation coefficient in the control 0.32 ± 0.20), the use of the proposed estimates for generating sets of precedents in chemomicrobiomics problems made it possible to significantly increase the generalizing ability of the corresponding algorithms (correlation coefficient in control 0.79 ± 0.21).
[+] References (8)
- Zhuravlev, Yu. I. 1998. Izbrannye nauchnye trudy [Selected scientific works]. Moscow: Magistr. 420 p.
- Torshin, I. Yu., and K. V. Rudakov. 2015. On the theoretical basis of metric analysis of poorly formalized problems of recognition and classification. Pattern Recognition Image Analysis 25(4):577—587. doi: 10.1134/S1054661815040252.
- Torshin, I. Yu., O. A. Gromova, I. N. Zakharova, and V A. Maksimov. 2019. Khemomikrobiomnyy analiz Laktitola [Hemomikrobiomny lactitol analysis]. Eksperimental’naya i klinicheskaya gastroenterologiya [Experimental and Clinical Gastroenterology]. 164(4):111—121. doi: 10.31146/1682-8658-ecg-164-4-111-121.
- Rudakov, K. V., and I. Yu. Torshin. 2012. Analiz informativnosti motivov na osnove kriteriya razreshimosti v zadache raspoznavaniya vtorichnoy struktury belka [Analysis of the informativeness of motives based on the criterion of solvability in the problem of recognizing the secondary structure
of a protein]. Informatika i ee Primeneniya — Inform Appl. 6(1):79—90.
- Torshin, I. Yu. 2023. O zadachakh optimizatsii, voznikayushchikh pri primenenii topologicheskogo analiza dannykh k poisku algoritmov prognozirovaniya s fiksirovannymi korrektorami [On optimization problems arising from the application of topological data analysis to the search for forecasting algorithms with fixed correctors]. Informatika i ee Primeneniya — Inform Appl. 17(2):2—10. doi: 10.14357/19922264230201. EDN: IGSPEW
- Deza, E., and M.-M. Deza. 2006. Dictionary of distances. Elsevier B.V. 412 p.
- Kolmogorov, A. N., and S. V. Fomin. 1989. Elementy teorii funktsiy ifunktsional’nogo analiza [Elements of the theory of functions and functional analysis]. Moscow: Nauka. 624 p.
- Forslund, S. K., R. Chakaroun, M. Stumvoll, and P. Bork.
2021. Combinatorial, additive and dose-dependent drugmicrobiome associations. Nature 600(7889):500—505. doi: 10.1038/s41586-021-04177-9.
[+] About this article
Title
ON THE FORMATION OF SETS OF PRECEDENTS BASED ON TABLES OF HETEROGENEOUS FEATURE DESCRIPTIONS BY METHODS OF TOPOLOGICAL THEORY OF DATA ANALYSIS
Journal
Informatics and Applications
2023, Volume 17, Issue 3, pp 2-7
Cover Date
2023-10-10
DOI
10.14357/19922264230301
Print ISSN
1992-2264
Publisher
Institute of Informatics Problems, Russian Academy of Sciences
Additional Links
Key words
topological data analysis; lattice theory; parametrization of lattice terms; human microbiome; pharmacoinformatics, algebraic approach of Yu. I. Zhuravlev.
Authors
I. Yu. Torshin
Author Affiliations
Federal Research Center "Computer Science and Control" of the Russian Academy of Sciences, 44-2 Vavilov Str., Moscow 119333, Russian Federation
|