Informatics and Applications
2019, Volume 13, Issue 4, pp 27-29
DATA MODEL SELECTION IN MEDICAL DIAGNOSTIC TASKS
Abstract
Effective solution of medical diagnostics tasks requires the use of complex probabilistic models which
allow one to adequately describe real data and permit the use of analytical methods of the supervised learning
classification. Choosing a model of a mixture of normal distributions solves the posed problems but leads to the
curse of dimensionality. The transition to the model of a mixture of probabilistic principal component analyzers
allows one to formally set the task of choosing its structural parameters. The solution is proposed to search by
combining the application of information criteria for the formation of initial approximations followed by refinement
of the resulting estimates. Using the example of experiments to diagnose liver diseases and to predict the chemical
composition of urinary stones, the capabilities of the described data analysis procedures are demonstrated. The
proposed solutions give a source of improving the accuracy of classification, impetus to experts in the subject area
to clarify the essence of the processes.
[+] References (7)
- Harris, E. K., and J. C. Boyd. 1995. Statistical bases of reference values in laboratory medicine. New York, NY: Marcel
Dekker. 361 p.
- Krivenko, M. P. 2015. Modeli dlya predstavleniya i obrabotki referensnykh znacheniy [Models for representation and
treatment of reference values]. Informatika i ee Prime-
neniya - Inform. Appl. 9(2):63-74.
- Jensen, R., and Q. Shen. 2008. Computational intelligence
and feature selection. Rough and fuzzy approaches. Hoboken,
NJ: John Wiley & Sons. 340 p.
- Krivenko, M. P. 2019. Vybor razmernostey dlya smesi
veroyatnostnykh analizatorov glavnykh component [The
choice of dimensions for a mixture of probabilistic analyzers
Received August 19, 2019
of the main components]. Sistemy i Sredstva Informatiki - Systems and Means of Informatics 29(3):4-15.
- Tipping, M. E., and C. M. Bishop. 1999. Mixtures of probabilistic principal component analyzers. Neural Comput. 11(2):443-482.
- Krivenko, M.P. 2019. Snizhenie razmernosti dlya smesi veroyatnostnykh analizatorov glavnykh komponent primenitel'no k zadacham meditsinskoy diagnostiki [Dimension reduction for a mixture of probabilistic analyzers of the main components as applied to medical diagnostic tasks]. Sistemy i Sredstva Informatiki - Systems and Means of Informatics 29(4):4-13.
- Albert, A., and E. K. Harris. 1987. Multivariate interpretation of clinical laboratory data. New York, NY: CRC Press. 328 p.
[+] About this article
Title
DATA MODEL SELECTION IN MEDICAL DIAGNOSTIC TASKS
Journal
Informatics and Applications
2019, Volume 13, Issue 4, pp 27-29
Cover Date
2019-12-30
DOI
10.14357/19922264190404
Print ISSN
1992-2264
Publisher
Institute of Informatics Problems, Russian Academy of Sciences
Additional Links
Key words
medical diagnostics; mixture of probabilistic principal component analyzers; model selection criterion; cross validation
Authors
M. P. Krivenko
Author Affiliations
Institute of Informatics Problems, Federal Research Center "Computer Science and Control" of the Russian Academy of Sciences, 44-2 Vavilov Str., Moscow 119333, Russian Federation
|