Informatics and Applications

2023, Volume 17, Issue 2, pp 50-56

CRITERIA FOR CHOOSING THE FACTORIZATION MODEL DIMENSIONALITY

  • M. P. Krivenko

Abstract

The paper is devoted to the choice ofmodel dimension ofmatrix factorization in the presence ofmissing elements. The problem of estimating the parameters of the adopted data model is solved by multidimensional optimization. Estimating the value of reduced dimensionality is a typical example of the problem of choosing a model when an alternative arises during data analysis and the choice means either finding out the preferences of individual options or highlighting the "best" representative. Typically, applied selection criteria are based on likelihood function which requires probabilistic assumptions about the data. But when evaluating the parameters of the factor model under consideration, they are not set and it is impractical to introduce them, so as not to violate the commonality of the formulated task of reducing dimensionality. Therefore, an attempt was made to turn to the idea of reusing the available data for the statistical output. None of the existing approaches (bootstrap, folding knife, rechecks, as well as permutation tests) is suitable; so, an original method for generating new data by additional omissions ofelements ofthe original matrix was proposed. To process the formed samples, it is suggested to use a combination of the model of a mixture of normal distributions in conjunction with nuclear smoothing.
The proposed solutions make it possible to correctly carry out the procedure for justifying the dimensionality ofthe adopted factorization model. The exposition is illustrated by an example ofsynthetic data processing.

[+] References (7)

[+] About this article