Systems and Means of Informatics

2024, Volume 34, Issue 2, pp 123-133

METHOD FOR SEARCHING FOR OPTIMAL PARAMETER VALUES OF THE ENTITY RESOLUTION ALGORITHM FOR CONCRETE HISTORICAL DATA

  • I. M. Adamovich
  • O. I. Volkov

Abstract

The article is devoted to the use of the collective entity resolution method based on a new relational clustering algorithm, which is a modification of the greedy agglomerative clustering algorithm, in concrete historical investigation when processing nominative sources. The article proposes the method for searching for optimal values of parameters of the collective entity resolution algorithm for tasks related to concrete historical investigation. The method is based on the analysis of the specifics of concrete historical data, their comparison with test data for which there are estimates of the effectiveness of the algorithm, and the procedure for finding the optimal process parameters according to the Gauss-Seidel scheme that consists in sequentially searching for the function optimum alternately for each variable. The application of the proposed method makes it possible to use the considered entity resolution algorithm in real concrete historical research in the tasks of automated record linkage in nominative sources.

[+] References (10)

[+] About this article