Systems and Means of Informatics
2015, Volume 25, Issue 1, pp 168-185
TECHNOLOGY FOR PREVENTION OF DUPLICATION
OF BIBLIOGRAPHIC DESCRIPTIONS IN THE SCIENTIFIC
DATABASE BIAS IPI RAS
- M. Yu. Zaikin
- V. S. Dolgopolov
- O. L. Obuhova
- I. V. Soloviev
Abstract
The paper considers the developed technology aimed at avoiding
duplication of bibliographic descriptions in the scientific database Bibliographic
Information-Analytical System (BIAS) of IPI RAS. The analysis of the reasons
of duplications is given. The constituent parts of the developed software are the
modules of definition of similarity using the methods of fuzzy search based on
the Oliver algorithm and the modules of visualization of the results which are
built into the system at the level of formation of the database content. Program
modules of visualization allow moderators of BIAS IPI RAS to receive full
information about the conflicts. They will be able to decide on further action
using additional information. The concept of similarity index used in the software
modules of definition of similarity is introduced. The paper considers the formal
data model underlying construction of the database, built on the principles of
facet navigation. Application of the developed software made it possible to detect
and remove duplicate bibliographic descriptions in the scientific database.
[+] References (8)
- Scientific database Bibliographic Information-Analytical System (BIAS) of IPI RAS.
Available at: http://bias.ipiran.ru (accessed April 7, 2015).
- GOST 7.1-2003. Bibliograficheskaya zapis'. Bibliograficheskoe opisanie [Bibliographic record. Bibliographic description]. Available at: http://diss.rsl.ru/datadocs/
doc 291wu.pdf (accessed April 7, 2015).
- Database SpringerLink. Available at: http://link.springer.com/ (accessedApril 7, 2015).
- Oliver, I. 1994. Programming classics: Implementing the World's best algorithms.
Prentice Hall PTR. 386 à.
- Levenshteyn, V. I. 1965. Dvoichnye kody s ispravleniemvypadeniy, vstavok i zameshcheniy simvolov [Binary codes with correction for deletions, insertions, and substitutions of
symbols]. Dokl. Akad. Nauk SSSR 163(4):845-833.
- Zaikin, M.Yu., O. L. Obuhova, and I.V. Soloviev. 2014. Bibliograficheskaya
informatsionno-analiticheskaya sistema IPI RAN [Bibliographic information-analytical
system (BIAS) of IPI RAS]. Sistemy i Sredstva Informatiki - Systems and Means of
Informatics 24(1):244-258.
- Chochia, A.P., I.V. Soloviev, O. L. Obuhova, T.K. Biryukova, and M.M. Gershkovich. 2008. Model' adaptivnoy fasetnoy navigatsii v otkrytykh elektronnykh kollek-
tsiyakh [The model for adaptive facet navigation in open digital collections]. Sistemy
i Sredstva Informatiki - Systems and Means of Informatics 18(1):171-185.
- Obuhova, O., I. Soloviev, T. Biryukova , M. Gershkovich, and A. Chochia. 2009.
Model' fasetnogo informatsionnogo poiska v kollektsii nauchnykh materialov [A model
of facet information retrieval within the collection of scientific materials]. Sistemy
i Sredstva Informatiki - Systems and Means of Informatics 19(2):163-174.
[+] About this article
Title
TECHNOLOGY FOR PREVENTION OF DUPLICATION
OF BIBLIOGRAPHIC DESCRIPTIONS IN THE SCIENTIFIC
DATABASE BIAS IPI RAS
Journal
Systems and Means of Informatics
Volume 25, Issue 1, pp 168-185
Cover Date
2013-11-30
DOI
10.14357/08696527150111
Print ISSN
0869-6527
Publisher
Institute of Informatics Problems, Russian Academy of Sciences
Additional Links
Key words
similarity index; software modules of definition of similarity; method
of fuzzy search on the Oliver algorithm; facet navigation
Authors
M. Yu. Zaikin , V. S. Dolgopolov ,
O. L. Obuhova , and I. V. Soloviev
Author Affiliations
Institute of Informatics Problems, Federal Research Center "Computer Science
and Control", Russian Academy of Sciences, 44-2 Vavilov Str., Moscow 119333, Russian Federation
|