Informatics and Applications
December 2013, Volume 7, Issue 4, pp 52-65
METHOD OF BIBLIOGRAPHIC INFORMATION EXTRACTION FROM FULL-TEXT DESCRIPTIONS OF INVENTIONS
- I. M. Zatsman
- V. A. Havanskov
- S.K. Shubnikov
Abstract
The method of bibliographic information extraction from full-text descriptions of inventions, which is
necessary for analysis of thematic linkages between science and technologies, is considered. The research objective
consists in the development of principles for creation of domestic information systems for calculation of indicators
of thematic linkages. This type of information systems is new to the Russian scientific and technical sphere. Their
creation is necessary for monitoring and evaluation of research and development programs and decision-making
at all stages of program activities. The suggested method of bibliographic information extraction from the texts in
natural language differs a lot from available foreign and domestic analogs. First, this method considers the fact
that bibliographic information can be found inside the natural language text of descriptions of inventions. Second,
paper bibliographic information is the structured information object, which is generally multilingual.
[+] References (23)
- Kuznetsov, I., and E. Kozerenko. 2003. The system for extracting
semantic information fromnatural language texts.
Conference (International) onMachineLearning (MLMTA-
03) Proceedings. Las Vegas. 75–80.
- Kuznetsov, I. P. 2006. Semantiko-orientirovannaya sistema
obrabotki neformalizovannoy informatsii s vydachey
rezul’tatov na estestvennom yazyke [Semantic-oriented
system for processing of nonformalized information with
outcomes in a natural language]. Systems and Means of
Informatics 16:235–53.
- Kuznetsov, I. P., and A.G. Matskevich. 2007. Semantikoorientirovannye
sistemy na osnove baz znaniy [Semanticoriented
systems based on knowledge bases]. M.: MTUSI.
173 p.
- Kuznetsov, I. P. 2008. Ob”ektno-orientirovannaya sistema,
osnovannaya na znaniyakh v vide XMLpredstavleniy
[The object-oriented system based on
knowledge in the form of XML representations]. Systems
and Means of Informatics 18:96–118.
- Kuznetsov, I. P., and E.B. Kozerenko. 2008. Linguistic
processor Semantix for knowledge extraction fromnatural
texts inRussian and English. Conference (International) on
Artificial Intelligence (ICAI 2008) Proceedings. Las Vegas:
CSREA Press. 835–41.
- Kuznetsov, I. P., and N. V. Somin. 2012. Vyyavlenie implitsitnoy
informatsii iz tekstov na estestvennom yazyke:
Problemy imetody [Retrieval of implicit information from
natural language texts: Problems and methods]. Inform.
Appl. 6(1):49–58.
- Narin, F., and E. Noma. 1985. Is technology becoming
science? Scientometrics 7(3-6):369–81.
- Narin,F., andD.Olivastro. 1998.Linkage between patents
and papers: An interimEPO/US comparison. Scientometrics
41(1-2):51–59.
- Schmoch, U. 1993. Tracing the knowledge transfer from
science to technology as reflected in patent indicators.
Scientometrics 26:193–211.
- Minin, V.A., I.M. Zatsman, M.G. Kruzhkov, and
T. P. Norekjan. 2013. Metodologicheskie osnovy sozdaniya
informatsionnykh sistem dlya vychisleniya indikatorov
tematicheskikh vzaimosvyazey nauki i tekhnologiy
[Methodological basis for the creation of information systems
for the calculation of indicators of thematic linkages
between science and technology]. Inform. Appl. 7(1):70–
81.
- Minin, V.A., I.M. Zatsman, V.A. Havanskov, and
S.K. Shubnikov. 2013. Arkhitekturnye resheniya dlya
sistem vychisleniya indikatorov tematicheskikh vzaimosvyazey
nauki i tekhnologiy [Information system conceptual
decisions for assessment of linkages between science
and technologies]. Systems and Means of Informatics
23(2):260–83.
- Zatsman, I., and S. Shubnikov. 2007. Printsipy obrabotki
informatsionnykh resursov dlya otsenki innovatsionnogo
potentsiala napravleniy nauchnykh issledovaniy [Processing
principles of information resources for an assessment
of innovation potential of the scientific domains]. Trudy
9-j Vserossijskoj nauchnoj konferencii “Jelektronnye biblioteki”
[9th All-Russian Scientific Conference on Digital
Libraries Proceedings]. Pereslavl’: Publishing House of
Pereslavl’ University. 35–44.
- Zatsman, I., O. Kurchavova, and I. Galina. 2008. Informatsionnye
resursy i indikatory dlya otsenki innovatsionnogo
potentsiala napravleniy nauchnykh issledovaniy
[Information resources and indicators for an assessment
of innovation potential of the scientific domains]. Systems
and Means of Informatics 18 (add.):159–75.
- Kozhunova, O. 2012. Tsitirovanie dokumentov v patentakh
kak indikator vzaimosvyazi oblastey nauki i
tehnologiy [Citing documents in patents as an indicator
for science and technologies linkages]. Systems and
Means of Informatics 22(2):106–28.
- Van Looy, B., E. Zimmermann, R. Veugelers, A. Verbeek,
J.Mello, andK.Debackere. 2003.Do science-technology
interactions pay on when developing technology? An exploratory
investigation of 10 science-intensive technology
domains. Scientometrics 57(3):355–67.
- Verbeek, Ŕ.,K.Debackere,M. Luwel, P. Andries, E.Zimmermann,
and D. Deleus. 2002. Linking science to technology:
Using bibliographic references in patents to build
linkage schemes. Scientometrics 54(3):399–420.
- FIPS. 2008. Administrativnyy reglament ispolneniya
Rospatentompriema zayavok na izobretenie, ikh rassmotreniya
i ekspertizy [Rospatent administrative regulations
for filing invention applications, their considerations
and examination]. http://www1.¦ps.ru/wps/wcm/
connect/content ru/ru/documents/russian laws/order
minobr/administrative regulations/test 8/ (accessed
December 10, 2013).
- Standart VOIS ST.14 “Rekomendatsii po vklyucheniyu
ssylok, citiruemykh v patentnykh dokumentakh”
[WIPO Standard ST.14 “Recommendation for
the inclusion of references cited in patent documents”].
http://www.rupto.ru/rupto/n¦le/52b8dfc1-
1049-11e1-a520-9c8e9921fb2c/03 14 01.pdf (acessed
December 10, 2013).
- Regulyarnye vyrazheniya v .NET Framework
[.NET Framework Regular Expressions]. http://msdn.
microsoft.com/ru-ru/library/hs600312.aspx (accessed
December 10, 2013).
- Zatsman, I.M., and G. F. Verevkin. 2006. Informatsionnyy
monitoring sfery nauki v zadachakh programmnocelevogo
upravleniya [Information monitoring in the science
sphere in problems of goal-oriented program management].
Systems and Means of Informatics 16:164–89.
- Zatsman, I.M. G.F. Verevkin, I. V. Drynova, O.A. Kurchavova,
N. V. Larin, and T. P. Norekjan. 2006. Modelirovanie
sistemin for matsionnogo monitoringa kak problema
informatiki [Modeling of systems of information monitoring
as informatics problem]. Systems and Means of Informatics.
Nauchno-metodologicheskie problemy informatiki
[Scientific and methodological problems of informatics].
Moscow: IPI RAN. 112–39.
- Zatsman, I., and O. Kozhunova. 2007. Semanticheskiy
slovar’ sistemy informatsionnogomonitoringa v sfere nauki:
Zadachi i funktsii [Semantic vocabulary of the systemof
informationmonitoring in scientific sphere: The tasks and
functions]. Systems and Means of Informatics 17:124–41.
- Zatsman, I., and O. Kozhunova. 2009. Evaluation
system for the Russian Academy of Sciences: Objectivesresources-
results approach and R&D indicators. 2009 Atlanta
Conference on Science and Innovation Policy Proceedings.
Eds. S. E.Cozzens, and P.Catalŕn. http://smartech.
gatech.edu/bitstream/1853/32300/1/104-674-1-PB.
pdf.
[+] About this article
Title
METHOD OF BIBLIOGRAPHIC INFORMATION EXTRACTION FROM FULL-TEXT DESCRIPTIONS OF INVENTIONS
Journal
Informatics and Applications
December 2013, Volume 7, Issue 4, pp 52-65
Cover Date
2013-12-31
DOI
10.14357/19922264130406
Print ISSN
1992-2264
Publisher
Institute of Informatics Problems, Russian Academy of Sciences
Additional Links
Key words
linkages between science and technologies; methodology of indicator calculation; information systems;
architectural decisions; bibliographic information; patent documents
Authors
I.M. Zatsman , V.A. Havanskov , and S.K. Shubnikov
Author Affiliations
Institute of Informatics Problems, Russian Academy of Sciences, Moscow, Russia
|