Systems and Means of Informatics
2014, Volume 24, Issue 4, pp 29-44
METHODS FOR MAPPING OF COLLECTIONS PRESENTED IN NONTRADITIONAL DATA MODELS INTO THE INTEGRATED REPRESENTATION
- S. A. Stupnikov
- A. E. Vovchenko
Abstract
The paper considers transformations of collections presented in nontraditional data models, such as graph, triplet, and key-value models, into the integrated representation in the relational data model. The context of the paper is the development of the combined virtual and materialized environment for integration of heterogeneous collections of (un-, semi-)structured data.
The proposed techniques constitute the basis for materialized integration of information resources in relational data warehouses over Hadoop.
[+] References (15)
- Skvortsov, N. A. 2013. Otobrazhenie modeli dannykh RDF v kanonicheskuyu model' predmetnykh posrednikov [Mapping of RDF data model into the canonical model of subject mediators]. Tr. 15-y Vseross. nauch. konf. "Elektronnye Biblioteki: Perspektivnye Metody i Tekhnologii, Elektronnye Kollektsii" RCDL'2013 [15th Russian Conference on Digital Libraries RCDL'2013 Proceedings]. 1108:95-101.
- Stupnikov, S. A. 2013. Otobrazhenie grafovoy modeli dannykh v kanonicheskuyu ob'ektno-freymovuyu informatsionnuyu model' pri sozdanii sistem integratsii neod- norodnykh informatsionnykh resursov [Mapping of a graph data model into an object- frame canonical information model for the development of heterogeneous information resources integration systems]. Tr. 15-y Vseross. nauch. konf. "Elektronnye Biblioteki: Perspektivnye Metody i Tekhnologii, Elektronnye Kollektsii" RCDL'2013 [15th Russian Conference on Digital Libraries RCDL'2013 Proceedings]. 1108:85-94.
- Skvortsov, N. A. 2012. Otobrazhenie modeley dannykhNoSQL v ob"ektnye spetsifika- tsii [Mapping of NoSQL data models to object specifications]. Tr. 14-y Vseross. nauch. konf. "Elektronnye Biblioteki: Perspektivnye Metody i Tekhnologii, Elektronnye Kollektsii" RCDL'2012 [14th Russian Conference on Digital Libraries RCDL'2012 Proceedings]. 934:53-62.
- Stupnikov, S. A, and A. E. Vovchenko. 2014 (in press). Kombinirovannaya virtual'no- materializovannaya sreda integratsii neodnorodnykh kollektsiy dannykh [Combined virtual and materialized environment for integration of large heterogeneous data collections]. Tr. 16-y Vseross. nauch. konf. "Elektronnye Biblioteki: Perspektivnye Metody i Tekhnologii, Elektronnye Kollektsii" RCDL'2014 [16th Russian Conference on Digital Libraries RCDL 2014' Proceedings].
- Briukhov, D.O., A.E. Vovchenko, V.N. Zakharov, O. P. Zhelenkova, L. A. Kalinichenko, D. O. Martynov, N. A. Skvortsov, and S. A. Stupnikov. 2008. Arkhitektura promezhutochnogo sloya predmetnykh posrednikov dlya resheniya zadach nad mnozhestvom integriruemykh neodnorodnykh raspredelennykh informatsionnykh resursov v gibridnoy grid-infrastrukture virtual'nykh observatoriy [The middleware architecture of the subject mediators for problem solving over a set of integrated heterogeneous distributed information resources in the hybrid grid-infrastucture of virtual observatories]. Informatika i ee Primeneniya - Inform. Appl. 2(1):2-34.
- Apache Hadoop Project. Available at: http://hadoop.apache.org (accessed October 02, 2014).
- Saracco, C.M., and J. Uttam. 2013. What's the big deal about Big SQL? Introducing relational DBMS users to IBM's SQL technology for Hadoop. IBM Devel- operWorks. Available at: http://www.ibm.com/developerworks/library/bd-bigsql/bd- bigsql-pdf.pdf (accessed October 02, 2014)
- Capriolo, E., D. Wampler, and J. Rutherglen. 2012. Programming Hive data warehouse and query language for Hadoop. O'Reilly Media. 329 p.
- IBM InfoSphere BigInsights Information Center. Available at: http://pic.dhe.ibm. com/infocenter/bigins/v2r1/indexjsp (accessed October 02, 2014).
- Introducing JSON. Available at: http://www.json.org (accessed October 02, 2014).
- Beyer, K. S., V. Ercegovac, R. Gemulla, A. Balmin, M. Eltabakh, C.-Ch. Kanne, F. Ozcan, and E. J. Shekita. 2011. Jaql: A scripting language for large scale semistructured data analysis. Proceedings of the VLDB Endowment 4(12):1272-1283.
- The Neo4j manual. Available at: http://goo.gl/cHiOGF (accessed October 02, 2014).
- Cyganiak, R., D. Wood, and M. Lanthaler, eds. RDF 1.1 concepts and abstract syntax. W3C Recommendation 25 February 2014. Available at: http://www. w3.org/TR/ 2014/REC-rdf11-concepts-20140225 (accessed October 02, 2014).
- Lars, G. 2011. HBase: The definitive guide. O'Reilly Media. 556 p.
- Wilkinson, K., C. Sayers, H. Kuno, andD. Reynolds. 2003. Efficient RDF storage and retrieval in Jena2. 1st Workshop (International) on Semantic Web and Databases Proceedings. 131-150. Available at: http://www.cs.uic.edu/~ifc/SWDB/proceedings.pdf
(accessed October 02, 2014).
[+] About this article
Title
METHODS FOR MAPPING OF COLLECTIONS PRESENTED IN NONTRADITIONAL DATA MODELS INTO THE INTEGRATED REPRESENTATION
Journal
Systems and Means of Informatics
Volume 24, Issue 4, pp 29-44
Cover Date
2013-11-30
DOI
10.14357/08696527140402
Print ISSN
0869-6527
Publisher
Institute of Informatics Problems, Russian Academy of Sciences
Additional Links
Key words
database integration; graph data models; RDF; NoSQL; data collections transformation
Authors
S. A. Stupnikov and A. E. Vovchenko
Author Affiliations
Institute of Informatics Problems, Russian Academy of Sciences, 44-2 Vavilov Str., Moscow 119333, Russian Federation
|