Main Architectures Used in ETL as a Tool, Necessary for the Integration of Data in Large Volumes - a Task in the Field of Digital Preservation of Cultural Heritage

Authors

  • Kamen Angelov Institute of Mathematics and Informatics, Bulgarian Academy of Sciences Sofia, Bulgaria

DOI:

https://doi.org/10.55630/dipp.2016.6.12

Keywords:

Cultural Heritage, Digitization, Data Integration, Optimization, Performance, ETL, Multiple Queries, Data Warehouse, Data Flow, Data Set, Quantum, Computational, Approach, Simulation, Uncertain, Predicates, Combinatorial, NP-Complete, Graph, Hamiltonian

Abstract

The digital recording of Cultural Heritage (CH) and its metadata relates to the concept for data warehousing systems and their lifecycle. It uses models and development tools of ETL packages. The tasks used to extract metadata from raw digitized CH data often include hard computing problems, including NP-complete problems. This paper describes the current state of digital preservation of Cultural Heritage with an accent on data integration and performance optimization of processes. We show our vision for further solutions for automated optimization of resource utilization in ETL jobs cutting development cost. In parallel, we describe our experiment that uses quantum computation (in simulation mode) to solve hard combinatorial problems as multiple query optimization.

References

Access to digital culture heritage, Krassimira Ivanova, Milena Dobreva, Peter Stanchev, George Totkov, Plovdiv University, 2012

http://www.oclc.org/research/publications/library/2000/lavoie-oais.html

http://www.dcc.ac.uk

https://www.doi.org/

https://www.informatica.com/content/dam/informatica-com/global/amer/us/collateral/datasheet/velocity_data-sheet_6091.pdf

James Connolly, Data warehouses: Tips for building a disaster recovery plan (http://searchcio.techtarget.com/tip/Data-warehouses-Tips-for-building-a-disasterrecovery-plan)

T.T.Lwin, T.Thein, High Availability Cluster System for Local Disaster Recovery with Markov Modeling Approach, IJCSI International Journal of Computer Science Issues, Vol.6, No.2, 2009, ISSN(online) 1694-0784

http://www.recode.net/2014/9/25/11631266/d-wave-ceo-our-next-quantum-processor-willmake-computer-science

http://www.dwavesys.com/resources/publications

https://www.microsoft.com/en-us/research/project/language-integrated-quantumoperations-liqui/

http://qurope.eu/manifesto/

Sellis, Multiple Query Optimization. TODS, 1988

Immanuel Trummer, Christoph Koch, MQO on the D-Wave 2x Adiabatic Quantum Computer, arXiv:1510.06437vl cs.DB

http://www.cs.cornell.edu/~sudip/quantumdb.pdf

Public Product information, Informatica®, https://www.informatica.com/content/dam/informatica-com/global/amer/us/collateral/datasheet/velocity_data-sheet_6091

Downloads

Published

2016-09-30

How to Cite

Angelov, K. (2016). Main Architectures Used in ETL as a Tool, Necessary for the Integration of Data in Large Volumes - a Task in the Field of Digital Preservation of Cultural Heritage. Digital Presentation and Preservation of Cultural and Scientific Heritage, 6, 129–136. https://doi.org/10.55630/dipp.2016.6.12