Main Architectures Used in ETL as a Tool, Necessary for the Integration of Data in Large Volumes - a Task in the Field of Digital Preservation of Cultural Heritage
DOI:
https://doi.org/10.55630/dipp.2016.6.12Keywords:
Cultural Heritage, Digitization, Data Integration, Optimization, Performance, ETL, Multiple Queries, Data Warehouse, Data Flow, Data Set, Quantum, Computational, Approach, Simulation, Uncertain, Predicates, Combinatorial, NP-Complete, Graph, HamiltonianAbstract
The digital recording of Cultural Heritage (CH) and its metadata relates to the concept for data warehousing systems and their lifecycle. It uses models and development tools of ETL packages. The tasks used to extract metadata from raw digitized CH data often include hard computing problems, including NP-complete problems. This paper describes the current state of digital preservation of Cultural Heritage with an accent on data integration and performance optimization of processes. We show our vision for further solutions for automated optimization of resource utilization in ETL jobs cutting development cost. In parallel, we describe our experiment that uses quantum computation (in simulation mode) to solve hard combinatorial problems as multiple query optimization.References
Access to digital culture heritage, Krassimira Ivanova, Milena Dobreva, Peter Stanchev, George Totkov, Plovdiv University, 2012
http://www.oclc.org/research/publications/library/2000/lavoie-oais.html
James Connolly, Data warehouses: Tips for building a disaster recovery plan (http://searchcio.techtarget.com/tip/Data-warehouses-Tips-for-building-a-disasterrecovery-plan)
T.T.Lwin, T.Thein, High Availability Cluster System for Local Disaster Recovery with Markov Modeling Approach, IJCSI International Journal of Computer Science Issues, Vol.6, No.2, 2009, ISSN(online) 1694-0784
http://www.dwavesys.com/resources/publications
https://www.microsoft.com/en-us/research/project/language-integrated-quantumoperations-liqui/
Sellis, Multiple Query Optimization. TODS, 1988
Immanuel Trummer, Christoph Koch, MQO on the D-Wave 2x Adiabatic Quantum Computer, arXiv:1510.06437vl cs.DB
http://www.cs.cornell.edu/~sudip/quantumdb.pdf
Public Product information, Informatica®, https://www.informatica.com/content/dam/informatica-com/global/amer/us/collateral/datasheet/velocity_data-sheet_6091