Applying Associative Classifier PGN for Digitised Cultural Heritage Resource Discovery

Authors

  • Krassimira Ivanova Institute of Mathematics and Informatics, Bulgarian Academy of Sciences, Sofia, Bulgaria
  • Iliya Mitov Institute of Mathematics and Informatics, Bulgarian Academy of Sciences, Sofia, Bulgaria
  • Peter L. Stanchev Institute of Mathematics and Informatics, Bulgarian Academy of Sciences, Sofia, Bulgaria; Kettering University, Flint, USA

DOI:

https://doi.org/10.55630/dipp.2011.1.13

Keywords:

Data Mining, Associative Classifier, Metadata Extraction, Cultural Heritage

Abstract

Resource discovery is one of the key services in digitised cultural heritage collections. It requires intelligent mining in heterogeneous digital content as well as capabilities in large scale performance; this explains the recent advances in classification methods. Associative classifiers are convenient data mining tools used in the field of cultural heritage, by applying their possibilities to taking into account the specific combinations of the attribute values. Usually, the associative classifiers prioritize the support over the confidence. The proposed classifier PGN questions this common approach and focuses on confidence first by retaining only 100% confidence rules. The classification tasks in the field of cultural heritage usually deal with data sets with many class labels. This variety is caused by the richness of accumulated culture during the centuries. Comparisons of classifier PGN with other classifiers, such as OneR, JRip and J48, show the competitiveness of PGN in recognizing multi-class datasets on collections of masterpieces from different West and East European Fine Art authors and movements.

References

Greenberg, J.: Metadata extraction and harvesting: A comparison of two automatic metadata generation applications. Journal of Internet Cataloging, 6(4), 59 – 82 (2004)

Martines, F., Morale, F.: Investigation of metadata applications at Palermo astronomical observatory". Library and Information Services in Astronomy IV, July 2-5 (2002)

Shi, R., Maly, F., Zubair, M.: Automatic metadata discovery from non-cooperative digital libraries. In Proc. of IADIS Int. Conf. on e-Society 2003, Lisbon, Portugal (2003)

Cardinaels, K., Meire, M., Duval, E.: Automating metadata generation: the simple indexing interface. In Proc. of 14th Int. Conf. on WWW, Chiba, Japan, ACM, NY, 548-556 (2005)

Fayyad, U., Piatetsky-Shapiro, G., Smyth, P.: From data mining to knowledge discovery: an overview. In Advances in Knowledge Discovery and Data Mining. American Association for AI, Menlo Park, CA, USA, 1-34, (1996)

Klosgen, W, Zytkow, J.: Knowledge discovery in databases terminology. In Advances in Knowledge Discovery and Data Mining. AAAI Press, 573-592 (1996)

Kouamou, G.: A software architecture for data mining environment. Ch.13 in New Fundamental Technologies in Data Mining, InTech Publ., 241-258 (2011)

Maimon, O., Rokach, L.: Decomposition Methodology for Knowledge Discovery and Data Mining. Vol. 61 of Series in Machine Perception and Artificial Intelligence, WSP (2005)

Mitchell, T.: Machine Learning, McGraw-Hill (1997)

Berry, P., Harrison, I., Lowrance, J., Rodriguez, A., Ruspini, E., Thomere, J., Wolverton, M.: Link Analysis Workbench. Technical Report for Air Force Res. Lab. IFOIPA (2004)

Liu, B., Hsu, W., Ma, Y.: Integrating classification and association rule mining. In Knowledge Discovery and Data Mining, 80-86 (1998)

Li, W., Han, J., Pei, J.: CMAR: Accurate and efficient classification based on multiple class-association rules. In: Proc. of the IEEE ICDM, 369-376 (2001) K.Ivanova, I.Mitov, P.Stanchev, M.Dobreva, K.Vanhoof, B.Depaire

Zaiane, O., Antonie, M.-L.: Classifying text documents by associating terms with text categories. J. Australian Computer Science Communications, 24(2), 215-222 (2002)

Yin, X., Han, J.: CPAR: Classification based on predictive association rules. In SIAM Int. Conf. on Data Mining, 331-335 (2003)

Zimmermann, A., De Raedt, L.: CorClass: Correlated association rule mining for classification. In Discovery Science, LNCS, Vol. 3245, 60-72 (2004)

Rak, R., Stach, W., Zaiane, O., Antonie M.-L.: Considering re-occurring features in associative classifiers. In Advances in Knowledge Discovery and Data Mining, LNCS, Vol. 3518, 240-248 (2005)

Coenen, F., Leng, P.: Obtaining best parameter values for accurate classification. In Proc. IEEE ICDM, 597-600 (2005)

Wang, J., Karypis, G.: HARMONY: Eficiently mining the best rules for classification. In Proc. of SDM, 205-216 (2005)

Thabtah, F., Cowling, P., Peng, Y.: MCAR: multi-class classification based on association rule. In Proc. of the IEEE ACS, 33-33 (2005)

Antonie, M.-L., Zaiane, O., Holte, R.: Learning to use a learned model: A two-stage approach to classification. In Proc. of IEEE, 33-42, (2006)

Tang, Z., Liao, Q.: A new class based associative classification algorithm. Int. Journal of Applied Mathematics, 36(2), 15-19 (2007)

Depaire, B., Vanhoof, K., Wets, G.: ARUBAS: an association rule based similarity framework for associative classifiers. In Proc. of IEEE ICDM Workshops, 692-699 (2008)

Mitov, I., Ivanova, K., Markov, K., Velychko, V., Vanhoof, K., and Stanchev, P.: PaGaNe – a classification machine learning system based on the multidimensional numbered information spaces. In WSPS on CEIS, No. 2, 279-286 (2009)

Mitov, I.: Class Association Rule Mining Using Multi-Dimensional Numbered Information Spaces. PhD Thesis, Hasselt University, Belgium (2011)

Witten, I., Frank, E.: Data Mining: Practical Machine Learning Tools and Techniques. 2nd Edition, Morgan Kaufmann, San Francisco (2005)

Holte, R.: Very simple classification rules perform well on most commonly used datasets. Machine Learning, Vol. 11, 63-91 (1993)

Ivanova, K., Stanchev, P., Vanhoof, K.: Automatic tagging of art images with color harmonies and contrasts characteristics in art image collections. Int. Journal on Advances in Software, 3 (3&4), 474-484 (2010)

Downloads

Published

2011-09-30

How to Cite

Ivanova, K., Mitov, I., & L. Stanchev, P. (2011). Applying Associative Classifier PGN for Digitised Cultural Heritage Resource Discovery. Digital Presentation and Preservation of Cultural and Scientific Heritage, 1, 117–126. https://doi.org/10.55630/dipp.2011.1.13

Most read articles by the same author(s)