Skip to main content
Log in

An analytical review of XML association rules mining

  • Published:
Artificial Intelligence Review Aims and scope Submit manuscript

Abstract

Over the past decade, there has been increasing interest in using extensible markup language (XML) which has made it a de facto standard for representing and exchanging data over different systems and platforms (specifically the internet). Due to the popularity of XML and with increasing numbers of XML documents, the process of knowledge discovery from this type of data has found more attention. Although in the last decade several different methods have been proposed for mining XML documents, this research field still is in its infancy compared to traditional data mining. As in relational techniques, in the case of XML documents, association rule mining has a strong research interest. In this paper we have performed a comprehensive study on all of the major works so far done on mining association rules from XML documents. The main contribution of the paper is to provide a reference point for future researches by collecting different techniques and methods concerning the topic; classifying them into a number of categories and creating a complete bibliography of the major published works. We think that this paper can help researchers in XML association rules mining domains to quickly find the current work as the basis for the future activities.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10

Similar content being viewed by others

Abbreviations

XML:

eXtensible markup language

ARs:

Association rules

References

  • Abazeed A, Mamat A, Nasir M, Ibrahim H (2009a) Mining association rules from structured XML data. In: Proceedings of international conference on electrical engineering and informatics (ICEEI ’09), vol 02, pp 376–379

  • Abazeed A, Mamat A, Sulaiman MN, Ibrahim H (2009b) Scalable approach for mining association rules from structured XML data. In: Proceedings of the 2nd conference on data mining and optimization (DMO ’09), pp 5–9

  • Agrawal R, Izmielinski T, Swami A (1993a) Database mining: a performance perspective. IEEE Trans Knowl Data Eng 5:6:914–925

    Article  Google Scholar 

  • Agrawal R, Izmielinski T, Swami A (1993b) Mining association rules between sets of items in large database. In: Proceedings of the ACM SIGMOD, Washington, DC, pp 207–216

  • Agrawal R, Srikant R (1994) Fast algorithms for mining association rules. In: Proceeding of the 20th international conference on very large databases, pp 407–419

  • AliMohammadzadeh R, Rahgozar M, Zarnani A (2006a) A new model for discovering XML association rules from XML documents. Int J Appl Sci Eng Technol (IJASET), Trans Eng Comput Technol 14:365–369

    Google Scholar 

  • AliMohammadzadeh R, Soltan S, Rahgozar M (2006b) Template guided association rule mining from XML documents. In: Proceedings of the 15th international conference on World Wide Web (WWW ’06). ACM, New York, NY, USA, pp 963–964

  • Braga D, Campi A, Ceri S, Klemettinen M, Lanzi PL, (2002a) Mining association rules from XML data. In: Proceedings of DEXA, (2002) (DaWaK), LNCS 2454. Aixen- Provence, France, pp 21–30

  • Braga D, Campi A, Ceri S, Klemettinen M, Lanzi PL (2002b) A tool for extracting XML association rules. In: Proceedings of the 14th IEEE international conference on tools with artificial intelligence (ICTAI ’02). IEEE Computer Society, Washington, DC, USA, pp 57–64

  • Braga D, Campi A, Ceri S, Klemettinen M, Lanzi PL (2003) Discovering interesting information in XML data with association rules. In: Proceedings of the (2003) ACM symposium on applied computing (SAC ’03). ACM, New York, NY, USA, pp 450–454

  • Bray T, Paoli J, Sperberg-McQueen CM (1998) Extensible markup language (XML) 1.0. World Wide Web Consortium (W3C). http://www.w3.org/TR/REC-xml

  • Buddhakulsomsiri J, Siradeghyan Y, Zakarian A, Li X (2006) Association rule-generation algorithm for mining automotive warranty data. Int J Prod Res 44:14:2749–2770

    Article  Google Scholar 

  • Caneva E, Oliboni B, Quintarelli E (2009) Mining flexible association rules from XML. In: Proceedings of the (2009) EDBT/ICDT workshops (EDBT/ICDT ’09). ACM, New York, NY, USA, pp 85–92

  • Chen Y-L, Tang K, Shen R-J, Hu Y-H (2005) Market basket analysis in a multiple store environment. Decis Support Syst 40:2:339–354

    Article  Google Scholar 

  • Ding Q, Sundarraj G (2006) Association rule mining from XML data. In: Proceedings of international conference on data mining, Las Vegas, Nevada, pp 144–150

  • Ding Q, Ricords K, Lumpkin J (2003) Deriving general association rules from XML data. In: Proceedings of international conference on software engineering, artificial intelligence, networking, and parallel/distributed computing, Lübeck, Germany, pp 348–352

  • Exarchos TP, Papaloukas C, Fotiadis DI, Michalis LK (2006) An association rule mining-based methodology for automated detection of ischemic ECG beats. IEEE Trans Biomed Eng 53:8:1531–1540

    Article  Google Scholar 

  • Feng L, Dillon TS (2004) Mining XML-enabled association rule with templates. In: Proceedings of KDID

  • Feng L, Dillon TS, Weigand H, Chang E (2003) An XML-enabled association rule framework. In: Proceedings of DEXA, 2003, pp 88–97

  • Garofalakis M, Gionis A, Rastogi R, Seshadri S, Shim K (2003) XTRACT: learning document type descriptors from XML document collections. Data Min Knowl Discov 7(1):23–56

    Article  MathSciNet  Google Scholar 

  • Han J, Pei J, Yin Y (2000) Mining frequent patterns without candidate generation. In: Proceedings of ACM SIGMOD international conference on management of data, pp 1–12

  • Khaing MM, Thein N (2006) An efficient association rule mining For XML data. In: Proceedings of international joint conference SICE-ICASE, pp 5782–5786

  • Li X-Y, Yuan J-S, Kong Y-H (2007) Mining association rules from XML data with index table. In: Proceedings of international conference on machine learning and cybernetics, vol 07, pp 3905–3910

  • Liu H-C, Zeleznikow J, Jamil HM (2006) Logic-based association rule mining in XML documents. In: Shen H T, Li J, Li M, Ni J, Wang W (ed) Proceedings of the international conference on advanced web and network technologies, and applications (APWeb’06). Springer, Berlin, Heidelberg, pp 97–106

  • Mazuran M, Quintarelli E, Tanca L (2009) Mining tree-based association rules from XML documents. In: Proceedings of SEBD, pp 109–116

  • Meo R, Psaila G, Ceri S (1998) An extension to SQL for mining association rules. Data Min Knowl Discov 2(2):195–224

    Article  Google Scholar 

  • Moh C-H, Lim E-P, Ng W-K (2000) DTD-miner: a tool for mining DTD from XML documents. In: Proceedings of the second international workshop on advance issues of E-commerce and web-based information systems (WECWIS ’00). IEEE Computer Society, Washington, DC, USA, pp 144–151

  • Mustapha N, Sulaiman MN, Othman M, Selamat MH (2003) Fast discovery of long patterns for association rules. Int J Comput Math 80(8):967–976

    Article  Google Scholar 

  • Nayak R (2009) Discovering knowledge from XML documents. In: Wang J (ed) Encyclopedia of data warehousing and mining, 2nd edn, IGI Global, pp 663–668. doi:10.4018/978-1-60566-010-3.ch103

  • Paik J, Nam J, Lee S, Kim UM (2007) A framework for data structure-guided extraction of XML association rules. In: Shi Y, Albada G D, Dongarra J, Sloot PM, (ed) Proceedings of the 7th international conference on computational science ((ICCS ’07). Springer, Berlin, Heidelberg, Part III, pp 709–716

  • Paik J, Nam J, Kim WY, Ryu JS, Kim UM (2009) Mining association rules in tree structured XML data. In: Proceedings of the 2nd international conference on interaction sciences: information technology, culture and human (ICIS ’09). ACM, New York, NY, USA, pp 807–811

  • Paik J, Youn HY, Kim U (2005) A new method for mining association rules from a collection of XML documents. In: Gervasi O, Gavrilova ML, Kumar V, Laganà A, Lee HP (ed) Proceedings of the international conference on computational science and its applications (ICCSA’05), vol. part II. Springer, Berlin, Heidelberg, pp 936–945

  • Porkodi R, Bhuvaneswari V, Rajesh R, Amudha T (2009) An improved association rule mining technique for Xml data using Xquery and apriori algorithm. In: Proceedings of IEEE international advance computing conference (IACC 2009), pp 1510–1514

  • Rusu LI, Rahayu W, Taniar D (2006a) Extracting variable knowledge from multiversioned XML documents. In: Proceedings of the sixth IEEE international conference on data mining—workshops (ICDMW ’06). IEEE Computer Society, Washington, DC, USA, pp 70–74

  • Rusu LI, Rahayu W, Taniar D (2006b) Mining changes from versions of dynamic XML documents. In: Proceedings of the 1st international workshop of knowledge discovery from XML documents (KDXD 2006), vol 3915. Singapore, LNCS, pp 3–12

  • Shahriar MdS, Liu J (2011) On mining association rules with semantic constraints in XML. In: Proceedings of sixth IEEE international conference on digital information management (ICDIM 2011), Melbourne, Australia, Sept 26–28

  • Shin J, Paik J, Kim U (2006) Mining association rules from a collection of XML documents using cross filtering algorithm. In: Proceedings of the international conference on hybrid information technology (ICHIT ’06), vol 1. IEEE Computer Society, Washington, DC, USA, pp 120–126

  • The World Wide Web Consortium (W3C) (2004) Extensible markup language (XML) 1.0, 3rd edn, W3C Recommendation. http://www.w3.org/TR/2004/RECxml-20040204/. Accessed 14 Feb 2012

  • Thompson HS, Beech D, Maloney M, Mendelsohn N (2000) XML schema part 1: structures, W3C working draft. http://www.w3.org/TR/xmlschema-1/

  • Tsoi AC, Zhang C, Hagenbuchner M (2005) Pattern discovery on Australian medical claims data—a systematic approach. IEEE Trans Knowl Data Eng 17:10:1420–1435

    Article  Google Scholar 

  • Wan JWW, Dobbie G (2004) Mining association rules from XML data using XQuery. In: Proceedings of the second workshop on Australasian information security, data mining and web intelligence, and software internationalisation (ACSW Frontiers ’04), vol 32. Australian Computer Society, Inc., Darlinghurst, Australia, Australia, pp 169–174

  • Wang X, Cao C (2008) Mining association rules from complex and irregular XML documents using XSLT and Xquery. In: Proceedings of the international conference on advanced language processing and web information technology (ALPIT ’08). IEEE Computer Society, Washington, DC, USA, pp 314–319

  • World Wide Web Consortium. XQuery 1.0:An XML query language (W3C Working Draft). http://www.w3.org/TR/2002/WDxquery-20020816, Aug 2002. Accessed 3 Feb 2012

  • Xiao Y, Yao FG, Li Z, Dunham MH (2003) Efficient data mining for maximal frequent subtrees. In: Proceedings of the third IEEE international conference on data mining (ICDM ‘03). IEEE computer Society, Washington, DC, USA, p 379

  • Zao-xin L (2008) Association rules mining method from XML based on ontology. J Comput Appl 28(9): 2318–2320

    Google Scholar 

  • Zhang M, He C (2010) Survey on association rules mining algorithms. Lect Notes Electr Eng 56:111–118

    Article  Google Scholar 

  • Zhang S, Zhang J, Liu H, Wang W (2005) XAR-miner: efficient association rules mining for XML data. In: Proceedings of special interest tracks and posters of the 14th international conference on World Wide Web (WWW ’05). ACM, New York, NY, USA, pp 894–895

  • Zhang J, Ling TW, Bruckner R, Tjoa AM, Liu H (2004) On efficient and effective association rule mining from XML data. In: 15th international conference on database and expert systems applications (DEXA’04), 30 Aug–3 Sept, Zaragoza, Spain

  • Zhang J, Liu H, Ling TW, Bruckner RM, Tjoa AM (2006) A framework for efficient association rule mining in XML data. J Database Manag (JDM) 17(3):19–40

    Article  Google Scholar 

  • Zhao Q, Chen L, Bhowmick SS, Madria S (2006) XML structural delta mining: issues and challenges. Data Knowl Eng 59(3):627–651

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Mohammad Moradi.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Moradi, M., Keyvanpour, M.R. An analytical review of XML association rules mining. Artif Intell Rev 43, 277–300 (2015). https://doi.org/10.1007/s10462-012-9376-5

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10462-012-9376-5

Keywords

Navigation