Abstract
The induction of knowledge from a data set relies in the execution of multiple data mining actions: to apply filters to clean and select the data, to train different algorithms (clustering, classification, regression, association), to evaluate the results using different approaches (cross validation, statistical analysis), to visualize the results, etc. In a real data mining process, previous actions are executed several times, sometimes in a loop, until an accurate result is obtained. However, performing previous tasks requires a data mining engineer or expert which supervises the design and evaluate the whole process. The goal of this paper is to describe MOLE, an architecture to automatize the data mining process. The architecture assumes that the data mining process can be seen from a classical planning perspective, and hence, that classical planning tools can be used to design the process. MOLE is built and instantiated on the basis of i) standard languages to describe the data set and the data mining process; ii) available tools to design, execute and evaluate the data mining processes.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bernstein, F.P.A., Hill, S.: Toward intelligent assistance for a data mining process: An ontology-based approach for cost-sensitive classification. IEEE Transactions on Knowledge and Data Engineering 17(4) (2005)
Amant, R.S., Cohen, P.R.: Evaluation of a semi-autonomous assistant for exploratory data analysis. In: Proc. of the First Intl. Conf. on Autonomous Agents, Marina Del Rey, CA, pp. 355–362. ACM Press, New York (1997)
Chien, S.A., Mortensen, H.B.: Automating image processing for scientific data analysis of a large image database. IEEE Trans. Pattern Anal. Mach. Intell. 18(8), 854–859 (1996)
de la Rosa, T., García-Olaya, A., Borrajo, D.: Using cases utility for heuristic planning improvement. In: Weber, R.O., Richter, M.M. (eds.) ICCBR 2007. LNCS, vol. 4626, pp. 137–148. Springer, Heidelberg (2007)
de la Rosa, T., Jiménez, S., Borrajo, D.: Learning relational decision trees for guiding heuristic planning. In: Proceedings of ICAPS 2008, Sydney, Australia. AAAI Press, Menlo Park (2008)
Edelkamp, S., Hoffmann, J.: The language for the 2004 international planning competition (2004)
Engels, R.: Planning tasks for knowledge discovery in databases; performing task-oriented user-guidance. In: Proc. of the 2nd Int. Conf. on KDD (1996)
Fernández, S., Borrajo, D., Fuentetaja, R., Arias, J.D., Veloso, M.: PLTOOL. A KE tool for planning and learning. Knowledge Engineering Review Journal 22(2), 153–184 (2007)
García-Durán, R., Fernández, F., Borrajo, D.: Learning and transferring relational instance-based policies. In: Taylor, A.F.M., Driessens, K. (eds.) Working Notes of the AAAI 2008 workshop on Transfer Learning for Complex Tasks, Chicago, IL, USA, pp. 19–24. AAAI Press, Menlo Park (2008); Technical Report WS-08-13
Gerevini, A., Saetti, A., Serina, I.: Planning through stochastic local search and temporal action graphs. Journal of Artificial Intelligence Research 20, 239–290 (2003)
Hoffmann, J.: The Metric-FF planning system: Translating “ignoring delete lists” to numeric state variables. Journal of Artificial Intelligence Research 20, 291–341 (2003)
Michalski, R.S., Kaufman, K.A.: Discovery planning: Multistrategy learning in data mining. In: Proceedings of the Fourth International Workshop on Multistrategy Learning, pp. 14–20 (1998)
Morik, K., Scholz, M.: The MiningMart Approach to Knowledge Discovery in Databases. In: Intelligent Technologies for Information Analysis, pp. 47–65. Springer, Heidelberg (2004)
Rodríguez-Moreno, M.D., Oddi, A., Borrajo, D., Cesta, A.: IPSS: A hybrid approach to planning and scheduling integration. IEEE Transactions on Knowledge and Data Engineering 18(12), 1681–1695 (2006)
Witten, I., Frank, E.: Data mining: practical machine learning tools and techniques with Java implementations. Morgan Kaufmann, San Francisco (2000)
Zimmerman, T., Kambhampati, S.: Learning-assisted automated planning: Looking back, taking stock, going forward. AI Magazine 24(2), 73–96 (Summer 2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Fernández, F., Borrajo, D., Fernández, S., Manzano, D. (2009). Assisting Data Mining through Automated Planning. In: Perner, P. (eds) Machine Learning and Data Mining in Pattern Recognition. MLDM 2009. Lecture Notes in Computer Science(), vol 5632. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-03070-3_57
Download citation
DOI: https://doi.org/10.1007/978-3-642-03070-3_57
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-03069-7
Online ISBN: 978-3-642-03070-3
eBook Packages: Computer ScienceComputer Science (R0)