Skip to main content

Assisting Data Mining through Automated Planning

  • Conference paper
Machine Learning and Data Mining in Pattern Recognition (MLDM 2009)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5632))

Abstract

The induction of knowledge from a data set relies in the execution of multiple data mining actions: to apply filters to clean and select the data, to train different algorithms (clustering, classification, regression, association), to evaluate the results using different approaches (cross validation, statistical analysis), to visualize the results, etc. In a real data mining process, previous actions are executed several times, sometimes in a loop, until an accurate result is obtained. However, performing previous tasks requires a data mining engineer or expert which supervises the design and evaluate the whole process. The goal of this paper is to describe MOLE, an architecture to automatize the data mining process. The architecture assumes that the data mining process can be seen from a classical planning perspective, and hence, that classical planning tools can be used to design the process. MOLE is built and instantiated on the basis of i) standard languages to describe the data set and the data mining process; ii) available tools to design, execute and evaluate the data mining processes.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Bernstein, F.P.A., Hill, S.: Toward intelligent assistance for a data mining process: An ontology-based approach for cost-sensitive classification. IEEE Transactions on Knowledge and Data Engineering 17(4) (2005)

    Google Scholar 

  2. Amant, R.S., Cohen, P.R.: Evaluation of a semi-autonomous assistant for exploratory data analysis. In: Proc. of the First Intl. Conf. on Autonomous Agents, Marina Del Rey, CA, pp. 355–362. ACM Press, New York (1997)

    Chapter  Google Scholar 

  3. Chien, S.A., Mortensen, H.B.: Automating image processing for scientific data analysis of a large image database. IEEE Trans. Pattern Anal. Mach. Intell. 18(8), 854–859 (1996)

    Article  Google Scholar 

  4. de la Rosa, T., García-Olaya, A., Borrajo, D.: Using cases utility for heuristic planning improvement. In: Weber, R.O., Richter, M.M. (eds.) ICCBR 2007. LNCS, vol. 4626, pp. 137–148. Springer, Heidelberg (2007)

    Chapter  Google Scholar 

  5. de la Rosa, T., Jiménez, S., Borrajo, D.: Learning relational decision trees for guiding heuristic planning. In: Proceedings of ICAPS 2008, Sydney, Australia. AAAI Press, Menlo Park (2008)

    Google Scholar 

  6. Edelkamp, S., Hoffmann, J.: The language for the 2004 international planning competition (2004)

    Google Scholar 

  7. Engels, R.: Planning tasks for knowledge discovery in databases; performing task-oriented user-guidance. In: Proc. of the 2nd Int. Conf. on KDD (1996)

    Google Scholar 

  8. Fernández, S., Borrajo, D., Fuentetaja, R., Arias, J.D., Veloso, M.: PLTOOL. A KE tool for planning and learning. Knowledge Engineering Review Journal 22(2), 153–184 (2007)

    Article  Google Scholar 

  9. García-Durán, R., Fernández, F., Borrajo, D.: Learning and transferring relational instance-based policies. In: Taylor, A.F.M., Driessens, K. (eds.) Working Notes of the AAAI 2008 workshop on Transfer Learning for Complex Tasks, Chicago, IL, USA, pp. 19–24. AAAI Press, Menlo Park (2008); Technical Report WS-08-13

    Google Scholar 

  10. Gerevini, A., Saetti, A., Serina, I.: Planning through stochastic local search and temporal action graphs. Journal of Artificial Intelligence Research 20, 239–290 (2003)

    MATH  Google Scholar 

  11. Hoffmann, J.: The Metric-FF planning system: Translating “ignoring delete lists” to numeric state variables. Journal of Artificial Intelligence Research 20, 291–341 (2003)

    MATH  Google Scholar 

  12. Michalski, R.S., Kaufman, K.A.: Discovery planning: Multistrategy learning in data mining. In: Proceedings of the Fourth International Workshop on Multistrategy Learning, pp. 14–20 (1998)

    Google Scholar 

  13. Morik, K., Scholz, M.: The MiningMart Approach to Knowledge Discovery in Databases. In: Intelligent Technologies for Information Analysis, pp. 47–65. Springer, Heidelberg (2004)

    Chapter  Google Scholar 

  14. Rodríguez-Moreno, M.D., Oddi, A., Borrajo, D., Cesta, A.: IPSS: A hybrid approach to planning and scheduling integration. IEEE Transactions on Knowledge and Data Engineering 18(12), 1681–1695 (2006)

    Article  Google Scholar 

  15. Witten, I., Frank, E.: Data mining: practical machine learning tools and techniques with Java implementations. Morgan Kaufmann, San Francisco (2000)

    Google Scholar 

  16. Zimmerman, T., Kambhampati, S.: Learning-assisted automated planning: Looking back, taking stock, going forward. AI Magazine 24(2), 73–96 (Summer 2003)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Fernández, F., Borrajo, D., Fernández, S., Manzano, D. (2009). Assisting Data Mining through Automated Planning. In: Perner, P. (eds) Machine Learning and Data Mining in Pattern Recognition. MLDM 2009. Lecture Notes in Computer Science(), vol 5632. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-03070-3_57

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-03070-3_57

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-03069-7

  • Online ISBN: 978-3-642-03070-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics