skip to main content
research-article

Report on the XML mining track at INEX 2007 categorization and clustering of XML documents

Published:01 June 2008Publication History
Skip Abstract Section

Abstract

This report concerns the last edition of the XML Mining Track at INEX 2007. A preceding report has been already published concerning the two preceding editions of the track. We present here the new corpus used for this third phase and briefly describe the models and the results obtained by the different participants.

References

  1. Denoyer, L., Gallinari, P.: Report on the xml mining track at inex 2005 and inex 2006: categorization and clustering of xml documents. SIGIR Forum 41(1) (2007) 79--90 Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Denoyer, L., Gallinari, P.: The Wikipedia XML Corpus. SIGIR Forum (2006) Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. de Campos, L. M., Fernandez-Luna, J. M., Huete, J. F., Romero, A. E.: Probabilistic methods for structured document classification at inex '07. In: Workshop of the INitiative for the Evaluation of XML Retrieval. (2007)Google ScholarGoogle Scholar
  4. Murugeshan, M. S., Krishnamurthy, L., Mukherjee, S.: Lakshmi krishnamurthy and saswati mukherjee. an ncd based approach for wikipedia categorization task. In: Workshop of the INitiative for the Evaluation of XML Retrieval. (2007)Google ScholarGoogle Scholar
  5. Yang, J., Zhang, F.: Xml document classification using extended vsm. In: Workshop of the INitiative for the Evaluation of XML Retrieval. (2007)Google ScholarGoogle Scholar
  6. Hagenbuchner, M., Tsoi, A. C., Sperduti, A., Kc, M.: Efficient clustering of structured documents using graph self-organizing maps. In: Workshop of the INitiative for the Evaluation of XML Retrieval. (2007)Google ScholarGoogle Scholar
  7. Tran, T., Nayak, R., Bruza, P.: Document clustering using incremental and pairwise approaches. In: Workshop of the INitiative for the Evaluation of XML Retrieval. (2007)Google ScholarGoogle Scholar
  8. Yao, J., Zerida, N.: Rare patterns to improve path-based clusteringof wikipedia articles. In: Workshop of the INitiative for the Evaluation of XML Retrieval. (2007)Google ScholarGoogle Scholar
  9. Kutty, S., Tran, T., Nayak, R., Li, Y.: Clustering xml documents using closed frequent subtrees- a structure-only based approach. In: Workshop of the INitiative for the Evaluation of XML Retrieval. (2007)Google ScholarGoogle Scholar

Index Terms

  1. Report on the XML mining track at INEX 2007 categorization and clustering of XML documents

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    Full Access

    • Published in

      cover image ACM SIGIR Forum
      ACM SIGIR Forum  Volume 42, Issue 1
      June 2008
      76 pages
      ISSN:0163-5840
      DOI:10.1145/1394251
      Issue’s Table of Contents

      Copyright © 2008 Authors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 1 June 2008

      Check for updates

      Qualifiers

      • research-article

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader