Skip to main content

An Iterative Approach for Web Catalog Integration with Support Vector Machines

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 3689))

Abstract

Web catalog integration is an emerging problem in current digital content management. Past studies show that more improvement on integration accuracy can be achieved with advanced classifiers. Because Support Vector Machine (SVM) has shown its supremeness in recent research, we propose an iterative SVM-based approach (SVM-IA) to improve the integration performance. We have conducted experiments of real-world catalog integration to evaluate the performance of SVM-IA and cross-training SVM. The results show that SVM-IA has prominent accuracy performance, and the performance is more stable.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Agrawal, R., Srikant, R.: On Integrating Catalogs. In: Proc. the 10th WWW Conf (WWW 10), pp. 603–612 (May 2001)

    Google Scholar 

  2. Sarawagi, S., Chakrabarti, S., Godbole, S.: Cross-Training: Learning Probabilistic Mappings between Topics. In: Proc. the 9th ACM SIGKDD Int’l Conf. on Knowledge Discovery and Data Mining, pp. 177–186 (August 2003)

    Google Scholar 

  3. Chen, I.-X., Shih, C.-H., Yang, C.-Z.: Web Catalog Integration using Support Vector Machines. In: Proc. 1st IWT (October 2004)

    Google Scholar 

  4. Tsay, J.-J., Chen, H.-Y., Chang, C.-F., Lin, C.-H.: Enhancing Techniques for Efficient Topic Hierarchy Integration. In: Proc. the 3rd Int’l Conf. on Data Mining (ICDM 2003), pp. 657–660 (November 2003)

    Google Scholar 

  5. Zhang, D., Lee, W.S.: Web Taxonomy Integration using Support Vector Machines. In: Proc. WWW 2004, pp. 472–481 (May 2004)

    Google Scholar 

  6. Kim, D., Kim, J., Lee, S.: Catalog Integration for Electronic Commerce through Category-Hierarchy Merging Technique. In: Proc. the 12th Int’l Workshop on Research Issues in Data Engineering: Engineering e-Commerce/e-Business Systems (RIDE 2002), pp. 28–33 (February 2002)

    Google Scholar 

  7. Marron, P.J., Lausen, G., Weber, M.: Catalog Integration Made Easy. In: Proc. the 19th Int’l Conf. on Data Engineering (ICDE 2003), pp. 677–679 (March 2003)

    Google Scholar 

  8. Stonebraker, M., Hellerstein, J.M.: Content Integration for e-Commerce. In: Proc. the 2001 ACM SIGMOD Int’l Conf. on Management of Data, pp. 552–560 (May 2001)

    Google Scholar 

  9. Zadrozny, B.: Reducing Multiclass to Binary by Coupling Probability Estimates. In: Dietterich, T.G., Becker, S., Ghahramani, Z. (eds.) Advances in Neural Information Processing Systems 14 (NIPS 2001). MIT Press, Cambridge (2002)

    Google Scholar 

  10. Frakes, W., Baeza-Yates, R.: Information Retrieval: Data Structures and Algorithms. Prentice-Hall, Englewood Cliffs (1992)

    Google Scholar 

  11. Joachims, T.: Making Large-Scale SVM Learning Practical. In: Scholkopf, B., Burges, C., Smola, A. (eds.) Advances in Kernel Methods: Support Vector Learning. MIT Press, Cambridge (1999)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Chen, IX., Ho, JC., Yang, CZ. (2005). An Iterative Approach for Web Catalog Integration with Support Vector Machines. In: Lee, G.G., Yamada, A., Meng, H., Myaeng, S.H. (eds) Information Retrieval Technology. AIRS 2005. Lecture Notes in Computer Science, vol 3689. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11562382_71

Download citation

  • DOI: https://doi.org/10.1007/11562382_71

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-29186-2

  • Online ISBN: 978-3-540-32001-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics