skip to main content
article

The Ferrety algorithm for the KDD Cup 2005 problem

Published: 01 December 2005 Publication History

Abstract

In this paper, we present a general solution for the KDD Cup 2005 problem. It uses the Internet as source of knowledge and extends it to categorize very short (less than 5 words) documents with reasonable accuracy. Our approach consists of three main parts: i.) a central knowledge filter ii.) an on-demand web crawler and iii.) a very efficient categorizer system. Our solution obtained Creativity and Precision Runner-up Awards at the competition. The main idea of Ferrety Algorithm can be generalized for mapping one taxonomy to another if training documents are available.

References

[1]
HITEC categorizer online. http://categorizer.tmit.bme.hu.
[2]
G. Salton and M. J. McGill. An Introduction to Modern Information Retrieval. McGraw-Hill, 1983.
[3]
D. Tikk, Gy. Biró, and J. D. Yang. A hierarchical text categorization approach and its application to FRT expansion. Australian Journal of Intelligent Information Processing Systems, 8(3):123--131, 2004.
[4]
D. Tikk, Gy. Biró, and J. D. Yang. Experiments with a hierarchical text categorization method on WIPO patent collections. In N. O. Attok-Okine and B. M. Ayyub, editors, Applied Research in Uncertainty Modelling and Analysis, number 20 in International Series in Intelligent Technologies, pages 283--302. Springer, 2005.

Cited By

View all
  • (2020)Query ClassificationQuery Understanding for Search Engines10.1007/978-3-030-58334-7_2(15-41)Online publication date: 2-Dec-2020
  • (2019)A hybrid deep neural network model for query intent classificationJournal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology10.3233/JIFS-18268236:6(6413-6423)Online publication date: 1-Jan-2019
  • (2012)A feature-free search query classification approach using semantic distanceExpert Systems with Applications: An International Journal10.1016/j.eswa.2012.02.19139:12(10739-10748)Online publication date: 1-Sep-2012
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM SIGKDD Explorations Newsletter
ACM SIGKDD Explorations Newsletter  Volume 7, Issue 2
December 2005
152 pages
ISSN:1931-0145
EISSN:1931-0153
DOI:10.1145/1117454
Issue’s Table of Contents

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 December 2005
Published in SIGKDD Volume 7, Issue 2

Check for updates

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)1
  • Downloads (Last 6 weeks)0
Reflects downloads up to 05 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2020)Query ClassificationQuery Understanding for Search Engines10.1007/978-3-030-58334-7_2(15-41)Online publication date: 2-Dec-2020
  • (2019)A hybrid deep neural network model for query intent classificationJournal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology10.3233/JIFS-18268236:6(6413-6423)Online publication date: 1-Jan-2019
  • (2012)A feature-free search query classification approach using semantic distanceExpert Systems with Applications: An International Journal10.1016/j.eswa.2012.02.19139:12(10739-10748)Online publication date: 1-Sep-2012
  • (2012)Mapping user search queries to product categoriesProceedings of the American Society for Information Science and Technology10.1002/meet.2011.1450480111148:1(1-10)Online publication date: 11-Jan-2012
  • (2011)A Feature-Free Flexible Approach to Topical Classification of Web QueriesProceedings of the 2011 Seventh International Conference on Semantics, Knowledge and Grids10.1109/SKG.2011.23(59-66)Online publication date: 24-Oct-2011
  • (2010)Mining Historic Query Trails to Label Long and Rare Search Engine QueriesACM Transactions on the Web10.1145/1841909.18419124:4(1-27)Online publication date: 1-Sep-2010
  • (2010)Classification-enhanced rankingProceedings of the 19th international conference on World wide web10.1145/1772690.1772703(111-120)Online publication date: 26-Apr-2010
  • (2009)Unsupervised query categorization using automatically-built concept graphsProceedings of the 18th international conference on World wide web10.1145/1526709.1526772(461-470)Online publication date: 20-Apr-2009
  • (2009)Classifying search queries using the Web as a source of knowledgeACM Transactions on the Web10.1145/1513876.15138773:2(1-28)Online publication date: 30-Apr-2009
  • (2009)Capturing the Meaning of Internet Search Queries by Taxonomy MappingIntelligent Engineering Systems and Computational Cybernetics10.1007/978-1-4020-8678-6_16(185-195)Online publication date: 2009
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media