Skip to main content

A Concurrent Neural Classifier for HTML Documents Retrieval

  • Conference paper
Neural Nets (WIRN 2003)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2859))

Included in the following conference series:

Abstract

A neural based multi-agent system for automatic HTML pages retrieval is presented. The system is based on the EαNet architecture, a neural network having good generalization capabilities and able to learn the activation function of its hidden units. The starting hypothesis is that the HTML pages are stored in networked repositories. The system goal is to retrieve documents satisfying a user query and belonging to a given class (i.e. documents containing the word “football” and talking about “Sports”). The system is composed by three interacting agents: the EαNet Neural Classifier Mobile Agent, the Query Agent, and the Locator Agent. The whole system was successfully implemented exploiting the Jade platform features and facilities. The preliminary experimental results show a good classification rate: in the best case a classification error of 9.98% is reached.

This research has been partially funded by Engineering Ingegneria Informatica SpA within the Teschet project.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Gaglio, S., Pilato, G., Sorbello, F., Vassallo, G.: Using the Hermite regression formula to design a neural architecture with automatic learning of the ‘hidden’ activation functions. In: Lamma, E., Mello, P. (eds.) AI*IA 1999. LNCS (LNAI), vol. 1792, pp. 226–237. Springer, Heidelberg (2000)

    Chapter  Google Scholar 

  2. Pilato, G., Sorbello, F., Vassallo, G.: An innovative way to measure the quality of a neural network without the use of the test set. International Journal of Artificial Computational Intelligence 5(1), 31–36 (2001)

    Google Scholar 

  3. Cirasa, A., Pilato, G., Sorbello, F., Vassallo, G.: An enhanced version of the αNet architecture: EαNet. In: Proc. of AI*IA Workshop of Robotics Parma, Italy (1999)

    Google Scholar 

  4. Cirasa, A., Pilato, G., Sorbello, F., Vassallo, G.: EαNet: A Neural Solution for Web Pages Classification. In: Proc. of 4th World MultiConference on Systemics, Cybernetics and Informatics - SCI 2000, Orlando, Florida, U.S.A., July 23-26 (2000)

    Google Scholar 

  5. Glitho, R.H., Olougouna, E., Pierre, S.: Mobile Agents and Their Use for Information Retrieval: a brief overview and an elaborate case study. IEEE Network, 34–41 (2002)

    Google Scholar 

  6. Brewington, B., Gray, R., Moizumi, K., Kotz, D., Cybenko, G., Rus, D.: Mobile agents in distributed information retrieval Intelligent Information Agents, pp. 1–32. Springer, Heidelberg (1999)

    Google Scholar 

  7. Kosala, R., Blockeel, H.: Web Mining research: a survey. ACM SIGKDD Explorations 2(1), 1–15 (2000)

    Article  Google Scholar 

  8. http://jade.cselt.it

  9. http://www.fipa.org

  10. Ng, Y.-K., Tang, J., Goodrich, M.: A Binary-Categorization Approach for Classifyng Multiple-Record Web Documents Using Application Ontologies and a Probabilistic Model. In: Proceedings of Seventh International Conference on Database Systems for Advanced Applications (2001)

    Google Scholar 

  11. Liang, J., Doermann, D., Ma, M., Guo, J.K.: Page Classification Through Logical Labelling. In: Proceedings of 16th International Conference on Pattern Recognition, vol. 3 (2002)

    Google Scholar 

  12. Soonthornphisaj, N., Kijsirikul, B.: The Effects of Different Feature Sets on the Web Page Categorization Problem Using the Iterative Cross-Training Algorithm. In: Proceeding of the International Conference on Enterprise and Information System (ICEIS), Setubal, Portugal (July 2001)

    Google Scholar 

  13. Pal, S.K., Talwar, V., Mitra, P.: Web Mining in Soft Computing Framework: Relevance, State of the Art and Future Direction. IEEE Transaction on Neural Networks 13(9) (September 2002)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2003 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Pilato, G., Vitabile, S., Vassallo, G., Conti, V., Sorbello, F. (2003). A Concurrent Neural Classifier for HTML Documents Retrieval. In: Apolloni, B., Marinaro, M., Tagliaferri, R. (eds) Neural Nets. WIRN 2003. Lecture Notes in Computer Science, vol 2859. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-45216-4_24

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-45216-4_24

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-20227-1

  • Online ISBN: 978-3-540-45216-4

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics