In this chapter, we presents iJADE InfoSeeker, an intelligent context-aware agents system that is designed to help users find, retrieve, and analyze news article from the Internet and then present the content in a semantic web. We present the advantages of using multiple intelligent agents to mine news articles on the web, the benefits of using ontologies to analyze the semantics of Chinese text, and also the advantages of using a semantic web to organize information semantically. iJADE InfoSeeker also demonstrates the advantages of using ontologies to identify topics. We use a Chinese document corpus to evaluate iJADE InfoSeeker and the testing result was compared to other approaches. It was found that the accuracy of identifying the topics of Chinese web articles is nearly 87%. It demonstrated a fast processing speed of less than one second per article. It also organizes content flexibly and understands knowledge accurately, unlike traditional text classification systems.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Lee, R. S. T. Fuzzy-neuro approach to agent applications: From the AI perspective to modern ontology. Springer, 2005.
iJADK - intelligent Java Agent Development Kit, http://www.ijadk.com/.
Lee, R. S. T. & Liu, J.N.K. A Web-based Mining Agent based on Intelligent Java Agent Development Environment (iJADE) on Internet Shopping, 2001.
Jennings, N. R. & Wooldridge, M. Applications of Intelligent Agents, Agent technology: Foundations, applications, and markets. pp. 3-28, 1998.
Change, G., Healey, M. J. & McHugh, A. M. Mining the World Wide Web: An information search approach. Kluwer Academic Publishers, 2001.
Franke, J., Nakaeizadeh, G. & Renz, I. Text mining: Theoretical aspects and applications. Physica-Verlag, 2003.
Li, Y. & Zhong, N. Capturing Evolving Patterns for Ontology-based Web Mining. Proceedings. IEEE/WIC/ACM International Conference on 20-24 Sept. WI 2004. pp. 256-263, 2004.
Widyantoro, D. H. & Yen, J. Relevant data expansion for learning concept drift from sparsely labeled data. IEEE Transactions on Knowledge and Data Engineering vol.17, no.3 pp. 401-412, 2005.
Joachims, T. A probabilistic analysis of the Rocchio algorithm with TFIDF for text categorization, 2002.
Aggarwal, C. C., Gates, S. C. & Yu, P. S. On using partial supervision for text categorization. IEEE Transactions on Knowledge and data Engineering, vol. 16, no. 2 pp. 245-255, 2004.
Gruber, T. What is ontology? http://www.ksl.stanford.edu/kst/what-is-an- ontology.html.
Maedche, A. Ontology learning for the semantic Web. Kluwer Academic Publishers, 2002.
Patel, M. & Duke, M. Knowloedge Discovery in an Agents Environment. ESWS 2004, LNCS 3053 pp. 121-136, 2004.
SUMO - Base Ontology, http://virtual.cvut.cz/kifb/cnt/ont/sumo-base- ontology.html.
OpenCYC - Formalized Common Knowledge, http://www.opencyc.org/.
Fellbaum, C. WordNet: an electronic lexical database. MIT Press, 1998.
HowNet - Computation of Meaning, http://www.keenage.com/.
Guan, Y., Wang, X. L. & Kong, X. Y. Quantifying semantic similarity of Chinese words from HowNet. Proceedings of the First international Conference on Machines Learning and Cybernetics, Beijing, 4-5 Nov 2002.
Gan, K. W. & Wong, P. W. Annotating information structures in Chinese text using HowNet, 2004.
Davies, J. & Fensel, D. Towards the semantic Web: ontology-driven knowledge management. Wiley, 2003.
Patel, C. & Superkar, K. E. OntoKhoj: A Semantic Web Portal for Onto- logy Searching, Ranking and Classification, 2005.
W3C Semantic Web, http://www.w3.org/2001/sw/.
W3C Resource Description Framework (RDF), http://www.w3.org/RDF/.
Handschuh, S. & Staab, S. Annotation for the semantic web.: IOS Press, 2003.
Schreiber, A. T., Dubbeldam, B. & Wielemaker, J. Ontology-based photo annotation. IEEE Intelligent Systems May/June 2001, pp. 66-74, 2004.
Soo, V. W., Lee, C. Y. & Li, C. C. Automated semantic annotation and retrieval based on sharable ontology and case-based learning techniques. Proceedings of the 2003 Joint Conference on Digital Libraries, 2003.
Handschuh, S. & Staab, S. CREAM - CREAting Metadata for the semantic web. Computer Networks. 42, pp. 579-598, 2003.
Hyvonen, E., Saarela, S. & Viljanen, K. Application Ontology Techniques to View-Based Semantic Search and Browsing. ESWS 2004, pp. 215-220, 2004.
Gao, M., Liu, C. & Chenf, C. An Ontology Search Engine Based on Semantic Analysis. Proceedings of the Third International Conference on Information Technology and Applications (ICITA’05), 2005.
IPTC - NewsCodes - List, http://www.iptc.org/NewsCodes/nc_ts-table01.php
BBCChiense.com | XML Feed - http://newsrss.bbc.co.uk/rss/chinese/simp/ news/rss.xml.
Clifton, C., Cooley, R. & Rennie, J. Data mining for topic identification in a text corpus. IEEE Transactions on Knowledge and Data Engineering, vol. 16, no. 8 pp. 949-964, 2004.
Adomavicius, G. & Tuzhilin, A. Toward the next generation of recommender systems: A syrvery of the state-of-the-art and possible extensions. IEEE Transactions on Knowledge and Data Engineering vol. 17, no. 6 pp. 734-749, 2005.
Li, Y. & Zhong, N. Capturing evolving patterns for ontology-based web mining. Proceedings of the IEEE/WIC/ACM International Conference on Web Intelligence 2003.
Bontas, E. P. & Schlangen, D. Ontology engineering for the semantic annotation of medical data. Proc. of the 16th International Workshop on Database and Expert Systems Applications (DEXA’05), 2005.
Lin, S. H., Chen, M. S. & Ho, J. M. Intelligent internet document organiza- tion and retrieval. IEEE Transactions on Knowledge and Data Engineering (SCI), vol. 14, no. 3, 2004.
Xi, C. Z. & Ibrahim, T. I. A Keyword-based semantic prefetching approach in internet news services. IEEE Transactions on Knowledge and data Engineering, vol. 16, no. 5 pp. 601-611, 2004.
Lu, J., Rahman, U. & Yao, S. An intelligent search agent system for semantic information retrieval on the internet. ICITA 2005.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Lim, E.H.Y., Lee, R.S.T. (2007). iJADE InfoSeeker: On Using Intelligent Context-Aware Agents for Retrieving and Analyzing Chinese Web Articles. In: Lee, R.S.T., Loia, V. (eds) Computational Intelligence for Agent-based Systems. Studies in Computational Intelligence, vol 72. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-73177-1_5
Download citation
DOI: https://doi.org/10.1007/978-3-540-73177-1_5
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-73175-7
Online ISBN: 978-3-540-73177-1
eBook Packages: EngineeringEngineering (R0)