Web Feed Clustering and Tagging Aggregator Using Topological Tree-Based Self-Organizing Maps

Freeman, Richard T.

doi:10.1007/978-3-642-04394-9_45

Richard T. Freeman¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 5788))

Included in the following conference series:

International Conference on Intelligent Data Engineering and Automated Learning

1848 Accesses
1 Citations

Abstract

With the rapid and dramatic increase in web feeds published by different publishers, providers or websites via Really Simple Syndication (RSS) and Atom, users cannot be expected to scan, select and consume all the content manually. This is leading to an information overload for consumers as the amount of content increases. With this growth there is a need to make the content more accessible and allow it to be efficiently searched and explored. This can be partially achieved by structuring and organising the content dynamically into topics or categories. Typical approaches make use of categorisation or clustering, however these approaches have a number of limitations such as the inability to represent the connections between topics and being heavy dependent on fixed parameters.

In this paper we apply the topological tree method, to dynamically identify categories, on financial and business news feed dataset. The topological tree method is used to automatically organise an aggregation of the financial news feeds into self-discovered topics and allows a drill down into sub-topics. The news feeds, organised using the topological tree method, are discussed against those of typical web aggregators. A discussion is made on the criterions of representing news feeds, and the advantages of presenting underlying topics and providing a clear view of the connections between news topics. The topological tree has been found to be a superior representation, and well suited for organising financial news content and could be applied to categorise and filter news more efficiently for market abuse detection.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Freeman, R.T.: Web Document Search, Organisation and Exploration Using Self-Organising Neural Networks, PhD Thesis, Faculty of Engineering and Physical Sciences, School of Electrical & Electronic Engineering, University of Manchester, Manchester (2004)
Google Scholar
Qamra, A., Tseng, B., Chang, E.Y.: Mining blog stories using community-based and temporal clustering. In: 15th ACM international conference on Information and knowledge management CIKM, pp. 58–67. ACM, New York (2006)
Google Scholar
Paliouras, G., Alexandros, M., Ntoutsis, C., Alexopoulos, A., Skourlas, C.: PNS: Personalized Multi-Source News Delivery. In: Gabrys, B., Howlett, R.J., Jain, L.C. (eds.) KES 2006. LNCS (LNAI), vol. 4252, pp. 1152–1161. Springer, Heidelberg (2006)
Chapter Google Scholar
Li, X., Yan, J., Deng, Z.-H., Ji, L., Fan, W., Zhang, B., Chen, Z.: A Novel Clustering-based RSS Aggregator. In: 16th international conference on World Wide Web, pp. 1309–1310. ACM, New York (2007)
Chapter Google Scholar
Agarwal, N., Galan, M., Liu, H., Subramanya, S.: Clustering Blogs with Collective Wisdom. In: 8th International Conference on Web Engineering, ICWE 2008, pp. 336–339. IEEE, Los Alamitos (2008)
Chapter Google Scholar
Huang, W., Webster, D.: Enabling Context-Aware Agents to Understand Semantic Resources on The WWW and The SemanticWeb. In: International Conference on Web Intelligence (WI 2004), pp. 138–144. IEEE, Los Alamitos (2004)
Google Scholar
Webster, D., Huang, W., Mundy, D., Warren, P.: Context-Orientated News Filtering for Web 2.0 and Beyond. In: 15th International World Wide Web Conference, pp. 1001–1002. ACM, New York (2006)
Chapter Google Scholar
Thelwall, M., Prabowo, R.: Identifying and Characterizing Public Science-Related Fears From RSS Feeds. Journal of the American Society for Information Science and Technology 58(3), 379–390 (2007)
Article Google Scholar
Kohonen, T.: Self-Organizing Maps. Third Extended edn. Springer, Heidelberg (2001)
Google Scholar
Freeman, R.T.: Topological Tree Clustering of Web Search Results. In: Corchado, E., Yin, H., Botti, V., Fyfe, C. (eds.) IDEAL 2006. LNCS, vol. 4224, pp. 789–797. Springer, Heidelberg (2006)
Chapter Google Scholar
Freeman, R.T.: Topological Tree Clustering of Social Network Search Results. In: Yin, H., Tino, P., Corchado, E., Byrne, W., Yao, X. (eds.) IDEAL 2007. LNCS, vol. 4881, pp. 760–769. Springer, Heidelberg (2007)
Chapter Google Scholar
Salton, G.: Automatic text processing - the transformation, analysis, and retrieval of information by computer. Addison-Wesley, Reading (1989)
Google Scholar
Freeman, R.T., Yin, H.: Web content management by self-organization. IEEE Transactions on Neural Networks 16(5), 1256–1268 (2005)
Article Google Scholar
Freeman, R.T., Yin, H.: Adaptive topological tree structure for document organisation and visualisation. Neural Networks 17(8-9), 1255–1271 (2004)
Article MATH Google Scholar

Download references

Author information

Authors and Affiliations

Capgemini, Financial Services GBU, Technology Consulting Group, No. 1 Forge End, Woking, Surrey, GU21 6DB, United Kingdom
Richard T. Freeman

Authors

Richard T. Freeman
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Escuela Politécnica Superior, Universidad de Burgos, Calle Francisco de Vitoria, S/N, Edifico C, 09006, Burgos, Spain
Emilio Corchado
School of Electrical and Electronic Engineering, University of Manchester, Sackville Street Building, Sackville Street, M60 1QD, Manchester, UK
Hujun Yin

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Freeman, R.T. (2009). Web Feed Clustering and Tagging Aggregator Using Topological Tree-Based Self-Organizing Maps. In: Corchado, E., Yin, H. (eds) Intelligent Data Engineering and Automated Learning - IDEAL 2009. IDEAL 2009. Lecture Notes in Computer Science, vol 5788. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-04394-9_45

Download citation

DOI: https://doi.org/10.1007/978-3-642-04394-9_45
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-04393-2
Online ISBN: 978-3-642-04394-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics