skip to main content
research-article

Constructing Maintainable Semantic Relation Network from Ambiguous Concepts in Web Content

Published: 11 February 2016 Publication History

Abstract

The semantic network is a form of knowledge that represents various relationships between concepts with ambiguity. The knowledge can be employed to identify semantically related objects. It helps, for example, a recommender system to generate effective recommendations to the users. We propose to study a new semantic network, namely, the Concept Relation Network (CRN), which is efficiently constructed and maintained using existing web search engines. CRN tackles the uncertainty and dynamics of web content, and thus is optimized for many important web applications, such as social networks and search engines. It is a large semantic network for the collection, analysis, and interpretation of web content, and serves as a cornerstone for applications such as web search engines, recommendation systems, and social networks that can benefit from a large-scale knowledge base. In this article, we present two applications for CRN: (1) search engine and web analytic and (2) semantic information retrieval. Experimental results show that CRN effectively enhances these applications by considering the heterogenous and polysemous nature of web content.

References

[1]
Appendix. 2014. Index terms. Retrieved from http://www.cse.ust.hk/∼kwtleung/appendix.pdf.
[2]
Abdur Chowdhury and Ian Soboroff. 2002. Automatic evaluation of world wide web search services. In Proc. of the SIGIR Conference.
[3]
Kenneth Church, William Gale, Patrick Hanks, and Donald Hindle. 1991. Using statistics in lexical analysis. Bell Laboratories and Oxford University Press.
[4]
Christiane Fellbaum. 1998. WordNet: An Electronic Lexical Database. MIT Press.
[5]
Jianfeng Gao, Gu Xu, and Jinxi Xu. 2013. Query expansion using path-constrained random walks. In Proc. of the SIGIR Conference.
[6]
Geographic Names for Geopolitical Areas from GNS. 2013. Retreived from http://earth-info.nga.mil/gns/html/namefiles.htm.
[7]
J. Huang and E. N. Efthimiadis. 2009. Analyzing and evaluating query reformulation strategies in web search logs. In Proc. of the CIKM Conference.
[8]
Alpa Jain, Umut Ozertem, and Emre Velipasaoglu. 2011. Synthesizing high utility suggestions for rare web search queries. In Proc. of the SIGIR Conference.
[9]
Steve Lawrence and C. Lee Giles. 1998. Searching the World Wide Web. Science 280, 5360, 98--100.
[10]
Steve Lawrence and C. Lee Giles. 2000. Accessibility of information on the web. Intelligence 11, 1, 32--39.
[11]
Kenneth Wai-Ting Leung, Hing Yuet Fung, and Dik Lun Lee. 2011a. Constructing concept relation network and its application to personalized web search. In Proc. of the EDBT Conference.
[12]
Kenneth Wai-Ting Leung, Dik Lun Lee, Wilfred Ng, and Hing Yuet Fung. 2011b. A framework for personalizing web search with concept-based user profiles. ACM TOIT 11, 4, 17:1--17:29.
[13]
Kenneth Wai-Ting Leung, Wilfred Ng, and Dik Lun Lee. 2008. Personalized concept-based clustering of search engine queries. IEEE TKDE 20, 11, 1505--1518.
[14]
Zhen Liao, Daxin Jiang, Enhong Chen, Jian Pei, Huanhuan Cao, and Hang Li. 2011. Mining concept sequences from large-scale search logs for context-aware query suggestion. ACM TSIT 3, 1, 17:1--17:40.
[15]
H. Liu and P. Singh. 2004. Focusing on conceptnet’s natural language knowledge representation. In Proc. of the KES Conference.
[16]
Jose G. Moreno, Gaël Dias, and Guillaume Cleuziou. 2014. Query log driven web search results clustering. In Proc. of the SIGIR Conference.
[17]
L. Page, S. Brin, R. Motwani, and T. Winograd. 1999. The pagerank citation ranking: Bringing order to the web. Technique Report, Computer Science Department, Stanford University (1999).
[18]
Roget's Thesaurus. 2013. http://www.thesaurus.com/.
[19]
Fabian Suchanek, Gjergji Kasneci, and Gerhard Weikum. 2007. YAGO: A core of semantic knowledge - Unifying WordNet and Wikipedia. In Proc. of the WWW Conference.
[20]
Jan Vosecky, Kenneth Wai-Ting Leung, and Wilfred Ng. 2014. Collaborative personalized twitter search with topic-Language models. In Proc. of the SIGIR Conference.
[21]
World Gazetteer. 2013. http://www.grida.no/geo/GEO/Geo-2-234.htm.
[22]
Yabo Xu, Ke Wang, Benyu Zhang, and Zheng Chen. 2007. Privacy--enhancing personalized web search. In Proc. of the WWW Conference.
[23]
Lina Yao, Quan Z. Sheng, Anne H. H. Ngu, Helen Ashman, and Xue Li. 2014. Exploring recommendations in internet of things. In Proc. of the SIGIR Conference.
[24]
Majid Yazdani and Andrei Popescu-Belis. 2013. Computing text semantic relatedness using the contents and links of a hypertext encyclopedia. Artificial Intelligence 194 (Jan. 2013), 176--202.

Cited By

View all
  • (2022)Process-Based Knowledge OrganizationJournal of Database Management10.4018/JDM.29955833:1(1-18)Online publication date: 13-May-2022
  • (2022)Enhancing N-Gram Based Metrics with Semantics for Better Evaluation of Abstractive Text SummarizationJournal of Computer Science and Technology10.1007/s11390-022-2125-637:5(1118-1133)Online publication date: 30-Sep-2022

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Internet Technology
ACM Transactions on Internet Technology  Volume 16, Issue 1
February 2016
129 pages
ISSN:1533-5399
EISSN:1557-6051
DOI:10.1145/2869768
  • Editor:
  • Munindar P. Singh
Issue’s Table of Contents
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 11 February 2016
Accepted: 01 August 2015
Revised: 01 April 2015
Received: 01 November 2011
Published in TOIT Volume 16, Issue 1

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Concept network
  2. query suggestion
  3. search engine
  4. semantic network
  5. web analytic
  6. web search

Qualifiers

  • Research-article
  • Research
  • Refereed

Funding Sources

  • HKSAR GRF

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)7
  • Downloads (Last 6 weeks)0
Reflects downloads up to 18 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2022)Process-Based Knowledge OrganizationJournal of Database Management10.4018/JDM.29955833:1(1-18)Online publication date: 13-May-2022
  • (2022)Enhancing N-Gram Based Metrics with Semantics for Better Evaluation of Abstractive Text SummarizationJournal of Computer Science and Technology10.1007/s11390-022-2125-637:5(1118-1133)Online publication date: 30-Sep-2022

View Options

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media