Skip to main content
Log in

Knowledge fusion framework based on Web page texts

  • Research Article
  • Published:
Frontiers of Computer Science in China Aims and scope Submit manuscript

Abstract

With the proliferation ofWeb page texts, it is important to fuse these texts to useful documents that users need. However, there is still no complete and unified theoretical model for studying the research issues including redundancy, localization, and fuzziness existing in the process of fusing Web page texts. This paper proposes a fusion framework calledWeb Pages Knowledge Fusion Framework (WPKFF) to synthesize the knowledge of Web page texts. First, sentences in Web page texts are extracted and transformed into triple semantic net as knowledge representation. Then a semantic description of attribute fusion rules, description information fusion rules and attribute-value and description information fusion rules are defined in WPKFF. These rules are used to fuse the attributes of same domain concepts in triple semantic net. The features of attributes include description (string) and value data (number). The results of the experiments indicate that the fusion framework is a feasible model in terms of precision and recall.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Feigenbaun E. Some challenges and grand challenges for computational intelligence. Journal of the ACM, 2003, 50(1): 32–40

    Article  MathSciNet  Google Scholar 

  2. Lin C Y, Hovy E. From single to multi-document summarization: a prototype system and its evaluation. In: Proceedings of the 40th AnnualMeeting of the Association for Computational Linguistics (ACL). Philadelphia, 2002, 457–464

  3. Radev D R, Jing H Y, Budzikowska M. Centroid-based summarization of multiple documents: sentence extraction, utility-based evaluation, and user studies. In: Proceedings of ANLP/NAACL 2000 Workshop. Seattle, 2000, 21–29

  4. Fung P, Ngai G. Combining optimal clustering and hidden markov model for extractive. In: Proceedings of the ACL 2003 workshop on multilingual summarization and question answering. Sapporo, 2003: 21–28

  5. Hunter A, Summerton R. Fusion rules for context-dependent aggregation of structured news reports. Journal of Applied Non-classical Logic, 2004, 14(3): 329–366

    Article  MATH  Google Scholar 

  6. Hunter A, Liu W. Fusion rules for merging uncertain information. Information Fusion, 2006, 7(1): 97–134

    Google Scholar 

  7. Hunter A, Summerton R. A knowledge-based approach to merging information, Knowledge-Based Systems. 2006, 19(8): 647–674

    Article  Google Scholar 

  8. Hunter A, Summerton R. Propositional fusion rules. In: Proceedings of LNCS. Springer, 2003, 2711: 502–514

    MathSciNet  Google Scholar 

  9. Chuang W T, Yang J. Extracting sentence segments for text summarization: a machine learning approach. In: Proceedings of the 23rd annual international ACM SIGIR, 2000, 152–159

  10. Grégoire É, Sofiane A. Fusing syntax and semantics in knowledge fusion. In: Proceedings of EUSFLAT Conference, 2001, 414–417

  11. Sui Y F, Gao Y, Cao C G. Ontologies, frames and logical theories in NKI. Journal of Software, 2005, 12(16): 2045–2053

    Article  MathSciNet  Google Scholar 

  12. Anokhin P, Motro A. Fusionplex: resolution of data inconsistencies in the integration of heterogeneous Iinformation sources. Technical Report ISE-TR-03-06, Information and Software Engineering Dept., George Mason Univ., Fairfax, Virginia, 2003

    Google Scholar 

  13. Berberich K, Vazirgiannis M, Weikum G. T-Rank: time-aware authority ranking. In: Proceedings of WAW, 2004, 131–142

  14. Xie N F. Knowledge Fusion and Synchronization Methods Based on Semantic Web Technologies. Ph. D. dissertation, Graduate School of the Chinese Academy of Sciences, 2005 (in Chinese)

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sikang Hu.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Hu, S., Cao, Y. Knowledge fusion framework based on Web page texts. Front. Comput. Sci. China 3, 457–464 (2009). https://doi.org/10.1007/s11704-009-0035-1

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11704-009-0035-1

Keywords

Navigation