Abstract
This paper presents the structure and the functionality of the Semantically Enhanced Intellectual Property Protection System. The system uses an extensive set of semantic net algorithms for the Polish and English language that which allows it to detect similarities between compared documents on a level far beyond simple text matching. SEIPro2S benefits result both from using a local document repository and from Web based resources. The SeiPro2S system uses a mechanism of semantic compression developed to generalize concepts during a comparison of documents. The main focus of this work is to give the reader an overview of architecture, applied mechanisms and some actual results.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Baeza-Yates, R., Ribeiro-Neto, B.: Modern Information Retrieval. ACM Press, Addison-Wesley Longman Publishing Co., New York (1999)
Broder, A.Z.: Syntactic clustering of the web. Comput. Netw. ISDN Syst. 29(8-13), 1157–1166 (1997)
Ceglarek, D.: Zastosowanie sieci semantycznej do disambiguacji pojec w jezyku naturalnym. In: Systemy wspomagania organizacji SWO 2006, Wydawnictwo Akademii Ekonomicznej w Katowicach, Katowice (2006)
Ceglarek, D., Haniewicz, K., Rutkowski, W.: Semantically Enhanced Intellectual Property Protection System - SEIPro2S. In: Nguyen, N.T., Kowalczyk, R., Chen, S.-M. (eds.) ICCCI 2009. LNCS, vol. 5796, pp. 449–459. Springer, Heidelberg (2009)
Ceglarek, D., Haniewicz, K., Rutkowski, W.: Quality of semantic compression in classification. In: Pan, J.-S., Chen, S.-M., Nguyen, N.T. (eds.) ICCCI 2010, Part I. LNCS (LNAI), vol. 6421, pp. 162–171. Springer, Heidelberg (2010)
Ceglarek, D., Haniewicz, K., Rutkowski, W.: Towards knowledge acquisition with wiSENet. In: Nguyen, N.T., Trawiński, B., Jung, J.J. (eds.) New Challenges for Intelligent Information and Database Systems. SCI, vol. 351, pp. 75–84. Springer, Heidelberg (2011)
Ceglarek, D., Haniewicz, K., Rutkowski, W.: Semantic compression for specialised information retrieval systems. In: Nguyen, N.T., Katarzyniak, R., Chen, S.-M. (eds.) Advances in Intelligent Information and Database Systems. SCI, vol. 283, pp. 111–121. Springer, Heidelberg (2010)
Ceglarek, D., Haniewicz, K.: Fast plagiarism detection by sentence hashing. In: Rutkowski, L., Korytkowski, M., Scherer, R., Tadeusiewicz, R., Zadeh, L.A., Zurada, J.M. (eds.) ICAISC 2012, Part II. LNCS, vol. 7268, pp. 30–37. Springer, Heidelberg (2012)
Ceglarek, D., Haniewicz, K., Rutkowski, W.: Robust Plagiary Detection Using Semantic Compression Augmented SHAPD. In: Nguyen, N.-T., Hoang, K., Jędrzejowicz, P. (eds.) ICCCI 2012, Part I. LNCS, vol. 7653, pp. 308–317. Springer, Heidelberg (2012)
Ceglarek, D.: Single-pass Corpus to Corpus Comparison by Sentence Hashing. In: Proceedings of 5th International Conference on Advanced Cognitive Technologies and Applications Conference. Xpert Publishing Services, Valencia (2013)
Clough, P., Stevenson, M.: A Corpus of Plagiarised Short Answers, University of Sheffield (2009), http://ir.shef.ac.uk/cloughie/resources/plagiarism_corpus.html
Frakes, W.B., Baeza-Yates, R.: Information Retrieval: Data Structures & Algorithms. Prentice-Hall (1992)
Hamid, O.A., Behzadi, B., Christoph, S., Henzinger, M.: Detecting the origin of text segments efficiently. In: Proceedings of the 18th International Conference on World Wide Web, WWW 2009, vol. 7(3), pp. 61–70 (2009)
Hoad, T.C., Zobel, J.: Methods for identifying versioned and plagiarized documents. Journal of the American Society for Information Science and Technology 54(3), 203–215 (2003)
Hotho, A., Staab, S., Stumme, G.: Explaining Text Clustering Results Using Semantic Structures. In: Lavrač, N., Gamberger, D., Todorovski, L., Blockeel, H. (eds.) PKDD 2003. LNCS (LNAI), vol. 2838, pp. 217–228. Springer, Heidelberg (2003)
Krovetz, R., Croft, W.B.: Lexical Ambiguity and Information Retrieval (1992)
Manber, U.: Finding similar files in a large file system. In: Proceedings of the USENIX Winter 1994 Technical Conference on USENIX, WTEC 1994 (1994)
Miller, G.A.: Wordnet: a lexical database for english Commun., vol. 38. ACM (1995)
Sinha, R., Mihalcea, R.: Unsupervised graph-basedword sense disambiguation using measures of word semantic similarity. In: ICSC, pp. 363–369 (2007)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer International Publishing Switzerland
About this paper
Cite this paper
Ceglarek, D. (2013). Architecture of the Semantically Enhanced Intellectual Property Protection System. In: Burduk, R., Jackowski, K., Kurzynski, M., Wozniak, M., Zolnierek, A. (eds) Proceedings of the 8th International Conference on Computer Recognition Systems CORES 2013. Advances in Intelligent Systems and Computing, vol 226. Springer, Heidelberg. https://doi.org/10.1007/978-3-319-00969-8_70
Download citation
DOI: https://doi.org/10.1007/978-3-319-00969-8_70
Publisher Name: Springer, Heidelberg
Print ISBN: 978-3-319-00968-1
Online ISBN: 978-3-319-00969-8
eBook Packages: EngineeringEngineering (R0)