Distributed Representations to Detect Higher Order Term Correlations in Textual Content

Öztürk, Pinar; Prasath, R. Rajendra; Moen, Hans

doi:10.1007/978-3-642-13529-3_78

Pinar Öztürk²⁴,
R. Rajendra Prasath²⁴ &
Hans Moen²⁴

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 6086))

Included in the following conference series:

International Conference on Rough Sets and Current Trends in Computing

1474 Accesses
3 Citations

Abstract

Case Based Reasoning(CBR), an artificial intelligence technique, solves new problem by reusing solutions of previously solved similar cases. In conventional CBR, cases are represented in terms of structured attribute-value pairs. Acquisition of cases, either from domain experts or through manually crafting attribute-value pairs from incident reports, constitutes the main reason why CBR systems have not been more common in industries. Manual case generation is a laborious, costlier and time consuming task. Textual CBR (TCBR) is an emerging line that aims to apply CBR techniques on cases represented as textual descriptions. Similarity of cases is based on the similarity between their constituting features. Conventional CBR benefits from employing domain specific knowledge for similarity assessment. Correspondingly, TCBR needs to involve higher-order relationships between features, hence domain specific knowledge. In addition, the term order has also been contended to influence the similarity assessment. This paper presents an account where features and cases are represented using a distributed representation paradigm that captures higher-order relations among features as well as term order information.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Aamodt, A.: Knowledge-intensive case-based reasoning in creek. In: Funk, P., González Calero, P.A. (eds.) ECCBR 2004. LNCS (LNAI), vol. 3155, pp. 1–15. Springer, Heidelberg (2004)
Google Scholar
Buckley, C., Salton, G., Allan, J., Singhal, A.: Automatic query expansion using smart: Trec 3. In: TREC (1994)
Google Scholar
Chakraborti, S., Mukras, R., Lothian, R., Wiratunga, N., Watt, S., Harper, D.: Supervised latent semantic indexing using adaptive sprinkling. In: Proc. of the 20th Int. Joint Conf. on AI, pp. 1582–1587. Morgan Kaufmann, San Francisco (2007)
Google Scholar
Deerwester, S.C., Dumais, S.T., Landauer, T.K., Furnas, G.W., Harshman, R.A.: Indexing by latent semantic analysis. JASIS 41(6), 391–407 (1990)
Article Google Scholar
Díaz-Agudo, B., González-Calero, P.A.: Cbronto: A task/method ontology for cbr. In: Proc. of 15th Int. Florida AI Research Society, pp. 101–105. AAAI Press, Menlo Park (2002)
Google Scholar
Firth, J.R.: A synopsis of linguistic theory, 1930-1955. In: Studies in Linguistic Analysis, pp. 1–32 (1957)
Google Scholar
Gentner, D., Forbus, K.D.: MAC/FAC: A model of similarity-based retrieval. Cognitive Science 19, 141–205 (1991)
Google Scholar
Johnson, W., Lindenstrauss, L.: Extensions of lipschitz maps into a hilbert space. Contemporary Mathematics 26, 189–206 (1984)
MATH MathSciNet Google Scholar
Jones, M.N., Mewhort, D.J.K.: Representing word meaning and order information in a composite holographic lexicon. Psychological Review 114, 1–37 (2007)
Article Google Scholar
Kanerva, P., Kristofersson, J., Holst, A.: Random indexing of text samples for latent semantic analysis. In: Proc. of the 22nd Annual Conf. of the Cognitive Science Society, pp. 103–106. Erlbaum, Mahwah (2000)
Google Scholar
Öztürk, P., Aamodt, A.: A context model for knowledge-intensive case-based reasoning. Int. J. Hum.-Comput. Stud. 48(3), 331–355 (1998)
Article Google Scholar
Plate, T.: Holographic reduced representations. IEEE Transactions on Neural Networks 6(3), 623–641 (1995)
Article Google Scholar
Raghunandan, M.A., Wiratunga, N., Chakraborti, S., Massie, S., Khemani, D.: Evaluation measures for tcbr systems. In: Althoff, K.-D., Bergmann, R., Minor, M., Hanft, A. (eds.) ECCBR 2008. LNCS (LNAI), vol. 5239, pp. 444–458. Springer, Heidelberg (2008)
Chapter Google Scholar
Sahlgren, M.: Vector-based semantic analysis: Representing word meanings based on random labels. In: ESSLI Workshop on Semantic Knowledge Acquistion and Categorization. Kluwer Academic Publishers, Dordrecht (2001)
Google Scholar
Sahlgren, M.: An introduction to random indexing. In: Methods and Applications of Semantic Indexing Workshop at 7th Int. Conf. on Terminology and Knowledge Eng., TKE 2005 (2005)
Google Scholar
Singhal, A., Salton, G., Mitra, M., Buckley, C.: Document length normalization. Inform. Process. Manage. 32(5), 619–633 (1996)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer and Information Science (IDI), Norwegian University of Science and Technology (NTNU), Sem Sælands Vei 7-9, NO - 7491, Trondheim, Norway
Pinar Öztürk, R. Rajendra Prasath & Hans Moen

Authors

Pinar Öztürk
View author publications
You can also search for this author in PubMed Google Scholar
R. Rajendra Prasath
View author publications
You can also search for this author in PubMed Google Scholar
Hans Moen
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Institute of Mathematics, Warsaw University, Banacha 2, 02-097, Warsaw, Poland
Marcin Szczuka
ICS, Warsaw University of Technology,,
Marzena Kryszkiewicz
Department of Applied Computer Science, University of Winnipeg, R3B 2E9, Winnipeg, Manitoba, Canada
Sheela Ramanna
Dept. of Computer Science, The University of Wales, Aberystwyth, UK
Richard Jensen
Harbin Institute of Technology, PO Box 458, 150006, Harbin, China
Qinghua Hu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Öztürk, P., Prasath, R.R., Moen, H. (2010). Distributed Representations to Detect Higher Order Term Correlations in Textual Content. In: Szczuka, M., Kryszkiewicz, M., Ramanna, S., Jensen, R., Hu, Q. (eds) Rough Sets and Current Trends in Computing. RSCTC 2010. Lecture Notes in Computer Science(), vol 6086. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-13529-3_78

Download citation

DOI: https://doi.org/10.1007/978-3-642-13529-3_78
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-13528-6
Online ISBN: 978-3-642-13529-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics