Abstract
In literature-based creative knowledge discovery the goal is to identify interesting terms or concepts which relate different domains. We propose to support this cross-context link discovery process by inspecting outlier documents which are not in the mainstream domain literature. We have explored the utility of outlier documents, discovered by combining three classification-based outlier detection methods, in terms of their potential for bridging concept discovery in the migraine-magnesium cross-domain discovery problem and in the autism-calcineurin domain pair. Experimental results prove that outlier documents present a small fraction of a domain pair dataset that is rich on concept bridging terms. Therefore, by exploring only a small subset of documents, where a great majority of bridging terms are present and more frequent, the effort needed for finding cross-domain links can be substantially reduced.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Brodley, C.E., Friedl, M.A.: Identifying mislabeled training data. Journal of Artificial Intelligence Research 11, 131–167 (1999)
Fortuna, B., Grobelnik, M., Mladenić, D.: Semi-automatic data-driven ontology construction system. In: Proc. of the Information Society Conf., pp. 223–226 (2006)
Koestler, A.: The act of creation. MacMillan Company, New York (1964)
Mednick, S.A.: The associative basis of the creative process. Psychological Review 69, 219–227 (1962)
Petrič, I., Cestnik, B., Lavrač, N., Urbančič, T.: Outlier detection in cross-context link discovery for creative literature mining. The Computer Journal (2010)
Petrič, I., Urbančič, T., Cestnik, B., Macedoni-Lukšič, M.: Literature mining method RaJoLink for uncovering relations between biomedical concepts. Journal of Biomedical Informatics 42(2), 220–232 (2009)
Sluban, B., Gamberger, D., Lavrač, N.: Performance analysis of class noise detection algorithms. In: Proceedings of STAIRS 2010, pp. 303–314 (2011)
Smalheiser, N.R., Swanson, D.R.: Using ARROWSMITH: a computer-assisted approach to formulating and assessing scientific hypotheses. Comput. Methods Programs Biomed. 57(3), 149–153 (1998)
Srinivasan, P., Libbus, B., Sehgal, A.K.: Mining MEDLINE: Postulating a beneficial role for curcumin longa in retinal diseases. In: BioLINK, pp. 33–40 (2004)
Swanson, D.R.: Undiscovered public knowledge. Libr. Quar. 56(2), 103–118 (1986)
Swanson, D.R., Smalheiser, N.R., Torvik, V.I.: Ranking indirect connections in literature-based discovery: The role of medical subject headings (MeSH). Jour. Am. Soc. Inf. Sci. Tec. 57(11), 1427–1439 (2006)
Urbančič, T., Petrič, I., Cestnik, B., Macedoni-Lukšič, M.: Literature mining: towards better understanding of autism. In: Bellazzi, R., Abu-Hanna, A., Hunter, J. (eds.) AIME 2007. LNCS (LNAI), vol. 4594, pp. 217–226. Springer, Heidelberg (2007)
Weeber, M., Vos, R., Klein, H., de Jong-van den Berg, L.T.W.: Using concepts in literature-based discovery: Simulating Swanson’s Raynaud–fish oil and migraine–magnesium discoveries. Jour. Am. Soc. Inf. Sci. Tec. 52, 548–557 (2001)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Sluban, B., Juršič, M., Cestnik, B., Lavrač, N. (2011). Evaluating Outliers for Cross-Context Link Discovery. In: Peleg, M., Lavrač, N., Combi, C. (eds) Artificial Intelligence in Medicine. AIME 2011. Lecture Notes in Computer Science(), vol 6747. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-22218-4_44
Download citation
DOI: https://doi.org/10.1007/978-3-642-22218-4_44
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-22217-7
Online ISBN: 978-3-642-22218-4
eBook Packages: Computer ScienceComputer Science (R0)