Abstract
Business intelligence requires the collecting and merging of information from many different sources, both structured and unstructured, in order to analyse for example financial risk, operational risk factors, follow trends and perform credit risk management. While traditional data mining tools make use of numerical data and cannot easily be applied to knowledge extracted from free text, traditional information extraction is either not adapted for the financial domain, or does not address the issue of information integration: the merging of information from different kinds of sources. We describe here the development of a system for content mining using domain ontologies, which enables the extraction of relevant information to be fed into models for analysis of financial and operational risk and other business intelligence applications such as company intelligence, by means of the XBRL standard. The results so far are of extremely high quality, due to the implementation of primarily high-precision rules.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Ahmad, K., Gillam, L., Cheng, D.: Sentiments on a grid: Analysis of streaming news and views. In: 5th Language Resources and Evaluation Conference (2006)
Appelt, D.E., et al.: Description of the JV-FASTUS system as used for MUC-5. In: Proceedings of the Fourth Message Understanding Conference MUC-5, pp. 221–235. Morgan Kaufmann, San Francisco (1993)
Baumgartner, R., et al.: Web data extraction for business intelligence: the lixto approach. In: Proc. of BTW (2005)
Chinchor, N.: Muc-4 evaluation metrics. In: Proceedings of the Fourth Message Understanding Conference, pp. 22–29 (1992)
Cunningham, H., et al.: GATE: A Framework and Graphical Development Environment for Robust NLP Tools and Applications. In: Proceedings of the 40th Anniversary Meeting of the Association for Computational Linguistics (ACL’02) (2002)
Cunningham, H., Maynard, D., Tablan, V.: JAPE: a Java Annotation Patterns Engine (2nd edn.). Research Memorandum CS–00–10, Department of Computer Science, University of Sheffield (November 2000)
Declerck, T., Krieger, H.: Translating XBRL into Description Logic: an approach using Protege, Sesame and OWL. In: Proceedings of Business Information Systems (BIS), Klagenfurt, Germany (2006)
Ellingsworth, M., Sullivan, D.: Text mining improves business intelligence and predictive modeling in insurance. DM Review Magazine (2003)
Nie, J.-Y., Paradis, F., Tajarobi, A.: Discovery of business opportunities on the internet with information extraction. In: Workshop on Multi-Agent Information Retrieval and Recommender Systems (IJCAI), Edinburgh, Scotland, pp. 47–54 (2005)
Fornasari, F., et al.: Xbrl web-based business intelligence services. In: Cunningham, P., Cunningham, M. (eds.) Innovation and the Knowledge Economy: Issues, Applications, Case Studies. Proceedings of eChallenge 2005, IOS Press, Amsterdam (2005)
Gaizauskas, R., Wilks, Y.: Information Extraction: Beyond Document Retrieval. Journal of Documentation 54(1), 70–105 (1998)
Jacobs, P.S., Rau, L.F.: Scisor: Extracting information from on-line news. Communications of the ACM 33(11), 88–97 (1990)
Maynard, D., Bontcheva, K., Cunningham, H.: Towards a semantic extraction of Named Entities. In: Recent Advances in Natural Language Processing, Bulgaria (2003)
Maynard, D., Peters, W., Li, Y.: Metrics for evaluation of ontology-based information extraction. In: WWW 2006 Workshop on “Evaluation of Ontologies for the Web” (EON), Edinburgh, Scotland (2006)
Maynard, D., et al.: Rapid customisation of an Information Extraction system for surprise languages. Special issue of ACM Transactions on Asian Language Information Processing: Rapid Development of Language Capabilities: The Surprise Languages (2003)
Maynard, D., et al.: Named Entity Recognition from Diverse Text Types. In: Recent Advances in Natural Language Processing 2001 Conference, Tzigov Chark, Bulgaria, pp. 257–274 (2001), http://gate.ac.uk/sale/ranlp2001/maynard-etal.pdf
Maynard, D., et al.: Ontology-based information extraction for market monitoring and technology watch. In: ESWC Workshop “End User Apects of the Semantic Web”, Heraklion, Crete (2005)
Montes, J.: Consumer entertainment software - industry trends. In: Stanford-Smith, B., Chozza, E. (eds.) E-Work and E-Commerce, IOS Press, Amsterdam (2001)
Popov, B., et al.: KIM – Semantic Annotation Platform. In: Natural Language Engineering (2004)
van Rijsbergen, C.J.: Information Retrieval. Butterworths, London (1979)
Wilks, Y., Catizone, R.: Can We Make Information Extraction More Adaptive? In: Pazienza, M.T. (ed.) SCIE 1999. LNCS (LNAI), vol. 1714, pp. 1–16. Springer, Heidelberg (1999)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer Berlin Heidelberg
About this paper
Cite this paper
Maynard, D., Saggion, H., Yankova, M., Bontcheva, K., Peters, W. (2007). Natural Language Technology for Information Integration in Business Intelligence. In: Abramowicz, W. (eds) Business Information Systems. BIS 2007. Lecture Notes in Computer Science, vol 4439. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-72035-5_28
Download citation
DOI: https://doi.org/10.1007/978-3-540-72035-5_28
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-72034-8
Online ISBN: 978-3-540-72035-5
eBook Packages: Computer ScienceComputer Science (R0)