Natural Language Technology for Information Integration in Business Intelligence

Maynard, Diana; Saggion, Horacio; Yankova, Milena; Bontcheva, Kalina; Peters, Wim

doi:10.1007/978-3-540-72035-5_28

Diana Maynard¹,
Horacio Saggion¹,
Milena Yankova^2,1,
Kalina Bontcheva¹ &
…
Wim Peters¹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4439))

Included in the following conference series:

International Conference on Business Information Systems

1961 Accesses
14 Citations

Abstract

Business intelligence requires the collecting and merging of information from many different sources, both structured and unstructured, in order to analyse for example financial risk, operational risk factors, follow trends and perform credit risk management. While traditional data mining tools make use of numerical data and cannot easily be applied to knowledge extracted from free text, traditional information extraction is either not adapted for the financial domain, or does not address the issue of information integration: the merging of information from different kinds of sources. We describe here the development of a system for content mining using domain ontologies, which enables the extraction of relevant information to be fed into models for analysis of financial and operational risk and other business intelligence applications such as company intelligence, by means of the XBRL standard. The results so far are of extremely high quality, due to the implementation of primarily high-precision rules.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Using Text Mining and Natural Language Processing to Support Business Decision: Towards a NooJ Application

Text Mining in Economics

VBSRL: A Semantic Frame-Based Approach for Data Extraction from Unstructured Business Documents

References

Ahmad, K., Gillam, L., Cheng, D.: Sentiments on a grid: Analysis of streaming news and views. In: 5th Language Resources and Evaluation Conference (2006)
Google Scholar
Appelt, D.E., et al.: Description of the JV-FASTUS system as used for MUC-5. In: Proceedings of the Fourth Message Understanding Conference MUC-5, pp. 221–235. Morgan Kaufmann, San Francisco (1993)
Chapter Google Scholar
Baumgartner, R., et al.: Web data extraction for business intelligence: the lixto approach. In: Proc. of BTW (2005)
Google Scholar
Chinchor, N.: Muc-4 evaluation metrics. In: Proceedings of the Fourth Message Understanding Conference, pp. 22–29 (1992)
Google Scholar
Cunningham, H., et al.: GATE: A Framework and Graphical Development Environment for Robust NLP Tools and Applications. In: Proceedings of the 40th Anniversary Meeting of the Association for Computational Linguistics (ACL’02) (2002)
Google Scholar
Cunningham, H., Maynard, D., Tablan, V.: JAPE: a Java Annotation Patterns Engine (2nd edn.). Research Memorandum CS–00–10, Department of Computer Science, University of Sheffield (November 2000)
Google Scholar
Declerck, T., Krieger, H.: Translating XBRL into Description Logic: an approach using Protege, Sesame and OWL. In: Proceedings of Business Information Systems (BIS), Klagenfurt, Germany (2006)
Google Scholar
Ellingsworth, M., Sullivan, D.: Text mining improves business intelligence and predictive modeling in insurance. DM Review Magazine (2003)
Google Scholar
Nie, J.-Y., Paradis, F., Tajarobi, A.: Discovery of business opportunities on the internet with information extraction. In: Workshop on Multi-Agent Information Retrieval and Recommender Systems (IJCAI), Edinburgh, Scotland, pp. 47–54 (2005)
Google Scholar
Fornasari, F., et al.: Xbrl web-based business intelligence services. In: Cunningham, P., Cunningham, M. (eds.) Innovation and the Knowledge Economy: Issues, Applications, Case Studies. Proceedings of eChallenge 2005, IOS Press, Amsterdam (2005)
Google Scholar
Gaizauskas, R., Wilks, Y.: Information Extraction: Beyond Document Retrieval. Journal of Documentation 54(1), 70–105 (1998)
Article Google Scholar
Jacobs, P.S., Rau, L.F.: Scisor: Extracting information from on-line news. Communications of the ACM 33(11), 88–97 (1990)
Article Google Scholar
Maynard, D., Bontcheva, K., Cunningham, H.: Towards a semantic extraction of Named Entities. In: Recent Advances in Natural Language Processing, Bulgaria (2003)
Google Scholar
Maynard, D., Peters, W., Li, Y.: Metrics for evaluation of ontology-based information extraction. In: WWW 2006 Workshop on “Evaluation of Ontologies for the Web” (EON), Edinburgh, Scotland (2006)
Google Scholar
Maynard, D., et al.: Rapid customisation of an Information Extraction system for surprise languages. Special issue of ACM Transactions on Asian Language Information Processing: Rapid Development of Language Capabilities: The Surprise Languages (2003)
Google Scholar
Maynard, D., et al.: Named Entity Recognition from Diverse Text Types. In: Recent Advances in Natural Language Processing 2001 Conference, Tzigov Chark, Bulgaria, pp. 257–274 (2001), http://gate.ac.uk/sale/ranlp2001/maynard-etal.pdf
Maynard, D., et al.: Ontology-based information extraction for market monitoring and technology watch. In: ESWC Workshop “End User Apects of the Semantic Web”, Heraklion, Crete (2005)
Google Scholar
Montes, J.: Consumer entertainment software - industry trends. In: Stanford-Smith, B., Chozza, E. (eds.) E-Work and E-Commerce, IOS Press, Amsterdam (2001)
Google Scholar
Popov, B., et al.: KIM – Semantic Annotation Platform. In: Natural Language Engineering (2004)
Google Scholar
van Rijsbergen, C.J.: Information Retrieval. Butterworths, London (1979)
Google Scholar
Wilks, Y., Catizone, R.: Can We Make Information Extraction More Adaptive? In: Pazienza, M.T. (ed.) SCIE 1999. LNCS (LNAI), vol. 1714, pp. 1–16. Springer, Heidelberg (1999)
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, University of Sheffield, Regent Court, 211 Portobello Street, Sheffield, S1 4DP, United Kingdom
Diana Maynard, Horacio Saggion, Milena Yankova, Kalina Bontcheva & Wim Peters
Onotext Lab, Sirma Group Corp., 135 Tazrigradsko Chassee, Fl.5, 1784 Sofia, Bulgaria
Milena Yankova

Authors

Diana Maynard
View author publications
You can also search for this author in PubMed Google Scholar
Horacio Saggion
View author publications
You can also search for this author in PubMed Google Scholar
Milena Yankova
View author publications
You can also search for this author in PubMed Google Scholar
Kalina Bontcheva
View author publications
You can also search for this author in PubMed Google Scholar
Wim Peters
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Witold Abramowicz

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Maynard, D., Saggion, H., Yankova, M., Bontcheva, K., Peters, W. (2007). Natural Language Technology for Information Integration in Business Intelligence. In: Abramowicz, W. (eds) Business Information Systems. BIS 2007. Lecture Notes in Computer Science, vol 4439. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-72035-5_28

Download citation

DOI: https://doi.org/10.1007/978-3-540-72035-5_28
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-72034-8
Online ISBN: 978-3-540-72035-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Natural Language Technology for Information Integration in Business Intelligence

Abstract

Access this chapter

Preview

Similar content being viewed by others

Using Text Mining and Natural Language Processing to Support Business Decision: Towards a NooJ Application

Text Mining in Economics

VBSRL: A Semantic Frame-Based Approach for Data Extraction from Unstructured Business Documents

References

Author information

Authors and Affiliations

Editor information

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Natural Language Technology for Information Integration in Business Intelligence

Abstract

Access this chapter

Preview

Similar content being viewed by others

Using Text Mining and Natural Language Processing to Support Business Decision: Towards a NooJ Application

Text Mining in Economics

VBSRL: A Semantic Frame-Based Approach for Data Extraction from Unstructured Business Documents

References

Author information

Authors and Affiliations

Editor information

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation