Abstract
As the financial report has entered the “era of annotation” (Fan and Zhang in Contemp Accout Res 29(1):38–42 2012), and the length of the unstructured information in the financial report has far exceeded the financial statements. In order to carry on the automatic analysis and processing to the text information in the financial report with the help of information technology, a combination of the information extraction and XBRL technology is needed to carry on the structured research to the text information in the Chinese financial report (Heidari and Felden in Proceedings of 10th international conference on DESRIST, LNCS 9073, 2015). Based on a self-built Chinese enterprise annual report corpus, this paper studies the characteristics and patterns of the disclosure language of accounting events in Chinese enterprises’ annual reports, which can be examplified in the “related transaction events”. This paper summarizes an expression framework of accounting event knowledge. The study is expected to lay the foundation for the study of annotated textual information extraction and automatic generation of XBRL financial reports, which shall be helpful to improve the quality of financial data and information sharing.




Similar content being viewed by others
References
Fan, Q.T., Zhang, X.J.: Accounting conservatism, aggregation, and information quality. Contemp. Accout. Res. 29(1), 38–42 (2012). https://doi.org/10.1111/j.1911-3846.2011.01069.x
Heidari, M., Felden, C.: Impact of text mining application on financial footnotes analysis. In: Proceedings of 10th International Conference on DESRIST, LNCS 9073, pp. 463–470 (2015). https://doi.org/10.1007/978-3-319-18714-3_39
Li, H.Q., Zhai, J.: Literature review of XBRL semantic research. In: International Conference on Computer Science and Intelligent Communication. HK: Atlantis, pp. 316–320. (2015). https://doi.org/10.2991/csic-15.2015.76
Radzimski, M., Sanchez-Cervantes, J., Garcia-Crespo, A.: Intelligent architecture for comparative analysis of public companies using semantics and XBRL data. Int. J. Softw. Eng. Knowl. Eng. 24(5), 801–823 (2014). https://doi.org/10.1142/s0218194014500314
Feng, H.Y., Wu, L.W.: Research on information disclosure of China ‘s listed companies based on XBRL financial statements. Financ. Newsl. 4, 14–17 (2011)
Ge, J.S., Liu, F.: On the nature and characteristics of financial reporting of business enterprises. Account. Res. 12, 3 (2011)
Sun, F., Yang, Z.N.: Linguistic analysis and improvement research of XBRL technical system structure. J. Account. Res. 7 (2013)
Santos, I., Castro, E., Velasco, M.: XBRL formula specification in the multidimensional data model. Inf. Syst. 57, 20–37 (2016). https://doi.org/10.1016/j.is.2015.11.001
Hirshleifer, D., Teoh, S.H.: Limited attention, information disclosure, and financial reporting. J. Accout. Econ. 36(1), 337–386 (2003). https://doi.org/10.1016/j.jacceco.2003.10.002
Rutherford, B.A.: Genre analysis of corporate annual report narratives a corpus linguistics-based approach. J. Bus. Commun. 42(4), 349–378 (2005). https://doi.org/10.1177/0021943605279244
Martin, W.: The digital divide: where we are (2012). http://ucrel.lancs.ac.uk/cfie/index.php
Wei, N.X.: Corous-based and corpus-driven approaches to the study of collocation. Contemp. Ling. 2(2), 101–114 (2002)
Cui, X.L., Zhang, B.L.: Global Chinese learners corpus construction program. Lang. Appl. (2), 100–108 (2011)
Simon, S., Wu, J.Y.: Design and use of Chinese sketch engine: a word collocation search interface. In: 2005 International Conference on Internet Chinese Education, pp. 19–27 (2005)
Yuan, Y.L.: Matching even-template with argument structure of verbs: towards a verb-driven approach of information extraction. J. Chin. Inf. Process. 19(5), 37–43 (2005)
Zhao, Y., Qin, B., Che, W.X.: Research on Chinese event extraction. J. Chin. Inf. Process. 22(1), 3–8 (2008)
Meng, L., Ding, X., Qin, B.: Financial event argument extraction based on dependency parsing and noun phrase parsing. In: The 11th Chinese National Conference on Computational Linguistics, CNCCL 2011 (2011)
Li, J.L., Li, X.Q., Zhou, J.S.: Event sentence extraction in financial field. Appl. Res. Comput. 34(10), 2915–2918 (2017). https://doi.org/10.3969/j.issn.1001-3695
Wang, W., Zhao, D.Y., Zhao, W.: Identification of topic sentence about key event in Chinese news. Acta Scientiarum Naturalium Universitatis Pekinensis (2011). https://doi.org/10.1109/isip.2010.112
Xu, R.H., Wu, G., Li, P.F., et al.: Topic event fusion based on event framework. Appl. Res. Comput. 26(12), 4542–4545 (2009)
Antonia, K., Camilla, M., Barbro, B.: Mining textual contents of financial reports. Int. J. Digit. Account. Res. 4(7), 1–29 (2004)
Mendez, N.S., Trivio, G.: Combining semantic web technologies and computational theory of perceptions for text generation in financial analysis. IEEE Int. Conf. on Fuzzy Syst. 2012, 1–8 (2010). https://doi.org/10.1109/fuzzy.2010.5583974
Chou, C., Lian, Z.: Enhancing effectiveness of business information retrieval and integration via text mining and XBRL technology. J. Contemp. Account. 12(1), 85–114 (2011). https://doi.org/10.6675/jca.2011.12.1.04
Antonina, K., Tomas, E., Jonas, K., et al.: Combining data and text mining techniques for analysing financial reports: research articles. Int. J. Intell. Syst. Account. Financ. Manag. 12(1), 29–41 (2004). https://doi.org/10.1002/isaf.v12:1
Apache (2009) The digital divide: where we are. https://pdfbox.apache.org/
Meng, Y.: Use of parallel text in translation of financial audit reports from english to Chinese. Dissertation of Shanghai International Studies University (2014)
He, B., Zhang, L.H.: Management Information Systems. TSINGHUA University Press, Beijing (2006)
Enterprise Accounting Standards Committee (2017). Guidelines for the Application of Accounting Standards for Business Enterprises. LiXin Accounting Press
Sun, M.S.: On the consistency of Chinese word—word corpus. Lang. Appl. (2), 87–90 (1992)
Che, W.X., Li, Z.H., Liu, T.: LTP: a Chinese language technology platform. In: Proceedings of the Coling 2010: Demonstrations. 08:13–16 (2010)
Liang, M.C., Li, W.Z., Xu, J.J.: Using Corpora: A Practical Coursebook. Foreign Language Teaching and Research Press (2010)
Anthony, L. (2011).The digital divide: where we are. http://www.laurenceanthony.net/software.html
Hsiao, F., Gibson, E.: Processing relative clauses in Chinese. Cognition 90(1), 3–27 (2003). https://doi.org/10.1016/S0010-0277(03)00124-0
Acknowledgements
The research is supported by National Natural Science Foundation of PR China (71771104, 61402197).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Liang, Z., Pan, D. & Xu, R. Knowledge representation framework of accounting event in corpus-based financial report text. Cluster Comput 22 (Suppl 4), 9335–9346 (2019). https://doi.org/10.1007/s10586-018-2153-8
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10586-018-2153-8