Abstract
This article outlines the digitization process and methodology applied to the archive of parliamentary questions from the 1st Parliamentary Term (1974–1977) in the Hellenic Parliament. A collaborative pilot project involving parliament, academia, and a research center facilitated the conversion of printed material to open data. The main tasks of the project include capturing digital images, a custom Optical Character Recognition (OCR) software solution employing machine learning, and rigorous validation for accuracy of a fragmented and of variable quality polytonic corpus in a variety of modern Greek language called Katharevousa. The article discusses the approach and challenges as well as the initial results of the digitization effort, emphasizing ongoing research steps. Overall, 1,674 images were digitally processed corresponding to 1,338 questions. Following algorithmic training, character recognition accuracy is over 98.5%. Successful implementation streamlines further similar digitalization operations in the vast parliamentary archives, while enabling in-depth studies on parliamentary control in the turbulent period of the immediate post-junta era in Greece. A preliminary comparative analysis with a corpus of newer parliamentary questions (2009–2019) provides insights and incentives for the further study of the characteristics and evolution of the Greek language.
O. Rozenberg—Independent researcher.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Saalfeld, T.: Members of parliament and governments in Western Europe: agency relations and problems of oversight. Eur. J. Polit. Res. 37(3), 353–376 (2000)
Mavrias, K.: Syntagmatikó Díkaio [Constitutional Law], 6th edn. Ekdoseis P.N. Sakkoulas, Athens (2022)
Martin, S., Rozenberg, O. (eds.): Roles & Functions of Parliamentary Questions. Routledge, Abingdon (2012)
Kaliviotou, M.: Koinovouleftikós Élenchos – Syntagmatikó plaísio kai ória [Parliamentary Scrutiny – Constitutional framework and limits]. Ekdoseis Sakkoula, Athens (2017)
Lauvaux, P.: Le contrôle, source du régime parlementaire, priorité du régime présidentiel. Pouvoirs 134, 23–36 (2010)
Griglio, E.: Parliamentary Oversight of the Executives. Tools and Procedures in Europe. Hart, Oxford (2020)
Rozenberg, O., Martin, S.: Questioning parliamentary questions. J. Legislative Stud. 17(3), 394–404 (2011)
Raunio, T.: Parliamentary questions in the European Parliament: representation, information and control. J. Legislative Stud. 2(4), 356–382 (1996)
Lazardeux, S.: Une Question Ecrite, Pour Quoi Faire? The causes of the production of written questions in the French Assemblée Nationale. French Politics 3, 258–281 (2005)
Martin, S.: Parliamentary questions, the behavior of legislators, and the function of legislatures: an Introduction. J. Legislative Stud. 17(3), 259–270 (2011)
Cornacchione, T., Tuning, R.: Women behaving differently: anti-establishment party membership and female parliamentary activity. J. Women Polit. Policy 41(4), 457–476 (2020)
Brack, N., Costa, O.: Parliamentary questions and representation of territorial interests in the EP. In: Costa, O. (ed.) The European Parliament in Times of EU Crisis: Dynamics and Transformations, pp. 225–254. Palgrave Macmillan, Cham (2019)
Kaniok, P., Kominkova, M.: Parliamentary questions: expressions of opposition(s) within the European Parliament? Baltic J. Eur. Stud. 9(1), 34–56 (2019)
Navarro, J.: Il n’y a pas de question idiote? Les questions des deputés européens à la Commission européenne et au Conseil depuis 1979. Parliaments, Estates and Representation 39(2), 236–256 (2019)
Proksch, S.O., Slapin, J.B.: Parliamentary questions and oversight in the European Union. Eur. J. Polit. Res. 50(1), 53–79 (2010)
Jensen, C.B., Proksch, S.O., Slapin, J.B.: Parliamentary questions, oversight, and national opposition status in the European Parliament. Legis. Stud. Q. 38(2), 259–282 (2013)
Marx, M., Schuth, A.: DutchParl: a corpus of parliamentary documents in Dutch. In: Proceedings of the 10th Dutch-Belgian Information Retrieval Workshop, pp. 82–83, Nijmegen, Netherlands (2010)
Drobac, S., Sinikallio, L., Hyvönen, E: An OCR pipeline for transforming parliamentary debates into linked data: case ParliamentSampo-Parliament of Finland on the semantic web. Digit. Humanit. Nordic Baltic Countries Publ. 5(1), 287–296 (2023)
Ogrodniczuk, M.: Polish parliamentary corpus. In: Proceedings of the LREC 2018 Workshop ParlaCLARIN: Creating and Using Parliamentary Corpora, pp. 15–19. European Language Resources Association (2018)
Steingrímsson, S., Barkarson, S., Örnólfsson, G.T.: IGC-Parl: Icelandic Corpus of parliamentary proceedings. In: Proceedings of the Second ParlaCLARIN Workshop, pp. 11–17. European Language Resources Association (2020)
Fitsilis, F., Schwemmer, C., Saalfeld, T.: Content reconstruction of parliamentary questions - combining metadata with an OCR process. In: Proceedings of the 5th International Virtual Conference on Advanced Scientific Results, pp. 107–112 (2017)
Fitsilis, F., Mikros, G.: Development and validation of a corpus of written parliamentary questions in the Hellenic Parliament. J. Open Humanit. Data 7, 18 (2021)
Hellenic OCR Team. https://hellenicOCRteam.gr. Accessed 29 May 2024
Kaddas, P., Palaiologos, K., Gatos, B., Katsouros, V., Christopoulou, K.: A system for processing and recognition of Greek Byzantine and Post-Byzantine documents. In: Fink, G.A., Jain, R., Kise, K., Zanibbi, R. (eds.) Document Analysis and Recognition - ICDAR 2023, LNCS, vol. 14190, pp. 366–376. Springer, Cham (2023)
Kaddas, P., Gatos, B., Palaiologos, K., Christopoulou, K., Kritsis, K.: Text line detection and recognition of Greek polytonic documents. In: Coustaty, M., Fornés, A. (eds.) Document Analysis and Recognition – ICDAR 2023, International Workshop on Machine Learning (4th edition), LNCS, vol. 14194, pp. 213–225. Springer, Cham (2023)
Jocher, G., et al.: ultralytics/yolov5: v7.0 - YOLOv5 SOTA Realtime Instance Segmentation. https://doi.org/10.5281/zenodo.7347926. Accessed 29 May 2024
Yolov5 for Oriented Object Detection. https://github.com/hukaixuan19970627/yolov5_obb. Accessed 29 May 2024
Calamari OCR. https://github.com/Calamari-OCR/calamari. Accessed 29 May 2024
Mackridge, P.: The Modern Greek Language. Oxford University Press, Oxford (1985)
Malvern, D., Richards, B.: Investigating accommodation in language proficiency interviews using a new measure of lexical diversity. Lang. Test. 19(1), 85–104 (2002)
Acknowledgments
The authors would like to thank Angeliki Karapanou, Head of the Department of Parliamentary Archives, Hellenic Parliament, for her expert support during the final stages of the study. The research concerning the recognition platform has been partially co-financed by the European Union and Greek national funds through the Operational Program Attica 2014–2020, under the call “RESEARCH AND INNOVATION PARTNERSHIPS IN THE REGION OF ATTICA”, project reBook (Digital platform for re-publishing Historical Greek Books, project code: ATTP4-0331172).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Ethics declarations
The authors have no competing interests to declare that are relevant to the content of this article.
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Fitsilis, F. et al. (2024). Digitization of Written Parliamentary Questions from the Historical Archive (1974–1977) of the Hellenic Parliament. In: Mouchère, H., Zhu, A. (eds) Document Analysis and Recognition – ICDAR 2024 Workshops. ICDAR 2024. Lecture Notes in Computer Science, vol 14935. Springer, Cham. https://doi.org/10.1007/978-3-031-70645-5_8
Download citation
DOI: https://doi.org/10.1007/978-3-031-70645-5_8
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-70644-8
Online ISBN: 978-3-031-70645-5
eBook Packages: Computer ScienceComputer Science (R0)