Skip to main content

Digitization of Written Parliamentary Questions from the Historical Archive (1974–1977) of the Hellenic Parliament

  • Conference paper
  • First Online:
Document Analysis and Recognition – ICDAR 2024 Workshops (ICDAR 2024)

Abstract

This article outlines the digitization process and methodology applied to the archive of parliamentary questions from the 1st Parliamentary Term (1974–1977) in the Hellenic Parliament. A collaborative pilot project involving parliament, academia, and a research center facilitated the conversion of printed material to open data. The main tasks of the project include capturing digital images, a custom Optical Character Recognition (OCR) software solution employing machine learning, and rigorous validation for accuracy of a fragmented and of variable quality polytonic corpus in a variety of modern Greek language called Katharevousa. The article discusses the approach and challenges as well as the initial results of the digitization effort, emphasizing ongoing research steps. Overall, 1,674 images were digitally processed corresponding to 1,338 questions. Following algorithmic training, character recognition accuracy is over 98.5%. Successful implementation streamlines further similar digitalization operations in the vast parliamentary archives, while enabling in-depth studies on parliamentary control in the turbulent period of the immediate post-junta era in Greece. A preliminary comparative analysis with a corpus of newer parliamentary questions (2009–2019) provides insights and incentives for the further study of the characteristics and evolution of the Greek language.

O. Rozenberg—Independent researcher.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

Notes

  1. 1.

    https://users.iit.demokritos.gr/~bgat/DIA.htm.

  2. 2.

    https://www.hellenicparliament.gr/en/Vouli-ton-Ellinon/Kanonismos-tis-Voulis/.

  3. 3.

    https://www.hellenicparliament.gr/en/Vouli-ton-Ellinon/To-Politevma/Syntagma/.

References

  1. Saalfeld, T.: Members of parliament and governments in Western Europe: agency relations and problems of oversight. Eur. J. Polit. Res. 37(3), 353–376 (2000)

    Article  Google Scholar 

  2. Mavrias, K.: Syntagmatikó Díkaio [Constitutional Law], 6th edn. Ekdoseis P.N. Sakkoulas, Athens (2022)

    Google Scholar 

  3. Martin, S., Rozenberg, O. (eds.): Roles & Functions of Parliamentary Questions. Routledge, Abingdon (2012)

    Google Scholar 

  4. Kaliviotou, M.: Koinovouleftikós Élenchos – Syntagmatikó plaísio kai ória [Parliamentary Scrutiny – Constitutional framework and limits]. Ekdoseis Sakkoula, Athens (2017)

    Google Scholar 

  5. Lauvaux, P.: Le contrôle, source du régime parlementaire, priorité du régime présidentiel. Pouvoirs 134, 23–36 (2010)

    Article  Google Scholar 

  6. Griglio, E.: Parliamentary Oversight of the Executives. Tools and Procedures in Europe. Hart, Oxford (2020)

    Book  Google Scholar 

  7. Rozenberg, O., Martin, S.: Questioning parliamentary questions. J. Legislative Stud. 17(3), 394–404 (2011)

    Google Scholar 

  8. Raunio, T.: Parliamentary questions in the European Parliament: representation, information and control. J. Legislative Stud. 2(4), 356–382 (1996)

    Article  Google Scholar 

  9. Lazardeux, S.: Une Question Ecrite, Pour Quoi Faire? The causes of the production of written questions in the French Assemblée Nationale. French Politics 3, 258–281 (2005)

    Article  Google Scholar 

  10. Martin, S.: Parliamentary questions, the behavior of legislators, and the function of legislatures: an Introduction. J. Legislative Stud. 17(3), 259–270 (2011)

    Article  Google Scholar 

  11. Cornacchione, T., Tuning, R.: Women behaving differently: anti-establishment party membership and female parliamentary activity. J. Women Polit. Policy 41(4), 457–476 (2020)

    Article  Google Scholar 

  12. Brack, N., Costa, O.: Parliamentary questions and representation of territorial interests in the EP. In: Costa, O. (ed.) The European Parliament in Times of EU Crisis: Dynamics and Transformations, pp. 225–254. Palgrave Macmillan, Cham (2019)

    Chapter  Google Scholar 

  13. Kaniok, P., Kominkova, M.: Parliamentary questions: expressions of opposition(s) within the European Parliament? Baltic J. Eur. Stud. 9(1), 34–56 (2019)

    Article  Google Scholar 

  14. Navarro, J.: Il n’y a pas de question idiote? Les questions des deputés européens à la Commission européenne et au Conseil depuis 1979. Parliaments, Estates and Representation 39(2), 236–256 (2019)

    Article  Google Scholar 

  15. Proksch, S.O., Slapin, J.B.: Parliamentary questions and oversight in the European Union. Eur. J. Polit. Res. 50(1), 53–79 (2010)

    Article  Google Scholar 

  16. Jensen, C.B., Proksch, S.O., Slapin, J.B.: Parliamentary questions, oversight, and national opposition status in the European Parliament. Legis. Stud. Q. 38(2), 259–282 (2013)

    Article  Google Scholar 

  17. Marx, M., Schuth, A.: DutchParl: a corpus of parliamentary documents in Dutch. In: Proceedings of the 10th Dutch-Belgian Information Retrieval Workshop, pp. 82–83, Nijmegen, Netherlands (2010)

    Google Scholar 

  18. Drobac, S., Sinikallio, L., Hyvönen, E: An OCR pipeline for transforming parliamentary debates into linked data: case ParliamentSampo-Parliament of Finland on the semantic web. Digit. Humanit. Nordic Baltic Countries Publ. 5(1), 287–296 (2023)

    Google Scholar 

  19. Ogrodniczuk, M.: Polish parliamentary corpus. In: Proceedings of the LREC 2018 Workshop ParlaCLARIN: Creating and Using Parliamentary Corpora, pp. 15–19. European Language Resources Association (2018)

    Google Scholar 

  20. Steingrímsson, S., Barkarson, S., Örnólfsson, G.T.: IGC-Parl: Icelandic Corpus of parliamentary proceedings. In: Proceedings of the Second ParlaCLARIN Workshop, pp. 11–17. European Language Resources Association (2020)

    Google Scholar 

  21. Fitsilis, F., Schwemmer, C., Saalfeld, T.: Content reconstruction of parliamentary questions - combining metadata with an OCR process. In: Proceedings of the 5th International Virtual Conference on Advanced Scientific Results, pp. 107–112 (2017)

    Google Scholar 

  22. Fitsilis, F., Mikros, G.: Development and validation of a corpus of written parliamentary questions in the Hellenic Parliament. J. Open Humanit. Data 7, 18 (2021)

    Article  Google Scholar 

  23. Hellenic OCR Team. https://hellenicOCRteam.gr. Accessed 29 May 2024

  24. Kaddas, P., Palaiologos, K., Gatos, B., Katsouros, V., Christopoulou, K.: A system for processing and recognition of Greek Byzantine and Post-Byzantine documents. In: Fink, G.A., Jain, R., Kise, K., Zanibbi, R. (eds.) Document Analysis and Recognition - ICDAR 2023, LNCS, vol. 14190, pp. 366–376. Springer, Cham (2023)

    Chapter  Google Scholar 

  25. Kaddas, P., Gatos, B., Palaiologos, K., Christopoulou, K., Kritsis, K.: Text line detection and recognition of Greek polytonic documents. In: Coustaty, M., Fornés, A. (eds.) Document Analysis and Recognition – ICDAR 2023, International Workshop on Machine Learning (4th edition), LNCS, vol. 14194, pp. 213–225. Springer, Cham (2023)

    Google Scholar 

  26. Jocher, G., et al.: ultralytics/yolov5: v7.0 - YOLOv5 SOTA Realtime Instance Segmentation. https://doi.org/10.5281/zenodo.7347926. Accessed 29 May 2024

  27. Yolov5 for Oriented Object Detection. https://github.com/hukaixuan19970627/yolov5_obb. Accessed 29 May 2024

  28. Calamari OCR. https://github.com/Calamari-OCR/calamari. Accessed 29 May 2024

  29. Mackridge, P.: The Modern Greek Language. Oxford University Press, Oxford (1985)

    Google Scholar 

  30. Malvern, D., Richards, B.: Investigating accommodation in language proficiency interviews using a new measure of lexical diversity. Lang. Test. 19(1), 85–104 (2002)

    Article  Google Scholar 

Download references

Acknowledgments

The authors would like to thank Angeliki Karapanou, Head of the Department of Parliamentary Archives, Hellenic Parliament, for her expert support during the final stages of the study. The research concerning the recognition platform has been partially co-financed by the European Union and Greek national funds through the Operational Program Attica 2014–2020, under the call “RESEARCH AND INNOVATION PARTNERSHIPS IN THE REGION OF ATTICA”, project reBook (Digital platform for re-publishing Historical Greek Books, project code: ATTP4-0331172).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Fotios Fitsilis .

Editor information

Editors and Affiliations

Ethics declarations

The authors have no competing interests to declare that are relevant to the content of this article.

Rights and permissions

Reprints and permissions

Copyright information

© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Fitsilis, F. et al. (2024). Digitization of Written Parliamentary Questions from the Historical Archive (1974–1977) of the Hellenic Parliament. In: Mouchère, H., Zhu, A. (eds) Document Analysis and Recognition – ICDAR 2024 Workshops. ICDAR 2024. Lecture Notes in Computer Science, vol 14935. Springer, Cham. https://doi.org/10.1007/978-3-031-70645-5_8

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-70645-5_8

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-70644-8

  • Online ISBN: 978-3-031-70645-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics