Skip to main content

On Building a Full-Text Digital Library of Historical Documents

  • Conference paper
Asian Digital Libraries. Looking Back 10 Years and Forging New Frontiers (ICADL 2007)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4822))

Included in the following conference series:

Abstract

The National Taiwan University Library has built a digital library of historical documents about Taiwan. The content is unique in that it covers about 80% of all primary Chinese historical materials about Taiwan before 1895, and that they are all available in searchable full text, in addition to metadata. To make these materials more accessible to the research community, we have developed, in addition to full-text search and retrieval, a concept of regarding the set of documents retrieved by a query as a sub-collection, and have designed post-query classification methods to help users find the inter-relationships among documents and the collective meaning of a sub-collection. We have also developed techniques for term extraction for old Chinese and a data format for representing governmental structures. We hope that our system will help advance research in Taiwanese history, and will set a model for other similar endeavor.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. http://en.wikipedia.org/wiki/Taiwan/History

  2. Chang, S.P.: A Word-Clip Algorithm for Named Entity Recognition: by Example of Historical Documents. Master Thesis, National Taiwan University, Taiwan (in Chinese) (2006)

    Google Scholar 

  3. Chiu, W.J.: The Digital Project of Taiwan-Related Archives in Ming and Qing Dynasty. The Library Yearbook of ROC 2006. National Central Library, Taiwan, 128–129 (in Chinese) (2006)

    Google Scholar 

  4. NTU Library (in Chinese), http://140.112.113.4/project/database1/database1_1.htm

  5. Dai, Y.H.: Preliminary Remarks on Putting in Order the Qing Danxin Archives. Taipei Cultural Relics (in Chinese) (1953)

    Google Scholar 

  6. Allee, M.: Law and Local Society in Late Imperial China: Northern Taiwan in the Nineteenth Century. Stanford University Press (1994)

    Google Scholar 

  7. Wu, M.C., Ang, K.I., Lee, W.L., Lin, H.Y.: A Brief Introduction to the Integrated Collections of Taiwan-related Historical Records. CCA and Yuan-Liou Publishing, Taiwan (in Chinese) (2005)

    Google Scholar 

  8. Hong, L.W.: A Study of Aboriginal Contractual Behavior and the Relationship between Aborigines and Han Immigrants in West-Central Taiwan, vol. 1. Taichung County Cultural Center, Taiwan 5 (in Chinese) (2002)

    Google Scholar 

  9. Pan, C.W. (ed.): Taiwan Geography and History, Taiwan Provincial Literature Committee, Taiwan 9(1) (in Chinese) (1980)

    Google Scholar 

  10. Chang, J.T.: Model and Implementation for Representing Governmental Structures and Officials. Master Thesis, National Taiwan University, Taiwan (in Chinese) (2007)

    Google Scholar 

  11. Chien, L.F.: PAT-Tree-Based Keyword Extraction for Chinese Information Retrieval. In: Proceedings of 1997 ACM SIGIR Conference (SIGIR 1997), Philadelphia, USA, pp. 50–58 (1997)

    Google Scholar 

  12. Chen, H.H., Lee, J.C.: Identification and Classification of Proper Nouns in Chinese Texts. In: Proceedings of 16th International Conference on Computational Linguistics, Copenhagen, Denmark, pp. 222–229 (1996)

    Google Scholar 

  13. Reddy, R., StClair, G.: The Million Book Digital Library Project (2001), http://www.rr.cs.cmu.edu/mbdl.htm

  14. National Digital Archives Program (2007), http://www.ndap.org.tw/index_en.php

Download references

Author information

Authors and Affiliations

Authors

Editor information

Dion Hoe-Lian Goh Tru Hoang Cao Ingeborg Torvik Sølvberg Edie Rasmussen

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Chen, SP., Hsiang, J., Tu, HC., Wu, M. (2007). On Building a Full-Text Digital Library of Historical Documents. In: Goh, D.HL., Cao, T.H., Sølvberg, I.T., Rasmussen, E. (eds) Asian Digital Libraries. Looking Back 10 Years and Forging New Frontiers. ICADL 2007. Lecture Notes in Computer Science, vol 4822. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-77094-7_11

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-77094-7_11

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-77093-0

  • Online ISBN: 978-3-540-77094-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics