skip to main content
10.1145/2756406.2756929acmconferencesArticle/Chapter ViewAbstractPublication PagesjcdlConference Proceedingsconference-collections
tutorial

Topic Exploration with the HTRC Data Capsule for Non-Consumptive Research

Published:21 June 2015Publication History

ABSTRACT

In this half-day tutorial, we will show 1) how the HathiTrust Research Center (HTRC) Data Capsule can be used for non-consumptive research over collection of texts and 2) how integrated tools for LDA topic modeling and visualization can be used to drive formulation of new research questions. Participants will be given an account in the HTRC Data Capsule and taught how to use the workset manager to create a corpus, and then use the VM's secure mode to download texts and analyze their contents.

References

  1. D. M. Blei, A. Y. Ng, and M. I. Jordan. Latent Dirichlet Allocation. Journal of Machine Learning Research, 3:993--1022, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. J. Murdock and C. Allen. Visualization techniques for topic model checking. In Proceedings of the 29th AAAI Conference on Artificial Intelligence (AAAI-15), 2015.Google ScholarGoogle Scholar
  3. F. Pérez and B. E. Granger. IPython: a system for interactive scientific computing, May 2007.Google ScholarGoogle Scholar
  4. J. Zeng, G. Ruan, A. Crowell, A. Prakash, and B. Plale. Cloud computing data capsules for non-consumptive use of texts. In Proceedings of the 5th ACM Workshop on Scientific Cloud Computing, ScienceCloud '14, pages 9--16, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Topic Exploration with the HTRC Data Capsule for Non-Consumptive Research

            Recommendations

            Comments

            Login options

            Check if you have access through your login credentials or your institution to get full access on this article.

            Sign in
            • Published in

              cover image ACM Conferences
              JCDL '15: Proceedings of the 15th ACM/IEEE-CS Joint Conference on Digital Libraries
              June 2015
              324 pages
              ISBN:9781450335942
              DOI:10.1145/2756406
              • General Chairs:
              • Paul Logasa Bogen,
              • Suzie Allard,
              • Holly Mercer,
              • Micah Beck,
              • Program Chairs:
              • Sally Jo Cunningham,
              • Dion Goh,
              • Geneva Henry

              Copyright © 2015 Owner/Author

              Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

              Publisher

              Association for Computing Machinery

              New York, NY, United States

              Publication History

              • Published: 21 June 2015

              Check for updates

              Qualifiers

              • tutorial

              Acceptance Rates

              JCDL '15 Paper Acceptance Rate18of60submissions,30%Overall Acceptance Rate415of1,482submissions,28%

            PDF Format

            View or Download as a PDF file.

            PDF

            eReader

            View online with eReader.

            eReader