skip to main content
10.1145/1096601.1096608acmconferencesArticle/Chapter ViewAbstractPublication PagesdocengConference Proceedingsconference-collections
Article

Enabling massive scale document transformation for the semantic web: the universal parsing agent

Published: 02 November 2005 Publication History

Abstract

The Universal Parsing Agent (UPA) is a document analysis and transformation program that supports massive scale conversion of information into forms suitable for the semantic web. UPA provides reusable tools to analyze text documents; identify and extract important information elements; enhance text with semantically descriptive tags; and output the information that is needed in the format and structure that is needed.

References

[1]
Clark, J. XSL Transformations. W3C Recommendation, http://www.w3.org/TR/xslt/. November 1999.
[2]
Cunningham, H., Wilks, Y., and Gaizauskas, R. GATE -- a General Architecture for Text Engineering. In Proceedings of the 16th Conference on Computational Linguistics (COLING-96). (Copenhagen, 1996).
[3]
McIlraith, S., Son, T.C., and Zeng, H. Semantic Web Services. IEEE Intelligent Systems, 16, 2, 46--53.
[4]
Miller, G. WordNet: A Lexical Database for English. Communications of the ACM, 38, 11 (Nov. 1995), 39--41.
[5]
Pennebaker, J. and Francis, M. Linguistic Inquiry and Word Count: LIWC. Lawrence Erlbaum Associates (software program for text analysis, 2001).
[6]
Tapanainen, S. and Jarvinen T. A Nonprojective Dependency Parser. In Proceedings of the Conference on Applied Natural Language Processing Association for Computational Linguistics.(Washington, DC, 1997).
[7]
Thompson, K. Programming Techniques: Regular Expression Search Algorithm. Communications of the ACM, 11,6 (June 1968), 419--422.

Cited By

View all
  • (2008)No mining, no meaningProceedings of the eighth ACM symposium on Document engineering10.1145/1410140.1410164(110-118)Online publication date: 16-Sep-2008
  • (2008)Creating realistic, scenario-based synthetic data for test and evaluation of information analytics softwareProceedings of the 2008 Workshop on BEyond time and errors: novel evaLuation methods for Information Visualization10.1145/1377966.1377977(1-9)Online publication date: 5-Apr-2008
  • (2007)Genre driven multimedia document production by means of incremental transformationProceedings of the 2007 ACM symposium on Document engineering10.1145/1284420.1284452(111-120)Online publication date: 28-Aug-2007
  • Show More Cited By

Index Terms

  1. Enabling massive scale document transformation for the semantic web: the universal parsing agent

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      DocEng '05: Proceedings of the 2005 ACM symposium on Document engineering
      November 2005
      252 pages
      ISBN:1595932402
      DOI:10.1145/1096601
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 02 November 2005

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. XML
      2. XSLT
      3. document transformation
      4. natural language processing
      5. parsing
      6. regular expressions

      Qualifiers

      • Article

      Conference

      DocEng05
      Sponsor:
      DocEng05: ACM Symposium on Document Engineering
      November 2 - 4, 2005
      Bristol, United Kingdom

      Acceptance Rates

      Overall Acceptance Rate 194 of 564 submissions, 34%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)1
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 25 Feb 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2008)No mining, no meaningProceedings of the eighth ACM symposium on Document engineering10.1145/1410140.1410164(110-118)Online publication date: 16-Sep-2008
      • (2008)Creating realistic, scenario-based synthetic data for test and evaluation of information analytics softwareProceedings of the 2008 Workshop on BEyond time and errors: novel evaLuation methods for Information Visualization10.1145/1377966.1377977(1-9)Online publication date: 5-Apr-2008
      • (2007)Genre driven multimedia document production by means of incremental transformationProceedings of the 2007 ACM symposium on Document engineering10.1145/1284420.1284452(111-120)Online publication date: 28-Aug-2007
      • (2007)A document engineering environment for clinical guidelinesProceedings of the 2007 ACM symposium on Document engineering10.1145/1284420.1284440(69-78)Online publication date: 28-Aug-2007
      • (2006)Mash-o-maticProceedings of the 2006 ACM symposium on Document engineering10.1145/1166160.1166214(205-214)Online publication date: 10-Oct-2006
      • (2006)Mapping physical formats to logical models to extract data and metadataProceedings of the 2006 international conference on Provenance and Annotation of Data10.1007/11890850_9(73-81)Online publication date: 3-May-2006

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Figures

      Tables

      Media

      Share

      Share

      Share this Publication link

      Share on social media