poster

Autopedia: automatic domain-independent Wikipedia article generation

Authors:

Xu Jia,

Hongyan LiuAuthors Info & Claims

WWW '11: Proceedings of the 20th international conference companion on World wide web

Pages 161 - 162

https://doi.org/10.1145/1963192.1963274

Published: 28 March 2011 Publication History

Get Access

Abstract

This paper proposes a general framework, named Autopedia, to generate high-quality wikipedia articles for given concepts in any domains, by automatically selecting the best wikipedia template consisting the sub-topics to organize the article for the input concept. Experimental results on 4,526 concepts validate the effectiveness of Autopedia, and the wikipedia template selection approach which takes into account both the template quality and the semantic relatedness between the input concept and its sibling concepts, performs the best.

Reference

[1]

P. Turney. Mining the Web for Synonyms: PMI-IR versus LSA on TOEFL. In Proceedings of the twelfth european conference on machine learning (ecml-2001), pages 491--502, 2001.

Digital Library

Google Scholar

Cited By

View all

Wang J(2020)AutoOverview: A Framework for Generating Structured Overviews over Many DocumentsComplexity and Approximation10.1007/978-3-030-41672-0_8(113-150)Online publication date: 21-Feb-2020
https://doi.org/10.1007/978-3-030-41672-0_8
Wang JZhang HZhang CYang WShao LWang JBorghoff USchimmler S(2019)An Effective Scheme for Generating An Overview Report over A Very Large Corpus of DocumentsProceedings of the ACM Symposium on Document Engineering 201910.1145/3342558.3345394(1-11)Online publication date: 23-Sep-2019
https://dl.acm.org/doi/10.1145/3342558.3345394
Pochampally YKarlapalem KBarrett RCummings RAgichtein EGabrilovich E(2017)Notability Determination for WikipediaProceedings of the 26th International Conference on World Wide Web Companion10.1145/3041021.3053361(1641-1646)Online publication date: 3-Apr-2017
https://dl.acm.org/doi/10.1145/3041021.3053361
Show More Cited By

Index Terms

Autopedia: automatic domain-independent Wikipedia article generation
1. Computing methodologies
  1. Machine learning
    1. Learning settings
2. Information systems
  1. Information systems applications

Recommendations

Two-stage approach to named entity recognition using Wikipedia and DBpedia
IMCOM '17: Proceedings of the 11th International Conference on Ubiquitous Information Management and Communication

In natural language understanding, extraction of named entity (NE) mentions in given text and classification of the mentions into pre-defined NE types are important processes. Most NE recognition (NER) relies on resources such as a training corpus or NE ...
DAWT: Densely Annotated Wikipedia Texts Across Multiple Languages
WWW '17 Companion: Proceedings of the 26th International Conference on World Wide Web Companion

In this work, we open up the DAWT dataset - Densely Annotated Wikipedia Texts across multiple languages. The annotations include labeled text mentions mapping to entities (represented by their Freebase machine ids) as well as the type of the entity. The ...
Learning multilingual named entity recognition from Wikipedia

We automatically create enormous, free and multilingual silver-standard training annotations for named entity recognition (ner) by exploiting the text and structure of Wikipedia. Most ner systems rely on statistical models of annotated data to identify ...

Comments

Information & Contributors

Information

Published In

WWW '11: Proceedings of the 20th international conference companion on World wide web

March 2011

552 pages

ISBN:9781450306379

DOI:10.1145/1963192

General Chairs:
S. Sadagopan
IIIT-Bangalore, India
,
Krithi Ramamritham
IIT-Bombay, India
,
Arun Kumar
IBM Research, India
,
M. P. Ravindra
Infosys E & R, India
,
Program Chairs:
Elisa Bertino
Purdue University, USA
,
Ravi Kumar
Yahoo! Research, USA

In-Cooperation

SIGWEB: ACM Special Interest Group on Hypertext, Hypermedia, and Web
The International Institute of Information Technology Bangalore: The International Institute of Information Technology Bangalore

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 28 March 2011

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Poster

Conference

WWW '11

WWW '11: 20th International World Wide Web Conference

March 28 - April 1, 2011

Hyderabad, India

Acceptance Rates

Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

5
Total Citations
View Citations
213
Total Downloads

Downloads (Last 12 months)1
Downloads (Last 6 weeks)0

Reflects downloads up to 07 Mar 2025

Other Metrics

View Author Metrics

Citations

Cited By

View all

Wang J(2020)AutoOverview: A Framework for Generating Structured Overviews over Many DocumentsComplexity and Approximation10.1007/978-3-030-41672-0_8(113-150)Online publication date: 21-Feb-2020
https://doi.org/10.1007/978-3-030-41672-0_8
Wang JZhang HZhang CYang WShao LWang JBorghoff USchimmler S(2019)An Effective Scheme for Generating An Overview Report over A Very Large Corpus of DocumentsProceedings of the ACM Symposium on Document Engineering 201910.1145/3342558.3345394(1-11)Online publication date: 23-Sep-2019
https://dl.acm.org/doi/10.1145/3342558.3345394
Pochampally YKarlapalem KBarrett RCummings RAgichtein EGabrilovich E(2017)Notability Determination for WikipediaProceedings of the 26th International Conference on World Wide Web Companion10.1145/3041021.3053361(1641-1646)Online publication date: 3-Apr-2017
https://dl.acm.org/doi/10.1145/3041021.3053361
Banerjee SMitra PVanoirbeek CGenevès P(2015)Filling the GapsProceedings of the 2015 ACM Symposium on Document Engineering10.1145/2682571.2797073(117-120)Online publication date: 8-Sep-2015
https://dl.acm.org/doi/10.1145/2682571.2797073
Banerjee SCaragea CMitra P(2014)Playscript Classification and Automatic Wikipedia Play Articles GenerationProceedings of the 2014 22nd International Conference on Pattern Recognition10.1109/ICPR.2014.624(3630-3635)Online publication date: 24-Aug-2014
https://dl.acm.org/doi/10.1109/ICPR.2014.624

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Abstract

Reference

Cited By

Index Terms

Recommendations

Two-stage approach to named entity recognition using Wikipedia and DBpedia

DAWT: Densely Annotated Wikipedia Texts Across Multiple Languages

Learning multilingual named entity recognition from Wikipedia

Comments

Information

Published In

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Login options

Full Access

View options

PDF

eReader

Share

Share this Publication link

Share on social media

Affiliations