A Model for Evaluating the Quality of User-Created Documents

Hoang, Linh; Lee, Jung-Tae; Song, Young-In; Rim, Hae-Chang

doi:10.1007/978-3-540-68636-1_54

Linh Hoang¹,
Jung-Tae Lee¹,
Young-In Song¹ &
…
Hae-Chang Rim¹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4993))

Included in the following conference series:

Asia Information Retrieval Symposium

1440 Accesses
4 Citations

Abstract

In this paper, we propose a model for evaluating the quality of general user-created documents. The model is based on supervised classification approach, in which output scores are considered as quality of given document. In order to utilize both textual and non-textual attributes of documents, we incorporated a number of objectively measurable, real-valued features selected upon predefined criteria for quality. Experiments on two datasets of real world documents show that textual features are stable indicators for evaluating documents’ quality. Some features are inferred to be effective for general kinds of documents.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Jeon, J., Croft, W.B., Lee, J.H., Park, S.: A Framework to Predict The Quality of Answers with Non-textual Features. In: SIGIR, pp. 228–235 (2006)
Google Scholar
Jindal, N., Liu, B.: Identifying Comparative Sentences in Text Documents. In: SIGIR, pp. 244–251 (2006)
Google Scholar
Kim, S.M., Pantel, P., Chklovski, T., Pennacchiotti, M.: Automatically Assessing Review Helpfulness. In: EMNLP, pp. 423–430 (2006)
Google Scholar
Liu, J., Cao, Y., Lin, C.Y., Huang, Y., Zhou, M.: Low-Quality Product Review Detection in Opinion Summarization. In: EMNLP-CoNLL, pp. 334–342 (2007)
Google Scholar
Malouf, R.: A Comparison of Algorithms for Maximum Entropy Parameter Estimation. In: CoNLL, pp. 49–55 (2002)
Google Scholar
Page, E.: Computer Grading of Student Prose. JEE 62(2), 127–142 (1994)
Google Scholar
Park, S., Lee, J.H., Jeon, J.: Evaluation of The Documents from The Web-based Question and Answer Service. Journal of KSLIS, 299–314 (2006)
Google Scholar
Riloff, E., Wiebe, J.: Learning Extraction Patterns for Subjective Expression. In: EMNLP, pp. 105–112 (2003)
Google Scholar

Download references

Author information

Authors and Affiliations

Dept. of Computer and Radio Communications Engineering, Korea University, Seoul, Korea
Linh Hoang, Jung-Tae Lee, Young-In Song & Hae-Chang Rim

Authors

Linh Hoang
View author publications
You can also search for this author in PubMed Google Scholar
Jung-Tae Lee
View author publications
You can also search for this author in PubMed Google Scholar
Young-In Song
View author publications
You can also search for this author in PubMed Google Scholar
Hae-Chang Rim
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Hang Li Ting Liu Wei-Ying Ma Tetsuya Sakai Kam-Fai Wong Guodong Zhou

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Hoang, L., Lee, JT., Song, YI., Rim, HC. (2008). A Model for Evaluating the Quality of User-Created Documents. In: Li, H., Liu, T., Ma, WY., Sakai, T., Wong, KF., Zhou, G. (eds) Information Retrieval Technology. AIRS 2008. Lecture Notes in Computer Science, vol 4993. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-68636-1_54

Download citation

DOI: https://doi.org/10.1007/978-3-540-68636-1_54
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-68633-0
Online ISBN: 978-3-540-68636-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics