Abstract
Labor-intensity of resource quality assessment is a bottleneck for content management in metadata-driven health information portals. This research proposes an adaptive attribute-based approach to assist informed judgments when assessing the quality of online information resources. It employs intelligent learning techniques to predict values of resource quality attributes based on previous value judgments encoded in resource metadata descriptions. The proposed approach is implemented as an intelligent quality attribute learning component of a portal’s content management system. This paper introduces the required machine learning procedures for the implementation of the component. Its prediction performance was evaluated via a series of machine learning experiments, which demonstrated the feasibility and the potential usefulness of the proposed approach.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Benigeri, M., Pluye, P.: Shortcomings of Health Information on the Internet. Health Promotion International 18, 381–386 (2003)
Evans, J., Manaszewicz, R., Xie, J.: The Role of Domain Expertise in Smart, User-Sensitive, Health Information Portal. In: the 42nd Hawaii International Conference on System Sciences, HICSS-42 (2009)
Xie, J.: Sustaining Quality Assessment Processes in User-Centred Health Information Portals. In: The 15th Americas Conference on Information Systems, AMCIS 2009 (2009)
Stvilia, B., Gasser, L., Twidale, M.B., Smith, L.C.: A Framework for Information Quality Assessment. Journal of the American Society for Information Science and Technology (JASIST) 58, 1720–1733 (2007)
Wang, R.Y., Strong, D.M.: Beyond Accuracy: What Data Quality Means to Data Consumers. Journal of Management Information Systems 12, 5–33 (1996)
Griffiths, K.M., Tang, T.T., Hawking, D., Christensen, H.: Automated Assessment of the Quality of Depression Websites. Journal of Medical Internet Research 7, e59 (2005)
Sessions, V., Valtorta, M.: Towards a Method for Data Accuracy Assessment Utilizing a Bayesian Network Learning Algorithm. Journal of Data and Information Quality 1, 1–34 (2009)
Burstein, F., Fisher, J., McKemmish, S., Manaszewicz, R., Malhotra, P.: User Centred Quality Health Information Provision: Benefits and Challenges. In: The 38th Annual Hawaii International Conference on System Sciences, HICSS 2005 (2005)
McKemmish, S., Manaszewicz, R., Burstein, F., Fisher, J.: Consumer Empowerment through Metadata-Based Quality Reporting: The Breast Cancer Knowledge Online Portal. Journal of the American Society for Information Science and Technology (JASIST) 60, 1792–1807 (2009)
Wang, R.Y., Reddy, M.P., Kon, H.B.: Toward Quality Data: An Attribute-Based Approach. Decision Support Systems 13, 349–372 (1995)
Anderson, J., McKemmish, S., Manaszewicz, R.: Quality Criteria Models Used to Evaluate Health Websites. In: The 10th Asia Pacific Special Health and Law Librarians Conference, pp. 337–354 (2003)
Williamson, K., Manaszewicz, R.: Breast Cancer Information Needs and Seeking: Towards an Intelligent, User Sensitive Portal to Breast Cancer Knowledge Online. The New Review of Information Behaviour Research 3, 203–219 (2003)
Eysenbach, G., Diepgen, T.L.: Towards Quality Management of Medical Information on the Internet: Evaluation, Labelling, and Filtering of Information. British Medical Journal (BMJ) 317, 1496–1502 (1998)
Wang, Y., Liu, Z.: Automatic Detecting Indicators for Quality of Health Information on the Web. International Journal of Medical Informatics 76, 575–582 (2007)
McKemmish, S., Manaszewicz, R., Cheah, C.: Bckonline Metadata Schema Version 1.0 (2004), http://www.sims.monash.edu.au/research/eirg/BCKO_MetadataSchema_Version16.doc
Griffiths, K.M., Christensen, H.: Website Quality Indicators for Consumers. Journal of Medical Internet Research 7, e55 (2005)
Price, S.L., Hersh, W.R.: Filtering Web Pages for Quality Indicators: An Empirical Approach to Finding High Quality Consumer Health Information on the World Wide Web. In: AMIA 1999 Annual Symposium, pp. 911–915 (1999)
Katerattanakul, P., Siau, K.: Measuring Information Quality of Web Sites: Development of an Instrument. In: The 20th International Conference on Information Systems, pp. 279–285 (1999)
Zhu, J., Gauch, S.: Incorporating Quality Metrics in Centralized/Distributed Information Retrieval on the World Wide Web. In: The 23rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 288–295. ACM Press, New York (2000)
Conrad, J.G., Leidner, J.L., Schilder, F.: Professional Credibility: Authority on the Web. In: The 2nd ACM Workshop on Information Credibility on the Web. ACM, New York (2008)
Freeman, K.S., Spyridakis, J.H.: An Examination of Factors That Affect the Credibility of Online Health Information. Technical Communication 51, 239–263 (2004)
Aladwani, A.M., Palvia, P.C.: Developing and Validating an Instrument for Measuring User-Perceived Web Quality. Information and Management 39, 467–476 (2002)
Hatala, M., Richards, G.: Value-Added Metatagging: Ontology and Rule Based Methods for Smarter Metadata. In: Rules and Rule Markup Languages for the Semantic Web: Second International Workshop, pp. 65–80. Springer, Heidelberg (2003)
Witten, I.H., Frank, E.: Data Mining: Practical Machine Learning Tools and Technologies. Elsevier, Amsterdam (2005)
Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.H.: The Weka Data Mining Software: An Update. SIGKDD Explorations 11, 10–18 (2009)
Evangelista, P.F., Embrechts, M.J., Szymanski, B.K.: Taming the Curse of Dimensionality in Kernels and Novelty Detection. In: Abraham, A., de Baets, B., Köppen, M., Nickolay, B. (eds.) Applied Soft Computing Technologies: The Challenge of Complexity, pp. 425–438. Springer, Heidelberg (2006)
Akbani, R., Kwek, S., Japkowicz, N.: Applying Support Vector Machines to Imbalanced Datasets. In: Boulicaut, J.-F., Esposito, F., Giannotti, F., Pedreschi, D. (eds.) ECML 2004. LNCS (LNAI), vol. 3201, pp. 39–50. Springer, Heidelberg (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Xie, J., Burstein, F. (2011). Using Machine Learning to Support Resource Quality Assessment: An Adaptive Attribute-Based Approach for Health Information Portals. In: Xu, J., Yu, G., Zhou, S., Unland, R. (eds) Database Systems for Adanced Applications. DASFAA 2011. Lecture Notes in Computer Science, vol 6637. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-20244-5_50
Download citation
DOI: https://doi.org/10.1007/978-3-642-20244-5_50
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-20243-8
Online ISBN: 978-3-642-20244-5
eBook Packages: Computer ScienceComputer Science (R0)