Skip to main content

Using a Normalized Score Centroid-Based Classifier to Classify Multi-label Herbal Formulae

  • Conference paper
  • 916 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8271))

Abstract

The popularity of herbal medicines has greatly increased in worldwide countries over recent years. Herbal formula is a form of traditional medicine where herbs are combined to heal patient to heal faster and more efficiency. Herbal formulae can be divided into one or more therapeutic categories. The categories of a formula are usually based on decision from a group of experts. To support experts for classifying a formula, the normalized score centroid-based, is proposed for multi-label herbal formulae classification. The centroid-based classifier with more advanced term weight scheme is used. The normalized scores are calculated. The maximum number and cutoff point are set to adjust the decision for multi-label herbal formulae. The experiment is done using a mixed data set of herbal formulae collected from the Natural List of Essential Medicine and the list of common household remedies for traditional medicine. Moreover, a set of well-known commercial products are used for evaluating the effectiveness of the proposed method. From the results, the normalized score centroid-based classifier is an efficient method to classify multi-label herbal formulae. Its performance is depended on the set values of the maximum category and the cutoff point.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Lovell-Smith, H.D.: In defence of ayurvedic medicine. The New Zealand Medical Journal 119, 1–3 (2006)

    Google Scholar 

  2. Aziz, Z., Peng, T.N.: Herbal medicines: prevalence and predictors of use among malaysian adults. Complementary Therapies in Medicine 44, 44–50 (2009)

    Article  Google Scholar 

  3. Roiger, R., Geatz, M.: Data Mining: A Tutorial Based Primer. Addison-Wesley, Boston (2002)

    Google Scholar 

  4. Sebastiani, F.: Machine learning in automated text categorization. ACM Computing Surveys 34, 1–47 (2002)

    Article  MathSciNet  Google Scholar 

  5. Nigam, K., McCallum, A.K., Thrun, S., Mitchell, T.M.: Text classification from labeled and unlabeled documents using em. Machine Learning 39, 103–134 (2000)

    Article  MATH  Google Scholar 

  6. Duwairi, R., Al-Zubaidi, R.: A hierarchical k-nn classifier for textual data. The International Arab Journal of Information Technology 8, 251–259 (2011)

    Google Scholar 

  7. Lertnattee, V., Theeramunkong, T.: Effect of term distributions on centroid-based text categorization. Information Sciences 158, 89–115 (2004)

    Article  Google Scholar 

  8. Joachims, T.: Learning to Classify Text using Support Vector Machines. Kluwer Academic Publishers, Dordrecht (2002)

    Book  Google Scholar 

  9. Han, E.-H., Karypis, G., Kumar, V.: Text categorization using weight-adjusted k-nearest neighbor classification. In: Cheung, D., Williams, G.J., Li, Q. (eds.) PAKDD 2001. LNCS (LNAI), vol. 2035, pp. 53–65. Springer, Heidelberg (2001)

    Chapter  Google Scholar 

  10. Schapire, R.E., Singer, Y., Singhal, A.: Boosting and Rocchio applied to text filtering. In Croft, W.B., Moffat, A., Van Rijsbergen, C.J., Wilkinson, R., Zobel, J., eds.: Proceedings of SIGIR-98, 21st ACM International Conference on Research and Development in Information Retrieval, Melbourne, AU, ACM Press, New York, US (1998) 215–223

    Google Scholar 

  11. Salton, G., Buckley, C.: Term-weighting approaches in automatic text retrieval. Information Processing and Management 24, 513–523 (1988)

    Article  Google Scholar 

  12. Singhal, A., Salton, G., Buckley, C.: Length normalization in degraded text collections. Technical Report TR95-1507 (1995)

    Google Scholar 

  13. Tsoumakas, G., Katakis, I.: Multi-label classification: An overview. International Journal Data Warehousing and Mining 3, 1–13 (2007)

    Article  Google Scholar 

  14. Read, J., Pfahringer, B., Holmes, G., Frank, E.: Classifier chains for multi-label classification. Machine Learning 85, 333–359 (2011)

    Article  MathSciNet  Google Scholar 

  15. Fujino, A., Isozaki, H., Suzuki, J.: Multi-label text categorization with model combination based on f1-score maximization. In: Proceeding of The 3rd International Joint Conference on Natural Language Processing, pp. 823–828 (2008)

    Google Scholar 

  16. Hua, L.: Research on multi-classification and multi-label in text categorization. In: Proceeding of International Conference on Intelligent Human-Machine Systems and Cybernetics, pp. 86–89 (2009)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Lertnattee, V., Chomya, S., Sornlertlamvanich, V. (2013). Using a Normalized Score Centroid-Based Classifier to Classify Multi-label Herbal Formulae. In: Ramanna, S., Lingras, P., Sombattheera, C., Krishna, A. (eds) Multi-disciplinary Trends in Artificial Intelligence. MIWAI 2013. Lecture Notes in Computer Science(), vol 8271. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-44949-9_12

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-44949-9_12

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-44948-2

  • Online ISBN: 978-3-642-44949-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics