Skip to main content

Active Learning based on Random Forest and Its Application to Terrain Classification

  • Conference paper
Book cover Progress in Systems Engineering

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 366))

Abstract

In this paper, a novel active learning technique was proposed for solving multiclass classification problem with random forest classifier. By combining uncertainty, density, and diversity criteria, the most informative samples are selected for manually labeling. The uncertainty criterion is implemented by analyzing the difference between the most votes and second most votes from classifier’s output. Samples in dense regions are thought to be more informative than samples in sparse regions. The average distance of a sample to its k-nearest unlabeled neighbors is computed to describe the sample’s density. The distance between a sample and its nearest labeled sample is used to measure the diversity of the sample. The larger the distance is, the less redundancy the sample is. To assess the effectiveness of the proposed method, it was compared with other techniques like traditional active learning based on random forest and SVM. The results of the experiment on terrain classification have demonstrated the effectiveness of the proposed approach.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 259.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 329.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. D. A. Cohn, Z. Ghahramani, and M. I. Jordan, “Active learning with statistical models,” Journal of Artificial Intelligence Research, vol. 4, pp. 129–145, 1996.

    MATH  Google Scholar 

  2. B. Settles, “Active learning literature survey,” University of Wisconsin, Madison, 2010.

    Google Scholar 

  3. S. Tong and E. Chang, “Support vector machine active learning for image retrieval,” in Proceedings of the ninth ACM international conference on Multimedia. ACM, 2001, pp. 107–118.

    Google Scholar 

  4. S. Tong and D. Koller, “Support vector machine active learning with applications to text classification,” The Journal of Machine Learning Research, vol. 2, pp. 45–66, 2002.

    MATH  Google Scholar 

  5. S.-J. Huang, R. Jin, and Z.-H. Zhou, “Active learning by querying informative and representative examples.” in NIPS, vol. 23, 2010, pp. 892–900.

    Google Scholar 

  6. S. C. Hoi, R. Jin, J. Zhu, and M. R. Lyu, “Semi-supervised svm batch mode active learning for image retrieval,” in Computer Vision and Pattern Recognition, 2008. CVPR 2008. IEEE Conference on. IEEE, 2008, pp. 1–7.

    Google Scholar 

  7. D. Tuia, F. Ratle, F. Pacifici, M. F. Kanevski, and W. J. Emery, “Active learning methods for remote sensing image classification,” Geoscience and Remote Sensing, IEEE Transactions on, vol. 47, no. 7, pp. 2218–2232, 2009.

    Article  Google Scholar 

  8. S. Patra and L. Bruzzone, “A cluster-assumption based batch mode active learning technique,” Pattern Recognition Letters, vol. 33, no. 9, pp. 1042–1048, 2012.

    Article  Google Scholar 

  9. L. Shi, Y. Zhao, and J. Tang, “Batch mode active learning for networked data,” ACM Transactions on Intelligent Systems and Technology (TIST), vol. 3, no. 2, p. 33, 2012.

    Google Scholar 

  10. G. Chmaj, K. Walkowiak, M. Tarnawski, and M. Kucharzak, “Heuristic algorithms for optimization of task allocation and result distribution in peer-to-peer computing systems,” International Journal of Applied Mathematics and Computer Science, vol. 22, no. 3, pp. 733–748, 2012.

    Article  MathSciNet  Google Scholar 

  11. G. Chmaj and S. Latifi, “Decentralization of a multi data source distributed processing system using a distributed hash table.” International Journal of Communications, Network & System Sciences, vol. 6, no. 10, 2013.

    Google Scholar 

  12. D. DeBarr and H. Wechsler, “Spam detection using clustering, random forests, and active learning,” in Sixth Conference on Email and Anti-Spam. Mountain View, California, 2009.

    Google Scholar 

  13. L. Breiman, “Random forests,” Machine learning, vol. 45, no. 1, pp. 5–32, 2001.

    Article  MathSciNet  MATH  Google Scholar 

  14. J. Gall and V. Lempitsky, “Class-specific hough forests for object detection,” in Decision Forests for Computer Vision and Medical Image Analysis. Springer, 2013, pp. 143–157.

    Google Scholar 

  15. A. Yao, J. Gall, and L. Van Gool, “A hough transform-based voting framework for action recognition,” in Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on. IEEE, 2010, pp. 2061–2068.

    Google Scholar 

  16. C. Marsala and M. Detyniecki, “High scale video mining with forests of fuzzy decision trees,” in Proceedings of the 5th international conference on Soft computing as transdisciplinary science and technology. ACM, 2008, pp. 413–418.

    Google Scholar 

  17. Random forest packages. [Online]. Available: http://cran.r-project.org/web/packages/

  18. University of oulu texture database. [Online]. Available: http://www.outex.oulu.fi/temp/

  19. Hand-labeled darpa lagr datasets. [Online]. Available: http://www.mikeprocopio.com/labeledlagrdata.html

  20. M. Pietikäinen, T. Nurmela, T. Mäenpää, and M. Turtinen, “View-based recognition of real-world textures,” Pattern Recognition, vol. 37, no. 2, pp. 313–323, 2004.

    Article  MATH  Google Scholar 

  21. T. Ojala, M. Pietikainen, and T. Maenpaa, “Multiresolution gray-scale and rotation invariant texture classification with local binary patterns,” Pattern Analysis and Machine Intelligence, IEEE Transactions on, vol. 24, no. 7, pp. 971–987, 2002.

    Article  MATH  Google Scholar 

  22. M. J. Procopio, J. Mulligan, and G. Grudic, “Learning terrain segmentation with classifier ensembles for autonomous robot navigation in unstructured environments,” Journal of Field Robotics, vol. 26, no. 2, pp. 145–175, 2009.

    Article  Google Scholar 

Download references

Acknowledgment

This work is partially supported by National Natural Science Foundation of China under Grant Nos. 61373063, 61233011, 61125305, 61375007, 61220301, and by National Basic Research Program of China under Grant No. 2014CB349303.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yingjie Gu .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Gu, Y., Zydek, D., Jin, Z. (2015). Active Learning based on Random Forest and Its Application to Terrain Classification. In: Selvaraj, H., Zydek, D., Chmaj, G. (eds) Progress in Systems Engineering. Advances in Intelligent Systems and Computing, vol 366. Springer, Cham. https://doi.org/10.1007/978-3-319-08422-0_41

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-08422-0_41

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-08421-3

  • Online ISBN: 978-3-319-08422-0

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics