Active Learning based on Random Forest and Its Application to Terrain Classification

Gu, Yingjie; Zydek, Dawid; Jin, Zhong

doi:10.1007/978-3-319-08422-0_41

Yingjie Gu^5,6,
Dawid Zydek⁶ &
Zhong Jin⁵

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 366))

3444 Accesses
4 Citations

Abstract

In this paper, a novel active learning technique was proposed for solving multiclass classification problem with random forest classifier. By combining uncertainty, density, and diversity criteria, the most informative samples are selected for manually labeling. The uncertainty criterion is implemented by analyzing the difference between the most votes and second most votes from classifier’s output. Samples in dense regions are thought to be more informative than samples in sparse regions. The average distance of a sample to its k-nearest unlabeled neighbors is computed to describe the sample’s density. The distance between a sample and its nearest labeled sample is used to measure the diversity of the sample. The larger the distance is, the less redundancy the sample is. To assess the effectiveness of the proposed method, it was compared with other techniques like traditional active learning based on random forest and SVM. The results of the experiment on terrain classification have demonstrated the effectiveness of the proposed approach.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 259.00; Price excludes VAT (USA)

Hardcover Book: USD 329.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

D. A. Cohn, Z. Ghahramani, and M. I. Jordan, “Active learning with statistical models,” Journal of Artificial Intelligence Research, vol. 4, pp. 129–145, 1996.
MATH Google Scholar
B. Settles, “Active learning literature survey,” University of Wisconsin, Madison, 2010.
Google Scholar
S. Tong and E. Chang, “Support vector machine active learning for image retrieval,” in Proceedings of the ninth ACM international conference on Multimedia. ACM, 2001, pp. 107–118.
Google Scholar
S. Tong and D. Koller, “Support vector machine active learning with applications to text classification,” The Journal of Machine Learning Research, vol. 2, pp. 45–66, 2002.
MATH Google Scholar
S.-J. Huang, R. Jin, and Z.-H. Zhou, “Active learning by querying informative and representative examples.” in NIPS, vol. 23, 2010, pp. 892–900.
Google Scholar
S. C. Hoi, R. Jin, J. Zhu, and M. R. Lyu, “Semi-supervised svm batch mode active learning for image retrieval,” in Computer Vision and Pattern Recognition, 2008. CVPR 2008. IEEE Conference on. IEEE, 2008, pp. 1–7.
Google Scholar
D. Tuia, F. Ratle, F. Pacifici, M. F. Kanevski, and W. J. Emery, “Active learning methods for remote sensing image classification,” Geoscience and Remote Sensing, IEEE Transactions on, vol. 47, no. 7, pp. 2218–2232, 2009.
Article Google Scholar
S. Patra and L. Bruzzone, “A cluster-assumption based batch mode active learning technique,” Pattern Recognition Letters, vol. 33, no. 9, pp. 1042–1048, 2012.
Article Google Scholar
L. Shi, Y. Zhao, and J. Tang, “Batch mode active learning for networked data,” ACM Transactions on Intelligent Systems and Technology (TIST), vol. 3, no. 2, p. 33, 2012.
Google Scholar
G. Chmaj, K. Walkowiak, M. Tarnawski, and M. Kucharzak, “Heuristic algorithms for optimization of task allocation and result distribution in peer-to-peer computing systems,” International Journal of Applied Mathematics and Computer Science, vol. 22, no. 3, pp. 733–748, 2012.
Article MathSciNet Google Scholar
G. Chmaj and S. Latifi, “Decentralization of a multi data source distributed processing system using a distributed hash table.” International Journal of Communications, Network & System Sciences, vol. 6, no. 10, 2013.
Google Scholar
D. DeBarr and H. Wechsler, “Spam detection using clustering, random forests, and active learning,” in Sixth Conference on Email and Anti-Spam. Mountain View, California, 2009.
Google Scholar
L. Breiman, “Random forests,” Machine learning, vol. 45, no. 1, pp. 5–32, 2001.
Article MathSciNet MATH Google Scholar
J. Gall and V. Lempitsky, “Class-specific hough forests for object detection,” in Decision Forests for Computer Vision and Medical Image Analysis. Springer, 2013, pp. 143–157.
Google Scholar
A. Yao, J. Gall, and L. Van Gool, “A hough transform-based voting framework for action recognition,” in Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on. IEEE, 2010, pp. 2061–2068.
Google Scholar
C. Marsala and M. Detyniecki, “High scale video mining with forests of fuzzy decision trees,” in Proceedings of the 5th international conference on Soft computing as transdisciplinary science and technology. ACM, 2008, pp. 413–418.
Google Scholar
Random forest packages. [Online]. Available: http://cran.r-project.org/web/packages/
University of oulu texture database. [Online]. Available: http://www.outex.oulu.fi/temp/
Hand-labeled darpa lagr datasets. [Online]. Available: http://www.mikeprocopio.com/labeledlagrdata.html
M. Pietikäinen, T. Nurmela, T. Mäenpää, and M. Turtinen, “View-based recognition of real-world textures,” Pattern Recognition, vol. 37, no. 2, pp. 313–323, 2004.
Article MATH Google Scholar
T. Ojala, M. Pietikainen, and T. Maenpaa, “Multiresolution gray-scale and rotation invariant texture classification with local binary patterns,” Pattern Analysis and Machine Intelligence, IEEE Transactions on, vol. 24, no. 7, pp. 971–987, 2002.
Article MATH Google Scholar
M. J. Procopio, J. Mulligan, and G. Grudic, “Learning terrain segmentation with classifier ensembles for autonomous robot navigation in unstructured environments,” Journal of Field Robotics, vol. 26, no. 2, pp. 145–175, 2009.
Article Google Scholar

Download references

Acknowledgment

This work is partially supported by National Natural Science Foundation of China under Grant Nos. 61373063, 61233011, 61125305, 61375007, 61220301, and by National Basic Research Program of China under Grant No. 2014CB349303.

Author information

Authors and Affiliations

Computer Science and Engineering, Nanjing University of Science and Technology, Nanjing, 210094, China
Yingjie Gu & Zhong Jin
Department of Electrical Engineering, Idaho State University, Pocatello, 83209-8060, USA
Yingjie Gu & Dawid Zydek

Authors

Yingjie Gu
View author publications
You can also search for this author in PubMed Google Scholar
Dawid Zydek
View author publications
You can also search for this author in PubMed Google Scholar
Zhong Jin
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yingjie Gu .

Editor information

Editors and Affiliations

University of Nevada at Las Vegas, Las Vegas, Nevada, USA
Henry Selvaraj
Department of Electrical Engineering, Idaho State University, Pocatello, Idaho, USA
Dawid Zydek
University of Nevada at Las Vegas, Las Vegas, Nevada, USA
Grzegorz Chmaj

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Gu, Y., Zydek, D., Jin, Z. (2015). Active Learning based on Random Forest and Its Application to Terrain Classification. In: Selvaraj, H., Zydek, D., Chmaj, G. (eds) Progress in Systems Engineering. Advances in Intelligent Systems and Computing, vol 366. Springer, Cham. https://doi.org/10.1007/978-3-319-08422-0_41

Download citation

DOI: https://doi.org/10.1007/978-3-319-08422-0_41
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-08421-3
Online ISBN: 978-3-319-08422-0
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics