Abstract
Object recognition is a significant task in an industrial assembly line, where a robotic arm should pick a small, textureless, and mostly homogeneous object to place it in its designated location. Despite all the recent advancements in object recognition, the problem still remains challenging for textureless industrial parts with similar shapes. In this paper, we propose an effective and real-time system using a single RGB-D camera to recognize the industrial objects placed at arbitrary viewing direction around the vertical axis. First, we segment the region of interest using an improved watershed segmentation approach. Then, we extract low-level geometrical features. Finally, we train five models and compare their accuracy based on different rotation strategies. Our experimental results highlight the efficiency as well as real-time suitability of our approach.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Abbasi, S., Mokhtarian, F., Kittler, J.: Enhancing css-based shape retrieval for objects with shallow concavities. Image Vis. Comput. 18(3), 199–211 (2000)
Balakrishnama, S., Ganapathiraju, A.: Linear discriminant analysis-a brief tutorial. Inst. Signal Inf. Process. 18, 1–8 (1998)
Beucher, S., Meyer, F.: The morphological approach to segmentation: the watershed transformation. Math. Morphol. Image Process. 34, 433–433 (1992). Optical Engineering-New York-Marcel Dekker Incorporated-
Breiman, L.: Random forests. Mach. Learn. 45(1), 5–32 (2001)
Calonder, M., Lepetit, V., Strecha, C., Fua, P.: BRIEF: binary robust independent elementary features. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010. LNCS, vol. 6314, pp. 778–792. Springer, Heidelberg (2010). https://doi.org/10.1007/978-3-642-15561-1_56
Celebi, M.E., Aslandogan, Y.A.: A comparative study of three moment-based shape descriptors. In: International Conference on Information Technology: Coding and Computing (ITCC’05)-Volume II, vol. 1, pp. 788–793. IEEE (2005)
Cevikalp, H., Triggs, B.: Efficient object detection using cascades of nearest convex model classifiers. In: 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3138–3145. IEEE (2012)
Chen, Y.W., Chen, Y.Q.: Invariant description and retrieval of planar shapes using radon composite features. IEEE Trans. Signal Process. 56(10), 4762–4771 (2008)
Chuang, G.H., Kuo, C.C.: Wavelet descriptor of planar curves: theory and applications. IEEE Trans. Image Process. 5(1), 56–70 (1996)
Davies, E.: Machine Vision: Theory, Algorithms, Practicalities. Academic Press, Cambridge (1997)
Ding, W., Wang, X., Liu, H., Hu, B.: An empirical study of shape recognition in ensemble learning context. In: 2018 International Conference on Wavelet Analysis and Pattern Recognition (ICWAPR), pp. 256–261. IEEE (2018)
Feng, Y., An, X., Liu, X.: The application of scale invariant feature transform fused with shape model in the human face recognition. In: 2016 IEEE Advanced Information Management, Communicates, Electronic and Automation Control Conference (IMCEC), pp. 1716–1720. IEEE (2016)
Flusser, J.: Invariant shape description and measure of object similarity. In: 1992 International Conference on Image Processing and its Applications, pp. 139–142. IET (1992)
Freeman, H.: On the encoding of arbitrary geometric configurations. IRE Trans. Electron. Comput. 2, 260–268 (1961)
Friedman, N., Geiger, D., Goldszmidt, M.: Bayesian network classifiers. Mach. Learn. 29(2–3), 131–163 (1997)
Girshick, R.: Fast r-cnn. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 1440–1448 (2015)
Girshick, R., Donahue, J., Darrell, T., Malik, J.: Rich feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 580–587 (2014)
Girshick, R., Donahue, J., Darrell, T., Malik, J.: Region-based convolutional networks for accurate object detection and segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 38(1), 142–158 (2015)
Guru, D., Nagendraswamy, H.: Symbolic representation of two-dimensional shapes. Pattern Recogn. Lett. 28(1), 144–155 (2007)
Hartigan, J.A., Wong, M.A.: Algorithm as 136: a k-means clustering algorithm. J. Roy. Stat. Soc. Ser. C (Appl. Stat.) 28(1), 100–108 (1979)
He, K., Gkioxari, G., Dollár, P., Girshick, R.: Mask r-cnn. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 2961–2969 (2017)
Hearst, M.A., Dumais, S.T., Osuna, E., Platt, J., Scholkopf, B.: Support vector machines. IEEE Intell. Syst. Appl. 13(4), 18–28 (1998)
Hinterstoisser, S., Holzer, S., Cagniart, C., Ilic, S., Konolige, K., Navab, N., Lepetit, V.: Multimodal templates for real-time detection of texture-less objects in heavily cluttered scenes. In: 2011 International Conference on Computer Vision, pp. 858–865. IEEE (2011)
Hu, M.K.: Visual pattern recognition by moment invariants. IRE Trans. Inf. Theory 8(2), 179–187 (1962)
Huang, C., Huang, J.: A fast hog descriptor using lookup table and integral image (2017). arXiv preprint arXiv:1703.06256
Jain, A.K., Dubes, R.C.: Algorithms for Clustering Data. Prentice-Hall, Upper Saddle River (1988)
Kohonen, T.: Learning vector quantization. In: Self-Organizing Maps, pp. 175–189. Springer (1995). https://doi.org/10.1007/978-3-642-97610-0_6
Kortli, Y., Jridi, M., Al Falou, A., Atri, M.: A comparative study of CFs, LBP, HOG, SIFT, SURF, and BRIEF for security and face recognition (2018)
Kpalma, K., Ronsin, J.: Multiscale contour description for pattern recognition. Pattern Recogn. Lett. 27(13), 1545–1559 (2006)
Kurnianggoro, L., Jo, K.H., et al.: A survey of 2d shape representation: methods, evaluations, and future research directions. Neurocomputing 300, 1–16 (2018)
Lee, S.M., Abbott, A.L., Clark, N.A., Araman, P.A.: A shape representation for planar curves by shape signature harmonic embedding. In: 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2006), vol. 2, pp. 1940–1947. IEEE (2006)
Liang, M., Hu, X.: Recurrent convolutional neural network for object recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3367–3375 (2015)
Mountrakis, G., Im, J., Ogole, C.: Support vector machines in remote sensing: a review. ISPRS J. Photogram. Rem. Sens. 66(3), 247–259 (2011)
Pham, T.T., Do, T.T., Sünderhauf, N., Reid, I.: Scenecut: joint geometric and object segmentation for indoor scenes. In: 2018 IEEE International Conference on Robotics and Automation (ICRA), pp. 1–9. IEEE (2018)
Quinlan, J.R.: Induction of decision trees. Mach. Learn. 1(1), 81–106 (1986)
Rahim, M.A., Azam, M.S., Hossain, N., Islam, M.R.: Face recognition using local binary patterns (lbp). Glob. J. Comput. Sci. Technol. (2013)
Redmon, J., Farhadi, A.: Yolo9000: better, faster, stronger. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7263–7271 (2017)
Ren, S., He, K., Girshick, R., Sun, J.: Faster r-cnn: towards real-time object detection with region proposal networks. IEEE Trans. Pattern Anal. Mach. Intell. 6, 1137–1149 (2017)
Ren, Z., Gao, S., Chia, L.T., Tsang, I.W.H.: Region-based saliency detection and its application in object recognition. IEEE Trans. Circ. Syst. Video Technol. 24(5), 769–779 (2013)
Rublee, E., Rabaud, V., Konolige, K., Bradski, G.R.: Orb: an efficient alternative to sift or surf. In: ICCV, vol. 11, p. 2. Citeseer (2011)
Smith, B.L., Williams, B.M., Oswald, R.K.: Comparison of parametric and nonparametric models for traffic flow forecasting. Transp. Res. Part C Emerg. Technol. 10(4), 303–321 (2002)
Sonka, M., Hlavac, V., Boyle, R.: Image Processing, Analysis, and Machine Vision. Cengage Learning, Boston (2014)
Verma, R., Kaur, R.: Enhanced character recognition using surf feature and neural network technique. IJCSIT Int. J. Comput. Sci. Inf. Technol. 5(4), 5565–5570 (2014)
Yang, M.: Extraction d’attributs et mesures de similarité basées sur la forme. Ph.D. thesis, INSA de Rennes (2008)
Yang, M., Kpalma, K., Ronsin, J.: Scale-controlled area difference shape descriptor. In: Document Recognition and Retrieval XIV, vol. 6500, p. 650003. International Society for Optics and Photonics (2007)
Zhang, D., Lu, G.: A comparative study of curvature scale space and fourier descriptors for shape-based image retrieval. J. Vis. Commun. Image Representation 14(1), 39–57 (2003)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Lugo, G., Hajari, N., Reddy, A., Cheng, I. (2020). Textureless Object Recognition Using an RGB-D Sensor. In: McDaniel, T., Berretti, S., Curcio, I., Basu, A. (eds) Smart Multimedia. ICSM 2019. Lecture Notes in Computer Science(), vol 12015. Springer, Cham. https://doi.org/10.1007/978-3-030-54407-2_2
Download citation
DOI: https://doi.org/10.1007/978-3-030-54407-2_2
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-54406-5
Online ISBN: 978-3-030-54407-2
eBook Packages: Computer ScienceComputer Science (R0)