Abstract
This paper describes the system overview and biped walking mechanism of the humanoid robot, Spark. Spark is able to mimic various human expressions, hand gestures, walking actions, recognize various human faces and objects and perform vocal communication. Spark has a total of 66 Degrees of Freedom (DOF). Each arm has 20 DOF and each leg has 6 DOF. The head has 12 DOF and Trunk has 2 DOF. The Broadcom BCM2711 processor is used for all the major operations. It is also acting as a Master control unit for other microcontrollers present in different parts of Spark’s body. This system is based on the Raspbian Operating System. Mechanical assembly is based on a hybrid system which includes both serial and parallel operations.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Sakagami, Y., Watanabe, R., Aoyama, C., Matsunaga, S., Higaki, N., Fujimura, K.: The intelligent ASIMO: system overview and integration, vol. 3, pp. 2478–2483 (2002). https://doi.org/10.1109/IRDS.2002.1041641
Feng, S., Whitman, E., Xinjilefu, X., Atkeson, C.G.: Optimization based full body control for the atlas robot, pp. 120–127 (2014). https://doi.org/10.1109/HUMANOIDS.2014.7041347
Park, I.W., Kim, J.Y., Lee, J., Oh, J.H.: Mechanical design of humanoid robot platform KHR-3 (KAIST humanoid robot 3: HUBO), pp. 321–326 (2005). https://doi.org/10.1109/ICHR.2005.1573587
OpenCV Library. https://docs.opencv.org/master/d6/d00/tutorial_py_root.html
Speech recognition library. https://pypi.org/project/SpeechRecognition/
pyttsx3 library. https://pypi.org/project/pyttsx3/
Raspberry Pi 4 (BCM2711). https://datasheets.raspberrypi.org/bcm2711/bcm2711-peripherals.pdf
ATMEGA CH2560. https://docs.arduino.cc/hardware/mega-2560
Megapixel Sony IMX219PQ. http://www.camera-module.com/product/8mpcameramodule/
Gim, K.G., Kim, J., Yamane, K.: Design and fabrication of a bipedal robot using serial-parallel hybrid leg mechanism, pp. 5095–5100 (2018). https://doi.org/10.1109/IROS.2018.8594182
Serial communication. https://roboticsbackend.com/raspberry-pi-arduino-serial-communication/
Adafruit PCA9685 16-channel servo driver. https://learn.adafruit.com/16-channel-pwm-servo-driver?view=all
Pandey, V., Kumar, S., Kumar, V., Goel, P.: A review paper on I2C communication protocol. Int. J. Adv. Res. Ideas Innov. Technol. 4, 340–343 (2018). https://www.ijariit.com/manuscript/a-review-paper-on-i2c-communication-protocol/
Arduino Nano. https://docs.arduino.cc/hardware/nano
MPU 6050 gyroscope and accelerometer sensor. https://invensense.tdk.com/products/motion-tracking/6-axis/mpu-6050/
Kajita, S., Nagasaki, T., Kaneko, K., Hirukawa, H.: ZMP-based biped running control. IEEE Robot. Autom. Mag. 14(2), 63–72 (2007). https://doi.org/10.1109/MRA.2007.380655
Types of chatbots. https://www.freshworks.com/live-chat-software/chatbots/three-types-of-chatbots/
Dahiya, M.: A tool of conversation: chatbot. Int. J. Comput. Sci. Eng. 5, 158–161 (2017). https://www.researchgate.net/publication/321864990_A_Tool_of_Conversation_Chatbot
Real-time face detection and recognition with SVM and HOG features. https://www.eeweb.com/real-time- face- detection-and-recognition-with-svm-and-hog-features/
GitHub - chuanqi305/MobileNet-SSD: caffe implementation of Google MobileNet SSD detection network, with pretrained weights on VOC0712 and mAP = 0.727. https://github.com/chuanqi305/MobileNet-SSD
Machine learning is fun! part 4: modern face recognition with deeplearning. https://medium.com/@ageitgey/machine-learning-is-fun-part-4-modern-face-recognition-with-deep-learning-c3cffc121d78
dlib C++ library: high quality face recognition with deep metric learning. http://blog.dlib.net/2017/02/high-quality-face-recognition-with-deep.html
Github - davisking/dlib: a toolkit for making real world machine learning and data analysis applications in C++. https://github.com/davisking/dlib
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition, pp. 770–778 (2016). https://doi.org/10.1109/CVPR.2016.90
COCO dataset - common objects in contexts. https://cocodataset.org/#home
SSD MobileNet OpenVINO toolkit. https://docs.openvinotoolkit.org/2021.2/omz_models_public_ssd_mobilenet_v1_coco_ssd_mobilenet_v1_coco.html
Acknowledgement
This research is mainly supported by PCCOER (Pimpri Chinchwad College of Engineering & Research). We thank Mr. Walmik R. Gawhale for the constant guidance, support and initial funding in this research. We thank Dr. H.U. Tiwari (Principal, PCCOER) and Dr. Rahul Mapari (H.O.D., E &TC, PCCOER) for their guidance and support throughout this research.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Gawhale, K. et al. (2022). Humanoid Robot - Spark. In: Raman, B., Murala, S., Chowdhury, A., Dhall, A., Goyal, P. (eds) Computer Vision and Image Processing. CVIP 2021. Communications in Computer and Information Science, vol 1568. Springer, Cham. https://doi.org/10.1007/978-3-031-11349-9_13
Download citation
DOI: https://doi.org/10.1007/978-3-031-11349-9_13
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-11348-2
Online ISBN: 978-3-031-11349-9
eBook Packages: Computer ScienceComputer Science (R0)