A Mobile Cloud Framework for Deep Learning and Its Application to Smart Car Camera

Chen, Chien-Hung; Lee, Che-Rung; Lu, Walter Chen-Hua

doi:10.1007/978-3-319-51969-2_2

A Mobile Cloud Framework for Deep Learning and Its Application to Smart Car Camera

Chien-Hung Chen¹⁷,
Che-Rung Lee¹⁷ &
Walter Chen-Hua Lu¹⁷

Conference paper
First Online: 20 January 2017

1051 Accesses
6 Citations

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 10036))

Abstract

Deep learning has become a powerful technology in image recognition, gaming, information retrieval, and many other areas that need intelligent data processing. However, huge amount of data and complex computations prevent deep learning from being practical in mobile applications. In this paper, we proposed a mobile cloud computing framework for deep learning. The architecture puts the training process and model repository in cloud platforms, and the recognition process and data gathering in mobile devices. The communication is carried out via Git protocol to ensure the success of data transmission in unstable network environments. We used smart car camera that can detect objects in recorded videos during driving as an example application, and implemented the system on NVIDIA Jetson TK1. Experimental results show that detection rate can achieve four frame-per-second with Faster R-CNN and ZF model, and the system can work well even when the network connection is unstable. We also compared the performance of system with and without GPU, and found that GPU still plays a critical role in the recognition side for deep learning.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

LeCun, Y., Jackel, L., Bottou, L., Brunot, A., Cortes, C., Denker, J., Drucker, H., Guyon, I., Muller, U., Sackinger, E., et al.: Comparison of learning algorithms for handwritten digit recognition. In: International Conference on Artificial Neural Networks, vol. 60, pp. 53–60 (1995)
Google Scholar
Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems, pp. 1097–1105 (2012)
Google Scholar
Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., Erhan, D., Vanhoucke, V., Rabinovich, A.: Going deeper with convolutions. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–9 (2015)
Google Scholar
Socher, R., Perelygin, A., Wu, J.Y., Chuang, J., Manning, C.D., Ng, A.Y., Potts, C.: Recursive deep models for semantic compositionality over a sentiment treebank. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), vol. 1631, p. 1642. Citeseer (2013)
Google Scholar
Collobert, R., Weston, J.: A unified architecture for natural language processing: deep neural networks with multitask learning. In: Proceedings of the 25th International Conference on Machine Learning, pp. 160–167. ACM (2008)
Google Scholar
Johnson, J., Karpathy, A., Fei-Fei, L.: Densecap: fully convolutional localization networks for dense captioning. arXiv preprint arXiv:1511.07571 (2015)
Fakoor, R., Ladhak, F., Nazi, A., Huber, M.: Using deep learning to enhance cancer diagnosis and classification. In: Proceedings of the International Conference on Machine Learning (2013)
Google Scholar
Dinh, H.T., Lee, C., Niyato, D., Wang, P.: A survey of mobile cloud computing: architecture, applications, and approaches. Wirel. Commun. Mob. Comput. 13, 1587–1611 (2013)
Article Google Scholar
Bengio, Y., Lamblin, P., Popovici, D., Larochelle, H., et al.: Greedy layer-wise training of deep networks. Adv. Neural Inform. Process. Syst. 19, 153 (2007)
Google Scholar
Girshick, R., Donahue, J., Darrell, T., Malik, J.: Rich feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 580–587 (2014)
Google Scholar
Redmon, J., Divvala, S., Girshick, R., Farhadi, A.: You only look once: unified, real-time object detection. arXiv preprint arXiv:1506.02640 (2015)
Sermanet, P., Eigen, D., Zhang, X., Mathieu, M., Fergus, R., LeCun, Y.: Overfeat: integrated recognition, localization and detection using convolutional networks. arXiv preprint arXiv:1312.6229 (2013)
Hinton, G., Deng, L., Yu, D., Dahl, G.E., Mohamed, A.R., Jaitly, N., Senior, A., Vanhoucke, V., Nguyen, P., Sainath, T.N., et al.: Deep neural networks for acoustic modeling in speech recognition: the shared views of four research groups. IEEE Sig. Process. Mag. 29, 82–97 (2012)
Article Google Scholar
Dahl, G.E., Yu, D., Deng, L., Acero, A.: Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition. IEEE Trans. Audio Speech Lang. Process. 20, 30–42 (2012)
Article Google Scholar
He, K., Zhang, X., Ren, S., Sun, J.: Delving deep into rectifiers: Surpassing human-level performance on imagenet classification. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 1026–1034 (2015)
Google Scholar
Srivastava, N., Hinton, G.E., Krizhevsky, A., Sutskever, I., Salakhutdinov, R.: Dropout: a simple way to prevent neural networks from overfitting. J. Mach. Learn. Res. 15, 1929–1958 (2014)
MathSciNet MATH Google Scholar
Sutskever, I., Martens, J., Dahl, G.E., Hinton, G.E.: On the importance of initialization and momentum in deep learning. ICML 28(3), 1139–1147 (2013)
Google Scholar
Huang, J., Qian, F., Gerber, A., Mao, Z.M., Sen, S., Spatscheck, O.: A close examination of performance and power characteristics of 4G LTE networks. In: Proceedings of the 10th International Conference on Mobile Systems, Applications, and Services, pp. 225–238. ACM (2012)
Google Scholar
Jia, Y., Shelhamer, E., Donahue, J., Karayev, S., Long, J., Girshick, R., Guadarrama, S., Darrell, T.: Caffe: convolutional architecture for fast feature embedding. arXiv preprint arXiv:1408.5093 (2014)
Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: Towards real-time object detection with region proposal networks. In: Advances in Neural Information Processing Systems, pp. 91–99 (2015)
Google Scholar
Sanneck, H.A., Carle, G.: Framework model for packet loss metrics based on loss runlengths. In: Proceedings of the SPIE, Multimedia Computing and Networking 2000, vol. 3969 (1999)
Google Scholar
Han, S., Mao, H., Dally, W.J.: Deep compression: compressing deep neural network with pruning, trained quantization and huffman coding. CoRR, abs/1510.00149 2 (2015)
Google Scholar

Download references

Acknowledgment

This study is conducted under the The Core Technologies of Smart Handheld Devices (3/4) of the Institute for Information Industry; which is subsidized by the Ministry of Economy Affairs, Taiwan. The authors thank the Institute for Information Industry for the financial support under grant number 105-EC-17-A-24-0691.

Author information

Authors and Affiliations

National Tsing Hua University, Hsinchu, 30013, Taiwan
Chien-Hung Chen, Che-Rung Lee & Walter Chen-Hua Lu

Authors

Chien-Hung Chen
View author publications
You can also search for this author in PubMed Google Scholar
Che-Rung Lee
View author publications
You can also search for this author in PubMed Google Scholar
Walter Chen-Hua Lu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Che-Rung Lee .

Editor information

Editors and Affiliations

Department of Computer Science, Chung Hua University, Hsinchu, Taiwan, Taiwan
Ching-Hsien Hsu
Beijing University of Posts and Telecommunications, Beijing, China
Shangguang Wang
Beijing University of Posts and Telecommunications, Beijing, China
Ao Zhou
The University of Fiji, Suva, Fiji
Ali Shawkat

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Chen, CH., Lee, CR., Lu, W.CH. (2016). A Mobile Cloud Framework for Deep Learning and Its Application to Smart Car Camera. In: Hsu, CH., Wang, S., Zhou, A., Shawkat, A. (eds) Internet of Vehicles – Technologies and Services. IOV 2016. Lecture Notes in Computer Science(), vol 10036. Springer, Cham. https://doi.org/10.1007/978-3-319-51969-2_2

Download citation

DOI: https://doi.org/10.1007/978-3-319-51969-2_2
Published: 20 January 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-51968-5
Online ISBN: 978-3-319-51969-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics