Abstract
With the development of computer vision and artificial intelligence technologies, the “Intelligent Arena” is becoming one of the new-emerging applications and research topics. Different from conventional sports video highlight detection, the intelligent playground can supply real-time and automatic sport video broadcast, highlight video generation, and sport technological analysis. In this paper, we have proposed a deep learning based intelligent basketball arena system to automatically broadcast the basketball match. First of all, with multiple cameras around the playground, the proposed system can automatically select the best camera to supply real-time high-quality broadcast. Furthermore, with basketball energy image and deep conventional neural network, we can accurately capture the scoring clips as the highlight video clips to supply the wonderful actions replay and online sharing. Finally, evaluations on a built real-world basketball match dataset demonstrate that the proposed system can obtain 94.59% accuracy with only 45 ms processing time (i.e., 10 ms live camera selection, 30 ms hotspot area detection, and 5 ms BEI+CNN) for each frame. As the outstanding performance, the proposed system has already been integrated into the commercial intelligent basketball arena applications.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
“Intelligent Arena,” http://www.huiti.com.
- 2.
“Beikantai,” http://www.kantai.tv.
References
Chen, C., Wang, O., Heinzle, S., Carr, P., Smolic, A., Gross, M.: Computational sports broadcasting: automated director assistance for live sports. In: IEEE ICME, pp. 1–6 (2013)
Chu, L., Jiang, S., Wang, S., Zhang, Y., Huang, Q.: Robust spatial consistency graph model for partial duplicate image retrieval. IEEE Trans. Multimedia 15(8), 1982–1996 (2013)
Chu, L., Wang, S., Liu, S., Huang, Q., Pei, J.: Alid: scalable dominant cluster detection. Proc. VLDB Endowment 8(8), 826–837 (2015)
Dollár, P.: Piotr’s Computer Vision Matlab Toolbox (PMT). https://github.com/pdollar/toolbox
Foote, E., Carr, P., Lucey, P., Sheikh, Y., Matthews, I.: One-man-band: a touch screen interface for producing live multi-camera sports broadcasts. In: ACM Multimedia, pp. 163–172 (2013)
Friedman, J., Hastie, T., Tibshirani, R., et al.: Additive logistic regression: a statistical view of boosting. Ann. Stat. 28(2), 337–407 (2000)
Gan, C., Wang, N., Yang, Y., Yeung, D., Hauptmann, A.G.: Devnet: a deep event network for multimedia event detection and evidence recounting. In: IEEE CVPR, pp. 2568–2577 (2015)
Gygli, M., Grabner, H., Riemenschneider, H., Gool, L.: Creating summaries from user videos. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8695, pp. 505–520. Springer, Heidelberg (2014). doi:10.1007/978-3-319-10584-0_33
Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: NIPS, pp. 1106–1114 (2012)
Liu, A., Xu, N., Su, Y., Lin, H., Hao, T., Yang, Z.: Single/multi-view human action recognition via regularized multi-task learning. Neurocomputing 151, 544–553 (2015)
Liu, C.: Beyond pixels: exploring new representations and applications for motion analysis. Ph.D. thesis, Cambridge, MA, USA (2009)
Liu, W., Mei, T., Zhang, Y.: Instant mobile video search with layered audio-video indexing and progressive transmission. IEEE Trans. Multimedia 16(8), 2242–2255 (2014)
Liu, W., Mei, T., Zhang, Y., Che, C., Luo, J.: Multi-task deep visual-semantic embedding for video thumbnail selection. In: IEEE CVPR, pp. 3707–3715 (2015)
Lucey, P., Bialkowski, A., Carr, P., Morgan, S., Matthews, I.A., Sheikh, Y.: Representing and discovering adversarial team behaviors using player roles. In: IEEE CVPR, pp. 2706–2713 (2013)
Maksai, A., Wang, X., Fua, P.: What Players do with the ball: a physically constrained interaction modeling. In: IEEE CVPR (2016)
Oldfield, R., Shirley, B., Cullen, N.: Demo paper: audio object extraction for live sports broadcast. In: IEEE ICME Workshop, pp. 1–2 (2013)
Sun, M., Farhadi, A., Seitz, S.: Ranking domain-specific highlights by analyzing edited videos. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8689, pp. 787–802. Springer, Heidelberg (2014). doi:10.1007/978-3-319-10590-1_51
Tran, D., Bourdev, L., Fergus, R., Torresani, L., Paluri, M.: Learning spatiotemporal features with 3D convolutional networks. In: IEEE ICCV, pp. 4489–4497 (2015)
Yang, H., Wang, B., Lin, S., Wipf, D., Guo, M., Guo, B.: Unsupervised extraction of video highlights via robust recurrent auto-encoders. In: IEEE ICCV, pp. 4633–4641 (2015)
Yao, T., Mei, T., Rui, Y.: Highlight detection with pairwise deep ranking for first-person video summarization. In: IEEE CVPR, pp. 982–990 (2016)
Acknowledgement
This work is partially supported by the National High Technology Research and Development Program of China (2014AA015101), the National Natural Science Foundation of China (No. 61602049), the Funds for Creative Research Groups of China (No. 61421061), the Beijing Training Project for the Leading Talents in S&T (ljrc 201502), and the Fundamental Research Funds for the Central University (No. 2016RC43).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Liu, W., Liu, J., Gu, X., Liu, K., Dai, X., Ma, H. (2017). Deep Learning Based Intelligent Basketball Arena with Energy Image. In: Amsaleg, L., Guðmundsson, G., Gurrin, C., Jónsson, B., Satoh, S. (eds) MultiMedia Modeling. MMM 2017. Lecture Notes in Computer Science(), vol 10132. Springer, Cham. https://doi.org/10.1007/978-3-319-51811-4_49
Download citation
DOI: https://doi.org/10.1007/978-3-319-51811-4_49
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-51810-7
Online ISBN: 978-3-319-51811-4
eBook Packages: Computer ScienceComputer Science (R0)