A trajectory-based analysis of coordinated team activity in a basketball game

doi:10.1016/j.cviu.2008.03.001

Computer Vision and Image Understanding

Volume 113, Issue 5, May 2009, Pages 612-621

https://doi.org/10.1016/j.cviu.2008.03.001 Get rights and content

Abstract

This paper proposes a novel, trajectory-based approach to the automatic recognition of complex multi-player behavior in a basketball game. First, a probabilistic play model is applied to the player-trajectory data in order to segment the play into game phases (offense, defense, time out). In this way, both the temporal boundaries of the observed activity and its broader context are obtained. Next, the team’s activity is analyzed in more detail by detecting the key elements of basketball play. Following basketball theory, these key elements (starting formation, screen, and move) are the building blocks of basketball play, and therefore their temporal order is used to produce a semantic description of the observed activity. Finally, the activity is recognized by comparing its semantic description with the descriptions of manually defined templates, stored in a database. The effectiveness and robustness of the proposed approach is demonstrated on two championship games and 71 examples of three types of basketball offense.

Introduction

One of the main challenges in sports science is an objective analysis of player performance. While an individual player’s physical abilities can be readily tested in the laboratory, a team’s performance can only be observed during an actual game. This process may include advanced analysis methods, such as video recording and statistical analysis, but it nevertheless relies on observation and manual annotation by sports experts, with the potential risk of becoming too subjective. Additionally, manual annotation is a time-consuming and tedious task, mostly limited either to academic research or to the small number of teams that can afford a sufficient number of qualified experts. Moreover, some researchers have found [1] that even sports experts often cannot observe and recall all the details that can prove crucial for the correct interpretation of the results. This is the reason for an increasing volume of research concerned with the automatic or semi-automatic recognition and analysis of human behavior in sports. The ultimate goal of such research is to develop methods for the automatic interpretation and analysis of a team’s performance, which would present a concise summary of the team’s and the players’ strengths, weaknesses, and mistakes.

The main focus of this article is the challenge of observing a basketball game and interpreting the activity of the team on the court. The quality of the team has two important components. The first component encompasses the skills of the players, and is expressed as their technical knowledge. The second component is expressed as the overall team tactics. In order for a team to be successful, it needs individuals with excellent technical skills. Nevertheless, these individuals have to be able to act together as a group—a task that requires good coordination between the individual players and can only be achieved with a lot of training. Following this challenge, the focus of this article is on an analysis of coordinated activity in team sports, in particular basketball.

It is widely accepted [2] that the most successful players have the capability to react differently in similar situations. On the team level, the situation is similar—good teams can quickly change their tactics if needed. Such behavior prevents the opposing teams from preparing a good defense, and keeps the play interesting, as both teams have to continuously adapt to the situation on the court. This introduces a certain level of complexity and randomness to a team’s performance, and makes the design of an entirely automatic analysis system, which would recognize, understand, and grade every possible situation on the court, extremely difficult. However, due to the nature of a sport’s rules and rigorous player training, the motion of the players across the court is not entirely random, and it is reasonable to expect that it is possible to extract some common features of the team, especially when considering that coordinated team play is usually practised in advance.

The remainder of this paper is structured as follows. In the rest of this section, we give a short overview of the related work and present the concept of our approach. The methods for the segmentation and recognition of the complex multi-agent behavior are presented in Sections 2 and 3 the experimental results are presented. Finally, Section 4 draws the main conclusions and describes our future work.

A lot of work concerning trajectory-based motion modeling and analysis has been presented in recent years. For example, Johnson and Hogg [3], [4] presented two approaches for modeling variable, non-linear behavior. In their first approach a competitive learning, neural network was used on flow vectors from image sequences of pedestrians [3], and in the other, a probabilistic motion model was obtained using a Gaussian mixture model, representing the system state changes of pedestrian motion [4].

The more sophisticated, trajectory-based, multi-agent, action-recognition, and analysis approaches involve the compact representation and modeling of actions and interactions and their logical and temporal relations [5], [6], [7], [8], [9], [10]. Rao et al. [5] presented an approach to view invariant action-recognition that is capable of explaining an action in terms of meaningful action units called dynamic instants and intervals. Li and Woodham [6] presented a system for representing and reasoning about selected hockey plays based on trajectory data, augmented with domain-specific knowledge, such as forward/backward skating, puck possession, etc. Intille and Bobick [7] built models of football plays using belief networks and temporal graphs. A similar approach was used by Jug et al. [9] to assess team performance in basketball offense. The main contribution of the latter two approaches is the representation of multi-agent activity and recognition from noisy trajectory data. This is done by dividing the multi-agent activity into individual visually grounded, goal-based primitives that are probabilistically integrated with the low-order temporal and logical relationships. However, there are two main problems with such an approach. The first one is the need for a precise temporal segmentation of the analyzed trajectories. The second problem is the difficulty of building temporal and logical relationships, especially because of the many different parameters that need to be defined manually. Therefore, such an approach is not particularly suitable for cases when either a large quantity of data has to be analyzed or many different behavior models are used in the analysis.

Other sport-related research has focused on the content-based indexing of video footage [11], [12], learning motion models in golf [13], soccer [14], baseball [15], or choreography in dance [16], and acrobatics [17].

The long-term aim of the research described above is to provide athletes with the feedback they need to improve their kinematic skills. It can also be used to develop and monitor the correct execution of certain predefined mechanical actions (e.g. ball handling) or to learn the correct execution of more complex team activities.

In our work, we address the problem of a trajectory-based analysis of basketball, with the aim of overcoming the described problems. We chose our approach with expert sports knowledge in mind. Like with the procedure used in sports research, we perform a two-step analysis process, where the game is first segmented according to the phases of play (offense, defense, time out), and then each of the segments is analyzed in detail.

In the first step, a Gaussian mixture model is used to segment the continuous player trajectories into shorter game segments, corresponding to offense, defense, and time outs. This step provides us with both the roughly segmented boundaries and with the broader context of the game inside a particular segment. In the second step, we are able to recognize a particular type of basketball activity. In this article, we only focus on the recognition of the organized activity in the offensive part of the game. The basketball offense is the most interesting and widely studied [2], [18] part of the game for coaches and basketball experts. It is trained in advance and the set of learnt offenses for a particular team does not change significantly during the course of a single game.

To perform the recognition of activity, we developed three different trajectory-based detectors of the key basketball elements, which are the theoretical building blocks of any basketball play. These detectors are used to transform player trajectories, which represent the observed activity, into the sequence of symbols—the semantic description. The obtained symbols describe the actions and interactions between the players, and contain the notation of the observed element and its observed position on the court. To determine the similarity between the observed activity and the predefined activity template, we simply compute a Levenshtein distance between the sequence of symbols obtained from the trajectories of the players and the sequence of symbols obtained from the activity template. By repeating the template-trajectory matching procedure for every template available, we can find the template with the shortest distance to the sequence of symbols, obtained from the trajectory data. If the distance is below a certain threshold, we assign the label of the most similar template to the observed trajectory segment.

Besides removing the need for manual segmentation, an important contribution of our approach is the simplified process of providing expert knowledge in machine-suitable format. The method used by Intille and Bobick [7] requires the provision of expert knowledge in the form of temporal and logical relationships in a belief network for every activity that has to be recognized. This process is inappropriate for field experts (i.e., sports coaches), slow, and can be inaccurate or subjective because of the non-obvious relationships between the activity and the network structure [7], [9]. In our case, the specific structure of basketball activity is encoded in the form of activity templates, which can be represented graphically and are significantly better understood by an average sports expert.

Section snippets

Methods

This section describes the methods developed for analyzing the player motion data in the context of cooperative basketball play. The analysis is conducted in two steps. First, the players’ trajectories are temporally segmented into three game phases—offense, defense, and time out. The segmentation is achieved using the probabilistic game model, presented in Section 2.1. Next, the template-based recognition procedure is applied to every individual phase of the game (Section 2.2).

Experiments and results

Our approach is general enough to be applied to any kind of (sufficiently accurate) trajectory data. However, to test the approach, we obtained the trajectory data using computer-vision-based tracking methods. We provide details of our experimental setup here to document the nature of the input data on which our trajectory-based activity analysis methodology was tested.

Conclusion and future work

We presented a two-step approach to the analysis of a basketball game. Our ultimate goal was an automatic segmentation of the trajectory data into meaningful game phases (offense, defense, and time outs) and automatic recognition of the team activity from these phases.

We have demonstrated that by observing only the average position of all the players in the team it is possible to segment the basketball game with reasonable accuracy. We modeled every phase as a two-component Gaussian mixture

References (30)

S.S. Intille et al.
Recognizing planned, multiperson action
Computer Vision and Image Understanding: CVIU
(2001)
Y. Luo et al.
Object-based analysis and interpretation of human motion in sports video sequences by dynamic Bayesian networks
Computer Vision and Image Understanding
(2003)
I.M. Franks et al.
Eyewitness testimony in sport
Journal of Sports Behavior
(1986)
J. Kresse, R. Jablonski, The Complete Book of Man-To-Man Offense, Coaches Choice, second ed.,...
H. Johnson et al.
Learning the distribution of object trajectories for event recognition
Image and Vision Computing
(1996)
H. Johnson et al.
Representation and synthesis of behavior using gaussian mixtures
Image and Vision Computing
(2002)
C. Rao et al.
View-invariant representation and recognition of actions
International Journal of Computer Vision
(2002)
F. Li, R.J. Woodham, Analysis of player actions in selected hockey game situations, in: Proceedings of the Second...
R. Hongeng, S. Nevatia, Multi-agent event recognition, in: Proceedings of the Eighth IEEE International Conference on...
M. Jug, J. Perš, B. Dežman, S. Kovačič, Trajectory based assessment of coordinated human activity, in: Proceedings of...

N. Vaswani, A. RoyChowdhury, R. Chellappa, Activity recognition using the dynamics of the configuration of interacting...

P. Xu, L. Xie, S. Chang, A. Divakaran, A. Vetro, H. Sun, Algorithms and systems for segmentation and structure analysis...

H. Pan, P. Van Beek, M.I. Sezan, Detection of slowmotion replay segments in sports video for highlights generation, in:...

R. Urtasun, D.J. Fleet, P. Fua, Monocular 3d tracking of the golf swing, in: IEEE Computer Society Conference on...

A.A. Efros, A.C. Berg, G. Mori, J. Malik, Recognizing action at a distance, in: Ninth IEEE International Conference on...

Cited by (102)

An overview of Human Action Recognition in sports based on Computer Vision
2022, Heliyon
Human Action Recognition (HAR) is a challenging task used in sports such as volleyball, basketball, soccer, and tennis to detect players and recognize their actions and teams' activities during training, matches, warm-ups, or competitions. HAR aims to detect the person performing the action on an unknown video sequence, determine the action's duration, and identify the action type. The main idea of HAR in sports is to monitor a player's performance, that is, to detect the player, track their movements, recognize the performed action, compare various actions, compare different kinds and skills of acting performances, or make automatic statistical analysis.
As an action that can occur in the sports field refers to a set of physical movements performed by a player in order to complete a task using their body or interacting with objects or other persons, actions can be of different complexity. Because of that, a novel systematization of actions based on complexity and level of performance and interactions is proposed.
The overview of HAR research focuses on various methods performed on publicly available datasets, including actions of everyday activities. That is just a good starting point; however, HAR is increasingly represented in sports and is becoming more directed towards recognizing similar actions of a particular sports domain. Therefore, this paper presents an overview of HAR applications in sports primarily based on Computer Vision as the main contribution, along with popular publicly available datasets for this purpose.
Simulation of basketball motion image resolution based on FPGA and gaussian mixture model
2021, Microprocessors and Microsystems
This article has been retracted: please see Elsevier Policy on Article Withdrawal (http://www.elsevier.com/locate/withdrawalpolicy).
This article has been retracted at the request of the Editor-in-Chief.
Significant similarities were noticed post-publication between this special issue article and other published sources. There is an indication that attempts have been made to disguise the copying using automated paraphrasing. Subsequent to acceptance of these special issue papers by the responsible guest editor, Hameem Shanavas, the integrity and rigor of the peer-review process of the Special Issue were investigated and confirmed to fall beneath the high standards expected by Microprocessors & Microsystems. Due to a configuration error in the editorial system, unfortunately neither the Editor in Chief nor the designated Handling Editors received these papers for approval as per the journal’s standard workflow.
Method and implementation of micro Inertial Measurement Unit (IMU) in sensing basketball dynamics
2019, Sensors and Actuators, A: Physical
Citation Excerpt :
Researchers have found success using different technology and sensors in basketball such as vision systems for tracking different aspects of the sport. Various methods of player movement tracking have been presented using computer vision and algorithms [27–29]. Ball tracking was also possible using computer vision, with the ability to reconstruct ball trajectories [30].
In this paper, an Inertial Measurement Unit (IMU) is integrated with a basketball and used to measure basketball dynamics during various training and game exercises. We implemented an IMU composed of three-axis MEMS accelerometers and three-axis MEMS gyroscopes interfaced with a microcontroller on a basketball to measure basketball dynamics. Herein, methods are outlined for determining the rotational velocity of the basketball, namely backspin, and its axis of rotation during a shot which can be directly related to player shot form. Information about a player’s dribbling ability can also be discerned from the players dribbling frequency, captured through acceleration information. Our experimental validation shows that methods of angular velocity determination from both gyroscopes and accelerometers individually yield less than 2% error, making both methods viable in use. We also showed a method to detect the angle of the rotational axis relative to the ground. The methods outlined in this paper could be utilized to collect dynamic data that can be used to effectively train a basketball player and assist in skill improvements and benchmarking; all of which is valuable information for players and coaches alike.
Offensive Lineup Analysis in Basketball with Clustering Players Based on Shooting Style and Offensive Role
2024, arXiv
Recognition of Basketball Tactics Based on Vision Transformer and Track Filter
2024, Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology
Visualizing Skiers’ Trajectories in Monocular Videos
2023, arXiv

View all citing articles on Scopus

^☆: This research was supported in part by Slovenian Research Agency (ARRS) contracts P2-0232 and L5-6274.

View full text

A trajectory-based analysis of coordinated team activity in a basketball game☆

Abstract

Introduction

Section snippets

Methods

Experiments and results

Conclusion and future work

Computer Vision and Image Understanding: CVIU

Computer Vision and Image Understanding

Eyewitness testimony in sport

Journal of Sports Behavior

Learning the distribution of object trajectories for event recognition

Image and Vision Computing

Representation and synthesis of behavior using gaussian mixtures

Image and Vision Computing

View-invariant representation and recognition of actions

International Journal of Computer Vision