Real-time sow behavior detection based on deep learning

doi:10.1016/j.compag.2019.104884

Computers and Electronics in Agriculture

Volume 163, August 2019, 104884

https://doi.org/10.1016/j.compag.2019.104884 Get rights and content

Highlights

•
The SBDA-DL network model is constructed based on the SSD and the MobileNet.
•
The model accelerates the network detection speed by compressing the SSD network model, that is, the last two convolutional layers in the detection network are deleted.
•
The algorithm was used for real-time detection of three typical sow behaviors: drinking, urination, and mounting.
•
The model is applied to the practical application of the real farm video detection algorithm, and the advantages of the algorithm in sow behavior detection are compared.

Abstract

Recording sow behaviors allows tracking their health status, the timely detection of abnormalities, and provides assistance to increase their health both physically and mentally. In recent years, detecting sow behavior using machine vision technology has become a popular research topic. However, current detection methods are based on the premise that an individual pig can be accurately identified. In this paper, A Real-Time Sow Behavior Detection Algorithm based on Deep Learning (SBDA-DL) is proposed. The algorithm was used for real-time detection of three typical sow behaviors: drinking, urination, and mounting. The experimental results show that the average precision (AP) of the algorithm to detect drinking, urination, and mounting behaviors is 96.5%, 91.4%, and 92.3%, respectively. The mean average precision (mAP) of a category is 93.4%, which can reach 7 frames per second in commonly configured microcomputers. The algorithm uses an optimized deep learning network structure to directly detect the sow behavior. This improves the accuracy of behavior detection at a processing speed required for real-time detection and meets the requirements of daily monitoring from auxiliary staff in most pig breeding farms.

Introduction

In 2017, China produced 68.861 million live pigs, an increase of 0.8% over 2016, and produced 52.99 million tons of pork, an increase of 0.5% over 2016, which accounted for more than half the global pork production (Chen et al., 2011). A sow is one of the most important members of a pig farm. Various types of behavioral data for sows are important when accurately predicting the estrus time (Xia et al., 2010).

Monitoring sow behavior can better reflect their health status. Feeding and drinking behaviors can help determine their food intake. When pigs have diarrhea and other diseases, their drinking behaviors can become abnormal (Kruse et al., 2011). Recording their urination behaviors can determine their urination frequency, and observed changes in sow urination can preliminarily judge their health status (Xia et al., 2015). Studying the mounting behavior of sows can determine their excitement level (Liu et al., 2013) and predict their estrus time.

There has been some research about observing or predicting pig behavior. Oczak et al. (2013) used a multilayer feedforward neural network to train five characteristic pairs of activity indices to identify aggressive behaviors in pigs. Jonguk et al. (2016) used a kinect depth sensor to identify aggressive behaviors in pigs. Kashiha et al. (2013) used a water meter to monitor the water use rate of each pigsty and used a camera to capture their drinking behaviors. They then studied the relationship between the water use rate and the drinking behavior. Madelyne et al. (2016) found that the drinking time of pigs measured using Radio Frequency Identification (RFID) was highly correlated with the observed time and quantity of drinking. Nasirahmadi et al. (2016) used an elliptical fit to locate pigs in pens and compared the relationship between the fitting proportion of two ellipses and the mounting behavior of pigs to measure whether mounting occurred along with its duration. Costa et al. (2013) analyzed the relationship between pig activity and climate change through image processing technology and found a significant correlation between the pig occupation index with the ventilation rate, temperature, and humidity. Yang et al. (2017) established a behavior recognition model of captive porcupines based on the structure of breeding pools and the actual porcupine activity. They recognized seven basic porcupine behaviors, such as resting, feeding, drinking, excretion, and biting on the iron gate and sink.

With the growth of deep learning, object detection techniques have developed extensively in recent years (Yu et al., 2013). The traditional target detection methods use haar-like features (Viola and Jones, 2001), histogram of oriented gradients (HOG) (Dalal et al., 2005), scale-invariant feature transform (SIFT) to extract image features (Lowe, 2004), support vector machine (SVM) (Andrew, 2000), and the adaptive boosting (AdaBoost) algorithm to perform target classification (Freund et al., 1997). The deformable part model (DPM) was proposed by Girshick et al. (2013) and uses a similar convolutional neural network (CNN) structure for target detection, which reached 43% mAP in the PASCAL VOC2007 competition (Everingham et al., 2015). The region-based CNN (R-CNN) proposed by Girshick et al. (2013) uses a selective search, a deep learning network, and a SVM to perform detection, feature extraction, and classification, respectively. This approach achieved 58% on the PASCAL VOC2007, which is more than 15% better than using the selective search alone for target detection (Uijlings et al., 2013). This also shows that deep learning can be successfully used for target detection.

The spatial pyramid pooling in deep convolutional network (SPP-Net) was proposed by He et al. (2015) and uses the spatial pyramid pooling layer to output a fixed-size feature image for an input image of arbitrary size. However, this method reduces the detection accuracy due to deformations in the detected image. The fast R-CNN proposed by Girshick et al. (2015) mapped a region box to the feature map of the previous convolutional layer. Such a detection scheme only needs to extract features once, which greatly improves the detection speed. The faster R-CNN proposed by Ren et al. (2017) replaces the detection part with a deep CNN, and directly performs detection and recognition within the feature map, which improves the accuracy of the PASCAL VOC2007 to 73%. However, these network frameworks have slow detecting speeds, making it difficult to apply them in real cases. Redmon et al. (2016) proposed the you only look once (YOLO) network framework, which is different from the previous target detection approaches because it uses a deep CNN to detect the target while also recognizing it. This comes with a slight reduction in the accuracy of the mAP, but has a greatly accelerated speed of 21 frames per second. The single shot multibox detector (SSD) network model was proposed by Liu et al. (2016), which improves the processing speed to 46 frames per second while maintaining an accuracy similar to the faster R-CNN.

The current detection method takes the accurate identification of individual pigs as the premise, and generally only detects a single pig behavior. There have been no scientific reports on the applications of simultaneously testing a variety of pig behaviors. Therefore, this paper proposes A Real-Time Sow Behavior Detection Algorithm based on Deep Learning (SBDA-DL), which is the first time sow behavior recognition has been described using a target detection method based on deep learning. The method can train a variety of categories with large differences on a small amount of data. The main sow behaviors of interest are drinking, urination, and mounting, which are detected in real-time. Moreover, the processing speed is accelerated while ensuring detection accuracy so that the algorithm can operate smoothly on conventional computation equipment.

Section snippets

Experimental device

This study was conducted at a sow farm of the Guangzhou Lizhi Agriculture Co., Ltd (http://www.lzpig.com/). A 3 million pixel (Haikang DS-2CD3355-I) infrared network camera was placed on a beam directly above the center of the pig pen. The length and width ratio of the pigsty is 6:5, the overall area is approximately 20 square meters, and the beam is around 5 m above the ground. A webcam is wired to the network video recorder (NVR) device and operates 24 h a day with a frame rate of 25 frames

SBDA-DL network model results

To verify the effectiveness of the proposed algorithm, the mAP of the most commonly used category in the object detection field is considered as the accuracy evaluate criterion. The mAP of a category is the average value of the sum of the APs for the various categories. The calculations are given in Eqs. (7), (8). $AP = \frac{boundingbox \cap g r o u n d t r u e}{groundtrue}$ $mAP = \frac{1}{n} {\sum_{i = 1}^{n} (A P)}_{i}$ The bounding box refers to the detection results of the image obtained using the object detection algorithm, and the ground truth

Conclusions

In this paper, an object detection method based on deep learning is proposed for the first time to detect the drinking, urination, and mounting behaviors of sows in real-time using videos of their behavior. The experimental results show that the average precisions for the proposed algorithm when detecting these sow behaviors are 96.5%, 91.4%, and 92.3%, respectively, and the mAP of the categories is 93.4%, which can reach 7 frames per second. The average category accuracy for the SBDA-DL

Acknowledgements

This work was supported by the National Key Research and Development Program of China (grant number 2017YFD0701601) and Guangdong Science and Technology Program of China (grant numbers 2019B020215004 and 2019B020215002).

References (38)

Y. Freund et al.
A decision-theoretic generalization of on-line learning and an application to boosting
J. Comput. Syst. Sci.
(1997)
M. Kashiha et al.
The automatic monitoring of pigs water use by cameras
Comput. Electron. Agric.
(2013)
S. Kruse et al.
A note on using wavelet analysis for disease detection in lactating sows
Comput. Electron. Agric.
(2011)
A. Nasirahmadi et al.
Automatic detection of mounting behaviors among pigs using image analysis
Comput. Electron. Agric.
(2016)
M. Oczak et al.
Analysis of the aggressive behaviors of pigs by automatic video recordings
Comput. Electron. Agric.
(2013)
A.M. Andrew
An Introduction to support vector machines and other kernel-based learning methods
Kybernetes
(2000)
X. Chen et al.
Application of anomaly mining in pig behavior data analysis
China Annual Conference on Agricultural Systems Engineering
(2011)
A. Costa et al.
An image-processing technique to measure pig activity in response to climatic variation in a pig barn
Animal Prod. Sci.
(2013)
N. Dalal et al.
Histograms of oriented gradients for human detection
Comput. Soc.
(2005)
M. Everingham et al.
The pascal visual object classes challenge: a retrospective
Int. J. Comput. Vision
(2015)

D. Erhan et al.

Scalable object detection using deep neural networks

IEEE Conf. Comp. Vision Pattern Recogn.

(2014)

R. Girshick et al.

Training deformable part models with decorrelated features

IEEE Int. Conf. Comp. Vision

(2013)

R. Girshick

Fast r-cnn. ICCV

(2015)

K. He et al.

Spatial pyramid pooling in deep convolutional networks for visual recognition

IEEE Trans. Pattern Anal. Mach. Intell.

(2015)

T. Huang et al.

A fast two-dimensional median filtering algorithm

IEEE Trans. Acoust. Speech Signal Process.

(1979)

A.G. Howard et al.

MobileNets: efficient convolutional neural networks for mobile vision applications

CoRR

(2017)

G.E. Hinton et al.

Replicated softmax: An undirected topic model

International Conference on Neural Information Processing Systems

(2009)

Ioffe S, Szegedy C. Batch Normalization: Accelerating deep network training by reducing internal covariate shift. ICML,...

L. Jonguk et al.

Automatic recognition of aggressive behavior in pigs using a Kinect depth sensor

Sensors

(2016)

Cited by (56)

A two-stage recognition method based on deep learning for sheep behavior
2023, Computers and Electronics in Agriculture
Recently, computer vision has been widely used in livestock behavior detection. In animal multi-behavior detection, there is a common problem of low recognition accuracy. To address this, a deep learning-based method for sheep-behavior detection is proposed in this paper. Six types of sheep behavior can be recognized by the method. Among these behaviors, standing, feeding, and lying are normal physiological activities. Attacking, biting, and climbing are disruptive behaviors that could result in ranch losses and require immediate concern from the breeder. The method consists of two stages: the detection stage and the classification stage. In the detection stage, a detection network is used to determine whether every sheep's behavior belongs to normal physiological activities or disruptive behaviors respectively. Based on a classic network, the multi-scale feature aggregation, attention mechanism and depthwise convolution module are imported to it, which makes the network trades off between detection accuracy and model size. These improvements make the network more suitable for sheep-behavior detection. In the classification stage, the VGG network is utilized to classify the behavior of each sheep specifically. Experimental results demonstrate that the two-stage method achieves desirable results in sheep-behavior recognition. In the detection stage, the mAP of the two types of behaviors exceeds 98%. In the classification stage, the classification accuracy of all behaviors exceeds 94%. At the same time, the memory of the detection model is less than 130 MB.
Evaluation of computer vision for detecting agonistic behavior of pigs in a single-space feeding stall through blocked cross-validation strategies
2023, Computers and Electronics in Agriculture
Agonistic behavior at feeding spaces is associated with both welfare and feed intake issues in swine farming. Studying interactive social behaviors of group-housed pigs provides valuable information to improve their production and welfare. The aims of this study were to (1) develop a deep learning pipeline based on convolutional neural network (CNN) and long short-term memory (LSTM) to classify videos depicting four types of interactive behavior between pigs in a single-space feeding stall and (2) validate the CNN + LSTM pipeline through various blocked validation strategies. Four categories of behaviors were classified in this study: head-to-body contact (including gentle nosing, casual contact between head/ears of a pig with a feeding pig, head knocking, tail biting, and pushing); levering where the feeding pig was lifted from behind by another pig; mounting in which the feeding pig was mounted by another pig; and no-contact when a second pig entered the feeding stall without physical contact with the feeding pig. Behavior at the feeding stall was filmed twice, three weeks apart, for two consecutive days each week using six groups of grow-finish pigs (10 per group) housed in pens equipped with FIRE® feeders. This resulted in a total of 15,679 30-frame video episodes for classification. Random validation, blocking-by-time validation, and blocking-by-feeder validation were utilized for training-validation data split. The size of training sets was held constant (N = 7,500) through all validation scenarios. The average validation accuracies were 0.968(±0.001), 0.860(±0.033), 0.766(±0.026), and 0.860(±0.010) for random validation, blocking-by-time validation, and blocking-by-feeder validation (at Feeder 1 and at Feeder 2), respectively. The results indicate that the CNN + LSTM pipeline yielded acceptable predictive performance in random validation. However, performance was substantially worse in blocking-by-time and blocking-by-feeder validations. More work is needed for algorithm generalization to improve its robustness across a variety of real-life application scenarios. We provide public access to the dataset and the code.
A lightweight CNN-based model for early warning in sow oestrus sound monitoring
2022, Ecological Informatics
Citation Excerpt :
Although monitoring methods with sensors are widely used, they require high-performance equipment and are prone to cause stress in livestock, which is not conducive to long-term monitoring. With the continuous development of science and technology, there are an increasing number of methods for detecting animal behaviours based on audio and video recognition technology (Cuan et al., 2020; Kumar et al., 2018; Sheng et al., 2020; Y. Zhang et al., 2019). Audio, as a nondestructive monitoring method for information collection, has advantages, such as simplicity of operation and low costs, over traditional identification and other methods, and it can be used for real-time monitoring around the clock.
The reproductive performance of sows is an important indicator for evaluating the economic efficiency and production level of pigs. In this paper, we design and propose a lightweight sow oestrus detection method based on acoustic data and deep convolutional neural network (CNN) algorithms by collecting and analysing short-frequency and long-frequency sow oestrus sounds. We use visual log-mel spectrograms, which can reflect three-dimensional information, as inputs to the network model to improve the overall recognition accuracy. The improved lightweight MobileNetV3_esnet model is used to identify oestrus and nonoestrus sounds and is compared with existing algorithms. The model outperforms the other algorithms, with 97.12% precision, 97.34% recall, 97.59% F1-score, and 97.52% accuracy; the model size is 5.94 MB. Compared with traditional oestrus monitoring methods, the proposed method can more accurately boost the vocal characteristics exhibited by sows in latent oestrus, thus providing an efficient and accurate approach for use in practical applications of oestrus monitoring and early warning systems on pig farms.
Oestrus detection in dairy cows by using atrous spatial pyramid and attention mechanism
2022, Biosystems Engineering
The mounting behaviour of dairy cows is driven by oestrus and is also related to their welfare. To improve the accuracy and speed of the detection of mounting of dairy cows in dense scenes, a detection model for the mounting behaviour of dairy cows was developed in this study based on the feature enhancement module. Firstly, an improved attention module (C3GC-3) based on global context information and convolution was proposed to capture long-distance dependence. Then, a feature enhancement module based on atrous spatial pyramid pooling and the C3GC-3 was designed to realise the multiscale fusion of high-level semantic information and improve the feature extraction ability of the model for dense dairy cows images. Finally, the K-means clustering algorithm was used to obtain new anchors for the mounting behaviour of dairy cows, and CIoU was used to optimise the loss function. A camera was installed on a farm containing 200 dairy cows for data collection. The mosaic method was used to expand the 2668 images of the mounting behaviour of dairy cows to train the model, and the remaining 675 images were used to test the model. The experimental results showed that the proposed model had a high inference speed of 61 fps, and the detection time for each image was 16.3 ms, which met the real-time performance requirements for detecting mounting in dairy cows. The mean average precision of the model was 94.3%, which was 5.9% higher than that of YOLOv5l. The proposed model showed promising results regarding the mounting of dairy cows and might be used in any weather.
High sensitivity, fast response and anti-interference crack-based reduced graphene oxide strain sensor for pig acoustic recognition
2022, Computers and Electronics in Agriculture
With the development of precision pig industry and the growing demands for pork products, it is of great significance to monitor the health of pigs. The reported detection methods suffer harmful, time-consuming, complex and expensive. Acoustic detection is a simple and effective contactless method to judge the health status of pigs. Strain sensor as an important contactless method has attracted wide research interest in acoustic detection. However, most strain sensors cannot accurately detect pig sounds due to their low sensitivity, slow response, and susceptibility to interference. In this work, a crack-based rGO strain sensor is proposed and exhibits unique “half-open, half-close” microstructures. The strain changes the crack gaps, resulting in the resistance of sensor increases. The sensor exhibits high sensitivity and fast response in a wide frequency range from 110 Hz to 5000 Hz and its ΔR/R₀ increases monotonically with increasing acoustic intensity. In addition, the rGO sensor is directional and thus has anti-interference ability to noise. The rGO sensor is applied for pig acoustic recognition and the measured results agree well with the original audios. In conclusion, the crack-based rGO strain sensor with high sensitivity, fast response and anti-interference shows great potential for pig acoustic recognition and its detected results can provide a reference for pig health monitoring.
Barriers to computer vision applications in pig production facilities
2022, Computers and Electronics in Agriculture
Surveillance and analysis of behavior can be used to detect and characterize health disruption and welfare status in animals. The accurate identification of changes in behavior is a time-consuming task for caretakers in large, commercial pig production systems and requires strong observational skills and a working knowledge of animal husbandry and livestock systems operations. In recent years, many studies have explored the use of various technologies and sensors to assist animal caretakers in monitoring animal activity and behavior. Of these technologies, computer vision offers the most consistent promise as an effective aid in animal care, and yet, a systematic review of the state of application of this technology indicates that there are many significant barriers to its widespread adoption and successful utilization in commercial production system settings. One of the most important of these barriers is the recognition of the sources of errors from objective behavior labeling that are not measurable by current algorithm performance evaluations. Additionally, there is a significant disconnect between the remarkable advances in computer vision research interests and the integration of advances and practical needs being instituted by scientific experts working in commercial animal production partnerships. This lack of synergy between experts in the computer vision and animal health and production sectors means that existing and emerging datasets tend to have a very particular focus that cannot be easily pivoted or extended for use in other contexts, resulting in a generality versus particularity conundrum.
This goal of this paper is to help catalogue and consider the major obstacles and impediments to the effective use of computer vision associated technologies in the swine industry by offering a systematic analysis of computer vision applications specific to commercial pig management by reviewing and summarizing the following: (i) the purpose and associated challenges of computer vision applications in pig behavior analysis; (ii) the use of computer vision algorithms and datasets for pig husbandry and management tasks; (iii) the process of dataset construction for computer vision algorithm development. In this appraisal, we outline common difficulties and challenges associated with each of these themes and suggest possible solutions. Finally, we highlight the opportunities for future research in computer vision applications that can build upon existing knowledge of pig management by extending our capability to interpret pig behaviors and thereby overcome the current barriers to applying computer vision technologies to pig production systems. In conclusion, we believe productive collaboration between animal-based scientists and computer-based scientists may accelerate animal behavior studies and lead the computer vision technologies to commercial applications in pig production facilities.

View all citing articles on Scopus

¹: These authors contributed to the work equally.

View full text

Original papersReal-time sow behavior detection based on deep learning

Highlights

Abstract

Introduction

Section snippets

Experimental device

SBDA-DL network model results

Conclusions

Acknowledgements

J. Comput. Syst. Sci.

Comput. Electron. Agric.

Comput. Electron. Agric.

Comput. Electron. Agric.

Comput. Electron. Agric.

An Introduction to support vector machines and other kernel-based learning methods

Kybernetes

Application of anomaly mining in pig behavior data analysis

China Annual Conference on Agricultural Systems Engineering

An image-processing technique to measure pig activity in response to climatic variation in a pig barn

Animal Prod. Sci.

Histograms of oriented gradients for human detection

Comput. Soc.

The pascal visual object classes challenge: a retrospective

Int. J. Comput. Vision

Scalable object detection using deep neural networks

IEEE Conf. Comp. Vision Pattern Recogn.

Training deformable part models with decorrelated features

IEEE Int. Conf. Comp. Vision

Fast r-cnn. ICCV

Spatial pyramid pooling in deep convolutional networks for visual recognition

IEEE Trans. Pattern Anal. Mach. Intell.

A fast two-dimensional median filtering algorithm

IEEE Trans. Acoust. Speech Signal Process.

MobileNets: efficient convolutional neural networks for mobile vision applications

CoRR

Replicated softmax: An undirected topic model

International Conference on Neural Information Processing Systems

Automatic recognition of aggressive behavior in pigs using a Kinect depth sensor

Sensors

Original papers
Real-time sow behavior detection based on deep learning