Convolutional Neural Networks Based Dictionary Pair Learning for Visual Tracking

Chenchen MENG; Jun WANG; Chengzhi DENG; Yuanyun WANG; Shengqian WANG

doi:10.1587/transfun.2021EAP1150

Regular Section

Convolutional Neural Networks Based Dictionary Pair Learning for Visual Tracking

Chenchen MENG, Jun WANG, Chengzhi DENG, Yuanyun WANG, Shengqian WANG

Author information

Keywords: visual tracking, hand-crafted feature, convolutional neural networks, dictionary pair Learning

JOURNAL RESTRICTED ACCESS

2022 Volume E105.A Issue 8 Pages 1147-1156

DOI https://doi.org/10.1587/transfun.2021EAP1150

Browse “Advance online publication” version

Details

Abstract

Feature representation is a key component of most visual tracking algorithms. It is difficult to deal with complex appearance changes with low-level hand-crafted features due to weak representation capacities of such features. In this paper, we propose a novel tracking algorithm through combining a joint dictionary pair learning with convolutional neural networks (CNN). We utilize CNN model that is trained on ImageNet-Vid to extract target features. The CNN includes three convolutional layers and two fully connected layers. A dictionary pair learning follows the second fully connected layer. The joint dictionary pair is learned upon extracted deep features by the trained CNN model. The temporal variations of target appearances are learned in the dictionary learning. We use the learned dictionaries to encode target candidates. A linear combination of atoms in the learned dictionary is used to represent target candidates. Extensive experimental evaluations on OTB2015 demonstrate the superior performances against SOTA trackers.

Corresponding author

Register with J-STAGE for free!