A Deformable Part Model for One-Shot Object Tracking

Georg Nebehay

Abstract

As cameras become ubiquitously available, the need for analyzing video sequences on-the-fly arises. An important class of applications requires algorithms that are able to continuously track an a-priori unknown object of interest as it makes its way through the scene. This problem is difficult, as no training data can be used beforehand to create an object model. In this thesis, this problem is referred to as one-shot object tracking. Extensive literature about the topic of one-shot object tracking is available, still the performance of state-of-the-art one-shot tracking algorithms on realistic sequences leaves much to be desired. In this thesis the viewpoint is taken that the deformation of objects of interest acts as a major obstacle for achieving satisfactory results. While approaches have been proposed in the literature for dealing with this challenge, they either are too simple to be of use for complex objects or require a considerable amount of training data to work. However, in one-shot object tracking there is by definition only one training example available. More training examples can be collected from the video sequence in an online manner, however this process is error-prone and can lead to the undesired effect of accumulating errors so that the object model is no longer a good representation of the object of interest. In this thesis, a deformable part model for one-shot object tracking is proposed, aiming at providing a robust model for deformable objects that does not rely on model updates to work. Instead, it operates on the basic assumption that object parts are connected by mediating parts, like an arm might connect a hand to the torso of a person. One advantage of the proposed model is the independence on the actual parts representation. We suggest to leverage the synergies between two very different methods for establishing parts correspondences. These methods consist on the one hand of static correspondences, which are based on training information only. This type of correspondences is robust, but unable to adapt to new object appearances. A complementary method can be found in adaptive correspondences, which are computed from a very recent appearance of the object. Adaptive correspondences lack the robustness of static correspondences, but can provide necessary accuracy. To assess the usefulness of the proposed model in practice, we conduct a rigorous evaluation on a dataset of 77 sequences. This evaluation includes a comparison to state-of-the art tracking algorithms, the effect of employing different part representations as well as additional experiments that reveal insights about internal workings of the proposed model. We find that the proposed deformable part model gives a significant performance improvement over the state of the art.

Published In

PhD thesis, 2016.
Download the PDF PDF logo

Code

Download the code from our GitHub repository.GitHub logo

Project Page

More information is available at the Project page.

BibTex

@phdthesis{Nebehay2016PHD,
    author = {Nebehay, Georg},
    month = sep,
    school = {Graz University of Technology},
    title = {A Deformable Part Model for {One-Shot} Object Tracking},
    year = {2016}
}
Back to publication list