Domain Adaptation for Visual Recognition

Raghuraman Gopalan; Ruonan Li; Vishal M. Patel; Rama Chellappa

doi:10.1561/0600000057

Foundations and Trends® in Computer Graphics and Vision > Vol 8 > Issue 4

Domain Adaptation for Visual Recognition

By Raghuraman Gopalan, AT&T Labs-Research, USA, raghuram@research.att.com | Ruonan Li, Harvard University, USA, ruonanli@seas.harvard.edu | Vishal M. Patel, University of Maryland, College Park, USA, pvishalm@umd.edu | Rama Chellappa , University of Maryland, College Park, USA, rama@umiacs.umd.edu

Suggested Citation

Raghuraman Gopalan, Ruonan Li, Vishal M. Patel and Rama Chellappa (2015), "Domain Adaptation for Visual Recognition", Foundations and Trends® in Computer Graphics and Vision: Vol. 8: No. 4, pp 285-378. http://dx.doi.org/10.1561/0600000057

Publication Date: 11 Mar 2015

Subjects

Face detection and recognition, Learning and statistical methods, Object and scene recognition, Video analysis and event recognition, Image and video processing, Sparse representations

Keywords

Domain Adaptation

Journal details

Download article

In this article:

Abstract

Domain adaptation is an active, emerging research area that attempts to address the changes in data distribution across training and testing datasets. With the availability of a multitude of image acquisition sensors, variations due to illumination, and viewpoint among others, computer vision applications present a very natural test bed for evaluating domain adaptation methods. In this monograph, we provide a comprehensive overview of domain adaptation solutions for visual recognition problems. By starting with the problem description and illustrations, we discuss three adaptation scenarios namely, (i) unsupervised adaptation where the “source domain” training data is partially labeled and the “target domain” test data is unlabeled, (ii) semi-supervised adaptation where the target domain also has partial labels, and (iii) multi-domain heterogeneous adaptation which studies the previous two settings with the source and/or target having more than one domain, and accounts for cases where the features used to represent the data in each domain are different. For all these topics we discuss existing adaptation techniques in the literature, which are motivated by the principles of max-margin discriminative learning, manifold learning, sparse coding, as well as low-rank representations. These techniques have shown improved performance on a variety of applications such as object recognition, face recognition, activity analysis, concept classification, and person detection. We then conclude by analyzing the challenges posed by the realm of “big visual data”, in terms of the generalization ability of adaptation algorithms to unconstrained data acquisition as well as issues related to their computational tractability, and draw parallels with the efforts from vision community on image transformation models, and invariant descriptors so as to facilitate improved understanding of vision problems under uncertainty.

DOI:10.1561/0600000057

Book details

ISBN: 978-1-68083-030-9

108 pp. $75.00

Buy book (pb)

ISBN: 978-1-68083-031-6

108 pp. $125.00

Buy E-book (.pdf)

Table of contents:

1. Introduction

2. Unsupervised Adaptation

3. Semi-supervised Adaptation

4. Adaptation with Multiple, Heterogeneous Domains

5. Recent Advances and Open Problems

6. Conclusion

Acknowledgements

References

Domain Adaptation for Visual Recognition

Domain adaptation is an active, emerging research area that attempts to address the changes in data distribution across training and testing datasets. With the availability of a multitude of image acquisition sensors, variations due to illumination and viewpoint among others, computer vision applications present a very natural test bed for evaluating domain adaptation methods. This monograph provides a comprehensive overview of domain adaptation solutions for visual recognition problems. By starting with the problem description and illustrations, it discusses three adaptation scenarios, namely, (i) unsupervised adaptation where the “source domain” training data is partially labeled and the “target domain” test data is unlabeled; (ii) semi-supervised adaptation where the target domain also has partial labels; and (iii) multi-domain heterogeneous adaptation which studies the previous two settings with the source and/or target having more than one domain, and accounts for cases where the features used to represent the data in each domain are different. For all of these scenarios, Domain Adaptation for Visual Recognition discusses the existing adaptation techniques in the literature. These techniques are motivated by the principles of max-margin discriminative learning, manifold learning, sparse coding, as well as low-rank representations, and have shown improved performance on a variety of applications such as object recognition, face recognition, activity analysis, concept classification, and person detection.

Domain Adaptation for Visual Recognition concludes by analyzing the challenges posed by the realm of “big visual data” - in terms of the generalization ability of adaptation algorithms to unconstrained data acquisition as well as issues related to their computational tractability - and draws parallels with efforts from the vision community on image transformation models and invariant descriptors so as to facilitate improved understanding of vision problems under uncertainty.

Domain Adaptation for Visual Recognition

Free Preview:

Share

Journal details

Abstract

Book details

Domain Adaptation for Visual Recognition