论文信息 - Domain Adaptation in Computer Vision Applications

Domain Adaptation in Computer Vision Applications

The aim of this chapter is to give an overview of domain adaptation and transfer learning with a specific view to visual applications. After a general motivation, we first position domain adaptation in the more general transfer learning problem. Second, we try to address and analyze briefly the state-of-the-art methods for different types of scenarios, first describing the historical shallow methods, addressing both the homogeneous and heterogeneous domain adaptation methods. Third, we discuss the effect of the success of deep convolutional architectures which led to the new type of domain adaptation methods that integrate the adaptation within the deep architecture. Fourth, we review DA methods that go beyond image categorization, such as object detection, image segmentation, video analyses or learning visual attributes. We conclude the chapter with a section where we relate domain adaptation to other machine learning solutions.

Gabriela Csurka | G. Csurka | Marius Leordeanu

[1] Roland Kuhn,et al. Rapid speaker adaptation in eigenvoice space , 2000, IEEE Trans. Speech Audio Process..

[2] Daphne Koller,et al. Support Vector Machine Active Learning with Applications to Text Classification , 2000, J. Mach. Learn. Res..

[3] Jian Yu,et al. Learning Transferred Weights From Co-Occurrence Data for Heterogeneous Transfer Learning , 2016, IEEE Transactions on Neural Networks and Learning Systems.

[4] Rama Chellappa,et al. Subspace Interpolation via Dictionary Learning for Unsupervised Domain Adaptation , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[5] Ling Shao,et al. Enhancing Action Recognition by Cross-Domain Dictionary Learning , 2013, BMVC.

[6] Andrew Y. Ng,et al. Zero-Shot Learning Through Cross-Modal Transfer , 2013, NIPS.

[7] Ingo Steinwart,et al. On the Influence of the Kernel on the Consistency of Support Vector Machines , 2002, J. Mach. Learn. Res..

[8] Kilian Q. Weinberger,et al. Unsupervised Learning of Image Manifolds by Semidefinite Programming , 2004, CVPR.

[9] Leonidas J. Guibas,et al. 3D-Assisted Feature Synthesis for Novel Views of an Object , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[10] Cordelia Schmid,et al. Evaluation of Local Spatio-temporal Features for Action Recognition , 2009, BMVC.

[11] Charu C. Aggarwal,et al. Towards cross-category knowledge propagation for learning visual concepts , 2011, CVPR 2011.

[12] Daniel Kondermann,et al. Synthesizing Real World Stereo Challenges , 2013, GCPR.

[13] Jitendra Malik,et al. Discriminative Decorrelation for Clustering and Classification , 2012, ECCV.

[14] Jian Sun,et al. Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[15] Rajat Raina,et al. Self-taught learning: transfer learning from unlabeled data , 2007, ICML '07.

[16] Anders Krogh,et al. Neural Network Ensembles, Cross Validation, and Active Learning , 1994, NIPS.

[17] Anuj Srivastava,et al. Optimal linear representations of images for object recognition , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[18] Wei Fan,et al. Actively Transfer Domain Knowledge , 2008, ECML/PKDD.

[19] Thorsten Joachims,et al. Transductive Inference for Text Classification using Support Vector Machines , 1999, ICML.

[20] Stephen Tyree,et al. Learning with Marginalized Corrupted Features , 2013, ICML.

[21] Jason Weston,et al. Large scale image annotation: learning to rank with joint word-image embeddings , 2010, Machine Learning.

[22] Shotaro Akaho,et al. TrBagg: A Simple Transfer Learning Method and its Application to Personalization in Collaborative Tagging , 2009, 2009 Ninth IEEE International Conference on Data Mining.

[23] Yin Li,et al. Learning Deep Structure-Preserving Image-Text Embeddings , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[24] Lorenzo Bruzzone,et al. Semisupervised Transfer Component Analysis for Domain Adaptation in Remote Sensing Image Classification , 2015, IEEE Transactions on Geoscience and Remote Sensing.

[25] Gilles Blanchard,et al. On the Convergence of Eigenspaces in Kernel Principal Component Analysis , 2005, NIPS.

[26] Aram Kawewong,et al. Online incremental attribute-based zero-shot learning , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[27] Xian-Sheng Hua,et al. Two-Dimensional Active Learning for image classification , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[28] Alexei A. Efros,et al. Ensemble of exemplar-SVMs for object detection and beyond , 2011, 2011 International Conference on Computer Vision.

[29] Dieter Fox,et al. A large-scale hierarchical multi-view RGB-D object dataset , 2011, 2011 IEEE International Conference on Robotics and Automation.

[30] Jeff A. Bilmes,et al. On Deep Multi-View Representation Learning , 2015, ICML.

[31] Andrew Y. Ng,et al. Reading Digits in Natural Images with Unsupervised Feature Learning , 2011 .

[32] Bernt Schiele,et al. Articulated people detection and pose estimation: Reshaping the future , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[33] Wei-Ying Ma,et al. Annotating Images by Mining Image Search Results , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[34] Kate Saenko,et al. Correlation Alignment for Unsupervised Domain Adaptation , 2016, Domain Adaptation in Computer Vision Applications.

[35] Song-Chun Zhu,et al. Human Attribute Recognition by Rich Appearance Dictionary , 2013, 2013 IEEE International Conference on Computer Vision.

[36] D. Jacobs,et al. Bypassing synthesis: PLS for face recognition with pose, low-resolution and sketch , 2011, CVPR 2011.

[37] Jakob Verbeek,et al. Heterogeneous Face Recognition with CNNs , 2016, ECCV Workshops.

[38] Fei-Fei Li,et al. Large-Scale Video Classification with Convolutional Neural Networks , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[39] Shai Ben-David,et al. Detecting Change in Data Streams , 2004, VLDB.

[40] Yishay Mansour,et al. Domain Adaptation: Learning Bounds and Algorithms , 2009, COLT.

[41] Antonio Torralba,et al. LabelMe: A Database and Web-Based Tool for Image Annotation , 2008, International Journal of Computer Vision.

[42] Christoph H. Lampert,et al. Learning to detect unseen object classes by between-class attribute transfer , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[43] Motoaki Kawanabe,et al. Direct Importance Estimation with Model Selection and Its Application to Covariate Shift Adaptation , 2007, NIPS.

[44] Yishay Mansour,et al. Multiple Source Adaptation and the Rényi Divergence , 2009, UAI.

[45] Michael I. Jordan,et al. Deep Transfer Learning with Joint Adaptation Networks , 2016, ICML.

[46] Andrea Vedaldi,et al. I Have Seen Enough: Transferring Parts Across Categories , 2016, BMVC.

[47] Shai Shalev-Shwartz,et al. Online learning: theory, algorithms and applications (למידה מקוונת.) , 2007 .

[48] Thomas Brox,et al. A Large Dataset to Train Convolutional Networks for Disparity, Optical Flow, and Scene Flow Estimation , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[49] Trevor Darrell,et al. Continuous Manifold Based Adaptation for Evolving Visual Domains , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[50] Trevor Darrell,et al. Deep Domain Confusion: Maximizing for Domain Invariance , 2014, CVPR 2014.

[51] Trevor Darrell,et al. Learning with Side Information through Modality Hallucination , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[52] Andrew W. Fitzgibbon,et al. Real-time human pose recognition in parts from single depth images , 2011, CVPR 2011.

[53] Bernt Schiele,et al. Analyzing appearance and contour based methods for object categorization , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[54] Venkatesh Saligrama,et al. Person Re-identification via Structured Prediction , 2014, ArXiv.

[55] Jitendra Malik,et al. Pose Induction for Novel Object Categories , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[56] Ivor W. Tsang,et al. Hybrid Heterogeneous Transfer Learning through Deep Learning , 2014, AAAI.

[57] David Zhang,et al. Fisher Discrimination Dictionary Learning for sparse representation , 2011, 2011 International Conference on Computer Vision.

[58] Alexei A. Efros,et al. Undoing the Damage of Dataset Bias , 2012, ECCV.

[59] Barbara Caputo,et al. The More You Know, the Less You Learn: From Knowledge Transfer to One-shot Learning of Object Categories , 2009, BMVC.

[60] Manuel Glez Bedia,et al. Artificial Intelligence approaches for the generation and assessment of believable human-like behaviour in virtual characters , 2014, Expert Syst. Appl..

[61] Tinne Tuytelaars,et al. Subspace Alignment Based Domain Adaptation for RCNN Detector , 2015, BMVC.

[62] Yi Yao,et al. Boosting for transfer learning with multiple sources , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[63] Li Zhang,et al. Collaborative Facial Landmark Localization for Transferring Annotations Across Datasets , 2014, ECCV.

[64] Ivor W. Tsang,et al. Domain Adaptation via Transfer Component Analysis , 2009, IEEE Transactions on Neural Networks.

[65] Trevor Darrell,et al. Spatial Semantic Regularisation for Large Scale Object Detection , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[66] Xiaogang Wang,et al. A Deep Sum-Product Architecture for Robust Facial Attributes Analysis , 2013, 2013 IEEE International Conference on Computer Vision.

[67] James Hays,et al. SUN attribute database: Discovering, annotating, and recognizing scene attributes , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[68] Trevor Darrell,et al. Simultaneous Deep Transfer Across Domains and Tasks , 2015, ICCV.

[69] David Vázquez,et al. Virtual worlds and active learning for human detection , 2011, ICMI '11.

[70] Fei-Fei Li,et al. Shifting Weights: Adapting Object Detectors from Image to Video , 2012, NIPS.

[71] Burr Settles,et al. Active Learning Literature Survey , 2009 .

[72] Xun Xu,et al. Cross-domain traffic scene understanding by motion model transfer , 2013, ARTEMIS '13.

[73] Terrance E. Boult,et al. Multi-attribute spaces: Calibration for attribute fusion and similarity search , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[74] Shih-Fu Chang,et al. Cross-domain learning methods for high-level visual concept classification , 2008, 2008 15th IEEE International Conference on Image Processing.

[75] John Shawe-Taylor,et al. Canonical Correlation Analysis: An Overview with Application to Learning Methods , 2004, Neural Computation.

[76] Bingpeng Ma,et al. Local Descriptors Encoded by Fisher Vectors for Person Re-identification , 2012, ECCV Workshops.

[77] Visvanathan Ramesh,et al. Model-driven Simulations for Deep Convolutional Neural Networks , 2016, ArXiv.

[78] Philip S. Yu,et al. Adaptation Regularization: A General Framework for Transfer Learning , 2014, IEEE Transactions on Knowledge and Data Engineering.

[79] Philip S. Yu,et al. Transfer Learning on Heterogenous Feature Spaces via Spectral Transformation , 2010, 2010 IEEE International Conference on Data Mining.

[80] Jonathan Tompson,et al. Joint Training of a Convolutional Network and a Graphical Model for Human Pose Estimation , 2014, NIPS.

[81] Ahmed M. Elgammal,et al. Learning Hypergraph-regularized Attribute Predictors , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[82] Le Song,et al. A Hilbert Space Embedding for Distributions , 2007, Discovery Science.

[83] Anil K. Jain,et al. Towards automated caricature recognition , 2012, 2012 5th IAPR International Conference on Biometrics (ICB).

[84] Rob Fergus,et al. Visualizing and Understanding Convolutional Networks , 2013, ECCV.

[85] Fei-Fei Li,et al. Connecting modalities: Semi-supervised segmentation and annotation of images using unaligned text corpora , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[86] Trevor Darrell,et al. Active Learning with Gaussian Processes for Object Categorization , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[87] Bernhard Schölkopf,et al. Domain Generalization via Invariant Feature Representation , 2013, ICML.

[88] Antonio M. López,et al. Virtual and Real World Adaptation for Pedestrian Detection , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[89] Yongxin Yang,et al. Multivariate Regression on the Grassmannian for Predicting Novel Domains , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[90] Mubarak Shah,et al. UCF101: A Dataset of 101 Human Actions Classes From Videos in The Wild , 2012, ArXiv.

[91] Rainer Lienhart,et al. Synthetically trained multi-view object class and viewpoint detection for advanced image retrieval , 2011, ICMR '11.

[92] Svetlana Lazebnik,et al. Multi-scale Orderless Pooling of Deep Convolutional Activation Features , 2014, ECCV.

[93] Trevor Darrell,et al. LSDA: Large Scale Detection through Adaptation , 2014, NIPS.

[94] Xiaogang Wang,et al. Locally Aligned Feature Transforms across Views , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[95] Hai Tao,et al. Evaluating Appearance Models for Recognition, Reacquisition, and Tracking , 2007 .

[96] Barbara Caputo,et al. Safety in numbers: Learning categories from few examples with multi model knowledge transfer , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[97] Xiaojin Zhu,et al. Introduction to Semi-Supervised Learning , 2009, Synthesis Lectures on Artificial Intelligence and Machine Learning.

[98] Ramakant Nevatia,et al. Automatic Concept Discovery from Parallel Text and Visual Corpora , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[99] Paul A. Viola,et al. Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[100] Matthew D. Zeiler. ADADELTA: An Adaptive Learning Rate Method , 2012, ArXiv.

[101] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[102] Nitish Srivastava,et al. Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[103] Kate Saenko,et al. From Virtual to Reality: Fast Adaptation of Virtual Object Detectors to Real Domains , 2014, BMVC.

[104] Qiang Ji,et al. A Unified Probabilistic Approach Modeling Relationships between Attributes and Objects , 2013, 2013 IEEE International Conference on Computer Vision.

[105] C. Lawrence Zitnick,et al. Edge Boxes: Locating Object Proposals from Edges , 2014, ECCV.

[106] Trevor Darrell,et al. Learning cross-modality similarity for multinomial data , 2011, 2011 International Conference on Computer Vision.

[107] Antonio M. López,et al. Adapting Pedestrian Detection from Synthetic to Far Infrared Images , 2013 .

[108] David Vázquez,et al. Interactive Training of Human Detectors , 2013, Multimodal Interaction in Image and Video Applications.

[109] Bianca Zadrozny,et al. Learning and evaluating classifiers under sample selection bias , 2004, ICML.

[110] S. Meister,et al. Real versus realistically rendered scenes for optical flow evaluation , 2011, 2011 14th ITG Conference on Electronic Media Technology.

[111] Xiaoou Tang,et al. Facial Landmark Detection by Deep Multi-task Learning , 2014, ECCV.

[112] H. Sebastian Seung,et al. Query by committee , 1992, COLT '92.

[113] Kristen Grauman,et al. Decorrelating Semantic Visual Attributes by Resisting the Urge to Share , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[114] Ivor W. Tsang,et al. Heterogeneous Domain Adaptation for Multiple Classes , 2014, AISTATS.

[115] Christoph H. Lampert,et al. Beyond Dataset Bias: Multi-task Unaligned Shared Knowledge Transfer , 2012, ACCV.

[116] Wen Gao,et al. Manifold Alignment via Corresponding Projections , 2010, BMVC.

[117] Rong Yan,et al. Cross-domain video concept detection using adaptive svms , 2007, ACM Multimedia.

[118] Gabriela Csurka,et al. Adapted Vocabularies for Generic Visual Categorization , 2006, ECCV.

[119] Derek Hoiem,et al. Indoor Segmentation and Support Inference from RGBD Images , 2012, ECCV.

[120] Trevor Darrell,et al. Efficient Learning of Domain-invariant Image Representations , 2013, ICLR.

[121] Hal Daumé,et al. Learning Task Grouping and Overlap in Multi-task Learning , 2012, ICML.

[122] Frédéric Jurie,et al. Improving object classification using semantic attributes , 2010, BMVC.

[123] Juergen Gall,et al. Adaptation of Synthetic Data for Coarse-to-Fine Viewpoint Refinement , 2015, BMVC.

[124] Angel Domingo Sappa,et al. Synthetic sequences and ground-truth flow field generation for algorithm validation , 2015, Multimedia Tools and Applications.

[125] Ivan Oseledets,et al. Tensor-Train Decomposition , 2011, SIAM J. Sci. Comput..

[126] Olivier Sigaud,et al. Gated networks: an inventory , 2015, ArXiv.

[127] Qiang Yang,et al. A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[128] Anuj Srivastava,et al. Tools for application-driven linear dimension reduction , 2005, Neurocomputing.

[129] Kate Saenko,et al. Deep CORAL: Correlation Alignment for Deep Domain Adaptation , 2016, ECCV Workshops.

[130] Trevor Darrell,et al. One-Shot Adaptation of Supervised Deep Convolutional Models , 2013, ICLR.

[131] Trevor Darrell,et al. Part-Based R-CNNs for Fine-Grained Category Detection , 2014, ECCV.

[132] Raghuraman Gopalan,et al. Learning Cross-Domain Information Transfer for Location Recognition and Clustering , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[133] Dumitru Erhan,et al. Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[134] Stefan Roth,et al. Efficient Multi-cue Scene Segmentation , 2013, GCPR.

[135] Yoshua Bengio,et al. Extracting and composing robust features with denoising autoencoders , 2008, ICML '08.

[136] Ali Farhadi,et al. Target-driven visual navigation in indoor scenes using deep reinforcement learning , 2016, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[137] Trevor Darrell,et al. PANDA: Pose Aligned Networks for Deep Attribute Modeling , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[138] Naila Murray,et al. Revisiting the Fisher vector for fine-grained classification , 2014, Pattern Recognit. Lett..

[139] Kevin Chen-Chuan Chang,et al. Unifying learning to rank and domain adaptation: enabling cross-task document scoring , 2014, KDD.

[140] Shih-Fu Chang,et al. Attributes and categories for generic instance search from one example , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[141] Stephen Milborrow. The MUCT Landmarked Face Database , 2010 .

[142] Yongxin Yang,et al. Trace Norm Regularised Deep Multi-Task Learning , 2016, ICLR.

[143] David Vázquez. Cool world : domain adaptation of virtual and real worlds for human detection using active learning , 2012 .

[144] Joachim Denzler,et al. Learning with Few Examples by Transferring Feature Relevance , 2009, DAGM-Symposium.

[145] Andrew W. Fitzgibbon,et al. Efficient Object Category Recognition Using Classemes , 2010, ECCV.

[146] Yoshua Bengio,et al. Zero-data Learning of New Tasks , 2008, AAAI.

[147] Peng Zhang,et al. Nonlinear Dimensionality Reduction by Locally Linear Inlaying , 2009, IEEE Transactions on Neural Networks.

[148] Maayan Harel,et al. Learning from Multiple Outlooks , 2010, ICML.

[149] C. Lawrence Zitnick,et al. Adopting Abstract Images for Semantic Scene Understanding , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[150] Leonid Sigal,et al. A Unified Semantic Embedding: Relating Taxonomies and Attributes , 2014, NIPS.

[151] Gang Hua,et al. Detection by detections: Non-parametric detector adaptation for a video , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[152] Kristen Grauman,et al. Relative attributes , 2011, 2011 International Conference on Computer Vision.

[153] Jieping Ye,et al. An accelerated gradient method for trace norm minimization , 2009, ICML '09.

[154] Stefan Carlsson,et al. CNN Features Off-the-Shelf: An Astounding Baseline for Recognition , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops.

[155] Zhengyou Zhang,et al. Taylor expansion based classifier adaptation: Application to person detection , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[156] Michael Goesele,et al. Back to the Future: Learning Shape Models from 3D CAD Data , 2010, BMVC.

[157] Daumé,et al. Domain Adaptation meets Active Learning , 2010, HLT-NAACL 2010.

[158] Bernt Schiele,et al. Transfer Learning in a Transductive Setting , 2013, NIPS.

[159] Geoffrey E. Hinton,et al. Zero-shot Learning with Semantic Output Codes , 2009, NIPS.

[160] Kristen Grauman,et al. Sharing features between objects and their attributes , 2011, CVPR 2011.

[161] Barbara Caputo,et al. A Deeper Look at Dataset Bias , 2015, Domain Adaptation in Computer Vision Applications.

[162] Tao Xiang,et al. Weakly Supervised Learning of Objects, Attributes and Their Associations , 2014, ECCV.

[163] Markus Schoeler,et al. Semantic Pose Using Deep Networks Trained on Synthetic RGB-D , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[164] 乔宇. Motionlets: Mid-Level 3D Parts for Human Motion Recognition , 2013 .

[165] Qiang Yang,et al. Transitive Transfer Learning , 2015, KDD.

[166] Slobodan Ilic,et al. Framework for Generation of Synthetic Ground Truth Data for Driver Assistance Applications , 2013, GCPR.

[167] Ping Li,et al. Cross-Domain Person Reidentification Using Domain Adaptation Ranking SVMs , 2015, IEEE Transactions on Image Processing.

[168] Jeff G. Schneider,et al. Active Transfer Learning under Model Shift , 2014, ICML.

[169] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[170] Geoffrey J. Gordon,et al. Relational learning via collective matrix factorization , 2008, KDD.

[171] Matt Post,et al. Domain Adaptation , 2017, Encyclopedia of Machine Learning and Data Mining.

[172] Dieter Fox,et al. 3D laser scan classification using web data and domain adaptation , 2009, Robotics: Science and Systems.

[173] Andrew Zisserman,et al. A Statistical Approach to Material Classification Using Image Patch Exemplars , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[174] Robert Tibshirani,et al. The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition , 2001, Springer Series in Statistics.

[175] Ali Farhadi,et al. Multi-attribute Queries: To Merge or Not to Merge? , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[176] Erik G. Learned-Miller,et al. Online domain adaptation of a pre-trained cascade of classifiers , 2011, CVPR 2011.

[177] Rama Chellappa,et al. Domain adaptation for object recognition: An unsupervised approach , 2011, 2011 International Conference on Computer Vision.

[178] Pietro Perona,et al. Entropy-based active learning for object recognition , 2008, 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[179] Horst Bischof,et al. Person Re-identification by Descriptive and Discriminative Classification , 2011, SCIA.

[180] James J. Little,et al. Play and Learn: Using Video Games to Train Computer Vision Models , 2016, BMVC.

[181] Devi Parikh,et al. Interactively Guiding Semi-Supervised Clustering via Attribute-Based Explanations , 2014, ECCV.

[182] Ivan Laptev,et al. Learning and Transferring Mid-level Image Representations Using Convolutional Neural Networks , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[183] 河原達也. Automatic Speech Recognition and Understanding Workshop(ASRU99) , 2000 .

[184] Stan Z. Li,et al. Deep Metric Learning for Practical Person Re-Identification , 2014, ArXiv.

[185] Adriana Kovashka,et al. WhittleSearch: Image search with relative attribute feedback , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[186] Anuj Srivastava,et al. Optimal linear projections for enhancing desired data statistics , 2010, Stat. Comput..

[187] Shai Shalev-Shwartz,et al. Online Learning and Online Convex Optimization , 2012, Found. Trends Mach. Learn..

[188] Krystian Mikolajczyk,et al. Deep correlation for matching images and text , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[189] Takafumi Kanamori,et al. Efficient Direct Density Ratio Estimation for Non-stationarity Adaptation and Outlier Detection , 2008, NIPS.

[190] Peter V. Gehler,et al. Teaching 3D geometry to deformable part models , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[191] Alex Graves,et al. Playing Atari with Deep Reinforcement Learning , 2013, ArXiv.

[192] David Vázquez,et al. Learning appearance in virtual scenarios for pedestrian detection , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[193] Bernhard Schölkopf,et al. Correcting Sample Selection Bias by Unlabeled Data , 2006, NIPS.

[194] Ming-Yu Liu,et al. Coupled Generative Adversarial Networks , 2016, NIPS.

[195] Adriana Kovashka,et al. Actively selecting annotations among objects and attributes , 2011, 2011 International Conference on Computer Vision.

[196] Rama Chellappa,et al. Domain Adaptive Dictionary Learning , 2012, ECCV.

[197] Avishek Saha,et al. Active Supervised Domain Adaptation , 2011, ECML/PKDD.

[198] L. Tucker,et al. Some mathematical notes on three-mode factor analysis , 1966, Psychometrika.

[199] Shiliang Sun,et al. Multi-source Transfer Learning with Multi-view Adaboost , 2012, ICONIP.

[200] David J. Hand,et al. Classifier Technology and the Illusion of Progress , 2006, math/0606441.

[201] Roberto Cipolla,et al. Understanding RealWorld Indoor Scenes with Synthetic Data , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[202] Jitendra Malik,et al. Viewpoints and keypoints , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[203] Philip S. Yu,et al. Transfer Sparse Coding for Robust Image Representation , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[204] Xiang Zhang,et al. OverFeat: Integrated Recognition, Localization and Detection using Convolutional Networks , 2013, ICLR.

[205] Nikolaos Papanikolopoulos,et al. Multi-class active learning for image classification , 2009, CVPR.

[206] Shree K. Nayar,et al. Attribute and simile classifiers for face verification , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[207] Takeo Kanade,et al. Learning scene-specific pedestrian detectors without real data , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[208] Shaogang Gong,et al. Re-id: Hunting Attributes in the Wild , 2014, BMVC.

[209] Antonio Torralba,et al. Sharing Visual Features for Multiclass and Multiview Object Detection , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[210] Nello Cristianini,et al. Inferring a Semantic Representation of Text via Cross-Language Correlation Analysis , 2002, NIPS.

[211] Devi Parikh,et al. Attributes for Classifier Feedback , 2012, ECCV.

[212] C. V. Jawahar,et al. Relative Parts: Distinctive Parts for Learning Relative Attributes , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[213] Larry S. Davis,et al. Image ranking and retrieval based on multi-attribute queries , 2011, CVPR 2011.

[214] Charless C. Fowlkes,et al. Do We Need More Training Data? , 2015, International Journal of Computer Vision.

[215] Yoshua Bengio,et al. How transferable are features in deep neural networks? , 2014, NIPS.

[216] Roger Levy,et al. A new approach to cross-modal multimedia retrieval , 2010, ACM Multimedia.

[217] Visvanathan Ramesh,et al. Model Validation for Vision Systems via Graphics Simulation , 2015, ArXiv.

[218] Thomas Mensink,et al. Image Classification with the Fisher Vector: Theory and Practice , 2013, International Journal of Computer Vision.

[219] Kristen Grauman,et al. Interactively building a discriminative vocabulary of nameable attributes , 2011, CVPR 2011.

[220] Ronan Sicre,et al. Particular object retrieval with integral max-pooling of CNN activations , 2015, ICLR.

[221] Benno Stein,et al. Cross-Language Text Classification Using Structural Correspondence Learning , 2010, ACL.

[222] G. Terrell. The Maximal Smoothing Principle in Density Estimation , 1990 .

[223] Andrew Zisserman,et al. Two-Stream Convolutional Networks for Action Recognition in Videos , 2014, NIPS.

[224] Mario Fritz,et al. Recognizing Materials from Virtual Examples , 2012, ECCV.

[225] Roberto Cipolla,et al. SynthCam3D: Semantic Understanding With Synthetic Indoor Scenes , 2015, ArXiv.

[226] Trevor Darrell,et al. Discovering Latent Domains for Multisource Domain Adaptation , 2012, ECCV.

[227] Xiaodong Yu,et al. Attribute-Based Transfer Learning for Object Categorization with Zero/One Training Example , 2010, ECCV.

[228] Charu C. Aggarwal,et al. Towards semantic knowledge propagation from text corpus to web images , 2011, WWW.

[229] Dit-Yan Yeung,et al. Transfer metric learning by learning task relationships , 2010, KDD.

[230] Yi Yang,et al. Unsupervised Video Adaptation for Parsing Human Motion , 2014, ECCV.

[231] Pramod Sharma,et al. Efficient Detector Adaptation for Object Detection in a Video , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[232] Leonidas J. Guibas,et al. Render for CNN: Viewpoint Estimation in Images Using CNNs Trained with Rendered 3D Model Views , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[233] Takeo Kanade,et al. How Useful Is Photo-Realistic Rendering for Visual Learning? , 2016, ECCV Workshops.

[234] Antonio Manuel López Peña,et al. Procedural Generation of Videos to Train Deep Action Recognition Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[235] Qiang Yang,et al. Heterogeneous Transfer Learning for Image Clustering via the SocialWeb , 2009, ACL.

[236] Kristen Grauman,et al. Zero-shot recognition with unreliable attributes , 2014, NIPS.

[237] Meng Wang,et al. Automatic adaptation of a generic pedestrian detector to a specific traffic scene , 2011, CVPR 2011.

[238] Antonio Torralba,et al. Using the Forest to See the Trees: A Graphical Model Relating Features, Objects, and Scenes , 2003, NIPS.

[239] Bernhard Schölkopf,et al. Hilbert Space Embeddings and Metrics on Probability Measures , 2009, J. Mach. Learn. Res..

[240] Fei-Fei Li,et al. Best of both worlds: Human-machine collaboration for object annotation , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[241] Sethuraman Panchanathan,et al. A Two-Stage Weighting Framework for Multi-Source Domain Adaptation , 2011, NIPS.

[242] Michael K. Ng,et al. Mixed-Transfer: Transfer Learning over Mixed Graphs , 2014, SDM.

[243] Pietro Perona,et al. The Caltech-UCSD Birds-200-2011 Dataset , 2011 .

[244] Lorenzo Torresani,et al. C3D: Generic Features for Video Analysis , 2014, ArXiv.

[245] Visvanathan Ramesh,et al. Simulations for Validation of Vision Systems , 2015, ArXiv.

[246] Yi Yang,et al. Articulated Human Detection with Flexible Mixtures of Parts , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[247] Michael I. Jordan,et al. Learning Transferable Features with Deep Adaptation Networks , 2015, ICML.

[248] Luc Van Gool,et al. Exploring context to learn scene specific object detectors , 2009 .

[249] Christoph H. Lampert,et al. Augmented Attribute Representations , 2012, ECCV.

[250] Rongrong Ji,et al. Weak attributes for large-scale image retrieval , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[251] Antonio Torralba,et al. Evaluation of image features using a photorealistic virtual world , 2011, 2011 International Conference on Computer Vision.

[252] Trevor Darrell,et al. Do Convnets Learn Correspondence? , 2014, NIPS.

[253] Pramod Sharma,et al. Unsupervised incremental learning for improved object detection in a video , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[254] C. Lawrence Zitnick,et al. Learning Common Sense through Visual Abstraction , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[255] Michael Goesele,et al. A shape-based object class model for knowledge transfer , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[256] Philip S. Yu,et al. Transfer Joint Matching for Unsupervised Domain Adaptation , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[257] Konstantinos G. Derpanis,et al. Evaluation of deep convolutional nets for document image classification and retrieval , 2015, 2015 13th International Conference on Document Analysis and Recognition (ICDAR).

[258] Sanja Fidler,et al. The Role of Context for Object Detection and Semantic Segmentation in the Wild , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[259] Ramakant Nevatia,et al. Temporal Localization of Fine-Grained Actions in Videos by Domain Transfer from Web Images , 2015, ACM Multimedia.

[260] Anton van den Hengel,et al. Learning to rank in person re-identification with metric ensembles , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[261] Bernhard Schölkopf,et al. A kernel view of the dimensionality reduction of manifolds , 2004, ICML.

[262] Matthijs C. Dorst. Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[263] Andrew J. Chosak,et al. OVVV: Using Virtual Worlds to Design and Evaluate Surveillance Systems , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[264] Sergey Ioffe,et al. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[265] Mathieu Aubry,et al. Deep Exemplar 2D-3D Detection by Adapting from Real to Rendered Views , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[266] 한보형,et al. Learning Deconvolution Network for Semantic Segmentation , 2015 .

[267] Juhan Nam,et al. Multimodal Deep Learning , 2011, ICML.

[268] Paul A. Viola,et al. Learning from one example through shared densities on transforms , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[269] Pascal Fua,et al. Beyond Sharing Weights for Deep Domain Adaptation , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[270] Bernt Schiele,et al. Learning people detection models from few training samples , 2011, CVPR 2011.

[271] Ming Yang,et al. Regionlets for Generic Object Detection , 2013, 2013 IEEE International Conference on Computer Vision.

[272] Antonio Criminisi,et al. Harvesting Image Databases from the Web , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[273] Florent Perronnin,et al. Large-scale image categorization with explicit data embedding , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[274] Chang Wang,et al. Heterogeneous Domain Adaptation Using Manifold Alignment , 2011, IJCAI.

[275] Tinne Tuytelaars,et al. A Testbed for Cross-Dataset Analysis , 2014, ECCV Workshops.

[276] H. Shimodaira,et al. Improving predictive inference under covariate shift by weighting the log-likelihood function , 2000 .

[277] Sergey Levine,et al. Towards Adapting Deep Visuomotor Representations from Simulated to Real Environments , 2015, ArXiv.

[278] Abhinav Gupta,et al. Constrained Semi-Supervised Learning Using Attributes and Comparative Attributes , 2012, ECCV.

[279] Jun Huan,et al. Knowledge Transfer with Low-Quality Data: A Feature Extraction Issue , 2011, IEEE Transactions on Knowledge and Data Engineering.

[280] Jinhui Tang,et al. Weakly-Shared Deep Transfer Networks for Heterogeneous-Domain Knowledge Propagation , 2015, ACM Multimedia.

[281] Qiang Yang,et al. Cross-domain sentiment classification via spectral feature alignment , 2010, WWW '10.

[282] Yejin Choi,et al. From Large Scale Image Categorization to Entry-Level Categories , 2013, 2013 IEEE International Conference on Computer Vision.

[283] Philip S. Yu,et al. Transfer Feature Learning with Joint Distribution Adaptation , 2013, 2013 IEEE International Conference on Computer Vision.

[284] Barbara Caputo,et al. Learning to Learn, from Transfer Learning to Domain Adaptation: A Unifying Perspective , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[285] Dacheng Tao,et al. Bregman Divergence-Based Regularization for Transfer Subspace Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[286] Pietro Perona,et al. Multiclass recognition and part localization with humans in the loop , 2011, 2011 International Conference on Computer Vision.

[287] Mubarak Shah,et al. Online detection and classification of moving objects using progressively improving detectors , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[288] Marc'Aurelio Ranzato,et al. Fast Inference in Sparse Coding Algorithms with Applications to Object Recognition , 2010, ArXiv.

[289] Trevor Darrell,et al. What you saw is not what you get: Domain adaptation using asymmetric kernel transforms , 2011, CVPR 2011.

[290] Michael S. Bernstein,et al. Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations , 2016, International Journal of Computer Vision.

[291] Qiang Yang,et al. Heterogeneous Transfer Learning for Image Classification , 2011, AAAI.

[292] Thomas Mensink,et al. Improving the Fisher Kernel for Large-Scale Image Classification , 2010, ECCV.

[293] Larry S. Davis,et al. Domain adaptive object detection , 2013, 2013 IEEE Workshop on Applications of Computer Vision (WACV).

[294] Ivor W. Tsang,et al. Learning With Augmented Features for Supervised and Semi-Supervised Heterogeneous Domain Adaptation , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[295] Koen E. A. van de Sande,et al. Selective Search for Object Recognition , 2013, International Journal of Computer Vision.

[296] Rama Chellappa,et al. Generalized Domain-Adaptive Dictionaries , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[297] Kate Saenko,et al. Generating Large Scale Image Datasets from 3 D CAD Models , 2015 .

[298] David W. Jacobs,et al. Generalized Multiview Analysis: A discriminative latent space , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[299] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[300] Qingyao Wu,et al. Online Heterogeneous Transfer Learning by Knowledge Transition , 2019, ACM Trans. Intell. Syst. Technol..

[301] Andrew Zisserman,et al. Multiple kernels for object detection , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[302] Florent Perronnin,et al. Large-scale image retrieval with compressed Fisher vectors , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[303] Trevor Darrell,et al. Caffe: Convolutional Architecture for Fast Feature Embedding , 2014, ACM Multimedia.

[304] Bernhard Schölkopf,et al. Domain Adaptation under Target and Conditional Shift , 2013, ICML.

[305] Yongxin Yang,et al. Deep Multi-task Representation Learning: A Tensor Factorisation Approach , 2016, ICLR.

[306] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[307] Geoffrey E. Hinton,et al. Unsupervised Learning of Image Transformations , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[308] Yamada Makoto,et al. No Bias Left Behind: Covariate Shift Adaptation for Discriminative 3D Pose Estimation , 2012 .

[309] Barbara Caputo,et al. Frustratingly Easy NBNN Domain Adaptation , 2013, 2013 IEEE International Conference on Computer Vision.

[310] Iasonas Kokkinos,et al. Understanding Objects in Detail with Fine-Grained Attributes , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[311] Chong-Wah Ngo,et al. Semi-supervised Domain Adaptation with Subspace Learning for visual recognition , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[312] Vinod Nair,et al. A joint learning framework for attribute models and object descriptions , 2011, 2011 International Conference on Computer Vision.

[313] Trevor Darrell,et al. Adapting Visual Category Models to New Domains , 2010, ECCV.

[314] G. Griffin,et al. Caltech-256 Object Category Dataset , 2007 .

[315] Deepak S. Turaga,et al. Cross domain distribution adaptation via kernel mapping , 2009, KDD.

[316] Alexei A. Efros,et al. Unbiased look at dataset bias , 2011, CVPR 2011.

[317] Tieniu Tan,et al. Robust view transformation model for gait recognition , 2011, 2011 18th IEEE International Conference on Image Processing.

[318] Dong Liu,et al. Robust visual domain adaptation with low-rank reconstruction , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[319] Dong Xu,et al. Visual recognition by learning from web data: A weakly supervised domain generalization approach , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[320] F. L. Hitchcock. The Expression of a Tensor or a Polyadic as a Sum of Products , 1927 .

[321] Trevor Darrell,et al. Adversarial Discriminative Domain Adaptation , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[322] M.J.F. Gales. Maximum Likelihood Linear Regression 32 . 1 Maximum likelihood linear regression , 2007 .

[323] Kate Saenko,et al. Return of Frustratingly Easy Domain Adaptation , 2015, AAAI.