TY - JOUR
T1 - Tight Source Domain Match for Partial Domain Adaptation Based on Maximum Density
AU - Ni, Zi Ao
AU - Su, Zixian
AU - Yang, Xi
AU - Wang, Qiufeng
AU - Huang, Kaizhu
N1 - Publisher Copyright:
© Published under licence by IOP Publishing Ltd.
PY - 2022/6/1
Y1 - 2022/6/1
N2 - Unsupervised domain adaptation (UDA) aims at transferring knowledge between a well-labelled 'source domain' and an unlabelled 'target domain' by decreasing distribution discrepancy. In real scenario, partial domain adaptation (PDA), where target domain only includes part of the classes of source domain, is adopted as fully-shared label space is often unavailable. Non-identical label spaces across domains lead to performance degradation due to source-unique classes being mis-matched to the target domain, i.e. negative transfer of the target domain. Although existing PDA approaches have produced promising results, they still confront with negative transfer problem without rigorous generalization bounds. In our work, a novel PDA model has been proposed based on margin disparity and maximum source intra-class density divergence (MDSD). It matches the feature distributions with shared labels and congregates source samples in the source with affirmative labels. Removing maximum target density with pseudo labels, it effectively avoids over-fitting and accelerates learning speed. In addition, we construct a new garbage classification dataset, which is comprised of source domain - Product (Pr) and target domain - Garbage (Ga) for PDA validation. Experiments show that our proposed unsupervised partial domain adaptation method has a good performance on Office-31, Office-Home and Pr-Ga datasets.
AB - Unsupervised domain adaptation (UDA) aims at transferring knowledge between a well-labelled 'source domain' and an unlabelled 'target domain' by decreasing distribution discrepancy. In real scenario, partial domain adaptation (PDA), where target domain only includes part of the classes of source domain, is adopted as fully-shared label space is often unavailable. Non-identical label spaces across domains lead to performance degradation due to source-unique classes being mis-matched to the target domain, i.e. negative transfer of the target domain. Although existing PDA approaches have produced promising results, they still confront with negative transfer problem without rigorous generalization bounds. In our work, a novel PDA model has been proposed based on margin disparity and maximum source intra-class density divergence (MDSD). It matches the feature distributions with shared labels and congregates source samples in the source with affirmative labels. Removing maximum target density with pseudo labels, it effectively avoids over-fitting and accelerates learning speed. In addition, we construct a new garbage classification dataset, which is comprised of source domain - Product (Pr) and target domain - Garbage (Ga) for PDA validation. Experiments show that our proposed unsupervised partial domain adaptation method has a good performance on Office-31, Office-Home and Pr-Ga datasets.
UR - http://www.scopus.com/inward/record.url?scp=85132024934&partnerID=8YFLogxK
U2 - 10.1088/1742-6596/2278/1/012032
DO - 10.1088/1742-6596/2278/1/012032
M3 - Conference article
AN - SCOPUS:85132024934
SN - 1742-6588
VL - 2278
JO - Journal of Physics: Conference Series
JF - Journal of Physics: Conference Series
IS - 1
M1 - 012032
T2 - 2022 6th International Conference on Machine Vision and Information Technology, CMVIT 2022
Y2 - 25 February 2022
ER -