Knowledge source selection by estimating distance between datasets

Yi Ting Chiang*, Wen Chieh Fang, Jane Yung Jen Hsu

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Scopus citations

Abstract

Most traditional machine learning methods make an assumption that the distribution of the training dataset is the same as the applied domain. Transfer learning omits this assumption and is able to transfer knowledge between different domains. It is a promising method to make machine learning technology become more practical. However, negative transfer can hurt the performance of the model, therefore, it should be avoided. In this paper, we focus on how to select a good knowledge source when there are multiple labelled datasets available. A method to estimate the divergence between two labelled datasets is given. In addition, we also provide a method to decide the mappings between features in different datasets. The experimental results show that the divergence estimated by our method is highly related to the performance of the model.

Original languageEnglish
Title of host publicationProceedings - 2012 Conference on Technologies and Applications of Artificial Intelligence, TAAI 2012
Pages44-49
Number of pages6
DOIs
StatePublished - 2012
Externally publishedYes
Event2012 Conference on Technologies and Applications of Artificial Intelligence, TAAI 2012 - Tainan, Taiwan
Duration: 16 11 201218 11 2012

Publication series

NameProceedings - 2012 Conference on Technologies and Applications of Artificial Intelligence, TAAI 2012

Conference

Conference2012 Conference on Technologies and Applications of Artificial Intelligence, TAAI 2012
Country/TerritoryTaiwan
CityTainan
Period16/11/1218/11/12

Keywords

  • Machine learning
  • Similarity measure
  • Transfer learning

Fingerprint

Dive into the research topics of 'Knowledge source selection by estimating distance between datasets'. Together they form a unique fingerprint.

Cite this