Abstract
Machine Learning algorithms are often as good as the data they can learn from. Enormous amount of unlabeled data is readily available and the ability to efficiently use such amount of unlabeled data holds a significant promise in terms of increasing the performance of various learning tasks. We consider the task of supervised Domain Adaptation and present a Self-Taught learning based framework which makes use of the K-SVD algorithm for learning sparse representation of data in an unsupervised manner. To the best of our knowledge this is the first work that integrates K-SVD algorithm into the self-taught learning framework. The K-SVD algorithm iteratively alternates between sparse coding of the instances based on the current dictionary and a process of updating/adapting the dictionary to better fit the data so as to achieve a sparse representation under strict sparsity constraints. Using the learnt dictionary, a rich feature representation of the few labeled instances is obtained which is fed to a classifier along with class labels to build the model. We evaluate our framework on the task of domain adaptation for sentiment classification. Both self-domain (requiring very few domain-specific training instances) and cross-domain classification (requiring 0 labeled instances of target domain and very few labeled instances of source domain) are performed. Empirical comparisons of self-domain and cross-domain results establish the efficacy of the proposed framework.
Original language | English |
---|---|
Title of host publication | CIKM 2012 - Proceedings of the 21st ACM International Conference on Information and Knowledge Management |
Pages | 2395-2398 |
Number of pages | 4 |
DOIs | |
Publication status | Published - 19-12-2012 |
Externally published | Yes |
Event | 21st ACM International Conference on Information and Knowledge Management, CIKM 2012 - Maui, HI, United States Duration: 29-10-2012 → 02-11-2012 |
Conference
Conference | 21st ACM International Conference on Information and Knowledge Management, CIKM 2012 |
---|---|
Country/Territory | United States |
City | Maui, HI |
Period | 29-10-12 → 02-11-12 |
All Science Journal Classification (ASJC) codes
- Human-Computer Interaction
- Computer Networks and Communications
- Computer Vision and Pattern Recognition
- Software