• semi-supervised learning with density-ratio estimation

    جزئیات بیشتر مقاله
    • تاریخ ارائه: 1392/07/24
    • تاریخ انتشار در تی پی بین: 1392/07/24
    • تعداد بازدید: 943
    • تعداد پرسش و پاسخ ها: 0
    • شماره تماس دبیرخانه رویداد: -
     in this paper we study statistical properties of semi-supervised learning, which is considered to be an important problem in the field of machine learning. in standard supervised learning only labeled data is observed, and classification and regression problems are formalized as supervised learning. on the other hand, in semi-supervised learning, unlabeled data is also obtained in addition to labeled data. hence, the ability to exploit unlabeled data is important to improve prediction accuracy in semi-supervised learning. this problem is regarded as a semiparametric estimation problem with missing data. under discriminative probabilistic models, it was considered that unlabeled data is useless to improve the estimation accuracy. recently, the weighted estimator using unlabeled data achieves a better prediction accuracy compared to the learning method using only labeled data, especially when the discriminative probabilistic model is misspecified. that is, improvement under the semiparametric model with missing data is possible when the semiparametric model is misspecified. in this paper, we apply the density-ratio estimator to obtain the weight function in semi-supervised learning. our approach is advantageous because the proposed estimator does not require well-specified probabilistic models for the probability of the unlabeled data. based on statistical asymptotic theory, we prove that the estimation accuracy of our method outperforms supervised learning using only labeled data. some numerical experiments present the usefulness of our methods.

سوال خود را در مورد این مقاله مطرح نمایید :

با انتخاب دکمه ثبت پرسش، موافقت خود را با قوانین انتشار محتوا در وبسایت تی پی بین اعلام می کنم
مقالات جدیدترین رویدادها
مقالات جدیدترین ژورنال ها