lei feng*, senlin shu, nan lu, bo han, miao xu, gang niu, bo an, masashi sugiyama. (*corresponding author)
proceedings of the 38th international conference on machine learning (icml'21).
to alleviate the data requirement for training effective binary classifiers in binary classification, many weakly supervised learning settings have been proposed. among them, some consider using pairwise but not pointwise labels, when pointwise labels are not accessible due to privacy, confidentiality, or security reasons. however, as a pairwise label denotes whether or not two data points share a pointwise label, it cannot be easily collected if either point is equally likely to be positive or negative. thus, in this paper, we propose a novel setting called pairwise comparison (pcomp) classification, where we have only pairs of unlabeled data that we know one is more likely to be positive than the other. firstly, we give a pcomp data generation process, derive an unbiased risk estimator (ure) with theoretical guarantee, and further improve ure using correction functions. secondly, we link pcomp classification to noisy-label learning to develop a progressive ure and improve it by imposing consistency regularization. finally, we demonstrate by experiments the effectiveness of our methods, which suggests pcomp is a valuable and practically useful type of pairwise supervision besides the pairwise label.
|
|
|
|
|
|
|
|
(a) ordinary binary classification (b) pairwise comparison classification