Classification by estimating the cumulative distribution function for small data

10/12/2022
by   Meng-Xian Zhu, et al.
0

In this paper, we study the classification problem by estimating the conditional probability function of the given data. Different from the traditional expected risk estimation theory on empirical data, we calculate the probability via Fredholm equation, this leads to estimate the distribution of the data. Based on the Fredholm equation, a new expected risk estimation theory by estimating the cumulative distribution function is presented. The main characteristics of the new expected risk estimation is to measure the risk on the distribution of the input space. The corresponding empirical risk estimation is also presented, and an ε-insensitive L_1 cumulative support vector machines (ε-L_1VSVM) is proposed by introducing an insensitive loss. It is worth mentioning that the classification models and the classification evaluation indicators based on the new mechanism are different from the traditional one. Experimental results show the effectiveness of the proposed ε-L_1VSVM and the corresponding cumulative distribution function indicator on validity and interpretability of small data classification.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset