Generation and Simulation of Synthetic Datasets with Copulas

03/30/2022
by   Régis Houssou, et al.
0

This paper proposes a new method to generate synthetic data sets based on copula models. Our goal is to produce surrogate data resembling real data in terms of marginal and joint distributions. We present a complete and reliable algorithm for generating a synthetic data set comprising numeric or categorical variables. Applying our methodology to two datasets shows better performance compared to other methods such as SMOTE and autoencoders.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset