We present a methodology for using unlabeled data to design semi supervi...
Efron's two-group model is widely used in large scale multiple testing. ...
Modern approaches to supervised learning like deep neural networks (DNNs...
This paper proposes parametric and non-parametric hypothesis testing
alg...
A central goal in designing clinical trials is to find the test that
max...
This paper presents a new approach for trees-based regression, such as s...
We present a general methodology for using unlabeled data to design semi...
K-fold cross-validation (CV) with squared error loss is widely used for
...
Interpolators -- estimators that achieve zero training error -- have
att...
The highly influential two group model in testing a large number of
stat...
Cross-validation of predictive models is the de-facto standard for model...
We consider the problem of predicting several response variables using t...
Typically, real-world stochastic processes are not easy to analyze. In t...
Ensemble methods are among the state-of-the-art predictive modeling
appr...
Independent Component Analysis (ICA) is a statistical tool that decompos...
Multiple testing problems are a staple of modern statistical analysis. T...
The problem of handling adaptivity in data analysis, intentional or not,...
Common model selection criteria, such as AIC and its variants, are based...
Modern data sets in various domains often include units that were sample...
Recursive partitioning approaches producing tree-like models are a long
...
Regularization aims to improve prediction performance of a given statist...
We consider the generic regularized optimization problem
β̂(λ)=_βL(y,Xβ)...