Inference with many correlated weak instruments and summary statistics

05/04/2020
by   Ashish Patel, et al.
0

This paper concerns inference in instrumental variable models with a high-dimensional set of correlated weak instruments. Our focus is motivated by Mendelian randomization, the use of genetic variants as instrumental variables to identify the unconfounded effect of an exposure on disease. In particular, we consider the scenario where a large number of genetic instruments may be exogenous, but collectively they explain a low proportion of exposure variation. Additionally, we assume that individual-level data are not available, but rather summary statistics on genetic associations with the exposure and outcome, as typically published from meta-analyses of genome-wide association studies. In a two-stage approach, we first use factor analysis to exploit structured correlations of genetic instruments as expected in candidate gene analysis, and estimate an unknown vector of optimal instruments. The second stage conducts inference on the parameter of interest under scenarios of strong and weak identification. Under strong identification, we consider point estimation based on minimization of a limited information maximum likelihood criterion. Under weak instrument asymptotics, we generalize conditional likelihood ratio and other identification-robust statistics to account for estimated instruments and summary data as inputs. Simulation results illustrate favourable finite-sample properties of the factor-based conditional likelihood ratio test, and we demonstrate use of our method by studying the effect of interleukin-6 signaling on glycated hemoglobin levels.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset