Skip to contents

Constrained maximum likelihood (cML) based Mendelian Randomization method robust to both correlated and uncorrelated pleiotropy.

Usage

mr_cML(
  object,
  MA = TRUE,
  DP = TRUE,
  K_vec = 0:(length(object@betaX) - 2),
  random_start = 0,
  num_pert = 200,
  random_start_pert = 0,
  maxit = 100,
  random_seed = 314,
  n,
  Alpha = 0.05
)

# S4 method for MRInput
mr_cML(
  object,
  MA = TRUE,
  DP = TRUE,
  K_vec = 0:(length(object@betaX) - 2),
  random_start = 0,
  num_pert = 200,
  random_start_pert = 0,
  maxit = 100,
  random_seed = 314,
  n,
  Alpha = 0.05
)

Arguments

object

An MRInput object.

MA

Whether model average is applied or not. Default is TRUE.

DP

Whether data perturbation is applied or not. Default is TRUE.

K_vec

Set of candidate K's, the constraint parameter representing number of invalid IVs. Default is from 0 to (#IV - 2).

random_start

Number of random starting points for cML, default is 0.

num_pert

Number of perturbation when DP is TRUE, default is 200.

random_start_pert

Number of random start points for cML with data perturbation, default is 0.

maxit

Maximum number of iterations for each optimization. Default is 100.

random_seed

Random seed, default is 314. When random_seed=NULL, no random seed will be used and the results may not be reproducible.

n

Sample size. When sample sizes of GWAS for exposure and outcome are different, and/or when sample sizes of different SNPs are different, the smallest sample size is recommended to get conservative result and avoid type-I error. See reference for more discussions.

Alpha

Significance level for the confidence interval for estimate, default is 0.05.

Value

The output from the function is an MRcML object containing:

Exposure

A character string giving the name given to the exposure.

Outcome

A character string giving the name given to the outcome.

Estimate

Estimate of theta.

StdError

Standard error of estimate.

Pvalue

p-value of estimate.

BIC_invalid

Set of selected invalid IVs if cML-BIC is performed, i.e. without MA or DP.

GOF1_p

p-value of the first goodness-of-fit test.

GOF2_p

p-value of the second goodness-of-fit test.

SNPs

The number of SNPs that were used in the calculation.

Alpha

Significance level for the confidence interval for estimate, default is 0.05.

CILower

Lower bound of the confidence interval for estimate.

CIUpper

Upper bound of the confidence interval for estimate.

MA

Indicator of whether model average is applied.

DP

Indicator of whether data perturbation is applied.

Details

The MRcML method selects invalid IVs with correlated and/or uncorrelated peliotropic effects using constrained maximum likelihood. cML-BIC gives results of the selected model with original data, while cML-MA-BIC averages over all candidate models. cML-BIC-DP and cML-MA-BIC-DP are the versions with data-perturbation to account for selection uncertainty when many invalid IVs have weak pleiotropic effects.

When DP is performed, two goodness-of-fit (GOF) tests are developed to check whether the model-based and DP- based variance estimates converge to the same estimate. Small p-values of GOF tests indicate selection uncertainty is not ignorable, and results from DP is more reliable. See reference for more details.

As the constrained maximum likelihood function is non-convex, multiple random starting points could be used to find a global minimum. For some starting points the algorithm may not converge and a warning message will be prompted, typically this will not affect the results.

References

Xue, H., Shen, X., & Pan, W. (2021). Constrained maximum likelihood-based Mendelian randomization robust to both correlated and uncorrelated pleiotropic effects. The American Journal of Human Genetics, 108(7), 1251-1269.

Examples

# Perform cML-MA-BIC-DP:
mr_cML(mr_input(bx = ldlc, bxse = ldlcse, by = chdlodds,
byse = chdloddsse), num_pert=5, MA = TRUE, DP = TRUE, n = 17723)
#> 
#> Constrained maximum likelihood method (MRcML) 
#> Number of Variants:  28 
#> Results for:  cML-MA-BIC-DP 
#> ------------------------------------------------------------------
#>         Method Estimate    SE Pvalue        95% CI
#>  cML-MA-BIC-DP    3.073 0.278  0.000 [2.528,3.617]
#> ------------------------------------------------------------------
# num_pert is set to 5 to reduce computational time
# the default value of 200 is recommended in practice

# Perform cML-BIC-DP:
mr_cML(mr_input(bx = ldlc, bxse = ldlcse, by = chdlodds,
byse = chdloddsse), MA = TRUE, DP = FALSE,, n = 17723)
#> 
#> Constrained maximum likelihood method (MRcML) 
#> Number of Variants:  28 
#> Results for:  cML-MA-BIC 
#> ------------------------------------------------------------------
#>      Method Estimate    SE Pvalue        95% CI
#>  cML-MA-BIC    2.895 0.321  0.000 [2.265,3.525]
#> ------------------------------------------------------------------