Modeling with Possible Non-Ignorable Dropout in

Several programs in one, fully integrated: ... on the relationship between dropout and outcomes, but also ... Muthén-Roy model extended to include an ultimate.
294KB taille 36 téléchargements 338 vues
Modeling with Possible Non-Ignorable Dropout in Longitudinal Studies Bengt Muth´en [email protected] www.statmodel.com

Presentation at Inserm Atelier de formation 205, Saint-Raphael, June 3, 2010

Bengt Muth´en

Non-Ignorable Dropout

1/ 25

Overview

Overview of key missing data models presented in the general latent variable framework of Mplus Current approaches: Shared set of latent variables for the outcome and missing data processes New approach: Disentangling the outcome process and the missing data process

Bengt Muth´en

Non-Ignorable Dropout

2/ 25

Example: Longitudinal Data From An Antidepressant Trial (STAR*D) n = 4041 Subjects treated with citalopram (Level 1). No placebo group 20

Overall

18

Observed

QIDS Depression Score

16

Dropout

14 12 10 8 6 4 2 0 0

2

4

6

8

10

12

Week

Bengt Muth´en

Non-Ignorable Dropout

3/ 25

14

MCAR, MAR, And NMAR

y: outcome vector - but y is not the only data m: missing data indicator vector MCAR mis P(mi |yobs i , yi ) = P(mi )

(1)

mis obs P(mi |yobs i , yi ) = P(mi |yi )

(2)

ML [y, m] = ML [y]

(3)

MAR

NMAR (non-ignorable or informative missingness): None of the above. Missingness predicted by latent variables

Bengt Muth´en

Non-Ignorable Dropout

4/ 25

NMAR Modeling: Joint Analysis Of The Outcome And Missing Data Processes y: outcome vector m: missing data indicator vector Focus has typically been on dropout with m replaced by dropout indicators d. Often, d’s are scored as discrete-time survival (event history) indicators. For example, dropout after 3rd visit: 0 0 0 1 999 999. But d’s can also be dropout dummy variables. Mplus creates m and d using Data Missing

c: latent class variable [y, m] = [y] [m|y], Selection modeling

(4)

= [m] [y|m], Pattern − mixture modeling =

∑[c] [m|c] [y|c]

(5)

Shared − parameter modeling (6)

c

Each NMAR model involves untestable assumptions - comparing results from several models gives a sensitivity analysis Overviews in Albert and Follman (2008), Little (2008) Bengt Muth´en

Non-Ignorable Dropout

5/ 25

The unifying theme of latent variables

Continuous Latent Variables

Categorical Latent Variables

Factors

Latent classes

Random effects

Clusters

Frailties, liabilities

Finite mixtures

Variance components

Missing data

Missing data Bayesian parameter priors

Bengt Muth´en

Non-Ignorable Dropout

6/ 25

The Mplus general latent variable modeling framework

Mplus support from National Institutes of Health Small Business Innovation Research contracts and grants Several programs in one, fully integrated: Exploratory factor analysis Structural equation modeling Item response theory analysis Latent class analysis Latent transition analysis (Hidden Markov modeling) Survival analysis

Bengt Muth´en

Growth modeling Multilevel analysis Complex survey data analysis Bayesian analysis using MCMC Monte Carlo simulation

Non-Ignorable Dropout

7/ 25

NMAR analysis in Mplus

A large set of NMAR models can be estimated in the general latent variable modeling framework of Mplus (www.statmodel.com) Maximum-likelihood estimation using EM in combination with FS and QN Number of classes informed by BIC and bootstrapped likelihood-ratio test Bayesian MCMC analysis available as well using DIC and PPC

Bengt Muth´en

Non-Ignorable Dropout

8/ 25

Diggle-Kenward Selection Model (d’s: dropout indicators in line with discrete-time survival)

Diggle-Kenward (1994) in Applied Statistics Logistic regression of dt on observed yt−1 and observed/unobserved yt Bengt Muth´en

Non-Ignorable Dropout

9/ 25

Pattern-Mixture Model (d’s: dropout pattern dummies)

Hedeker-Gibbons (1997) in Psychological Methods, Demirtas-Schafer (2003) in Statistics in Medicine Bengt Muth´en

Non-Ignorable Dropout

10/ 25

Roy-Dantan Latent Class Dropout Model (d’s: dropout pattern dummies)

Roy (2003) Biometrics: c on dropout time Dantan et al (2008) Int’l J. of Biostat: c on dropout dummies Bengt Muth´en

Non-Ignorable Dropout

11/ 25

Summary Of Mixture Modeling Of STAR*D Data Using Dropout Pattern Dummies

Model Pattern-mixture Roy 2c Roy 3c Roy 4c Roy 5c

Loglikelihood

#par.s

BIC

-44946 -44871 -44777 -44728 -44698

27 24 33 42 51

90117 89942 89828 89806 89820

Bengt Muth´en

Non-Ignorable Dropout

12/ 25

An aside: BIC curves for Roy-GMM and Roy-LCGA

91000 90900 90800 90700 90600 90500 90400 90300 90200 90100 90000 89900 89800 89700 89600 89500 89400 89300 89200 89100 89000

GMM LCGA

1

2

3

4

5

6

7

8

9

10

Number of Classes

Bengt Muth´en

Non-Ignorable Dropout

13/ 25

Depression Mean Curves Estimated Under MAR, Diggle-Kenward NMAR, And Roy NMAR 25

QUIDS Depression Score

20

MAR DK

15

Roy

10

5

0 0

2

4

6

8

10

12

Week

Bengt Muth´en

Non-Ignorable Dropout

14/ 25

14

Disadvantages Of Pattern-Mixture And Roy Latent Class Dropout Modeling

For both approaches the intention is to mix the parameter estimates over the patterns/classes to obtain an overall estimated growth curve This mixing may hide substantively interesting trajectory classes

Bengt Muth´en

Non-Ignorable Dropout

15/ 25

4-Class Roy-Dantan Latent Class Dropout Model

Bengt Muth´en

Non-Ignorable Dropout

16/ 25

4-Class MAR Model (Y Outcomes Only)

Bengt Muth´en

Non-Ignorable Dropout

17/ 25

Disadvantages Of Pattern-Mixture And Roy Latent Class Dropout Modeling

Roy latent class dropout modeling forms classes based not only on the relationship between dropout and outcomes, but also based on the development of the outcomes over time This may confound dropout classes with trajectory classes

Bengt Muth´en

Non-Ignorable Dropout

18/ 25

Muth´en-Roy Model Using Two Latent Class Variables

Bengt Muth´en

Non-Ignorable Dropout

19/ 25

Model Comparisons

Model

Loglikelihood

#par.s

BIC

-44946 -44728 -44662

27 42 44

90117 89806 89689

Pattern-mixture Roy 4c Muthen 4c-Roy 2c

Bengt Muth´en

Non-Ignorable Dropout

20/ 25

Comparing Trajectory Class Percentages Across Models Adding dropout information gives a less favorable conclusion regarding drug response than assuming MAR

Model

MAR 4 classes

Response class

Temporary response class

Non-response class

55 %

3%

15 %

43 % 32 %

18 % 19 %

28 % 32 %

Bengt Muth´en

Non-Ignorable Dropout

NMAR models: Roy 4 classes Muthen 4c-Roy 2c

21/ 25

Selection models with latent classes: Generalized Diggle-Kenward and Beunckens et al.

Bengt Muth´en

Non-Ignorable Dropout

22/ 25

Muth´en-Roy model extended to include an ultimate outcome

Bengt Muth´en

Non-Ignorable Dropout

23/ 25

Technical Papers: See Papers at www.statmodel.com Muth´en & Shedden (1999). Finite mixture modeling with mixture outcomes using the EM algorithm. Biometrics. Muth´en (2002). Beyond SEM: General latent variable modeling. Behaviormetrika. Muth´en et al. (2002). General growth mixture modeling for randomized preventive interventions. Biostatistics. Muth´en & Asparouhov (2009). Growth mixture modeling: Analysis with non-Gaussian random effects. In Fitzmaurice et al. (eds.), Longitudinal Data Analysis. Chapman & Hall. Muth´en & Brown (2009). Estimating drug effects in the presence of placebo response: Causal inference using growth mixture modeling. Statistics in Medicine. Muth´en et al. (2010). Growth modeling with non-ignorable dropout: Alternative analyses of the STAR*D antidepressant trial. Under review.

Bengt Muth´en

Non-Ignorable Dropout

24/ 25

Applied Papers: See Papers at www.statmodel.com Kreuter & Muth´en (2008). Analyzing criminal trajectory profiles: Bridging multilevel and group-based approaches using growth mixture modeling. Journal of Quantitative Criminology, 24, 1-31. Muth´en (2001). Latent variable mixture modeling. In Marcoulides & Schumacker (eds.), New Developments and Techniques in Structural Equation Modeling (pp. 1-33). Lawrence Erlbaum Associates. Muth´en (2004). Latent variable analysis: Growth mixture modeling and related techniques for longitudinal data. In D. Kaplan (ed.), Handbook of quantitative methodology for the social sciences (pp. 345-368). Newbury Park, CA: Sage Publications. Muth´en (2008). Latent variable hybrids: Overview of old and new models. In Hancock & Samuelsen (Eds.), Advances in latent variable mixture models, pp. 1-24. Charlotte, NC: Information Age Publishing, Inc. Muth´en & Muth´en (2000). Integrating person-centered and variable-centered analyses: Growth mixture modeling with latent trajectory classes. Alcoholism: Clinical and Experimental Research, 24, 882-891. Bengt Muth´en

Non-Ignorable Dropout

25/ 25