In Progress

 

Principal Investigators: 

Prof Katherine Lee

Prof Margarita Moreno-Betancur

Researchers: 

Dr Cattram Nguyen

Dr Ghazaleh Dashti

Dr Rheanna Mainzer

Dr Tom Sullivan

Dr Anurika De Silva

Dr Rushani Wijesuriya

Dr Melissa Middleton

Ms Jiaxin Zhang

PhD Students:

Mr Cameron Patrick

Ms Jessica Xu

Collaborators:

Prof John Carlin

Prof Julie Simpson

Prof Ian White

Prof James Carpenter

Prof Kate Tilling

Dr Rachel Hughes

Addressing new challenges with missing data in complex epidemiological studies: methods, guidance and software

 

There are large and growing investments in life-course epidemiological studies, which are central to understanding disease aetiology and progression, and thus to developing interventions to improve population health. The broadening scope and complexity of such studies generate numerous statistical challenges.

Firstly it is important that the analysis of data from these studies accounts for their complex design features, such as non-equal probability sampling and multilevel structures. Secondly researchers are posing increasingly intricate research questions around the causal relations between exposures and outcomes, giving rise to a range of sophisticated new analytic methods. Missing data are inevitable in all research studies, but this is especially important in longitudinal studies, where there are multiple opportunities for drop-out and sporadic non-response. It is critical that missing data are handled appropriately in the analysis to minimise the risk of biased findings and maximise precision. Multiple imputation (MI) is now widely used – and called for by editors and reviewers – for dealing with missing data, but in the context of complex designs and modern causal methods, it is not yet clear how to implement MI, or whether it is the best approach.

The aims of this research are to develop and evaluate novel approaches for the implementation of MI, in comparison with alternative approaches such as inverse probability weighting and direct Bayesian methods, and to provide software and guidance for handling missing data in the context of:

1) complex study designs, including weighted sampling and multilevel data, and
2) modern causal modelling approaches, namely mediation analysis, marginal structural models, principal stratification and instrumental variable methods.
Our research spans a comprehensive range of approaches, from examining the underlying mathematics and performing simulation experiments to empirical validation through application to case studies.

We are members of the Missing Data, Imputation & Analysis (MiDIA) group, an international group including key missing data researchers at the London School of Hygiene & Tropical Medicine, MRC Biostatistics Unit Cambridge, UCL/MRC Clinical Trials Unit, and University of Bristol.

 

     

    Related People

    Lead Investigator
    Prof John Carlin
    Lead Investigator
    Prof Katherine Kate Lee
    Lead Investigator
    Post-doctoral Biostatistician
    Post-doctoral Biostatistician
    Post-doctoral Biostatistician
    Ghazaleh Dashti
    Post-doctoral Biostatistician
    Post-doctoral Biostatistician
    Affiliated Investigator
    Cameron Patrick ViCBiostat
    PhD Student
    Jiaxin Zhang
    PhD Student