Key words: Statistical and deep learning, Longitudinal analysis, Clustering, Mixed effect model, variational autoencoders, Biomarkers.
Host Laboratory : CREATIS Lyon – MYRIAD team
Scientific context:
Anomaly detection in medical imaging is a challenging task in contexts where abnormalities are not annotated and difficult to detect even for experts. This problem can be addressed through unsupervised anomaly detection (UAD) methods, which identify features that do not match with a reference model of normal profiles.
The goal of this project is to further improve the reliability of the UAD by leveraging additional information coming from longitudinal data. Longitudinal data [Hedeker & Gibbons 2006] consist in the repeated observations of patients over time. In practice, we expect to analyse image data at a few different times corresponding to successive visits of patients. Their analysis informs us on the progression of the disease through the evolution of abnormalities, both in size, numbers, or locations. More specifically, when applied to anomaly detection, the expectation is the confirmation of uncertain detections or the discovery of new ones, not visible at early stages.
Modelling longitudinal data presents different types of challenges. First are the methodological challenges related to the design of relevant models to handle all the data and disease’s characteristics in order to answer the statistical and medical questions. These modelling difficulties cannot be separated from challenges arising from data with very different modalities and time dependencies, in particular involving different acquisition time-sets and different scales of patient screening, resulting on possibly partially missing data [Couronne et al 2019].
Young et al. data [Young et al 2024] recently performed an exhaustive review of data-driven generative models of how neurodegenerative diseases evolve over time. Such models use a generative disease progression model and a set of constraints informed by human insight to infer a data-driven disease time axis and the shape of biomarker trajectories along it.
Within this framework, Sauty et al [Sauty et al 2022] recently investigated a way to model such longitudinal effects directly in the medical images by training a linear mixed effect model in the latent representation space of a longitudinal variational autoencoder. This design enables to combine the robustness of mixed-effects modelling of clinical biomarkers progression with missing data with that of autoencoders both to learn efficient and compact representations of 3D images and reconstruct the image from the latent variable. This model was shown to successfully model 3D T1w MRI normal brain aging and disease progression in Alzheimer patients.
In the same line, Puglisi et al. [Puglisi et al 2024] also recently tackled the issue of progression modelling on medical images by introducing a novel spatiotemporal model that combines a latent diffusion model (LDM) with a ControlNet to generate individualized brain MRIs conditioned on subject-specific data. Similarly to Sauty et al, this model was shown to successfully model healthy and Alzheimer patients’ brains.
Targeted applications:
This project will evaluate the relevance of such methods for brain and cardiac imaging applications.
For brain imaging, anomalies will consist of markers of neurodegenerative diseases, mainly Alzheimer disease to be identified from magnetic resonance (MR) brain scans. In both cases, we will focus on newly diagnosed patients, where the detection task is all the more challenging as abnormalities may be subtle and hardly visible in structural magnetic resonance (MR) brain scans.
For cardiac imaging, we target the detection of myocardial shape and/or deformation abnormalities during a stress protocol (e.g. during effort) from 2D echocardiographic sequences. This may be critical for trained athletes, who have an increased risk of arrhythmias, for which geometrical and functional remodeling may be the underlying substrate [Gabrielli et al. 2016]
Directions of research:
Review the state-of-the art in the domain of deep generative progression models, e.g. based on the review by Young et al.
Select and implement some promising models of the literature and try to replicate reported performance on T1w MRI of the public ADNI database.
Evaluate the performance of unsupervised anomaly detection: learn a model of normal aging evolution on control subject population, infer normal progression of ADNI patient, and derive anomaly maps by comparing pseudo-normative and patient MR exam at the same time point.
Compare with standard UAD based on reconstruction error or support estimation of the normative distribution
Transfer to the study of shape and functional remodelling on 2D echocardiographic data.
Skills and working environment
The selected candidate will be co-supervised by C. Lartizien (CREATIS CNRS Senior Researcher) and N. Duchateau (CREATIS Associate Professor)
The candidate should have a background either in machine learning and/or deep learning or image processing, as well as good programming skills. He/she will have access to computing resources (CREATIS and/or CNRS supercomputer) as well as to public brain datasets (ADNI) and to a private cardiac databases. The candidate will benefit from the stimulating research environment of the MYRIAD team working in the field of deep machine learning for medical image analysis.
Application
Interested applicants are required to send a cover letter, CV and any other relevant documents (reference letter, recent transcripts of marks, ...) to: carole.lartizien@creatis.insa-lyon.fr and Nicolas.duchateau@creatis.insa-lyon.fr