[논문 리뷰] Deep multimodality-disentangled association analysis network for imaging genetics in neurodegenerative diseases

Posted Dec 4, 2025

By Jaehyuk Lee

5 min read

Adversarial Autoencoder를 이용한 representation imputation 논문이다. AD와 PD 두 종류의 신경퇴행성 질환을 대상으로 연구했으며 metadata와 SNP 데이터를 이용해 imputation을 진행한다.

임상에서는 SNP데이터가 없는 sample이 대부분이라 실적용에는 한계가 있어보인다.

Introduction

Image data

sMRI는 뇌의 구조적 변화를 파악하는데 효과적
PET은 amyloid beta, tau 파악에 효과적 (AD)
DTI는 white matter 변화 파악에 효과적이며 PD에서의 인지, 보행 및 자세 등에 관련
이전 연구들은 IDPs, ROI 기반 feature extract 방법 사용
- IDPs 추출의 경우 전처리 비용 높음
- ROI 기반 연구들이 주를 이룸

Genetic data

→ Multimodality로 image, genetic 사용

Challenges

MLMM (Multimodal Learning with Modality Missing)
Common and complementary information in multimodal data → 데이터에서의 공통, 상호보완적 정보
→ modality-shared, modality-specific biomarker 탐색이 multimodal imaging genetics의 핵심 과제
image와 genetic data간 관계의 복잡성
- multi-genetic, multi-imaging
- correlation among genetic data, correlation among imaging data

Proposal of DMAAN

Contribution

Encoder
- Modality data {x_i}_{i=1,…,M}, encoder E^{Img}_i 로 입력, latent imaging representation {v_i}_{i=1,…,M} 생성
- v_i = E^{Img}_i(x_i)
Discriminator
- Adversarial learning & Discriminator learning
- representation은 Discriminator에 의해 prior distribution(Gaussian)에 근사하도록 강제
- Discriminator는 MLP로 구성
- multimodality에 대해 shared parameter 가짐
- v_i가 prior distribution 따르는지 판별
Disentangle layer
- Adversarial learning 후 FC에 의해 common, specific representation으로 분리
- Fully connected layer가 disentanglement 수행하는 layer
- common representation과 specific representation 간 L-2 distance 멀어지도록 학습
Decoder
- sample 내 모든 modality의 common representation, v^c_j 와 현재 modality의 specific representation, v^s_i 로 image reconstruction
- modality 별로 존재하는 common representation과 현재 specific representation을 입력으로 reconstruction
  → modality 수가 2개라면 2회 reconstruct 진행됨

AAE와 2개의 association network로 구성 (network는 imaging modality 수 만큼 존재)

Adversarial autoencoder, AAE
- prior distribution 내 제약된 genetic latent representation 생성
- adversarial learning, gene representation reconstruction

Association network

genetic representation을 imaging representation에 mapping

⚠️ Mapping?

  imaging data의 latent representation과 유사한 representation 출력하도록 학습하겠다는 의미 (objective)

  _**→  image representation과 어떠한 연산을 하는 개념이 아님**_

Flow

Feature embedding

💡 **e.g. **

 trainset에서 한 SNP locus에 대해 dosage가 0/1/2 나올 확률이 각각 0.1/0.7/0.2 라고 할 때

 → sample의 dosage 값이 1인 경우 0.7로 embedding

Adversarial learning
- Multimodality-disentangled module과 같은 방법으로 adversarial learning
- genetic AAE의 경우 disentangle layer 없이 전형적인 AAE 형태
Association Network
- Input : latent representation + age, sex, education year
- 각 association network는 imaging representation과 유사하도록 representation 생성 → modality missing 발생 시 사용됨
- diagnosis module에서 사용되는 mask(attention weight) 생성