TY - JOUR AU - Yan, Pingkun AB - Abstract:Chest X-rays (CXRs) are the most widely used medical imaging modality and play a pivotal role in diagnosing diseases. However, as 2D projection images, CXRs are limited by structural superposition, which constrains their effectiveness in precise disease diagnosis and risk prediction. To address the limitations of 2D CXRs, this study introduces Xray2Xray, a novel World Model that learns latent representations encoding 3D structural information from chest X-rays. Xray2Xray captures the latent representations of the chest volume by modeling the transition dynamics of X-ray projections across different angular positions with a vision model and a transition model. We employed the latent representations of Xray2Xray for downstream risk prediction and disease diagnosis tasks. Experimental results showed that Xray2Xray outperformed both supervised methods and self-supervised pretraining methods for cardiovascular disease risk estimation and achieved competitive performance in classifying five pathologies in CXRs. We also assessed the quality of Xray2Xray's latent representations through synthesis tasks and demonstrated that the latent representations can be used to reconstruct volumetric context. TI - Xray2Xray: World Model from Chest X-rays with Volumetric Context JF - Electrical Engineering and Systems Science DO - 10.48550/arxiv.2506.19055 DA - 2025-06-17 UR - https://www.deepdyve.com/lp/arxiv-cornell-university/xray2xray-world-model-from-chest-x-rays-with-volumetric-context-00STOR1E4z VL - 2025 IS - 2506 DP - DeepDyve ER -