Heliyon 2024
![]() | Fig. 1. The proposed Usformer belonging to single 3D methods captures the inter-slice correlation not included in the 2D methods and avoids error propagation introduced in two-stage methods. |
Abstract: Left atrial (LA) fibrosis significantly influences the progression of atrial fibrillation, with 3D late gadolinium-enhancement (LGE) MRI being a proven method for identifying LA fibrosis. However, manual segmentation of the LA wall from 3D LGE MRI is time-consuming and difficult. Automated segmentation is also challenging due to varying data intensities, limited contrast between the LA and surrounding tissues, and the complex anatomy of the LA. Traditional 3D network approaches are computationally intensive, often requiring two-stage methods. To address these issues, we propose Usformer, a lightweight, transformer-based 3D architecture for precise, single-stage LA segmentation. Usformer’s transposed attention captures global context efficiently, outperforming state-of-the-art methods in both accuracy and speed, with a dice score of 93.1% in the 2018 Atrial Segmentation Challenge and 92.0% on our local dataset. Usformer also significantly reduces parameter count and computational complexity by 2.8x and 3.8x, respectively, and achieves a 92.1% dice score using only 16 labeled MRI scans. This method may enhance the clinical translation of LA LGE for catheter ablation planning in atrial fibrillation.
Fig. 2. The architecture of Usformer. It is designed for end-to-end left atrium segmentation from 3D LGE MRIs. In the final two stages, the U-Net architecture integrates transformer blocks represented by the orange boxes. The transposed block includes both a transposed attention module (shown in Fig. 3) and a feed-forward network made up of fully connected layers. H × W × Ż represents the size of a 3D LGE scan. All feature maps are 3D volumes instead of 2D images. For additional insights into Usformer, please turn to Section 2.
Fig. 6. Results of LA segmentation in the axial view by Usformer, nnU-Net [10], UNeXt [36], and TMS-Net [35]. Cases are randomly selected from the challenge and NU datasets, respectively. Each visualization includes the 2D dice score, denoted in the top left corner. Red and green delineate the contours of manual and predicted segmentation. Arrows highlight regions where Usformer exhibits notably superior performance in comparison to the other two baselines. Viewing this figure in color is advised in the printed edition.