GW Authored Works

DXM-TransFuse U-net: Dual cross-modal transformer fusion U-net for automated nerve identification

Baijun Xie, Department of Biomedical Engineering, School of Engineering and Applied Science, The George Washington University, 800 22nd Street NW, Science & Engineering Hall Suite 5000, Washington 20052, DC, USA.
Gary Milam, Department of Biomedical Engineering, School of Engineering and Applied Science, The George Washington University, 800 22nd Street NW, Science & Engineering Hall Suite 5000, Washington 20052, DC, USA.
Bo Ning, Children's National Hospital, 111 Michigan Avenue NW, Washington 20010, DC, USA.
Jaepyeong Cha, Children's National Hospital, 111 Michigan Avenue NW, Washington 20010, DC, USA.
Chung Hyuk Park, Department of Biomedical Engineering, School of Engineering and Applied Science, The George Washington University, 800 22nd Street NW, Science & Engineering Hall Suite 5000, Washington 20052, DC, USA. Electronic address: chpark@gwu.edu.

Document Type

Journal Article

Publication Date

6-10-2022

Journal

Computerized medical imaging and graphics : the official journal of the Computerized Medical Imaging Society

Volume

DOI

10.1016/j.compmedimag.2022.102090

Keywords

Deep learning; Image segmentation; Medical imaging; Multi-modal fusion

Abstract

Accurate nerve identification is critical during surgical procedures to prevent damage to nerve tissues. Nerve injury can cause long-term adverse effects for patients, as well as financial overburden. Birefringence imaging is a noninvasive technique derived from polarized images that have successfully identified nerves that can assist during intraoperative surgery. Furthermore, birefringence images can be processed under 20 ms with a GPGPU implementation, making it a viable image modality option for real-time processing. In this study, we first comprehensively investigate the usage of birefringence images combined with deep learning, which can automatically detect nerves with gains upwards of 14% over its color image-based (RGB) counterparts on the F2 score. Additionally, we develop a deep learning network framework using the U-Net architecture with a Transformer based fusion module at the bottleneck that leverages both birefringence and RGB modalities. The dual-modality framework achieves 76.12 on the F2 score, a gain of 19.6 % over single-modality networks using only RGB images. By leveraging and extracting the feature maps of each modality independently and using each modality's information for cross-modal interactions, we aim to provide a solution that would further increase the effectiveness of imaging systems for enabling noninvasive intraoperative nerve identification.

APA Citation

Xie, Baijun; Milam, Gary; Ning, Bo; Cha, Jaepyeong; and Park, Chung Hyuk, "DXM-TransFuse U-net: Dual cross-modal transformer fusion U-net for automated nerve identification" (2022). GW Authored Works. Paper 1133.
https://hsrc.himmelfarb.gwu.edu/gwhpubs/1133

Department

Pediatrics

Link to Full Text

COinS

See more details

GW Authored Works

DXM-TransFuse U-net: Dual cross-modal transformer fusion U-net for automated nerve identification

Document Type

Publication Date

Journal

Volume

DOI

Keywords

Abstract

APA Citation

Department

Search

Browse

Author Corner

Links

GW Authored Works

DXM-TransFuse U-net: Dual cross-modal transformer fusion U-net for automated nerve identification

Authors

Document Type

Publication Date

Journal

Volume

DOI

Keywords

Abstract

APA Citation

Department

Share

Search

Browse

Author Corner

Links