A multimodal medical image contrastive learning algorithm with domain adaptive denormalization.
10.7507/1001-5515.202302050
- Author:
Han WEN
1
;
Ying ZHAO
2
;
Xiuding CAI
1
;
Ailian LIU
2
;
Yu YAO
1
;
Zhongliang FU
1
Author Information
1. Chengdu Institute of Computer Applications, Chinese Academy of Sciences, Chengdu 610213, P. R. China.
2. The First Affiliated Hospital of Dalian Medical University, Dalian, Liaoning 116011, P. R. China.
- Publication Type:Journal Article
- Keywords:
Disease diagnosis;
Domain adaptive denormalization;
Multimodal medical image;
Self-supervised learning
- MeSH:
Humans;
Algorithms;
Brain/diagnostic imaging*;
Brain Neoplasms/diagnostic imaging*;
Recognition, Psychology
- From:
Journal of Biomedical Engineering
2023;40(3):482-491
- CountryChina
- Language:Chinese
-
Abstract:
Recently, deep learning has achieved impressive results in medical image tasks. However, this method usually requires large-scale annotated data, and medical images are expensive to annotate, so it is a challenge to learn efficiently from the limited annotated data. Currently, the two commonly used methods are transfer learning and self-supervised learning. However, these two methods have been little studied in multimodal medical images, so this study proposes a contrastive learning method for multimodal medical images. The method takes images of different modalities of the same patient as positive samples, which effectively increases the number of positive samples in the training process and helps the model to fully learn the similarities and differences of lesions on images of different modalities, thus improving the model's understanding of medical images and diagnostic accuracy. The commonly used data augmentation methods are not suitable for multimodal images, so this paper proposes a domain adaptive denormalization method to transform the source domain images with the help of statistical information of the target domain. In this study, the method is validated with two different multimodal medical image classification tasks: in the microvascular infiltration recognition task, the method achieves an accuracy of (74.79 ± 0.74)% and an F1 score of (78.37 ± 1.94)%, which are improved as compared with other conventional learning methods; for the brain tumor pathology grading task, the method also achieves significant improvements. The results show that the method achieves good results on multimodal medical images and can provide a reference solution for pre-training multimodal medical images.