Search Results

1.Cross-modal hash retrieval of medical images based on Transformer semantic alignment.

Qianlin WU ; Lun TANG ; Qinghai LIU ; Liming XU ; Qianbin CHEN

Journal of Biomedical Engineering 2025;42(1):156-163

Medical cross-modal retrieval aims to achieve semantic similarity search between different modalities of medical cases, such as quickly locating relevant ultrasound images through ultrasound reports, or using ultrasound images to retrieve matching reports. However, existing medical cross-modal hash retrieval methods face significant challenges, including semantic and visual differences between modalities and the scalability issues of hash algorithms in handling large-scale data. To address these challenges, this paper proposes a Medical image Semantic Alignment Cross-modal Hashing based on Transformer (MSACH). The algorithm employed a segmented training strategy, combining modality feature extraction and hash function learning, effectively extracting low-dimensional features containing important semantic information. A Transformer encoder was used for cross-modal semantic learning. By introducing manifold similarity constraints, balance constraints, and a linear classification network constraint, the algorithm enhanced the discriminability of the hash codes. Experimental results demonstrated that the MSACH algorithm improved the mean average precision (MAP) by 11.8% and 12.8% on two datasets compared to traditional methods. The algorithm exhibits outstanding performance in enhancing retrieval accuracy and handling large-scale medical data, showing promising potential for practical applications.
Algorithms ; Semantics ; Humans ; Ultrasonography ; Information Storage and Retrieval/methods* ; Image Processing, Computer-Assisted/methods*

2.Medical text classification model integrating medical entity label semantics.

Li WEI ; Dechun ZHAO ; Lu QIN ; Yanghuazi LIU ; Yuchen SHEN ; Changrong YE

Journal of Biomedical Engineering 2025;42(2):326-333

Automatic classification of medical questions is of great significance in improving the quality and efficiency of online medical services, and belongs to the task of intent recognition. Joint entity recognition and intent recognition perform better than single task models. Currently, most publicly available medical text intent recognition datasets lack entity annotation, and manual annotation of these entities requires a lot of time and manpower. To solve this problem, this paper proposes a medical text classification model, bidirectional encoder representation based on transformer-recurrent convolutional neural network-entity-label-semantics (BRELS), which integrates medical entity label semantics. This model firstly utilizes an adaptive fusion mechanism to absorb prior knowledge of medical entity labels, achieving local feature enhancement. Then in global feature extraction, a lightweight recurrent convolutional neural network (LRCNN) is used to suppress parameter growth while preserving the original semantics of the text. The ablation and comparison experiments are conducted on three public medical text intent recognition datasets to validate the performance of the model. The results show that F1 score reaches 87.34%, 81.71%, and 77.74% on each dataset, respectively. The results show that the BRELS model can effectively identify and understand medical terminology, thereby effectively identifying users' intentions, which can improve the quality and efficiency of online medical services.
Semantics ; Neural Networks, Computer ; Humans ; Natural Language Processing

3.Cross modal medical image online hash retrieval based on online semantic similarity.

Qinghai LIU ; Lun TANG ; Qianlin WU ; Liming XU ; Qianbin CHEN

Journal of Biomedical Engineering 2025;42(2):343-350

Online hashing methods are receiving increasing attention in cross modal medical image retrieval research. However, existing online methods often lack the learning ability to maintain semantic correlation between new and existing data. To this end, we proposed online semantic similarity cross-modal hashing (OSCMH) learning framework to incrementally learn compact binary hash codes of medical stream data. Within it, a sparse representation of existing data based on online anchor datasets was designed to avoid semantic forgetting of the data and adaptively update hash codes, which effectively maintained semantic correlation between existing and arriving data and reduced information loss as well as improved training efficiency. Besides, an online discrete optimization method was proposed to solve the binary optimization problem of hash code by incrementally updating hash function and optimizing hash code on medical stream data. Compared with existing online or offline hashing methods, the proposed algorithm achieved average retrieval accuracy improvements of 12.5% and 14.3% on two datasets, respectively, effectively enhancing the retrieval efficiency in the field of medical images.
Semantics ; Humans ; Algorithms ; Information Storage and Retrieval/methods* ; Diagnostic Imaging ; Image Processing, Computer-Assisted/methods*

4.A heterogeneous graph method integrating multi-layer semantics and topological information for improving drug-target interaction prediction.

Zihao CHEN ; Yanbu GUO ; Shengli SONG ; Quanming GUO ; Dongming ZHOU

Journal of Southern Medical University 2025;45(11):2394-2404

OBJECTIVES: To develop a heterogeneous graph prediction method based on the fusion of multi-layer semantics and topological information for addressing the challenges in drug-target interaction prediction, including insufficient modeling of high-order semantic dependencies, lack of adaptive fusion of semantic paths, and over-smoothing of node features. METHODS: A heterogeneous graph network with multiple types of entities such as drugs, proteins, side effects, and diseases was constructed, and graph embedding techniques were used to obtain low-dimensional feature representations. An adaptive metapath search module was introduced to automatically discover semantic path combinations for guiding the propagation of high-order semantic information. A semantic aggregation mechanism integrating multi-head attention was designed to automatically learn the importance of each semantic path based on contextual information and achieve differentiated aggregation and dynamic fusion among paths. A structure-aware gated graph convolutional module was then incorporated to regulate the feature propagation intensity for suppressing redundant information and redcuing over-smoothing. Finally, the potential interactions between drugs and targets were predicted through an inner product operation. RESULTS: Compared with existing drug-target interaction prediction methods, the proposed method achieved an average improvement of 3.4% and 2.4%, 3.0% and 3.8% in terms of the area under the receiver operating characteristic curve (AUC) and the area under the precision-recall curve (AUPRC) on public datasets, respectively. CONCLUSIONS The drug-target interaction prediction method developed in this study can effectively extract complex high-order semantic and topological information from heterogeneous biological networks, thereby improving the accuracy and stability of drug-target interaction prediction. This method provides technical support and theoretical foundation for precise drug target discovery and targeted treatment of complex diseases.
Semantics ; Humans ; Drug Interactions ; Neural Networks, Computer ; Algorithms

5.Colorectal polyp segmentation method based on fusion of transformer and cross-level phase awareness.

Liming LIANG ; Anjun HE ; Chenkun ZHU ; Xiaoqi SHENG

Journal of Biomedical Engineering 2023;40(2):234-243

In order to address the issues of spatial induction bias and lack of effective representation of global contextual information in colon polyp image segmentation, which lead to the loss of edge details and mis-segmentation of lesion areas, a colon polyp segmentation method that combines Transformer and cross-level phase-awareness is proposed. The method started from the perspective of global feature transformation, and used a hierarchical Transformer encoder to extract semantic information and spatial details of lesion areas layer by layer. Secondly, a phase-aware fusion module (PAFM) was designed to capture cross-level interaction information and effectively aggregate multi-scale contextual information. Thirdly, a position oriented functional module (POF) was designed to effectively integrate global and local feature information, fill in semantic gaps, and suppress background noise. Fourthly, a residual axis reverse attention module (RA-IA) was used to improve the network's ability to recognize edge pixels. The proposed method was experimentally tested on public datasets CVC-ClinicDB, Kvasir, CVC-ColonDB, and EITS, with Dice similarity coefficients of 94.04%, 92.04%, 80.78%, and 76.80%, respectively, and mean intersection over union of 89.31%, 86.81%, 73.55%, and 69.10%, respectively. The simulation experimental results show that the proposed method can effectively segment colon polyp images, providing a new window for the diagnosis of colon polyps.
Humans ; Colonic Polyps/diagnostic imaging* ; Computer Simulation ; Electric Power Supplies ; Semantics ; Image Processing, Computer-Assisted

6.Exploration of the construction of semantic framework of meridians and acupoints based on top-level ontology.

Lu FU ; Bao-Jin LI ; Ke-Yu YAO ; Yan ZHU

Chinese Acupuncture & Moxibustion 2022;42(9):1064-1072

Based on the top-level ontology and the existing ontology methodology, the related concepts of meridians and acupoints were discriminated, defined and classified; the relationship of core concepts were established, e.g. meridians, acupoints and zangfu. It was attempted to build an ontological semantic framework of meridians and acupoints. Through the investigation on the classification mode of the top-level ontology, it is proposed that the meridians and acupoints, as the unique concepts of traditional Chinese medicine, exist in the form of "emptiness" and belong to "immaterial entity". Meridians refer to the three-dimensional channels in the human body, and acupoints are divided into ontological acupoints and body surface ones. Ontological acupoints are regarded as a three-dimensional structure within the human body, whereas, body surface ones are the optimal sites for acupuncture needle insertion on the body surface, meaning, the zero-dimensional point on the body surface. The main relationships between meridians and acupoints include is-a, exterior-interior, located-in, correspondent-to, mapping, etc. The exploration of the semantic framework of meridians and acupoints is conductive to understanding the connotation of meridians, acupoints and their relationship.
Acupuncture ; Acupuncture Points ; Acupuncture Therapy/methods* ; Humans ; Meridians ; Semantics

7.Traditional Chinese Medicine (TCM) Domain Ontology: Current Status and Rethinking for the Future Development.

Yan ZHU ; Ke-Yu YAO ; Su-Yuan PENG ; Xiao-Lin YANG

Chinese Medical Sciences Journal 2022;37(3):228-233

8.Multimodal high-grade glioma semantic segmentation network with multi-scale and multi-attention fusion mechanism.

Yuchao WU ; Lan LIN ; Shuicai WU

Journal of Biomedical Engineering 2022;39(3):433-440

Glioma is a primary brain tumor with high incidence rate. High-grade gliomas (HGG) are those with the highest degree of malignancy and the lowest degree of survival. Surgical resection and postoperative adjuvant chemoradiotherapy are often used in clinical treatment, so accurate segmentation of tumor-related areas is of great significance for the treatment of patients. In order to improve the segmentation accuracy of HGG, this paper proposes a multi-modal glioma semantic segmentation network with multi-scale feature extraction and multi-attention fusion mechanism. The main contributions are, (1) Multi-scale residual structures were used to extract features from multi-modal gliomas magnetic resonance imaging (MRI); (2) Two types of attention modules were used for features aggregating in channel and spatial; (3) In order to improve the segmentation performance of the whole network, the branch classifier was constructed using ensemble learning strategy to adjust and correct the classification results of the backbone classifier. The experimental results showed that the Dice coefficient values of the proposed segmentation method in this article were 0.909 7, 0.877 3 and 0.839 6 for whole tumor, tumor core and enhanced tumor respectively, and the segmentation results had good boundary continuity in the three-dimensional direction. Therefore, the proposed semantic segmentation network has good segmentation performance for high-grade gliomas lesions.
Attention ; Glioma/diagnostic imaging* ; Humans ; Magnetic Resonance Imaging/methods* ; Semantics

9.Cross-modal retrieval method for thyroid ultrasound image and text based on generative adversarial network.

Feng XU ; Xiaoping MA ; Libo LIU

Journal of Biomedical Engineering 2020;37(4):641-651

Ultrasonic examination is a common method in thyroid examination, and the results are mainly composed of thyroid ultrasound images and text reports. Implementation of cross modal retrieval method of images and text reports can provide great convenience for doctors and patients, but currently there is no retrieval method to correlate thyroid ultrasound images with text reports. This paper proposes a cross-modal method based on the deep learning and improved cross-modal generative adversarial network: ①the weight sharing constraints between the fully connection layers used to construct the public representation space in the original network are changed to cosine similarity constraints, so that the network can better learn the common representation of different modal data; ②the fully connection layer is added before the cross-modal discriminator to merge the full connection layer of image and text in the original network with weight sharing. Semantic regularization is realized on the basis of inheriting the advantages of the original network weight sharing. The experimental results show that the mean average precision of cross modal retrieval method for thyroid ultrasound image and text report in this paper can reach 0.508, which is significantly higher than the traditional cross-modal method, providing a new method for cross-modal retrieval of thyroid ultrasound image and text report.
Humans ; Image Processing, Computer-Assisted ; Semantics ; Thyroid Gland

10.Clustering and Switching Patterns in Semantic Fluency and Their Relationship to Working Memory in Mild Cognitive Impairment

Se Jin OH ; Jee Eun SUNG ; Su Jin CHOI ; Jee Hyang JEONG

Dementia and Neurocognitive Disorders 2019;18(2):47-61

BACKGROUND AND PURPOSE: Semantic verbal fluency test is a neuropsychological assessment that can sensitively detect neuropathological changes. Considering its multifactorial features tapping various cognitive domains such as semantic memory, executive function, and working memory, it is necessary to examine verbal fluency performance in association with underlying cognitive functions. The objective of the current study was to investigate semantic fluency patterns of people with mild cognitive impairment (MCI) based on clustering and switching and their relationship with working memory. METHODS: Twenty-six individuals with MCI and 23 normal elderly adults participated in this study. A semantic verbal fluency test (animal version) was administered and the performance was analyzed using the following measures: number of correct words, cluster size, and number of switches. Scores of digit forward (DF) and backward span tasks were employed as working memory measures. RESULTS: Analyses of variance revealed significant group differences in the numbers of correct words and switches. Multivariate logistic regression and receiver-operating characteristic analyses showed that the number of switches more sensitively distinguished MCI existence than the number of correct words. Stepwise linear regression analysis showed that DF task and age significantly predicted the number of correct words while only the DF task significantly predicted the number of switches. CONCLUSIONS: Decrement in semantic verbal fluency in MCI seems to be associated with impaired switching abilities. Working memory capacity might serve as the underlying cognitive factor related to decreased verbal fluency in MCI.
Adult ; Aged ; Cluster Analysis ; Cognition ; Executive Function ; Humans ; Linear Models ; Logistic Models ; Memory ; Memory, Short-Term ; Mild Cognitive Impairment ; Semantics