1.A review of deep learning methods for non-contact heart rate measurement based on facial videos.
Shuyue GUAN ; Yimou LYU ; Yongchun LI ; Chengzhi XIA ; Lin QI ; Lisheng XU
Journal of Biomedical Engineering 2025;42(1):197-204
Heart rate is a crucial indicator of human health with significant physiological importance. Traditional contact methods for measuring heart rate, such as electrocardiograph or wristbands, may not always meet the need for convenient health monitoring. Remote photoplethysmography (rPPG) provides a non-contact method for measuring heart rate and other physiological indicators by analyzing blood volume pulse signals. This approach is non-invasive, does not require direct contact, and allows for long-term healthcare monitoring. Deep learning has emerged as a powerful tool for processing complex image and video data, and has been increasingly employed to extract heart rate signals remotely. This article reviewed the latest research advancements in rPPG-based heart rate measurement using deep learning, summarized available public datasets, and explored future research directions and potential advancements in non-contact heart rate measurement.
Humans
;
Deep Learning
;
Heart Rate/physiology*
;
Photoplethysmography/methods*
;
Video Recording
;
Face
;
Monitoring, Physiologic/methods*
;
Signal Processing, Computer-Assisted
2.Small bowel video keyframe retrieval based on multi-modal contrastive learning.
Xing WU ; Guoyin YANG ; Jingwen LI ; Jian ZHANG ; Qun SUN ; Xianhua HAN ; Quan QIAN ; Yanwei CHEN
Journal of Biomedical Engineering 2025;42(2):334-342
Retrieving keyframes most relevant to text from small intestine videos with given labels can efficiently and accurately locate pathological regions. However, training directly on raw video data is extremely slow, while learning visual representations from image-text datasets leads to computational inconsistency. To tackle this challenge, a small bowel video keyframe retrieval based on multi-modal contrastive learning (KRCL) is proposed. This framework fully utilizes textual information from video category labels to learn video features closely related to text, while modeling temporal information within a pretrained image-text model. It transfers knowledge learned from image-text multimodal models to the video domain, enabling interaction among medical videos, images, and text data. Experimental results on the hyper-spectral and Kvasir dataset for gastrointestinal disease detection (Hyper-Kvasir) and the Microsoft Research video-to-text (MSR-VTT) retrieval dataset demonstrate the effectiveness and robustness of KRCL, with the proposed method achieving state-of-the-art performance across nearly all evaluation metrics.
Humans
;
Video Recording
;
Intestine, Small/diagnostic imaging*
;
Machine Learning
;
Image Processing, Computer-Assisted/methods*
;
Algorithms
3.A multi-feature fusion-based model for fetal orientation classification from intrapartum ultrasound videos.
Ziyu ZHENG ; Xiaying YANG ; Shengjie WU ; Shijie ZHANG ; Guorong LYU ; Peizhong LIU ; Jun WANG ; Shaozheng HE
Journal of Southern Medical University 2025;45(7):1563-1570
OBJECTIVES:
To construct an intelligent analysis model for classifying fetal orientation during intrapartum ultrasound videos based on multi-feature fusion.
METHODS:
The proposed model consists of the Input, Backbone Network and Classification Head modules. The Input module carries out data augmentation to improve the sample quality and generalization ability of the model. The Backbone Network was responsible for feature extraction based on Yolov8 combined with CBAM, ECA, PSA attention mechanism and AIFI feature interaction module. The Classification Head consists of a convolutional layer and a softmax function to output the final probability value of each class. The images of the key structures (the eyes, face, head, thalamus, and spine) were annotated with frames by physicians for model training to improve the classification accuracy of the anterior occipital, posterior occipital, and transverse occipital orientations.
RESULTS:
The experimental results showed that the proposed model had excellent performance in the tire orientation classification task with the classification accuracy reaching 0.984, an area under the PR curve (average accuracy) of 0.993, and area under the ROC curve of 0.984, and a kappa consistency test score of 0.974. The prediction results by the deep learning model were highly consistent with the actual classification results.
CONCLUSIONS
The multi-feature fusion model proposed in this study can efficiently and accurately classify fetal orientation in intrapartum ultrasound videos.
Humans
;
Female
;
Ultrasonography, Prenatal/methods*
;
Pregnancy
;
Fetus/diagnostic imaging*
;
Neural Networks, Computer
;
Video Recording
4.A Method for Detecting Depression in Adolescence Based on an Affective Brain-Computer Interface and Resting-State Electroencephalogram Signals.
Zijing GUAN ; Xiaofei ZHANG ; Weichen HUANG ; Kendi LI ; Di CHEN ; Weiming LI ; Jiaqi SUN ; Lei CHEN ; Yimiao MAO ; Huijun SUN ; Xiongzi TANG ; Liping CAO ; Yuanqing LI
Neuroscience Bulletin 2025;41(3):434-448
Depression is increasingly prevalent among adolescents and can profoundly impact their lives. However, the early detection of depression is often hindered by the time-consuming diagnostic process and the absence of objective biomarkers. In this study, we propose a novel approach for depression detection based on an affective brain-computer interface (aBCI) and the resting-state electroencephalogram (EEG). By fusing EEG features associated with both emotional and resting states, our method captures comprehensive depression-related information. The final depression detection model, derived through decision fusion with multiple independent models, further enhances detection efficacy. Our experiments involved 40 adolescents with depression and 40 matched controls. The proposed model achieved an accuracy of 86.54% on cross-validation and 88.20% on the independent test set, demonstrating the efficiency of multimodal fusion. In addition, further analysis revealed distinct brain activity patterns between the two groups across different modalities. These findings hold promise for new directions in depression detection and intervention.
Humans
;
Male
;
Female
;
Adolescent
;
Case-Control Studies
;
Depression/diagnosis*
;
Early Diagnosis
;
Rest
;
Electroencephalography/methods*
;
Brain-Computer Interfaces
;
Models, Psychological
;
Reproducibility of Results
;
Affect/physiology*
;
Photic Stimulation/methods*
;
Video Recording
;
Brain/physiopathology*
5.Analysis of vocal fold movement and voice onset behavior in patients with laryngopharyngeal reflux based on high speed laryngeal high-speed videoendoscopy.
Xinlin XU ; Xueqiong HUANG ; Xiangping LI ; Peiyun ZHUANG
Journal of Clinical Otorhinolaryngology Head and Neck Surgery 2024;38(11):1031-1037
Objective:Patients with Laryngopharyngeal Reflux(LPR) have chronic inflammation of the laryngeal mucosa leading to a high response state in the larynx, which may make the vocal fold movement too fast. This paper discusses the characteristics of vocal fold movement and voice onset by analyzing laryngeal high-speed videoendoscopy in patients with LPR. Methods:Forty patients with LPR were enrolled as LPR group. The diagnostic criteria of LPR included positive reflux symptom index(RSI) and reflux syndrome score(RFS) to identify suspected LPR, objective oropharyngeal DX pH monitoring was carried out, and positive Ryan index indicated reflux. According to age and sex matching, 40 healthy volunteers were selected as the normal group. Laryngeal high-speed videoendoscopy, and the vocal fold motion and vibration parameters, including vocal fold adduction time, vocal fold abduction time, vocal fold vibration onset mode(vocal onset time and mode) and the opening quotient of vocal fold vibration cycle. Statistical analysis was performed using SPSS 25.0. Results:The time of vocal fold adduction in LPR group(mean 225.81ms) was less than that in normal group(mean 277.01 ms), and the difference was statistically significant(P<0.05). There was no significant difference in adduction time between LPR group and normal group(P>0.05). The vocal onset time in LPR group was significantly longer than that in normal group(P<0.05). High speed video endoscope showed that there were 17 patients with hard onset in LPR group and 8 patients with hard onset in normal group, the difference was statistically significant(P<0.05). There was no significant difference in the open quotient of vocal fold vibration between LPR group and normal group(P>0.05). The vocal fold abduction time in LPR group(mean 372.92 ms) was less than that in normal group(mean 426.98ms), but the difference was not statistically significant(P>0.05). The time difference of bilateral abduction of vocal fold in LPR group was significantly higher than that in normal group(P<0.05). Conclusion:The larynx of LPR patients is in a high response state, the vocal fold moves faster, and it is more likely to have a hard vocal onset. These may result in voice dysfunction.
Humans
;
Vocal Cords/physiopathology*
;
Laryngopharyngeal Reflux/diagnosis*
;
Laryngoscopy/methods*
;
Male
;
Video Recording
;
Female
;
Middle Aged
;
Adult
;
Voice/physiology*
;
Case-Control Studies
;
Vibration
6.Utilization and Application of Modified Action Camera in Otorhinolaryngoloic Surgery.
Ho Young BAE ; Hantai KIM ; Jun Young AN ; Jung Jun LEE ; Dong Young KIM ; Do Yang PARK ; Hyun Jun KIM
Korean Journal of Otolaryngology - Head and Neck Surgery 2019;62(1):36-42
BACKGROUND AND OBJECTIVES: In recent years, surgical imaging has become important for legal and educational purposes. Significant improvements can be made from the surgeon's point of view in recording surgical procedures, particularly with respect to the action camera with high-definition video recordings. For otolaryngologic surgery, the surgical view is narrow, and there is a limit to proper imaging using the existing lens of the action camera. Therefore, we aimed to find out if we could obtain surgical images through simple modification of action camera. MATERIALS AND METHOD: The action camera was modified to match the surgical field. We selected a suitable lens for otolaryngology surgery using a calculation formula. The action camera was simply modified according to the design. The modified action camera can be mounted on the surgeon's head or the surgical light. We compared the images taken with the modified action camera and the images taken with the existing camcorder. The modified action camera was able to capture a narrow surgical field for otolaryngologic surgery. RESULTS: Unlike the existing method, we were able to obtain high-quality images using a modified action camera at the first person's viewpoint without auxiliary manpower. The action camera was considerably cost effective compared to other methods of recording surgery. CONCLUSION: The modified action camera allows for high-definition, cost-effective, and firstperson viewpoint for otolaryngologic surgery. The modified action camera allows for detailed videography that can enhance surgical teaching, presentation and patient education materials.
Education
;
Head
;
Methods
;
Otolaryngology
;
Patient Education as Topic
;
Video Recording
7.Efficacy of different tooth-brushing methods for removing dental plaque
Jeongmin KO ; Seon Jip KIM ; Hyun Jae CHO
Journal of Korean Academy of Oral Health 2019;43(3):111-117
OBJECTIVES: This study aimed to measure the efficacy of different tooth-brushing methods for removing plaque in Korea. METHODS: This study was conducted with the approval of the Institutional Review Board (IRB) of the Seoul National University School of Dentistry (S-D20180021). Thirty participants aged between 19 and 30 years, who did not have periodontal disease, were enrolled in this observational study. Participants were given the same type of toothbrush and toothpaste and asked to brush their teeth as they usually would. During brushing, participants were recorded with a camcorder that was attached to a mirror. Participants were aware they were being recorded. After they had finished brushing their teeth, a dental plaque staining and oral plaque index (PI) examination was performed. The PI score was measured using the Turesky modified Quigley Hein Index. Brushing methods were classified as rolling, horizontal, vertical, circling, and oblique. Skipped surfaces were recorded separately. Following this, statistical analysis was performed using SPSS software. RESULTS: Most surfaces of the mouth were skipped. The most commonly used brushing method was the circling method, followed by the vertical, horizontal, rolling, and oblique methods. The most frequently used method on the vestibular surface was circling, with 52.92% of the oral surface skipped. The oblique brushing method had the lowest mean PI score with a mean±SD of 1.73±0.82. The mean PI score of the skipped surfaces was the highest with a mean±SD of 2.52±0.81. We also analyzed the linear mixed model considering the different lengths of time spent brushing. Both the brushing method used and the time spent brushing had a significant effect on the PI score, but no interactions between these were observed. In areas where a horizontal brushing method had been used, the PI score was significantly decreased. CONCLUSIONS: This study suggests that the horizontal brushing method is an efficient tooth-brushing method compared to the other methods. Additionally, tooth-brushing for more than 10 seconds on 3 to 4 teeth area was effective in removing dental biofilm.
Biofilms
;
Dental Plaque Index
;
Dental Plaque
;
Dentistry
;
Ethics Committees, Research
;
Humans
;
Korea
;
Methods
;
Mouth
;
Observational Study
;
Oral Hygiene
;
Periodontal Diseases
;
Seoul
;
Tooth
;
Toothpastes
;
Video Recording
8.Machine-Learning Based Automatic and Real-time Detection of Mouse Scratching Behaviors
Ingyu PARK ; Kyeongho LEE ; Kausik BISHAYEE ; Hong Jin JEON ; Hyosang LEE ; Unjoo LEE
Experimental Neurobiology 2019;28(1):54-61
Scratching is a main behavioral response accompanied by acute and chronic itch conditions, and has been quantified as an objective correlate to assess itch in studies using laboratory animals. Scratching has been counted mostly by human annotators, which is a time-consuming and laborious process. It has been attempted to develop automated scoring methods using various strategies, but they often require specialized equipment, costly software, or implantation of device which may disturb animal behaviors. To complement limitations of those methods, we have adapted machine learning-based strategy to develop a novel automated and real-time method detecting mouse scratching from experimental movies captured using monochrome cameras such as a webcam. Scratching is identified by characteristic changes in pixels, body position, and body size by frame as well as the size of body. To build a training model, a novel two-step J48 decision tree-inducing algorithm along with a C4.5 post-pruning algorithm was applied to three 30-min video recordings in which a mouse exhibits scratching following an intradermal injection of a pruritogen, and the resultant frames were then used for the next round of training. The trained method exhibited, on average, a sensitivity and specificity of 95.19% and 92.96%, respectively, in a performance test with five new recordings. This result suggests that it can be used as a non-invasive, automated and objective tool to measure mouse scratching from video recordings captured in general experimental settings, permitting rapid and accurate analysis of scratching for preclinical studies and high throughput drug screening.
Animals
;
Animals, Laboratory
;
Behavior, Animal
;
Body Size
;
Complement System Proteins
;
Decision Trees
;
Drug Evaluation, Preclinical
;
Humans
;
Injections, Intradermal
;
Machine Learning
;
Methods
;
Mice
;
Motion Pictures as Topic
;
Pruritus
;
Research Design
;
Sensitivity and Specificity
;
Video Recording
9.Utilizing Video vs Simulation Practice for Handoff Education of Nursing Students in Pediatric Nursing
Child Health Nursing Research 2018;24(1):27-36
PURPOSE: The purpose of this study was to develop a model for handoff education for nursing students based on simulation using video and to identify educational effects of a simulated situation in pediatric care units. METHODS: Data were collected from May 1 to 30, 2016. Participants were 84 senior nursing students in Seoul(video group: 43, simulation group: 41). Both groups were given a lecture and pre-briefing on handoff education. The simulation group had nursing practice on resolving health issues for respiratory distress using a high-fidelity baby simulator. The video group watched a video recording of a scenario based simulation, and used a summarized handoff situation to practice patient handoff to another student. RESULTS: There was no significant difference between the two groups for handoff self-confidence, problem solving ability, handoff competence(self-assessment of students), or learning satisfaction. Self-confidence increased significantly in both groups. Handoff competency evaluated by the instructor was higher in the video group compared to the simulation group(t=2.33, p=.022). CONCLUSION: Nursing student education for handoff practice utilizing a video in the pediatric unit was more cost effective. Therefore, it could be a useful educational method for students in learning patient handoff practices and helpful for related research.
Clinical Competence
;
Education
;
Humans
;
Learning
;
Methods
;
Nursing
;
Patient Handoff
;
Patient Simulation
;
Pediatric Nursing
;
Problem Solving
;
Students, Nursing
;
Video Recording
10.Development and Feasibility of a Video Recording-Based Standardized Handoff Program for Ward Nurses: A pilot study
Eun Koung SEO ; Sun Hee KIM ; Youn Jung SON
Journal of Korean Critical Care Nursing 2018;11(3):71-84
PURPOSE: This study develop a Video Recording-Based Standardized Handoff (VRSH) program at shift change for ward nurses.METHOD: The study was conducted in five medical, three surgical, and one comprehensive nursing care service wards affiliated with a secondary general hospital. In this methodological study, the VRSH program was developed between April and December, 2017. It is noted that 65 nurses who were involved in the VRSH program participated in this study.RESULTS: In line with the modified Situation, Background, Assessment, Recommendation tool, the VRSH program consisted of three phases. In the VRSH program, the average time for handoff duration was 3–5 minutes per patient. More than 90% of the ward nurses were satisfied with the VRSH program since it benefited them by reducing overtime work and improving the performance, as well as effective communication, of nurses. The content analysis of nurses' VRSH program experience, revealed three categories and eight sub-categories.CONCLUSION: This study provides evidence that the VRSH program improves effective nursing performance and, the handoff communication and relationships between nurses. Future studies on large sample sizes and multiple settings are required to substantially evaluate the impact of the VRSH program on clinical outcomes.
Feasibility Studies
;
Hospitals, General
;
Humans
;
Methods
;
Nursing
;
Nursing Care
;
Patient Handoff
;
Pilot Projects
;
Sample Size
;
Video Recording

Result Analysis
Print
Save
E-mail