1
|
Arii H, Sakai T. Hyoid Bone Velocity and Distance during the Forward Phase Correlate with Pyriform Sinus Residue: A Retrospective Case Series. Prog Rehabil Med 2025; 10:20250009. [PMID: 40206814 PMCID: PMC11976461 DOI: 10.2490/prm.20250009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2024] [Accepted: 03/31/2025] [Indexed: 04/11/2025] Open
Abstract
Objectives This study investigated the relationship between the three phases of hyoid bone movement and pharyngeal residue using the videofluoroscopic swallowing study (VFSS). Methods We retrospectively analyzed the data from 66 patients who underwent VFSS between April 2019 and December 2019. Hyoid bone movement was classified into three phases: upward, forward, and downward. We measured the velocity and distance of hyoid bone movement in each phase, as well as the pharyngeal residue after swallowing. The correlation between hyoid bone movement and the amount of pharyngeal residue was analyzed using Spearman's rank correlation coefficient. A receiver operating characteristic (ROC) analysis was performed to evaluate the presence of pyriform sinus residue. Results Hyoid bone velocity and distance during the forward phase correlated with the amount of pyriform sinus residue (velocity: r=0.311, P=0.011; distance: r=0.255, P=0.0389). ROC analysis revealed that the cutoff value for hyoid bone velocity during the forward phase was 26.1 mm/s (0.846 sensitivity, 0.604 specificity) with an area under the curve of 0.717. Conclusions The velocity and distance of the hyoid bone during the forward phase were significantly related to the amount of pyriform sinus residue. In VFSS assessment, it is important to classify hyoid bone movement into three phases-upward, forward, and downward-and to calculate its velocity and distance.
Collapse
Affiliation(s)
- Hironori Arii
- Division of Rehabilitation, Gunma University Hospital,
Maebashi, Japan
| | - Tetsuro Sakai
- Department of Speech-Language-Hearing Therapy, Gunma Paz
University, Takasaki, Japan
- Division of Rehabilitation, Fujioka General Hospital,
Fujioka, Japan
| |
Collapse
|
2
|
Cubero L, Tessier C, Castelli J, Robert K, de Crevoisier R, Jégoux F, Pascau J, Acosta O. Automated dysphagia characterization in head and neck cancer patients using videofluoroscopic swallowing studies. Comput Biol Med 2025; 187:109759. [PMID: 39914196 DOI: 10.1016/j.compbiomed.2025.109759] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2024] [Revised: 01/24/2025] [Accepted: 01/27/2025] [Indexed: 02/21/2025]
Abstract
BACKGROUND Dysphagia is one of the most common toxicities following head and neck cancer (HNC) radiotherapy (RT). Videofluoroscopic Swallowing Studies (VFSS) are the gold standard for diagnosing and assessing dysphagia, but current evaluation methods are manual, subjective, and time-consuming. This study introduces a novel framework for the automated analysis of VFSS to characterize dysphagia in HNC patients. METHOD The proposed methodology integrates three key steps: (i) a deep learning-based labeling framework, trained iteratively to identify ten regions of interest; (ii) extraction of 23 swallowing dynamic parameters, followed by comparison across diverse cohorts; and (iii) machine learning (ML) classification of the extracted parameters into four dysphagia-related impairments. RESULTS The labeling framework achieved high accuracy, with a mean error of 1.6 pixels across the ten regions of interest in an independent test dataset. Analysis of the extracted parameters revealed significant differences in swallowing dynamics between healthy individuals, HNC patients before and after RT, and patients with non-HNC-related dysphagia. The ML classifiers achieved accuracies ranging from 0.60 to 0.87 for the four dysphagia-related impairments. CONCLUSIONS Despite challenges related to dataset size and VFSS variability, our framework demonstrates substantial potential for automatically identifying ten regions of interest and four dysphagia-related impairments from VFSS. This work sets the foundation for future research aimed at refining dysphagia analysis and characterization using VFSS, particularly in the context of HNC RT.
Collapse
Affiliation(s)
- Lucía Cubero
- Université Rennes, CLCC Eugène Marquis, Inserm, LTSI - UMR 1099, F-35000, Rennes, France; Departamento de Bioingeniería, Universidad Carlos III de Madrid, Madrid, Spain.
| | - Christophe Tessier
- Service d'ORL et Chirurgie Maxillo-Faciale, CHU Pontchaillou, Université Rennes, 35033, Rennes, France
| | - Joël Castelli
- Université Rennes, CLCC Eugène Marquis, Inserm, LTSI - UMR 1099, F-35000, Rennes, France
| | - Kilian Robert
- Service d'ORL et Chirurgie Maxillo-Faciale, CHU Pontchaillou, Université Rennes, 35033, Rennes, France
| | - Renaud de Crevoisier
- Université Rennes, CLCC Eugène Marquis, Inserm, LTSI - UMR 1099, F-35000, Rennes, France
| | - Franck Jégoux
- Service d'ORL et Chirurgie Maxillo-Faciale, CHU Pontchaillou, Université Rennes, 35033, Rennes, France
| | - Javier Pascau
- Departamento de Bioingeniería, Universidad Carlos III de Madrid, Madrid, Spain; Instituto de Investigación Sanitaria Gregorio Marañón, Madrid, Spain.
| | - Oscar Acosta
- Université Rennes, CLCC Eugène Marquis, Inserm, LTSI - UMR 1099, F-35000, Rennes, France
| |
Collapse
|
3
|
Shu K, Mao S, Zhang Z, Coyle JL, Sejdić E. Recent advancements and future directions in automatic swallowing analysis via videofluoroscopy: A review. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2025; 259:108505. [PMID: 39579458 DOI: 10.1016/j.cmpb.2024.108505] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/22/2024] [Revised: 11/06/2024] [Accepted: 11/06/2024] [Indexed: 11/25/2024]
Abstract
Videofluoroscopic swallowing studies (VFSS) capture the complex anatomy and physiology contributing to bolus transport and airway protection during swallowing. While clinical assessment of VFSS can be affected by evaluators subjectivity and variability in evaluation protocols, many efforts have been dedicated to developing methods to ensure consistent measures and reliable analyses of swallowing physiology using advanced computer-assisted methods. Latest advances in computer vision, pattern recognition, and deep learning technologies provide new paradigms to explore and extract information from VFSS recordings. The literature search was conducted on four bibliographic databases with exclusive focus on automatic videofluoroscopic analyses. We identified 46 studies that employ state-of-the-art image processing techniques to solve VFSS analytical tasks including anatomical structure detection, bolus contrast segmentation, and kinematic event recognition. Advanced computer vision and deep learning techniques have enabled fully automatic swallowing analysis and abnormality detection, resulting in improved accuracy and unprecedented efficiency in swallowing assessment. By establishing this review of image processing techniques applied to automatic swallowing analysis, we intend to demonstrate the current challenges in VFSS analyses and provide insight into future directions in developing more accurate and clinically explainable algorithms.
Collapse
Affiliation(s)
- Kechen Shu
- School of Mechanical Science and Engineering, Huazhong University of Science and Technology, Wuhan, China
| | - Shitong Mao
- Department of Head and Neck Surgery, University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Zhenwei Zhang
- Center for Advanced Analytics, Baptist Health South Florida, Miami, FL, USA
| | - James L Coyle
- Department of Communication Science and Disorders, School of Health and Rehabilitation Sciences, University of Pittsburgh, Pittsburgh, PA, USA; Department of Otolaryngology, School of Medicine, University of Pittsburgh, Pittsburgh, PA, USA; Department of Electrical and Computer Engineering, Swanson School of Engineering, University of Pittsburgh, Pittsburgh, PA, USA
| | - Ervin Sejdić
- Edward S. Rogers Department of Electrical and Computer Engineering, Faculty of Applied Science and Engineering, University of Toronto, Toronto, ON, Canada; North York General Hospital, Toronto, ON, Canada.
| |
Collapse
|
4
|
Li W, Mao S, Mahoney AS, Coyle JL, Sejdić E. Automatic Tracking of Hyoid Bone Displacement and Rotation Relative to Cervical Vertebrae in Videofluoroscopic Swallow Studies Using Deep Learning. JOURNAL OF IMAGING INFORMATICS IN MEDICINE 2024; 37:1922-1932. [PMID: 38383805 PMCID: PMC11300761 DOI: 10.1007/s10278-024-01039-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/04/2023] [Revised: 01/17/2024] [Accepted: 02/01/2024] [Indexed: 02/23/2024]
Abstract
The hyoid bone displacement and rotation are critical kinematic events of the swallowing process in the assessment of videofluoroscopic swallow studies (VFSS). However, the quantitative analysis of such events requires frame-by-frame manual annotation, which is labor-intensive and time-consuming. Our work aims to develop a method of automatically tracking hyoid bone displacement and rotation in VFSS. We proposed a full high-resolution network, a deep learning architecture, to detect the anterior and posterior of the hyoid bone to identify its location and rotation. Meanwhile, the anterior-inferior corners of the C2 and C4 vertebrae were detected simultaneously to automatically establish a new coordinate system and eliminate the effect of posture change. The proposed model was developed by 59,468 VFSS frames collected from 1488 swallowing samples, and it achieved an average landmark localization error of 2.38 pixels (around 0.5% of the image with 448 × 448 pixels) and an average angle prediction error of 0.065 radians in predicting C2-C4 and hyoid bone angles. In addition, the displacement of the hyoid bone center was automatically tracked on a frame-by-frame analysis, achieving an average mean absolute error of 2.22 pixels and 2.78 pixels in the x-axis and y-axis, respectively. The results of this study support the effectiveness and accuracy of the proposed method in detecting hyoid bone displacement and rotation. Our study provided an automatic method of analyzing hyoid bone kinematics during VFSS, which could contribute to early diagnosis and effective disease management.
Collapse
Affiliation(s)
- Wuqi Li
- Edward S. Rogers Department of Electrical and Computer Engineering, University of Toronto, Toronto, ON, Canada
| | - Shitong Mao
- Department of Head and Neck Surgery, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Amanda S Mahoney
- Department of Communication Science and Disorders, School of Health and Rehabilitation Sciences, University of Pittsburgh, Pittsburgh, PA, USA
| | - James L Coyle
- Department of Communication Science and Disorders, School of Health and Rehabilitation Sciences, University of Pittsburgh, Pittsburgh, PA, USA
| | - Ervin Sejdić
- Edward S. Rogers Department of Electrical and Computer Engineering, University of Toronto, Toronto, ON, Canada.
- North York General Hospital, Toronto, ON, Canada.
| |
Collapse
|
5
|
Nerella S, Bandyopadhyay S, Zhang J, Contreras M, Siegel S, Bumin A, Silva B, Sena J, Shickel B, Bihorac A, Khezeli K, Rashidi P. Transformers and large language models in healthcare: A review. Artif Intell Med 2024; 154:102900. [PMID: 38878555 PMCID: PMC11638972 DOI: 10.1016/j.artmed.2024.102900] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2023] [Revised: 05/28/2024] [Accepted: 05/30/2024] [Indexed: 08/09/2024]
Abstract
With Artificial Intelligence (AI) increasingly permeating various aspects of society, including healthcare, the adoption of the Transformers neural network architecture is rapidly changing many applications. Transformer is a type of deep learning architecture initially developed to solve general-purpose Natural Language Processing (NLP) tasks and has subsequently been adapted in many fields, including healthcare. In this survey paper, we provide an overview of how this architecture has been adopted to analyze various forms of healthcare data, including clinical NLP, medical imaging, structured Electronic Health Records (EHR), social media, bio-physiological signals, biomolecular sequences. Furthermore, which have also include the articles that used the transformer architecture for generating surgical instructions and predicting adverse outcomes after surgeries under the umbrella of critical care. Under diverse settings, these models have been used for clinical diagnosis, report generation, data reconstruction, and drug/protein synthesis. Finally, we also discuss the benefits and limitations of using transformers in healthcare and examine issues such as computational cost, model interpretability, fairness, alignment with human values, ethical implications, and environmental impact.
Collapse
Affiliation(s)
- Subhash Nerella
- Department of Biomedical Engineering, University of Florida, Gainesville, United States
| | | | - Jiaqing Zhang
- Department of Electrical and Computer Engineering, University of Florida, Gainesville, United States
| | - Miguel Contreras
- Department of Biomedical Engineering, University of Florida, Gainesville, United States
| | - Scott Siegel
- Department of Biomedical Engineering, University of Florida, Gainesville, United States
| | - Aysegul Bumin
- Department of Computer and Information Science and Engineering, University of Florida, Gainesville, United States
| | - Brandon Silva
- Department of Computer and Information Science and Engineering, University of Florida, Gainesville, United States
| | - Jessica Sena
- Department Of Computer Science, Universidade Federal de Minas Gerais, Belo Horizonte, Brazil
| | - Benjamin Shickel
- Department of Medicine, University of Florida, Gainesville, United States
| | - Azra Bihorac
- Department of Medicine, University of Florida, Gainesville, United States
| | - Kia Khezeli
- Department of Biomedical Engineering, University of Florida, Gainesville, United States
| | - Parisa Rashidi
- Department of Biomedical Engineering, University of Florida, Gainesville, United States.
| |
Collapse
|
6
|
Mao S, Naser MA, Buoy SN, Brock KK, Hutcheson KA. Optimizing Modified Barium Swallow Exam Workflow: Automating Pre-Analysis Video Sorting in Swallowing Function Assessment. ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. ANNUAL INTERNATIONAL CONFERENCE 2024; 2024:1-4. [PMID: 40039880 PMCID: PMC11883172 DOI: 10.1109/embc53108.2024.10782457] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/06/2025]
Abstract
Modified Barium Swallow (MBS) exams, performed using video-fluoroscopy, an X-ray imaging technique, are essential for assessing swallowing function. They visualize the barium bolus (contrast agent) during the swallowing process in the head and neck area, thereby providing crucial insights into the dynamics of swallowing. Typically, these exams include both diagnostic anteroposterior (AP) and lateral planes, in addition to non-diagnostic "scout" films. This study introduces a deep learning solution aimed at streamlining the pre-analysis process of MBS exams by automating the identification of video orientations and scout video clips. Our methods are trained and tested on a comprehensive dataset comprising 2,315 video clips from 172 MBS exams and 106 patients. To distinguish AP videos from lateral views, our model achieved more than 99% accuracy at the frame level and 100% at the video level. In differentiating scout from bolus swallowing tasks, the model attained a maximum accuracy of 86% at the video level. We further merged these two tasks into a multi-task learning approach further enhanced the accuracy to 91% for scout/bolus differentiation. These advancements allow clinicians to allocate more efforts to focus primarily on lateral view videos for clinically relevant measurements such as the Penetration-Aspiration Scale (PAS) and Dynamic Imaging Grade of Swallowing Toxicity (DIGEST). This image sorting is also a pre-requisite step necessary to apply deep learning solutions to full image analysis.
Collapse
|
7
|
Park D, Kim Y, Kang H, Lee J, Choi J, Kim T, Lee S, Son S, Kim M, Kim I. PECI-Net: Bolus segmentation from video fluoroscopic swallowing study images using preprocessing ensemble and cascaded inference. Comput Biol Med 2024; 172:108241. [PMID: 38489987 DOI: 10.1016/j.compbiomed.2024.108241] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2023] [Revised: 01/30/2024] [Accepted: 02/27/2024] [Indexed: 03/17/2024]
Abstract
Bolus segmentation is crucial for the automated detection of swallowing disorders in videofluoroscopic swallowing studies (VFSS). However, it is difficult for the model to accurately segment a bolus region in a VFSS image because VFSS images are translucent, have low contrast and unclear region boundaries, and lack color information. To overcome these challenges, we propose PECI-Net, a network architecture for VFSS image analysis that combines two novel techniques: the preprocessing ensemble network (PEN) and the cascaded inference network (CIN). PEN enhances the sharpness and contrast of the VFSS image by combining multiple preprocessing algorithms in a learnable way. CIN reduces ambiguity in bolus segmentation by using context from other regions through cascaded inference. Moreover, CIN prevents undesirable side effects from unreliably segmented regions by referring to the context in an asymmetric way. In experiments, PECI-Net exhibited higher performance than four recently developed baseline models, outperforming TernausNet, the best among the baseline models, by 4.54% and the widely used UNet by 10.83%. The results of the ablation studies confirm that CIN and PEN are effective in improving bolus segmentation performance.
Collapse
Affiliation(s)
- Dougho Park
- Pohang Stroke and Spine Hospital, Pohang, Republic of Korea; School of Convergence Science and Technology, Pohang University of Science and Technology, Pohang, Republic of Korea
| | - Younghun Kim
- School of CSEE, Handong Global University, Pohang, Republic of Korea
| | - Harim Kang
- School of CSEE, Handong Global University, Pohang, Republic of Korea
| | - Junmyeoung Lee
- School of CSEE, Handong Global University, Pohang, Republic of Korea
| | - Jinyoung Choi
- School of CSEE, Handong Global University, Pohang, Republic of Korea
| | - Taeyeon Kim
- Pohang Stroke and Spine Hospital, Pohang, Republic of Korea
| | - Sangeok Lee
- Pohang Stroke and Spine Hospital, Pohang, Republic of Korea
| | - Seokil Son
- Pohang Stroke and Spine Hospital, Pohang, Republic of Korea
| | - Minsol Kim
- Pohang Stroke and Spine Hospital, Pohang, Republic of Korea
| | - Injung Kim
- School of CSEE, Handong Global University, Pohang, Republic of Korea.
| |
Collapse
|
8
|
Ryu YH, Kim JH, Kim D, Kim SY, Lee SJ. Diagnostic value of a deep learning-based hyoid bone tracking model for aspiration in patients with post-stroke dysphagia. Digit Health 2024; 10:20552076241271778. [PMID: 39130520 PMCID: PMC11311153 DOI: 10.1177/20552076241271778] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2024] [Accepted: 06/20/2024] [Indexed: 08/13/2024] Open
Abstract
Objective Hyoid bone movement is potentially related to aspiration risk in post-stroke dysphagia (PSD) patients but is difficult to assess quantitatively. This study aimed to measure the distance of hyoid bone movement more efficiently and accurately using a deep learning model and determine the clinical usefulness of the model in PSD patients. Methods This study included 85 patients with PSD within 6 months from onset. Patients were grouped into an aspiration group (n = 35) and a non-aspiration group (n = 50) according to the results of a videofluoroscopic swallowing study. Hyoid bone movement was tracked using a deep learning model constructed with the BiFPN-U-Net(T) architecture. The maximum distance of hyoid bone movement was measured horizontally (H max), vertically (V max), and diagonally (D max). Results Compared with the non-aspiration group, the aspiration group showed significant decreases in hyoid bone movement in all directions. The area under the curve of V max was highest at 0.715 with a sensitivity of 0.680 and specificity of 0.743. The V max cutoff value for predicting aspiration risk was 1.61 cm. The success of oral feeding at the time of discharge was significantly more frequent when hyoid movement was equal to or larger than the cutoff value although no significant relationship was found between hyoid movement and other clinical characteristics. Conclusion Hyoid bone movement of PSD patients can be measured quantitatively and efficiently using a deep learning model. Deep learning model-based analysis of hyoid bone movement seems to be useful for predicting aspiration risk and the possibility of resuming oral feeding.
Collapse
Affiliation(s)
- Yeong Hwan Ryu
- Department of Rehabilitation Medicine, College of Medicine, Dankook University, Cheonan, Republic of Korea
| | - Ji Hyun Kim
- Department of Rehabilitation Medicine, College of Medicine, Dankook University, Cheonan, Republic of Korea
| | - Dohhyung Kim
- Department of Internal Medicine, College of Medicine, Dankook University, Cheonan, Republic of Korea
| | - Seo Young Kim
- Department of Rehabilitation Medicine, College of Medicine, Dankook University, Cheonan, Republic of Korea
| | - Seong Jae Lee
- Department of Rehabilitation Medicine, College of Medicine, Dankook University, Cheonan, Republic of Korea
| |
Collapse
|
9
|
Ruan X, Dai M, Chen Z, You Z, Zhang Y, Li Y, Dou Z, Tan M. Temporal Micro-Action Localization for Videofluoroscopic Swallowing Study. IEEE J Biomed Health Inform 2023; 27:5904-5913. [PMID: 37682645 DOI: 10.1109/jbhi.2023.3313255] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/10/2023]
Abstract
Videofluoroscopic swallowing study (VFSS) visualizes the swallowing movement by using X-ray fluoroscopy, which is the most widely used method for dysphagia examination. To better facilitate swallowing assessment, the temporal parameter is one of the most important indicators. However, most information of that acquire is hand-crafted and elaborated, which is time-consuming and difficult to ensure objectivity and accuracy. In this article, we propose to formulate this task as a temporal action localization task and solve it using deep neural networks. However, the action of VFSS has the following characteristics such as small motion targets, small action amplitudes, large sample variances, short duration, and variations in duration. Furthermore, all existing methods often rely on daily behaviors, which makes locating and recognizing micro-actions more challenging. To address the above issues, we first collect and annotate the VFSS micro-action dataset, which includes 847 VFSS data from 71 subjects, due to the lack of benchmarks. We then introduce a coarse-to-fine mechanism to handle the short and repeated nature of micro-actions, which can significantly enhancing micro-action localization accuracy. Moreover, we propose a Variable-Size Window Generator method, which improves the model's characterization performance and addresses the issue of different action timings, leading to further improvements in localization accuracy. The results of our experiments demonstrate the superiority of our method, with significantly improved performance (46.10% vs. 37.70%).
Collapse
|
10
|
Hsiao MY, Weng CH, Wang YC, Cheng SH, Wei KC, Tung PY, Chen JY, Yeh CY, Wang TG. Deep Learning for Automatic Hyoid Tracking in Videofluoroscopic Swallow Studies. Dysphagia 2023; 38:171-180. [PMID: 35482213 DOI: 10.1007/s00455-022-10438-0] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2021] [Accepted: 12/13/2021] [Indexed: 01/27/2023]
Abstract
The hyoid bone excursion is one of the most important gauges of larynx elevation in swallowing, contributing to airway protection and bolus passage into the esophagus. However, the implications of various parameters of hyoid bone excursion, such as the horizontal or vertical displacement and velocity, remain elusive and raise the need for a tool providing automatic kinematics analysis. Several conventional and deep learning-based models have been applied automatically to track the hyoid bone, but previous methods either require partial manual localization or do not transform the trajectory by anatomic axis. This work describes a convolutional neural network-based algorithm featuring fully automatic hyoid bone localization and tracking and spine axis determination. The algorithm automatically estimates the hyoid bone trajectory and calculates several physical quantities, including the average velocity and displacement in horizontal or vertical anatomic axis. The model was trained in a dataset of 365 videos of videofluoroscopic swallowing from 189 patients in a tertiary medical center and tested using 44 videos from 44 patients with different dysphagia etiologies. The algorithm showed high detection rates for the hyoid bone. The results showed excellent inter-rater reliability for hyoid bone detection, good-to-excellent inter-rater reliability for calculating the maximal displacement and the average velocity of the hyoid bone in horizontal or vertical directions, and moderate-to-good reliability in calculating the average velocity in horizontal direction. The proposed algorithm allows for complete automatic kinematic analysis of hyoid bone excursion, providing a versatile tool with high potential for clinical applications.
Collapse
Affiliation(s)
- Ming-Yen Hsiao
- Department of Physical Medicine and Rehabilitation, College of Medicine, Zhongzheng Dist., National Taiwan University, No. 7, Zhongshan S. Rd., Taipei, 100, Taiwan
- Department of Physical Medicine and Rehabilitation, National Taiwan University Hospital, Taipei, Taiwan
| | | | - Yu-Chen Wang
- Department of Physical Medicine and Rehabilitation, College of Medicine, Zhongzheng Dist., National Taiwan University, No. 7, Zhongshan S. Rd., Taipei, 100, Taiwan
| | - Sheng-Hao Cheng
- Department of Physical Medicine and Rehabilitation, National Taiwan University Hospital, Taipei, Taiwan
| | - Kuo-Chang Wei
- Department of Physical Medicine and Rehabilitation, National Taiwan University Hospital, Taipei, Taiwan
| | - Po-Ya Tung
- The UC Berkeley/ UCSF Master Program in Translational Medicine, University of California, Berkeley, University of California, San Francisco, CA, USA
| | - Jo-Yu Chen
- Department of Medical Imaging, National Taiwan University Hospital, Taipei, Taiwan
| | | | - Tyng-Guey Wang
- Department of Physical Medicine and Rehabilitation, College of Medicine, Zhongzheng Dist., National Taiwan University, No. 7, Zhongshan S. Rd., Taipei, 100, Taiwan.
- Department of Physical Medicine and Rehabilitation, National Taiwan University Hospital, Taipei, Taiwan.
| |
Collapse
|
11
|
Bordoni B, Escher AR. A Missing Voice: The Lingual Complex and Osteopathic Manual Medicine in the Context of Five Osteopathic Models. Cureus 2021; 13:e18658. [PMID: 34659928 PMCID: PMC8503936 DOI: 10.7759/cureus.18658] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/11/2021] [Indexed: 12/19/2022] Open
Abstract
The five osteopathic models recognized by the American Association of Colleges of Osteopathic Medicine guide clinicians in the evaluation and therapeutic choice which must be the most appropriate concerning the patient's needs. Skeletal muscles represent an important interpretation, such as screening and treatment, on which these models are based. A muscle district that is not considered by the usual osteopathic practice is the tongue. The lingual complex has numerous functions, both local and systemic; it can adapt negatively in the presence of pathology, just as it can influence the body system in a non-physiological manner if it is a source of dysfunctions. This paper, the first of its kind in the panorama of scientific literature, briefly reviews the anatomy and neurophysiology of the tongue, trying to highlight the logic and the need to insert this muscle in the context of the five osteopathic models. The clinician's goal is to restore the patient's homeostasis, and we believe that this task is more concrete if the patient is approached after understanding all the contractile districts, including the tongue.
Collapse
Affiliation(s)
- Bruno Bordoni
- Physical Medicine and Rehabilitation, Don Carlo Gnocchi Foundation, Milan, ITA
| | - Allan R Escher
- Anesthesiology/Pain Medicine, H. Lee Moffitt Cancer Center and Research Institute, Tampa, USA
| |
Collapse
|