首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 796 毫秒
1.
A model-based approach is proposed to objectively measure and classify vocal fold vibrations by left-right asymmetries along the anterior-posterior direction, especially in the case of nonstationary phonation. For this purpose, vocal fold dynamics are recorded in real time with a digital high-speed camera during phonation of sustained vowels as well as pitch raises. The dynamics of a multimass model with time-dependent parameters are matched to vocal fold vibrations extracted at dorsal, medial, and ventral positions by an automatic optimization procedure. The block-based optimization accounts for nonstationary vibrations and compares the vocal fold and model dynamics by wavelet coefficients. The optimization is verified with synthetically generated data sets and is applied to 40 clinical high-speed recordings comprising normal and pathological voice subjects. The resulting model parameters allow an intuitive visual assessment of vocal fold instabilities within an asymmetry diagram and are applicable to an objective quantification of asymmetries.  相似文献   

2.
3.

Objectives

The aim of this study was to look for visual subjective and objective parameters of vocal fold dynamics being capable of differentiating healthy from pathologic voices in daily clinical practice applying endoscopic high-speed digital imaging (HSI).

Study Design and Methods

Four hundred ninety-six datasets containing 80 healthy and 416 pathologic subjects (232 functional dysphonia (FD), 13 bilateral, and 171 unilateral vocal fold nerve paralysis) were analyzed retrospectively. Videos at 4000 Hz (256 × 256 pixel) were recorded during sustained phonation. Subjective parameters were visually evaluated and complemented by an analysis of objective parameters. Visual subjective parameters were mucosal wave, glottal closure type, glottal closure insufficiency (GI), asymmetries of the vocal folds, and phonovibrogram (PVG) symmetry. After image segmentation, objective parameters were computed: closed quotient, perturbation measures (PMs) of glottal area, and left-right asymmetry values.

Results

HSI evaluation enabled to distinguish healthy from pathologic voices. For visual subjective parameters, GI, symmetrical behavior, and PVG symmetry exhibited statistical significant differences. For 95% of the data, objective parameters could be computed. Among objective parameters, closed quotient, jitter, shimmer, harmonic-to-noise ratio, and signal-to-noise ratio for the glottal area function differentiated statistically significant normal from pathologic voices. Applying linear discriminant analysis by combining visual subjective and objective parameters, accurate classifications were made for 63.2% of the female and 87.5% of the male group for the three-class problem (healthy, FD, and unilateral vocal fold nerve paralysis).

Conclusion

Actual acoustically applied PMs can be transferred to clinical beneficial HSI analysis. Combining visual subjective and objective basic parameters succeeds in differentiating pathologic from healthy voices. The presented evaluation can easily be included into everyday clinical practice. However, further research is needed to broaden our understanding of the variability within and across healthy and pathologic vocal fold vibrations for diagnosing voice disorders and therapy control.  相似文献   

4.
5.
Irregularities in voiced speech are often observed as a consequence of vocal fold lesions, paralyses, and other pathological conditions. Many of these instabilities are related to the intrinsic nonlinearities in the vibrations of the vocal folds. In this paper, bifurcations in voice signals are analyzed using narrow-band spectrograms. We study sustained phonation of patients with laryngeal paralysis and data from an excised larynx experiment. These spectrograms are compared with computer simulations of an asymmetric 2-mass model of the vocal folds. (c) 1995 American Institute of Physics.  相似文献   

6.
Electroglottography (EGG) was used to monitor vocal fold vibration patterns in normal subjects and patients with various laryngeal disorders. In order to evaluate the regularity of vocal fold vibration, frequency and amplitude perturbation of EGG waves during sustained phonation were measured with a laboratory computer. The data were compared to the degree of hoarseness evaluated by auditory perception and by sound spectrographic analysis. Frequency and amplitude perturbation measures showed some overlap between normal and pathological groups. However, there was a close relation between perturbation analysis of EGG waves and degree of hoarseness (Spearman's rank correlation coefficient rs = 0.73, p less than 0.0005). Amplitude perturbation was found to be a more sensitive measure of the irregularity of vocal fold vibration than frequency perturbation.  相似文献   

7.
Although the problem of vocal fatigue is not uncommon in people with voice disorders, research on objective quantifiable indicators of vocal fatigue is limited. It has been suggested that a speaker's perception of increased phonatory effort associated with periods of prolonged voice use is related to increased lung pressure required to initiate and sustain phonation. The purpose of this study was to examine the relationship among perceived phonatory effort (PPE), which was used as a subjective index of vocal fatigue, and phonation threshold pressure (PTP), a quantifiable measure defined as the minimal lung pressure required to initiate and sustain vocal fold oscillation. PTP and PPE were recorded before, during, and after five adult male and five adult female speakers engaged in a prolonged oral reading task designed to induce vocal fatigue. The results supported a direct, moderately strong relationship between PTP and PPE, particularly when PTP was measured during speech produced at comfortable and low-speaking pitch levels. No gender effects were found. PTP returned to baseline levels within 1 hour after the fatiguing task. PPE returned to baseline within 1 day. The data support the use of PTP as an objective index of vocal fatigue.  相似文献   

8.
A method has been developed for tracking irregularities in the acoustic waveform of a sustained phonation using the adaptive Wiener filter. Irregularities are determined by the technique of correlation cancellation. The algorithm is evaluated using sustained vowels produced by a formant synthesizer and by subjects with and without phonatary disorders. Results indicate that the method is capable of differentiating between normal and abnormal voices. Most significantly, however, it can also track sporadic or nonstationary irregularities in the shape of an individual acoustic wavelet. This method is expected to be a useful tool for the acoustics analysis of voice production.  相似文献   

9.
Speech range profile (SRP) is a graphical display of frequency-intensity occurring interactions during functional speech activity. Few studies have suggested the potential clinical applications of SRP. However, these studies are limited to qualitative case comparisons and vocally healthy participants. The present study aimed to examine the effects of voice disorders on speaking and maximum voice ranges in a group of vocally untrained women. It also aimed to examine whether voice limit measures derived from SRP were as sensitive as those derived from voice range profile (VRP) in distinguishing dysphonic from healthy voices. Ninety dysphonic women with laryngeal pathologies and 35 women with normal voices, who served as controls, participated in this study. Each subject recorded a VRP for her physiological vocal limits. In addition, each subject read aloud the "North Wind and the Sun" passage to record SRP. All the recordings were captured and analyzed by Soundswell's computerized real-time phonetogram Phog 1.0 (Hitech Development AB, T?by, Sweden). The SRPs and the VRPs were compared between the two groups of subjects. Univariate analysis results demonstrated that individual SRP measures were less sensitive than the corresponding VRP measures in discriminating dysphonic from normal voices. However, stepwise logistic regression analyses revealed that the combination of only two SRP measures was almost as effective as a combination of three VRP measures in predicting the presence of dysphonia (overall prediction accuracy: 93.6% for SRP vs 96.0% for VRP). These results suggest that in a busy clinic where quick voice screening results are desirable, SRP can be an acceptable alternate procedure to VRP.  相似文献   

10.
The human voice signal originates from the vibrations of the two vocal folds within the larynx. The interactions of several intrinsic laryngeal muscles adduct and shape the vocal folds to facilitate vibration in response to airflow. Three-dimensional vocal fold dynamics are extracted from in vitro hemilarynx experiments and fitted by a numerical three-dimensional-multi-mass-model (3DM) using an optimization procedure. In this work, the 3DM dynamics are optimized over 24 experimental data sets to estimate biomechanical vocal fold properties during phonation. Accuracy of the optimization is verified by low normalized error (0.13 ± 0.02), high correlation (83% ± 2%), and reproducible subglottal pressure values. The optimized, 3DM parameters yielded biomechanical variations in tissue properties along the vocal fold surface, including variations in both the local mass and stiffness of vocal folds. That is, both mass and stiffness increased along the superior-to-inferior direction. These variations were statistically analyzed under different experimental conditions (e.g., an increase in tension as a function of vocal fold elongation and an increase in stiffness and a decrease in mass as a function of glottal airflow). The study showed that physiologically relevant vocal fold tissue properties, which cannot be directly measured during in vivo human phonation, can be captured using this 3D-modeling technique.  相似文献   

11.
The purpose of this exploratory study was to determine if laryngeal transillumination in combination with stroboscopy (strobophotoglottography; SPGG) is useful for (1) the visualization of vocal fold vibration (VFV) opening patterns, (2) the localization of initial vocal fold opening in horizontal glottal thirds (anterior, midmembranous, and posterior), (3) determination of the temporal correspondence of the so-called electroglottography (EGG)-knee and initial vocal fold separation, and, finally, (4) automatized quantitative measurements of glottal area function within endoscopic images. With stroboscopic transillumination, initial inferior vocal fold separation was detectable during the "closed" phase, where the vocal folds were still closed in the upper portion and therefore initial inferior vocal fold separation could not be visualized with usual laryngoscopy techniques. In the horizontal plane within similar fundamental frequencies in modal voice registers in two male subjects, localization of initial glottal opening depended on the voice types used (soft, normal, or pressed phonation). We found zipperlike posterior-to-anterior openings, initial midmembranous openings, initial anterior openings, as well as simultaneous initial opening of all three portions in the two healthy male adults examined. This technique proved to add temporal and spatial information to vocal fold opening patterns and extends our examination techniques to the very beginning of vocal fold opening at the inferior portion. Simultaneous electroglottogram tracking and comparison with bidirectionally illuminated stroboscopic images revealed a time-locked correspondence of the EGG-knee with the aforementioned initial inferior vocal fold separation. Bidirectional illumination combined with digital color extraction techniques allowed for image separation of subglottally and supraglottally illuminated structures. This facilitated vocal fold contour detection and automatized image processing, for example, for determination of glottal area function, and is considered to be a further step to objective automatized quantitative measurements within endoscopic images.  相似文献   

12.
《Journal of voice》2019,33(6):851-859
PurposeThe pitch-shift reflex (PSR) is the adaptation of the fundamental frequency during phonation and speech and describes the auditory feedback control. Speakers without voice and speech disorders mostly show a compensation of the pitch change in the auditory feedback and adapt their fundamental frequency to the opposite direction. Dysphonic patients often display problems with the auditory perception and control of their voice during therapy. Our study focuses on the auditory and kinesthetic control mechanisms of patients with muscle tension dysphonia (MTD) and speakers without voice and speech problems. Main purpose of the study is the analysis of the functionality of the control mechanisms within phonation and speech between patients with MTD and normal speakers.MethodSixty-one healthy subjects (17 male, 44 female) and 22 patients with MTD (7 male, 15 female) participated following two paradigms including a sustained phonation (vowel /a/) and speech ([‘mama]). Within both paradigms the fundamental frequency of the auditory feedback was increased synthetically. For the analysis of the PSR the electroencephalogram, electroglottography, the voice signal, and the high-speed endoscopy data were recorded simultaneously. The PSR in the electroencephalogram was detected via the N100 and the mismatch negativity. Statistical tests were applied for the detection of the PSR in the physiological response within the electroglottography, voice, and high-speed endoscopy signals. The results were compared between both groups.ResultsNo differences were found between the controls and patients with MTD regarding latency and magnitude of the perception of the pitch shift in both paradigms, but for the magnitude of the behavioral response. Differences also could be found for both groups between the “no pitch” and “pitch” condition of the two paradigms regarding vocal fold dynamics and voice quality. Patients with MTD showed more vibrational irregularities during the PSR than the controls, especially regarding the symmetry of vocal fold dynamics.ConclusionPatients with MTD seem to have a disturbed interaction between the auditory and kinesthetic feedback inducing the execution of an overriding behavioral response.  相似文献   

13.
Currently, early phonatory changes in amyotrophic lateral sclerosis(ALS) are not well understood. The aim of this study was to compare acoustic parameters of voice in ALS subjects who demonstrated perceptually normal vocal quality on sustained phonation with a control group. We hypothesized that objective analysis of voice would reveal significant differences on specific acoustic parameters of voice compared to the control group. Results revealed statistically significant differences between the two groups on measures related to frequency range and phonatory stability. The findings suggest that early bulbar signs affecting the laryngeal system may be present in patients with ALS before the occurrence of perceptually aberrant vocal characteristics.  相似文献   

14.
Posterior closure insufficiency of the glottis is often mentioned in connection with permanent voice disorders. Recently published studies have revealed that an incomplete closure of the glottis can be found also in normal-speaking voices, especially in women. However, the effect of glottal closure configuration on vocal efficacy is not sufficiently clarified. The purpose of this study was to determine the effect of glottal closure configuration on singing and speaking voice characteristics. Overall, 520 young female normal-speaking subjects were examined by videostroboscopy for different phonation conditions in the combination of soft, loud, low, and/or high phonation and by voice range profile measurements. According to the videostroboscopic analysis, the subjects were subdivided into four groups: complete closure of the vocal folds already in soft phonation (group 1), closure of the vocal fold with increasing intensity (group 2), persistent closure insufficiencies despite increasing intensity (group 3), and hourglass-shaped closure in subjects with vocal nodules (group 4). Subjects in which the glottal closure could not be evaluated sufficiently were subclassified into group 5 (missing values).

Selected criteria of the singing and speaking voice were evaluated and statistically processed according to the mentioned subclassification. Group 1 reached significantly the highest sound pressure levels (SPLmax) for the singing voice as well as for the shouting voice. Group 3 showed a limited capacity to increase the intensity of the singing and speaking voice. The results gathered in this study objectify the relationship of insufficient glottal closure and reduced vocal capabilities. As long as no conclusive data on long-term consequences of insufficient glottal closure are available, a prophylactic improvement of the laryngeal situation especially in female professional voice users by voice therapy should be recommended.  相似文献   


15.
Vocal fold hemorrhage often results in a sudden change in voice quality. Traumatic use of the voice (phonation or singing) is generally thought to be the cause of the vocal fold hemorrhage. The current report reviews three cases in which the traumatic event was crying. In one case, the patient's voice was only used for crying. All three patients were female and all were professional singers. The treatment of these individuals consisted of voice rest and subsequent phonomicrosurgery for lesions associated with the vocal fold hemorrhage. These case studies suggest that crying as a traumatic vocal behavior may result in vocal fold hemorrhage.  相似文献   

16.
Thyroplasty type I is one of several surgical treatments in which improving the voice of unilateral vocal fold paralysis is the ultimate objective. The goal of the surgery is the medialization of the paralyzed vocal fold. The purpose of this study is to evaluate the effectiveness of thyroplasty type I through acoustical analysis, aerodynamic measures, and quantitative videostroboscopic measurements. We report on 20 patients with unilateral vocal cord paralysis who underwent thyroplasty type I. We performed preoperative and postoperative video image analysis (normalized glottal gap area) and computer-assisted voice analysis (fundamental frequency, jitter, shimmer, noise-to-harmonic ratio, mean phonation time, mean flow rate, mean subglottic pressure) in all patients. The glottal gap was significantly reduced after thyroplasty type I. Postoperative voice quality was characterized by an improved pitch and amplitude pertubation (jitter and shimmer), phonation time (mean phonation time), and subglottic pressure (mean subglottic pressure). Thyroplasty type I is an effective method for regaining glottal closure and vocal function.  相似文献   

17.
A hypophonic voice, characterized perceptually as weak and breathy, is associated with voice disorders such as vocal fold atrophy and unilateral vocal fold paralysis. Although voice therapy programs for hypophonia typically address the vocal folds or the sound source, twang voice quality was examined in this study as an alternative technique for increasing vocal power by altering the epilarynx or the sound filter. OBJECTIVE: This study investigated the effect of twang production on physiologic, acoustic, and perceived voice handicap measures in speakers with hypophonia. DESIGN/METHODS: This prospective pilot study compared the vocal outcomes of six participants with hypophonia at pre- and posttreatment time points. Outcome measures included mean airflow rate, intensity in dB sound pressure level (SPL), maximum phonation time, and self-report of voice handicap. RESULTS: All subjects improved in at least three of the four vocal outcome measures. Wilcoxon signed-rank test of paired differences revealed significant differences between pre- and posttherapy group means for airflow rate, SPL, and Voice Handicap Index scores. CONCLUSION: The twang voice quality as a manipulation of the sound filter offers a clinical complement to traditional voice therapies that primarily address the sound source.  相似文献   

18.
This study documents the vocal characteristics of an actor before and after a series of eight performances involving extended voice use. The hypothesis was that this type of extended voice use would result in symptoms of vocal abuse and that damage to the actor's voice would be evident in measures made after the performance series. Three pre-performance and three post-performance speech samples were gathered and analyzed using the CSL and Visipitch II. Measurements taken included maximum phonational range; maximum sustained phonation; fundamental frequency during reading; maximum intensity levels; sound pressure levels for soft, moderate, and loud productions of sustained /a/; and perturbation including jitter, shimmer, harmonics-to-noise ratio, and an s/z ratio. Pre- and post-performance samples of the “Rainbow passage” and sustained vowel phonation were rated by a group of blinded listeners that included professional voice trainers and speech pathologists. In addition, sample lines from the performance were played for the listeners to judge whether this technique would result in symptoms of vocal abuse. Eleven out of 12 professional voice trainers rated that this technique would result in symptoms of vocal abuse. The data revealed post-performance improvement in phonational range, maximum intensity levels, perturbation measures, and s/z ratio. Measures of maximum sustained phonation, fundamental frequency, and sound pressure levels remained stable. Videoendoscopy revealed normal function of the larynx and vocal folds.  相似文献   

19.
For years, otolaryngologists and voice therapists have warned voice patients that whispering causes more trauma to the larynx than normal speech. However, no large series of patients has ever been examined fiberoptically during whispering to test this hypothesis. As part of our routine examination, patients are asked to count from 1 to 10 in a normal voice and in a whispered voice. We reviewed recorded fiberoptic examinations of 100 patients who had voice complaints. We compared supraglottic hyperfunction and vocal fold closure during the normal and whispered phonation of each patient. Sixty-nine percent of the patients demonstrated increased supraglottic hyperfunction with whispered voice. Eighteen percent had no change, and 13% had less severe hyperfunction. The most common glottal configuration during whisper was an inverted Y, which resulted from compression of the anterior and middle thirds of the true vocal folds. However, 12 patients had no true vocal fold contact during whispered voice, despite having adequate glottic closure with normal voice. Although whispering involves more severe hyperfunction in most patients, it does not seem to do so in all patients. In some patients, it may be less traumatic than normal voice.  相似文献   

20.
Aerodynamic measurements of patients with parkinson''s disease   总被引:2,自引:0,他引:2  
Patients with Parkinson's disease commonly complain of voice dysfunction. Most of these complaints can be attributed to the known muscular control disorders that occur with Parkinson's disease. However, the manifestations of Parkinson's disease muscular dysfunction on parameters of phonation such as airflow, laryngeal resistance, and subglottal pressure necessary to sustain phonation have not been reported. The purpose of this study was to examine the aerodynamic characteristics of flow, laryngeal resistance, and phonation pressure threshold in a heterogeneous population of patients with Parkinson's disease who had varying voice complaints and to compare the data to similar studies for human subjects who have no voice complaints. The studies used a noninvasive method of detecting flow and acoustic signal from the lips, oral cavity and nose during phonation and used an external flow interruption technique to estimate subglottal pressure and phonation threshold pressure. About one third of the patients could not produce phonation at regular and loud intensities that were comfortable for normal subjects. The mean subglottal pressure (SGP) of patients with Parkinson disease who could produce 3 levels of intensity comparable to normal subjects was significantly higher than the mean SG-Ps for normal subjects for the same intensities of vocal production. The mean flow rates measured from patients with Parkinson's disease at the same 3 intensities of phonation was not significantly greater than in normal subjects. This indicated that the mean laryngeal resistance calculated for patients with Parkinson's disease was notably and significantly greater than mean laryngeal resistance calculated for normal subjects at the same intensity levels. The mean vocal efficiency (VE) for normal subjects was not significantly different than the mean VE for patients with Parkinson's disease, because greater pressure was used to generate similar flow and acoustic energy. These findings correlate with the perception of patients with Parkinson's disease that they are working harder to produce phonation. The observation of notably greater laryngeal resistance and phonation threshold pressure in patients with Parkinson's disease suggests that further studies of the glottic aperture in patients with Parkinson' disease may be useful for understanding how this common motor disorder disturbs phonation.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号