首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Public engagement with science, technology, and engineering is seen as being increasingly important as the numbers of school leavers choosing to read for degrees in these areas is typically dropping. Engagement with pupils during their school years is seen as being a key element in influencing their choices of career for which seeds are sown from the primary years. Acoustics is an excellent vehicle for public engagement since the demonstrations can be appreciated directly by the sense of hearing and the underlying principles also apply in many branches of physics and engineering. This paper describes a number of demonstrations that have been employed during science engagement events for schools and the general public in the context of the principles of acoustics and human speech production. The apparatus used, which in some cases has been purpose-built, is described along with the activities themselves. In addition, a way to quantify the success of the process is proposed that involves a single button press on entry to and exit from an event.  相似文献   

2.
3.
To determine whether a correlation exists between the Grade, Roughness, Breathiness, Aesthenia, Strain (GRBAS) scale (a subjective measure of voice) and the Multi-Dimensional Voice Program (MDVP) scale (an objective measure of voice). A retrospective review of 37 voice patients (12 male/25 female) was conducted. Each voice was perceptually evaluated using the GRBAS scale by an experienced speech pathologist and acoustically analyzed using the MDVP scale. Statistical analysis using a multivariate regression model identified a significant correlation between the noise-related parameters of MDVP and the components of the GRBAS scale. Grade correlated with voice turbulence index (VTI), noise harmonic ratio (NHR), and soft phonation index (SPI). Roughness correlated with NHR only. Breathiness correlated with SPI only. Aesthenia also correlated with SPI only. Of the 19 acoustic variables measured by the MDVP system, only three noise parameters significantly correlated with the GRBAS perceptual voice analysis. Perhaps "noise" is the perceived acoustical quality of the dysphonic voice. A voice quantifying measure such as a "voice index score" could be proposed using the GRBAS scoring and the three clinically relevant MDVP values following further studies.  相似文献   

4.
Traditional measurements performed on the acoustic signals of normal speech are frequently used to quantify the acoustic characteristics of disordered speech as well. This letter demonstrates how important aspects of speech production deficits in motor speech disorders may be overlooked if stringent quantification procedures are employed, especially in the stage of exploratory data analysis. It is suggested that qualitative procedures, wherein phenomena are inferred from visual examination of certain acoustic displays, are useful to supplement traditional measurements, and moreover, that they be used to point to the types of measurements that should be made in the finer-grained stages of quantitative analysis.  相似文献   

5.
An important clinical issue concerns the efficacy of current voice therapy approaches in treating voice disorders, such as vocal nodules. Much research focuses on finding reliable methods for documentation of treatment results. In this second treatment study of ten patients with vocal nodules, who participated in a behaviorally based voice therapy program, 11 aerodynamic (transglottal air pressure and glottal waveform) and acoustic (spl, f0, and spectrum slope) measures were used. Three pretherapy baseline assessments were carried out, followed by one assessment after each of five therapy phases. Measurements were made of two types of speech materials: Strings of repeated /pae/ syllables and sustained /ae/ phonations in two loudness conditions: comfortable loudness and loud voice. The data were normalized using z-scores, which were based on data from 22 normal subjects. The results showed that the aerodynamic measures reflected the presence of vocal pathology to a higher degree than did the acoustic spectral measures, and they should be useful in studies comparing nodule and normal voice production. Large individual session-to-session variation was found for all measures across pretherapy baseline recordings, which contributed to nonsignificant differences between baseline and therapy data.  相似文献   

6.
This paper examines how breathing differs in the upright and supine body positions. Passive and active forces and associated chest wall motions are described for resting tidal breathing and speech breathing performed in the two positions. Clinical implications are offered regarding evaluation and treatment of breathing behavior in clients with speech and voice disorders.  相似文献   

7.
8.
The signals of running speech and sustained vowels of normals and subjects suffering from dysphonia were analyzed statistically with respect to the signal-to-noise ratio (SNR). The distribution of the SNR measured in multiple overlapping frames in the speech signal was described by a linear combination of the distribution frequencies for SNR = 0 dB, 0 dB less than SNR less than 15 dB, and SNR greater than or equal to 15 dB. The values of the linear combination, the SNR of the vowels, and clinical assignment of the voices to normal and pathologic populations based on laryngoscopic and stroboscopic investigation parameters were used to compare the different evaluations of the voices. The SNR distribution in speech remained stable over signal lengths of more than 30 s. The correlation coefficient between the SNR measure for running speech and the SNR of sustained vowels amounted to only 0.63. The error rate in the discrimination between normal and dysphonic voices amounted to 22.6% in application to sustained vowels and 5.6% when the SNR distribution was used. Possible reasons for the observed discrepancies are discussed, and the results are compared to those of other studies.  相似文献   

9.
Schlieren photography has been used to analyse quantitatively the acoustic field of an electromagnetic acoustic transducer (EMAT). By measuring the angle through which the rays are refracted it is possible to compute the refractive index gradient and thus determine both the absolute and complex pressure related structures of the images. Using this method, planar and focused shock transients generated by the EMAT have been evaluated and compared with transducer derived pressure measurements. The peak pressure in the unfocused shock was found to be 3.2 MPa and 4.6 MPa for the Schlieren and piezoelectric transducer measurements respectively. Corresponding values for the focused shock-wave agreed to within experimental error at about 19 MPa.  相似文献   

10.
Twenty-four normal adult women read part of the Rainbow Passage and sustained vowels three trials each. Utterances were assessed for selected parameters measured by Visi-Pitch (average and SD of fundamental frequency (F0), average and SD of dBA, perturbation, and percent voiced/unvoiced/pause). Assessment of each parameter included measures of central tendency, dispersion, and distribution characteristics (skewness and kurtosis) of the data and of the ranges of values that would include 95% of the scores (95% fiduciary limits). Generally, differences for the group between the three trials were not significant. Intersubject variability for only a few parameters was less than 20% of the parameter's mean. For vowels, variability of jitter was 30–48% of the mean. Eight subjects provided performances 2 months later to obtain an estimate of intrasubject variability over time. There were desirable intrasubject correlations between performances for mean F0, jitter in reading and on vowels /i/ and /a/, and percent of voicing. Inter- and intrasubject variability seems restricted and the data appear to resemble a normally distributed function for mean F0 on reading, jitter on /i/, and percent of voicing. Thus, these parameters may have statistical merit for use in vocal testing.  相似文献   

11.
Parkinson's disease is a neurological disorder associated with the disfunction of dopaminergic pathways of the basal ganglia, mainly resulting in a progressive alteration in the execution of voluntary movements. We present a functional magnetic resonance imaging (fMRI) study on cortical activations during simple motor task performance, in six early–stage hemiparkinsonian patients and seven healthy volunteers. We acquired data in three sessions, during which subjects performed the task with right or left hand, or bimanually. We observed consistent bilateral activations in cingulate cortex and dorsolateral prefrontal cortex of Parkinsonian subjects during the execution of the task with the affected hand. In addition, patients showed both larger and stronger activations in motor cortex of the affected hemisphere with respect to the healthy hemisphere. Compared with the control group, patients showed a hyperactivation of the dorsolateral prefrontal cortex of the affected hemisphere. We concluded that a presymptomatic reorganization of the motor system is likely to occur in Parkinson's disease at earlier stages than previously hypothesized. Moreover, our results support fMRI as a sensitive technique for revealing the initial involvement of motor cortex areas at the debut of this degenerative disorder.  相似文献   

12.
The goal of cross-language voice conversion is to preserve the speech characteristics of one speaker when that speaker's speech is translated and used to synthesize speech in another language. In this paper, two preliminary studies, i.e., a statistical analysis of spectrum differences in different languages and the first attempt at a cross-language voice conversion, are reported. Speech uttered by a bilingual speaker is analyzed to examine spectrum difference between English and Japanese. Experimental results are (1) the codebook size for mixed speech from English and Japanese should be almost twice the codebook size of either English or Japanese; (2) although many code vectors occurred in both English and Japanese, some have a tendency to predominate in one language or the other; (3) code vectors that predominantly occurred in English are contained in the phonemes /r/, /ae/, /f/, /s/, and code vectors that predominantly occurred in Japanese are contained in /i/, /u/, /N/; and (4) judged from listening tests, listeners cannot reliably indicate the distinction between English speech decoded by a Japanese codebook and English speech decoded by an English codebook. A voice conversion algorithm based on codebook mapping was applied to cross-language voice conversion, and its performance was somewhat less effective than for voice conversion in the same language.  相似文献   

13.
Laryngeal aerodynamic and acoustic characteristics of African American voice production were examined from vowel samples produced by ten adult female and ten adult male speakers. The data were compared with that for a control group consisting of ten adult female and ten adult male White speakers, matched for age, height, and weight. All measures were analyzed using Cspeech 4.0. Aerodynamic measurements, extracted from a glottal airflow waveform, included maximum flow declination rate, alternating glottal airflow, minimum glottal airflow, and airflow open quotient. Acoustic measures included fundamental frequency and sound pressure level. No significant mean differences between the African American and White speakers were found, except for maximum-flow declination rate. The White speakers produced significantly higher declination rates than the African American speakers. The factor of sex for the African American speakers was statistically significant for the measures of maximum-flow declination rate, alternating glottal airflow, open quotient, and fundamental frequency, consistent with the functioning of the White speakers. The results suggest that during vowel production, where the vocal tract is in a fairly static position, acoustic and aerodynamic characteristics for African American and White Speakers are comparable.  相似文献   

14.
This paper describes acoustic cues for classification of consonant voicing in a distinctive feature-based speech recognition system. Initial acoustic cues are selected by studying consonant production mechanisms. Spectral representations, band-limited energies, and correlation values, along with Mel-frequency cepstral coefficients features (MFCCs) are also examined. Analysis of variance is performed to assess relative significance of features. Overall, 82.2%, 80.6%, and 78.4% classification rates are obtained on the TIMIT database for stops, fricatives, and affricates, respectively. Combining acoustic parameters with MFCCs shows performance improvement in all cases. Also, performance in the NTIMIT telephone channel speech shows that acoustic parameters are more robust than MFCCs.  相似文献   

15.
16.
Accented speech recognition is more challenging than standard speech recognition due to the effects of phonetic and acoustic confusions. Phonetic confusion in accented speech occurs when an expected phone is pronounced as a different one, which leads to erroneous recognition. Acoustic confusion occurs when the pronounced phone is found to lie acoustically between two baseform models and can be equally recognized as either one. We propose that it is necessary to analyze and model these confusions separately in order to improve accented speech recognition without degrading standard speech recognition. Since low phonetic confusion units in accented speech do not give rise to automatic speech recognition errors, we focus on analyzing and reducing phonetic and acoustic confusability under high phonetic confusion conditions. We propose using likelihood ratio test to measure phonetic confusion, and asymmetric acoustic distance to measure acoustic confusion. Only accent-specific phonetic units with low acoustic confusion are used in an augmented pronunciation dictionary, while phonetic units with high acoustic confusion are reconstructed using decision tree merging. Experimental results show that our approach is effective and superior to methods modeling phonetic confusion or acoustic confusion alone in accented speech, with a significant 5.7% absolute WER reduction, without degrading standard speech recognition.  相似文献   

17.
18.
19.
Spectral estimation based on acoustic backscatter from a motionless stochastic medium is described for characterization of aberration in ultrasonic imaging. The underlying assumptions for the estimation are: The correlation length of the medium is short compared to the length of the transmitted acoustic pulse, an isoplanatic region of sufficient size exists around the focal point, and the backscatter can be modeled as an ergodic stochastic process. The motivation for this work is ultrasonic imaging with aberration correction. Measurements were performed using a two-dimensional array system with 80 x 80 transducer elements and an element pitch of 0.6 mm. The f number for the measurements was 1.2 and the center frequency was 3.0 MHz with a 53% bandwidth. Relative phase of aberration was extracted from estimated cross spectra using a robust least-mean-square-error method based on an orthogonal expansion of the phase differences of neighboring wave forms as a function of frequency. Estimates of cross-spectrum phase from measurements of random scattering through a tissue-mimicking aberrator have confidence bands approximately +/- 5 degrees wide. Both phase and magnitude are in good agreement with a reference characterization obtained from a point scatterer.  相似文献   

20.
Although laser surgery has been widely advocated for use in the treatment of vocal fold papilloma because it does not incur bleeding, it has been questioned for use in treating Reinke's edema due to the possibility of heat dispersion to normal surrounding tissue and of scarring. We present a series of 8 cases in which laser surgery was the method of treatment for bilateral Reinke's edema. In each case, voice therapy was selected as the initial treatment; laser surgery was performed following voice therapy. Prior to and following surgery, videostroboscopic examinations were performed on the subjects. Only 4 subjects were available for assessment at the 1-month postoperative period. From the audio track of the videotape, the speaking fundamental frequency, perturbation measures for the vowel /i/, and noise-to-harmonic ratio of a completely voiced sentence were obtained. From the videostroboscopic recordings, the symmetry of the vocal folds, the presence or absence of the mucosal wave and the glottic closure pattern, prior to and after surgery, were judged independently by 3 examiners. The fundamental frequencies approximated the normal male and female ranges for those subjects seen 1 month after surgery. In addition, the noise-to-harmonic ratio and the relative average perturbation improved. Stroboscopy revealed irregularities in the symmetry of vocal folds, mucosal wave, and glottic closure 1 month after surgery.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号