首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Four acoustic measures were taken from voice recordings of 121 children between the ages of 64 and 134 months. Acoustic parameters were measured on a sustained neutral /a/ vowel, produced imitatively at normal pitch and loudness. Samples were analyzed for fundamental frequency, jitter, shimmer, and signal-to-noise ratio (SNR). Data are presented to characterize effects of age, sex, height, and weight on these acoustic parameters. Results indicate statistically significant relationships between frequency and sex, with higher frequencies for girls. Also significant is a positive relationship between shimmer and height and a negative relationship between SNR and height.  相似文献   

2.
The effects of practice on jitter and shimmer were assessed in two groups of normally speaking women. Subjects in both groups sustained trials of /a/ as steadily as possible during a baseline session, two practice sessions, and a transfer session. Subjects in one group received visual and verbal feedback during the practice sessions. Subjects in the other group received no feedback. Shimmer means remained essentially stable over the four sessions for both groups, and no differences were apparent between the groups. Jitter values were significantly different between sessions for both groups, and between the two groups for the practice sessions. These results are consistent with findings from manual performance and retention tasks. The present findings also support a recently developed neurologic model of jitter.  相似文献   

3.
The singing power ratio (SPR) is an objective means of quantifying the singer's formant. SPR has been shown to differentiate trained singers from nonsingers and sung from spoken tones. This study was designed to evaluate SPR and acoustic parameters in singing students to determine if the singer-intraining has an identifiable difference between sung and spoken voices. Digital audio recordings were made of both sung and spoken vowel sounds in 55 singing students for acoustic analysis. SPR values were not significantly different between the sung and spoken samples. Shimmer and noise-to-harmonic ratio were significantly higher in spoken samples. SPR analysis may provide an objective tool for monitoring the student's progress.  相似文献   

4.
The purpose of this study was to examine the influence of noise on voice profile statistics from female samples. Six young adult females served as subjects. Five had normal voices; one had a pathological voice with accompanying bilateral vocal nodules. Each female subject was required to match a generated 235 Hz tone (+/- 2 Hz) while maintaining a constant output level of 70 dB SPL (+/- 5 dB). Data collected from a previous study involving a normal male subject were included for comparative purposes. Noise was generated from a personal computer fan which had a strong center frequency component at 235 Hz. Six different A-weighted signal-to-noise [S/N(A)] conditions were created, ranging in 5 dB increments from 25 to 0 dB. Results revealed that fundamental frequency was reasonably resistant to the effects of noise and to the effects of the noisy (pathological) voice signal. Jitter and shimmer estimates generally increased as noise floors elevated. The greatest amount of measurement error was found for the pathological female voice when captured in the presence of environmental noise. Findings are discussed relative to clinical issues surrounding measurement error.  相似文献   

5.
Recent advances in the diagnosis and treatment of voice disorders necessitate the need for accurate and reliable objective voice measurements. There are many instruments commonly used to analyze voice data. Many, if not most, of these instruments have not been adequately tested for reliability or consistency. This study evaluates the intrasubject variability of the objective voice measurements from two commonly used voice analysis instruments. The study also presents data correlating subjective mood states, room temperatures, sleep times of the subject, time since last meal, and hydration levels to the various acoustic measures. Several weak but significant correlations were obtained and are discussed. Guidelines for the appropriate use of these instruments are described.  相似文献   

6.
The need for standardization of procedures in approaches to voice measurement has been recently emphasized. The purpose of this study was to determine the extent to which the acoustic perturbation measurements from three different analysis systems agree when standardized recording and analysis procedures are used. High-quality acoustic voice recordings from 20 patients were analyzed. The results showed that, although fundamental frequency measurements were in strong agreement among the three systems tested, frequency and amplitude perturbation measurements were not in agreement. The underlying approaches to perturbation measurement appeared to be sufficiently different to produce different results. An argument is made for a standardized set of acoustic signals representing normal, dysphonic, and synthesized voices with known characteristics to facilitate testing of new acoustic analysis systems and confirm measurement accuracy and sensitivity.  相似文献   

7.
Head extension with protruded tongue is the position for video-laryngoscopy and simultaneous glottographic recordings including photoglottographic signals. This study investigated the effect of head extension and tongue protrusion on the measures of fundamental frequency, frequency perturbation (jitter), and amplitude perturbation (shimmer). Acoustic signals recorded during sustained vowels were obtained from 49 women and 66 men with no speech or voice disorders in different head-tongue positions. Head extension was associated with increased fundamental frequency and decreased shimmer. In men, head extension did not appear to affect jitter. When the tongue was protruded, head extension tended to lower jitter. For both genders, tongue protrusion was associated with decreased fundamental frequency with head extension. In the men, tongue protrusion tended to increase shimmer when the head was in the neutral position. In the women, tongue protrusion was associated with increased jitter and increased shimmer and was most evident in the head-neutral position. These findings supported a physical linkage hypothesis of the relationship between vocal tract configuration and vocal fold vibration, suggesting that head-tongue position must be taken into account when comparing voice measures.  相似文献   

8.
Vocal perturbation, harmonics-to-noise, and intensity measures were obtained for 10 subjects during three experimental tasks: (a) prolonged /a/, (b) /pa/ with vowel prolonged, and (c) same as (b) with subjects wearing a pneumotachographic mask and oral pressure tube inserted between the lips. There were no statistically significant differences among the experimental conditions for any of the measures. The findings suggest that a single task may be used to obtain airflow, oral pressure, and acoustic measures of vocal performance. Observed differences in jitter and harmonic-to-noise means for the male and female speakers are discussed.  相似文献   

9.
Lidar is being widely used to monitor meteorological parameters and atmospheric constituents. Applications include meteorology, environmental pollution, atmospheric dynamics and global climate change. Signal processing for lidar applications involve highly nonlinear models and consequently nonlinear filtering. In this paper, we applied a new method, empirical mode decomposition to the lidar signal processing. The denoising approach is done by removal of the proper intrinsic mode functions. The data from the simulation and measurements are analyzed to evaluate this method comparing with the traditional low-pass filter and the multi-pulse averaging. Results show that it is effective and superior to the band-pass filter and the averaging method. The denoising method also allows less averaging laser shots which is important for the real-time monitoring and for the low cost laser transmitter.  相似文献   

10.
11.
Accuracy of acoustic voice analysis is influenced by the quality of recording. Lately, articles have suggested that soundcards perform equivalently to specialized professional-grade data acquisition (DA) systems. The purpose of this study was to investigate the influence of DA environment (DA system and microphone) on acoustic voice quality measurement (VQM) while balancing for gender, age, intersubject and intrasubject variability, and analysis software. More specifically, the relative performance of different hardware environments and the relationship between their technical characteristics and VQM performance was investigated. The discretization error and the effective dynamic range of the different DA environments were measured. We used 3 software systems to record and measure separately 2000 acoustic samples of sustained phonation for fundamental frequency, jitter, and shimmer. Analyses of variance (ANOVA) were performed with these parameters as the dependent variables. The results of the study suggested that professional-grade DA hardware is strongly recommended to provide accurate and valid voice assessment. The fundamental frequency measurement differences across DA environments were highly correlated to the discretization error (r=1.00), whereas jitter and shimmer were highly correlated to the effective dynamic range of the DA environments (r=-0.68 and r=-0.86, respectively).  相似文献   

12.
Alteration of the harmonic structure in voice source spectra, taken over at least two periods of the waveform, may occur due to the presence of fundamental frequency (f0) perturbation, amplitude perturbation, additive noise, or changes within the glottal source signal itself. In order to make accurate inferences regarding glottal-flow dynamics or perceptual evaluations based on spectral measurements taken from the acoustic speech waveform, investigation of the spectral features of each aperiodic component is required. Based on a heuristic development involving a consideration of the partial sum of the Fourier series taken for two periods of a jittered, shimmered, and (additive, random) noise-contaminated signal, the corresponding spectral characteristics are hypothesized. Subsequent to this, the Fourier series coefficients are calculated for the two periods in order to test the hypotheses. Definite spectral differences are found for each aperiodic component; based on these findings differential quantitative spectral measurements are suggested. Further supportive evidence is obtained through use of Fourier transform and periodogram-averaged calculations. The analysis is carried out on synthetically generated glottal-pulse waveforms and on radiated speech waveforms. A discussion of the results is given in terms of voice aperiodicity in general and in terms of their implication for future studies involving human voice signals.  相似文献   

13.
The effect of field and medium asymmetry on improvement of signal-to-noise ratio (SNR) in respect of homodyne detection technique is analysed theoretically. The analysis is applied to the sample of bulk GaAs irradiated by off-resonant nanosecond pulsed laser. The dependence of SNR on electric field amplitude and local oscillator (LO) phase is examined. It is found that SNR is maximum for the combination of asymmetric field and asymmetric medium.   相似文献   

14.
Previous studies have shown that trained listeners are highly reliable in making perceptual judgments of several parameters of normal and pathologic voices. This study investigated objective measures of acoustic characteristics of high and low preference voices as determined by previous perceptual study. Four acoustic parameters were measured including harmonics-to-noise ratio, autocorrelation function, average jitter, and the standard deviation of the fundamental frequency. Useful correlations between perceptual and measured results were identified. Normal voices differ from pathologic voices in terms of the acoustic-perceptual relationships.  相似文献   

15.
We examine the calculated signal-to-noise ratio (SNR) achievable with different MRI detection modalities in precession fields ranging from 10 microT to 1.5 T. In particular, we compare traditional Faraday detectors with both tuned and untuned detectors based on superconducting quantum interference devices (SQUIDs). We derive general expressions for the magnetic field noise due to the samples and the detectors, and then calculate the SNR achievable for a specific geometry with each modality with and without prepolarization. We show that each of the three modalities is superior in one of the three field ranges. SQUID-based detection is superior to conventional Faraday detection for MRI in precession fields below 250 mT for a 65 mm diameter surface coil placed a distance of 25 mm from the voxel of interest embedded in a cylinder of tissue 50 mm tall and of radius 50 mm. This crossover field, however, is sensitive to the geometry.  相似文献   

16.
A procedure is developed to quantify and improve the signal-to-noise ratio (SNR) of magnetic resonance images. The image SNR is quantified using the correlation function of two independent acquisitions of an image. To test the performance of the quantification, SNR measurement data are fitted to theoretically expected curves. The proposed correlation technique is also used to improve the SNR by estimating the amplitude of the signal spectrum. The technique is applied to a set of MR images, and its performance in terms of gain in SNR, contrast-to-noise ratio (CNR), and resolution loss is compared to that of classical noise filters. The SNR as well as the CNR is improved significantly with minor loss of resolution. Finally, it is shown that the correlation technique can be implemented in a highly efficient way in almost any acquisition procedure of a magnetic resonance imaging system.  相似文献   

17.
A stratified random sample of 20 males and 20 females matched for physiological factors and cultural-linguistic markers were examined to determine differences in fundamental frequency and spectral characteristics during prolongation of three vowels: [a], [i], and [u]. The ethnic-gender breakdown included four sets of five male and five female subjects comprised of Caucasian and African-American speakers of standard American English, native Hindi Indian speakers, and native Mandarin Chinese speakers. Acoustic measures were analyzed using the Multidimensional Voice Program (Kay Elemetrics, Lincoln Park, NJ) (Model 4305) from which fundamental frequency and associated acoustic spectra were extracted from a 200-ms sample of each vowel token. Statistically significant group differences for the main effects of culture, race, and gender were found. The acoustic differences found are attributed to biomechanical, physiological, cultural, and linguistic factors.  相似文献   

18.
Michihito Ueda 《Physica A》2010,389(10):1978-2862
Stochastic resonance (SR) has become a well-known phenomenon that can enhance weak periodic signals with the help of noise. SR is an interesting phenomenon when applied to signal processing. Although it has been proven that SR does not always improve the signal-to-noise ratio (SNR), in a strongly nonlinear system such as simple threshold system, SR does in fact improve SNR for noisy pulsed signals at appropriate noise strength. However, even in such cases, when noise is weak, the SNR is degraded. Since the noise strength cannot be known in advance, it is difficult to apply SR to real signal processing. In this paper, we focused on the shape of the threshold at which SR did not degrade the SNR when noise was weak. To achieve output change when noise was weak, we numerically analyzed a sigmoid function threshold system. When the slope around the threshold was appropriate, SNR did not degrade when noise was weak and instead was improved at suitable noise strength. We also demonstrated SNR improvement for noisy pulsed voltages using a CMOS inverter, a very common threshold device. The input-output property of a CMOS inverter resembles the sigmoid function. By inputting the noisy signal voltage to a CMOS inverter, we measured the input and output voltages and analyzed the SNRs. The results showed that SNR was effectively improved over a wide range of noise strengths.  相似文献   

19.
Fractal geometry, a relatively young branch of mathematics, offers new ways of evaluating the irregularity of the physiologic and acoustic aspects of speech. The validity and reliability of a box-counting method for estimating the fractal dimension (DF) of the period and amplitude of vocal signals were demonstrated, and the method was applied to sustained vowels produced by four men and four women. Mean DF of the fundamental period was 1.46; DF of the amplitude records averaged a slightly, but significantly, higher 1.54. The potential of DF as a research and clinical tool is considered.  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号