共查询到20条相似文献,搜索用时 15 毫秒
1.
Inga M. Arkhipova Angelo Luongo Alexander P. Seyranian 《Journal of sound and vibration》2012,331(2):457-469
The phenomenon of stabilization by parametric excitation of an unstable, elastically restrained double inverted pendulum under its own weight is addressed. The solution is pursued by the Multiple Scale Method, as a perturbation of a critical Hamiltonian system, possessing a zero- and a real frequency. Several asymptotic expansions are carried out, which are able to capture the long-term behavior of the system, for generic (non-resonant) values of the excitation frequency, and some special (resonant) values of excitation-to-natural frequency ratio. It is shown that a proper ordering of the control parameters must be performed, and proper use of integer or fractional power expansions must be made, according to the resonance under study. In particular, a non-standard application of the Multiple Scale Method is illustrated for the 1:1 resonant case, requiring fractional powers and accounting for the ‘arbitrary constants’, generally omitted in regular cases. A comprehensive scenario of the stabilization regions is given in which lower-bound as well as upper-bound curves are evaluated, thus integrating results that recently appeared in the literature. 相似文献
2.
Latika Singh 《Applied Acoustics》2007,68(3):260-269
It has been suggested that pauses between words could act as indices of processes such as selection, retrieval or planning that are required before an utterance is articulated. For normal meaningful phrase utterances, there is hardly any information regarding the relationship between articulation and pause duration and their subsequent relation to the final phrase duration. Such associations could provide insights into the mechanisms underlying the planning and execution of a vocal utterance. To execute a fluent vocal utterance, children might adopt different strategies in development. We investigate this hypothesis by examining the roles of articulation time and pause duration in meaningful phrase utterances in 46 children between the ages of 4 and 8 years, learning English as a second language.Our results indicate a significant reduction in phrase, word and interword pause duration with increasing age. A comparison of pause, word and phrase duration for individual subjects belonging to different age groups indicates a changing relationship between pause and word duration for the production of fluent speech. For the youngest children, a strong correlation between pause and word duration indicates local planning at word level for speech production and thus greater dependence of pause on immediate word utterance. In contrast for the oldest children we find a significant drop in correlation between word and pause indicating the emergence of articulation and pause planning as two independent processes directed at producing a fluent utterance. Strong correlations between other temporal parameters indicate a more holistic approach being adopted by the older children for language production. 相似文献
3.
N Tye-Murray 《The Journal of the Acoustical Society of America》1986,79(4):1169-1171
The question of whether visual information can affect ongoing speech production arises from numerous studies demonstrating an interaction between auditory and visual information during speech perception. In a preliminary study, the effect of delayed visual feedback on speech production was examined. Two of the 13 subjects demonstrated speech errors that were directly related to the delayed visual signal. However, in the main experiment, providing immediate visual feedback of the articulators did not diminish the effects of delayed auditory feedback for 11 speakers. 相似文献
4.
Speech intelligibility is known to be relatively unaffected by certain deformations of the acoustic spectrum. These include translations, stretching or contracting dilations, and shearing of the spectrum (represented along the logarithmic frequency axis). It is argued here that such robustness reflects a synergy between vocal production and auditory perception. Thus, on the one hand, it is shown that these spectral distortions are produced by common and unavoidable variations among different speakers pertaining to the length, cross-sectional profile, and losses of their vocal tracts. On the other hand, it is argued that these spectral changes leave the auditory cortical representation of the spectrum largely unchanged except for translations along one of its representational axes. These assertions are supported by analyses of production and perception models. On the production side, a simplified sinusoidal model of the vocal tract is developed which analytically relates a few "articulatory" parameters, such as the extent and location of the vocal tract constriction, to the spectral peaks of the acoustic spectra synthesized from it. The model is evaluated by comparing the identification of synthesized sustained vowels to labeled natural vowels extracted from the TIMIT corpus. On the perception side a "multiscale" model of sound processing is utilized to elucidate the effects of the deformations on the representation of the acoustic spectrum in the primary auditory cortex. Finally, the implications of these results for the perception of generally identifiable classes of sound sources beyond the specific case of speech and the vocal tract are discussed. 相似文献
5.
Speech production knowledge in automatic speech recognition 总被引:1,自引:0,他引:1
King S Frankel J Livescu K McDermott E Richmond K Wester M 《The Journal of the Acoustical Society of America》2007,121(2):723-742
6.
7.
Pressure measurements during speech production using semiconductor miniature pressure transducers: impact on models for speech production 总被引:1,自引:0,他引:1
It appears that temperature instabilities are a major obstacle hindering the use of semiconductor strain gauge pressure transducers in speech research, especially when absolute pressure data are mandatory. In this paper a simple and reliable method for an in vivo calibration of this kind of transducer is described. The most important error source, the drift of the zero pressure level due to temperature changes, is discussed, and an estimation of the measurement accuracy which can be obtained is given. Moreover, some registrations of subglottal, supraglottal, and transglottal pressure are presented. It is shown that the pressure recordings allow us to obtain estimates of the volume flow in the trachea and pharynx. Analysis of those waveforms appears to lead to new insights into the physical processes underlying voice production. Specifically, an independent glottal contribution to the skewing of the glottal flow pulses is identified. 相似文献
8.
Modeling the peripheral speech motor system can advance the understanding of speech motor control and audiovisual speech perception. A 3-D physical model of the human face is presented. The model represents the soft tissue biomechanics with a multilayer deformable mesh. The mesh is controlled by a set of modeled facial muscles which uses a standard Hill-type representation of muscle dynamics. In a test of the model, recorded intramuscular electromyography (EMG) was used to activate the modeled muscles and the kinematics of the mesh was compared with 3-D kinematics recorded with OPTOTRAK. Overall, there was a good match between the recorded data and the model's movements. Animations of the model are provided as MPEG movies. 相似文献
9.
W V Summers D B Pisoni R H Bernacki R I Pedlow M A Stokes 《The Journal of the Acoustical Society of America》1988,84(3):917-928
Acoustical analyses were carried out on a set of utterances produced by two male speakers talking in quiet and in 80, 90, and 100 dB SPL of masking noise. In addition to replicating previous studies demonstrating increases in amplitude, duration, and vocal pitch while talking in noise, these analyses also found reliable differences in the formant frequencies and short-term spectra of vowels. Perceptual experiments were also conducted to assess the intelligibility of utterances produced in quiet and in noise when they were presented at equal S/N ratios for identification. In each experiment, utterances originally produced in noise were found to be more intelligible than utterances produced in the quiet. The results of the acoustic analyses showed clear and consistent differences in the acoustic-phonetic characteristics of speech produced in quiet versus noisy environments. Moreover, these accounts differences produced reliable effects on intelligibility. The findings are discussed in terms of: (1) the nature of the acoustic changes that taken place when speakers produce speech under adverse conditions such as noise, psychological stress, or high cognitive load: (2) the role of training and feedback in controlling and modifying a talker's speech to improve performance of current speech recognizers; and (3) the development of robust algorithms for recognition of speech in noise. 相似文献
10.
Howard DM 《The Journal of the Acoustical Society of America》2012,131(3):2405-2412
Public engagement with science, technology, and engineering is seen as being increasingly important as the numbers of school leavers choosing to read for degrees in these areas is typically dropping. Engagement with pupils during their school years is seen as being a key element in influencing their choices of career for which seeds are sown from the primary years. Acoustics is an excellent vehicle for public engagement since the demonstrations can be appreciated directly by the sense of hearing and the underlying principles also apply in many branches of physics and engineering. This paper describes a number of demonstrations that have been employed during science engagement events for schools and the general public in the context of the principles of acoustics and human speech production. The apparatus used, which in some cases has been purpose-built, is described along with the activities themselves. In addition, a way to quantify the success of the process is proposed that involves a single button press on entry to and exit from an event. 相似文献
11.
Monica A. McHenry Samuel T. Kuna John T. Minton Christi R. Vanoye 《Journal of voice》1996,10(3):236-244
Direct measures of subglottal pressure obtained through a tracheal puncture were used to calculate laryngeal airway resistance. Six subjects completed tasks including syllable trains and more natural speech samples produced at three loudness levels. Direct calculations of natural speech resistance values were compared with indirect estimates obtained during syllable train production. The degree of correspondence between direct and indirect calculations varied by subject. Overall, the smallest relative errors among calculations occurred for syllable trains, with higher relative errors for the monologue and sentence. For loudness conditions, the smallest and largest relative errors occurred for soft and loud productions, respectively. The clinical utility of indirect estimation is questioned and suggestions for improving its validity are provided. 相似文献
12.
Newman RS 《The Journal of the Acoustical Society of America》2003,113(5):2850-2860
This paper examines whether correlations between speech perception and speech production exist, and, if so, whether they might provide a way of evaluating different acoustic metrics. The cues listeners use for many phonemic distinctions are not known, often because many different acoustic cues are highly correlated with one another, making it difficult to distinguish among them. Perception-production correlations may provide a new means of doing so. In the present paper, correlations were examined between acoustic measures taken on listeners' perceptual prototypes for a given speech category and on their average production of members of that category. Significant correlations were found for VOT among stop consonants, and for spectral peaks (but not centroids or skewness) for voiceless fricatives. These results suggest that correlations between speech perception and production may provide a methodology for evaluating different proposed acoustic metrics. 相似文献
13.
Japanese 5- to 13-yr-olds who used cochlear implants (CIs) and a comparison group of normally hearing (NH) Japanese children were tested on their perception and production of speech prosody. For the perception task, they were required to judge whether semantically neutral utterances that were normalized for amplitude were spoken in a happy, sad, or angry manner. The performance of NH children was error-free. By contrast, child CI users performed well below ceiling but above chance levels on happy- and sad-sounding utterances but not on angry-sounding utterances. For the production task, children were required to imitate stereotyped Japanese utterances expressing disappointment and surprise as well as culturally typically representations of crow and cat sounds. NH 5- and 6-year-olds produced significantly poorer imitations than older hearing children, but age was unrelated to the imitation quality of child CI users. Overall, child CI user's imitations were significantly poorer than those of NH children, but they did not differ significantly from the imitations of the youngest NH group. Moreover, there was a robust correlation between the performance of child CI users on the perception and production tasks; this implies that difficulties with prosodic perception underlie their difficulties with prosodic imitation. 相似文献
14.
A model is presented which predicts the movements of flesh points on the tongue, lips, and jaw during speech production, from time-aligned phonetic strings. Starting from a database of x-ray articulator trajectories, means and variances of articulator positions and curvatures at the midpoints of phonemes are extracted from the data set. During prediction, the amount of articulatory effort required in a particular phonetic context is estimated from the relative local curvature of the articulator trajectory concerned. Correlations between position and curvature are used to directly predict variations from mean articulator positions due to coarticulatory effects. Use of the explicit coarticulation model yields a significant increase in articulatory modeling accuracy with respect to x-ray traces, as compared with the use of mean articulator positions alone. 相似文献
15.
Effects of speaking rate on tongue position and velocity of movement in vowel production 总被引:1,自引:0,他引:1
J E Flege 《The Journal of the Acoustical Society of America》1988,84(3):901-916
This study used glossometry to examine the position of the tongue and the velocity of its movements in vowels spoken normally and at a self-selected fast rate. The subject in experiment 1 showed lingual undershoot for stressed vowels in "a big again" and "a bob again." The tongue was lower for /I/ and higher for /a/ at the fast rate than at the normal rate. The stressed vowels exerted an affect on unstressed vowels: The tongue was lower in the schwas that preceded and followed /a/ than /I/. Only one of the three subjects in experiment 2 showed no lingual undershoot for fast-rate /I/. The tongue was higher at the fast rate than at the normal rate in the schwas flanking /I/ so that the displacement was less at the fast rate than at the normal rate. Another talker increased the peak velocity of tongue movements at the fast rate and showed no undershoot for /a/. Multiple regression analyses showed that the timing of movements for successive phonetic segments accounted well for undershoot in only one of the three subjects. The results suggest that in order to model the effects of speaking rate on the tongue movements used in forming stressed vowels, it will be necessary to take into account: (1) how much vowels are shortened at a fast rate: (2) how much the peak velocity of tongue movements is increased, if at all; and (3) the position of the tongue before and after the stressed vowels. All three factors are likely to be influenced by how clearly the talker wishes to speak. 相似文献
16.
Narayanan S Nayak K Lee S Sethy A Byrd D 《The Journal of the Acoustical Society of America》2004,115(4):1771-1776
Magnetic resonance imaging (MRI) has served as a valuable tool for studying static postures in speech production. Now, recent improvements in temporal resolution are making it possible to examine the dynamics of vocal-tract shaping during fluent speech using MRI. The present study uses spiral k-space acquisitions with a low flip-angle gradient echo pulse sequence on a conventional GE Signa 1.5-T CV/i scanner. This strategy allows for acquisition rates of 8-9 images per second and reconstruction rates of 20-24 images per second, making veridical movies of speech production now possible. Segmental durations, positions, and interarticulator timing can all be quantitatively evaluated. Data show clear real-time movements of the lips, tongue, and velum. Sample movies and data analysis strategies are presented. 相似文献
17.
Jeannette D. Hoit 《Journal of voice》1995,9(4):341-347
This paper examines how breathing differs in the upright and supine body positions. Passive and active forces and associated chest wall motions are described for resting tidal breathing and speech breathing performed in the two positions. Clinical implications are offered regarding evaluation and treatment of breathing behavior in clients with speech and voice disorders. 相似文献
18.
Liu H Zhang Q Xu Y Larson CR 《The Journal of the Acoustical Society of America》2007,122(4):2405-2412
Previous studies have demonstrated that perturbations in voice pitch or loudness feedback lead to compensatory changes in voice F(0) or amplitude during production of sustained vowels. Responses to pitch-shifted auditory feedback have also been observed during English and Mandarin speech. The present study investigated whether Mandarin speakers would respond to amplitude-shifted feedback during meaningful speech production. Native speakers of Mandarin produced two-syllable utterances with focus on the first syllable, the second syllable, or none of the syllables, as prompted by corresponding questions. Their acoustic speech signal was fed back to them with loudness shifted by +/-3 dB for 200 ms durations. The responses to the feedback perturbations had mean latencies of approximately 142 ms and magnitudes of approximately 0.86 dB. Response magnitudes were greater and latencies were longer when emphasis was placed on the first syllable than when there was no emphasis. Since amplitude is not known for being highly effective in encoding linguistic contrasts, the fact that subjects reacted to amplitude perturbation just as fast as they reacted to F(0) perturbations in previous studies provides clear evidence that a highly automatic feedback mechanism is active in controlling both F(0) and amplitude of speech production. 相似文献
19.
《Physics letters. A》1987,125(8):394-398
The strongest magnetic fields (|B|max≈1015G) being accessible to experimental investigations are created transiently in collisions of very heavy ions. The possible interplay of these collisional magnetic fields with detected peak structures in spectra of emitted electron-position pairs is elucidated. As basis for a dynamical treatment we computed the adiabatic phase correlation diagram which exhibits no structures to cause the striking peaks. Relativistic two-centre continuum states for non-axial symmetric vector potentials are determined. 相似文献
20.
This paper presents a decision analysis method for detection, position and classification of moving objects on automatic production line. The decision is based on the coordinates of moving objects in the image frames, and the displacement information provided by servo motor control synchronized with a conveyor belt. Multiple objects to identify duplication or omission can be avoided. The method for smart packing robot can provide reliable location information of measured objects. 相似文献