首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 93 毫秒
1.
Functional imaging studies of speech perception in the human brain have identified a key role for auditory association areas in the temporal lobes (bilateral superior temporal gyri and sulci) in the perceptual processing of the speech signal. This is extended to suggest some functional specialization within this bilateral system, with a particular role for the left anterior superior temporal sulcus (STS) in processing intelligible speech. In the current study, noise-vocoded speech was used to vary the intelligibility of speech parametrically. This replicated the finding of a selective response to intelligibility in speech in the left anterior superior temporal sulcus, in contrast to the posterior superior temporal sulcus, which showed a response profile insensitive to the degree of intelligibility. These results are related to theories of functional organization in the human auditory system, which have indicated that there are separate processing streams, with different functional roles, running anterior and posterior to primary auditory cortex. Specifically, it is suggested that an anterior stream processing intelligibility can be distinguished from a posterior stream associated with transient representations, important in spoken repetition and working memory.  相似文献   

2.
This positron emission tomography study used a correlational design to investigate neural activity during speech perception in six normal subjects and two aphasic patients. The normal subjects listened either to speech or to signal-correlated noise equivalents; the latter were nonspeech stimuli, similar to speech in complexity but not perceived as speechlike. Regions common to the auditory processing of both types of stimuli were dissociated from those specific to spoken words. Increasing rates of presentation of both speech and nonspeech correlated with cerebral activity in bilateral transverse gyri and adjacent superior temporal cortex. Correlations specific to speech stimuli were located more anteriorly in both superior temporal sulci. The only asymmetry in normal subjects was a left lateralized response to speech in the posterior superior temporal sulcus, corresponding closely to structural asymmetry on the subjects' magnetic resonance images. Two patients, who had left temporal infarction but performed well on single word comprehension tasks, were also scanned while listening to speech. These cases showed right superior temporal activity correlating with increasing rates of hearing speech, but no significant left temporal activation. These findings together suggest that the dorsolateral temporal cortex of both hemispheres can be involved in prelexical processing of speech.  相似文献   

3.
ABSTRACT: BACKGROUND: Earlier functional imaging studies on visually induced self-motion perception (vection) disclosed a bilateral network of activations within primary and secondary visual cortex areas which was combined with signal decreases, i.e., deactivations, in multisensory vestibular cortex areas. This finding led to the concept of a reciprocal inhibitory interaction between the visual and vestibular systems. In order to define areas involved in special aspects of selfmotion perception such as intensity and duration of the perceived circular vection (CV) or the amount of head tilt, correlation analyses of the regional cerebral glucose metabolism, rCGM (measured by fluorodeoxyglucose positron-emission tomography, FDG-PET) and these perceptual covariates were performed in 14 healthy volunteers. For analyses of the visualvestibular interaction, the CV data were compared to a random dot motion stimulation condition (not inducing vection) and a control group at rest (no stimulation at all). RESULTS: Group subtraction analyses showed that the visual-vestibular interaction was modified during CV, i.e., the activations within the cerebellar vermis and parieto-occipital areas were enhanced. The correlation analysis between the rCGM and the intensity of visually induced vection, experienced as body tilt, showed a relationship for areas of the multisensory vestibular cortical network (inferior parietal lobule bilaterally, anterior cingulate gyrus), the medial parieto-occipital cortex, the frontal eye fields and the cerebellar vermis. The "earlier" multisensory vestibular areas like the parieto-insular vestibular cortex and the superior temporal gyrus did not appear in the latter analysis. The duration of perceived vection after stimulus stop was positively correlated with rCGM in medial temporal lobe areas bilaterally, which included the (para-)hippocampus, known to be involved in various aspects of memory processing. The amount of head tilt was found to be positively correlated with the rCGM of bilateral basal ganglia regions responsible for the control of motor function of the head. CONCLUSIONS: Our data gave further insights into subfunctions within the complex cortical network involved in the processing of visual-vestibular interaction during CV. Specific areas of this cortical network could be attributed to the ventral stream ("what" pathway) responsible for the duration after stimulus stop and to the dorsal stream ("where/how" pathway) responsible for intensity aspects.  相似文献   

4.
Positron emission tomography (PET) was used to investigate the neural basis of the comprehension of speech in unmodulated noise ("energetic" masking, dominated by effects at the auditory periphery), and when presented with another speaker ("informational" masking, dominated by more central effects). Each type of signal was presented at four different signal-to-noise ratios (SNRs) (+3, 0, -3, -6 dB for the speech-in-speech, +6, +3, 0, -3 dB for the speech-in-noise), with listeners instructed to listen for meaning to the target speaker. Consistent with behavioral studies, there was SNR-dependent activation associated with the comprehension of speech in noise, with no SNR-dependent activity for the comprehension of speech-in-speech (at low or negative SNRs). There was, in addition, activation in bilateral superior temporal gyri which was associated with the informational masking condition. The extent to which this activation of classical "speech" areas of the temporal lobes might delineate the neural basis of the informational masking is considered, as is the relationship of these findings to the interfering effects of unattended speech and sound on more explicit working memory tasks. This study is a novel demonstration of candidate neural systems involved in the perception of speech in noisy environments, and of the processing of multiple speakers in the dorso-lateral temporal lobes.  相似文献   

5.
大脑皮质内部的联系神经束,对于大脑皮质之间的信息传递担任非常重要的角色.传统的语言模型理论提出人类的2个主要语言中枢分别位于大脑皮质的左侧额下回的布罗卡区域(Broca’s area,BA44andBA45)以及颞上回处的维尼基区域(Wernicke’s area,BA22),而联系这2个区域的纤维束,也就是弓状束(arcuate fasciculus).另外,近期研究也发现下顶叶(inferior parietal cortex,BA39and BA40)在语音处理历程具重要性.扩散磁振造影(Diffu-sionMRI)可以提供大脑白质精细的组织结构,配合神经径路追踪(tractography)便能撷取出复杂的神经纤维连结路径.该研究利用扩散磁振影像中的高夹角分辨率扩散磁振造影(high angular resolution diffusion imaging)与神经径路追踪技术,呈现与语言相关的大脑机率神经连结路径(probabilistic language pathway).  相似文献   

6.
How are laminar circuits of neocortex organized to generate conscious speech and language percepts? How does the brain restore information that is occluded by noise, or absent from an acoustic signal, by integrating contextual information over many milliseconds to disambiguate noise-occluded acoustical signals? How are speech and language heard in the correct temporal order, despite the influence of contexts that may occur many milliseconds before or after each perceived word? A neural model describes key mechanisms in forming conscious speech percepts, and quantitatively simulates a critical example of contextual disambiguation of speech and language; namely, phonemic restoration. Here, a phoneme deleted from a speech stream is perceptually restored when it is replaced by broadband noise, even when the disambiguating context occurs after the phoneme was presented. The model describes how the laminar circuits within a hierarchy of cortical processing stages may interact to generate a conscious speech percept that is embodied by a resonant wave of activation that occurs between acoustic features, acoustic item chunks, and list chunks. Chunk-mediated gating allows speech to be heard in the correct temporal order, even when what is heard depends upon future context.  相似文献   

7.
In order to obtain a precise diagnosis and treatment for temporomandibular dysfunctions, it is necessary to have morphological and functional knowledge of the temporomandibular joint. Anatomic components are important to the understanding of the complexity of temporomandibular joint. Nonetheless, just as important are the anatomic relationships that this joint has with the neighboring structures. Thus, the aim of this study was to present the anatomic relationships of the temporomandibular joint in its various surfaces: external or lateral, anterior, posterior, medial, superior, and posterosuperior, considering the morphological and histological aspects. Nine human heads, fixed in formalin (10%) underwent sagittal medial section and were subsequently dissected, evidencing the anatomic components of all surfaces to be analyzed. Components of the external surface were: skin, subcutaneous tissue, lymphatic ganglia, parotid gland, superficial temporal artery, transverse facial artery, zygomatic-orbital artery, superficial temporal vein, facial and auriculotemporal nerves, masseter muscle, and pre-auricular lymphonodus. The anterior surface comprised the masseter and lateral pterygoid muscles (upper and inferior heads), pterygoid venous plexus, mandibular notch, posterior deep temporal artery, masseteric nerve, and deep posterior temporal branches. Medial surface components were: internal maxillary artery, of which middle meningeal artery was one of the closest branches to the TMJ, anterior tympanic artery, inferior alveolar, lingual, auriculotemporal, and chorda tympani nerves, which belonged to the surface posterior to the anterior wall of the auditory duct; auricolotemporal nerve, parotid gland; and petrotympanic fissure. The cerebral fossa (meninges and encephalon) belonged to the superior surface and the ear belonged to the posterosuperior surface. Histologically, the temporomandibular joint is composed by different tissues that compound the mandibular head, mandibular fossa, and articular eminence of the temporomandibular joint. It was concluded that the temporomandibular joint is a complex structure and the clinician must have the ability to formulate the diagnosis based on the understanding of morphological aspects of the structures that compose the stomatognatic system.  相似文献   

8.
张全 《应用声学》2002,21(1):35-39
本文涉及语言声学的语音学研究、文语转换技术、语音识别技术及自然语言处理等方面,简要介绍了有关汉语在前三方面的进展和发展方向;重点介绍了面向整个自然语言理解处理的新理论一概念层次网络(HNC)理论的主要内容及其进展,试图在理论层面上给出HNC理论的基本概貌。  相似文献   

9.

Background

Despite strong evidence that the pathophysiology of Tourette syndrome (TS) involves structural and functional disturbances of the basal ganglia and cortical frontal areas, findings from in vivo imaging studies have provided conflicting results. In this study we used whole brain diffusion tensor imaging (DTI) to investigate the microstructural integrity of white matter pathways and brain tissue in 19 unmedicated, adult, male patients with TS “only” (without comorbid psychiatric disorders) and 20 age- and sex-matched control subjects.

Results

Compared to normal controls, TS patients showed a decrease in the fractional anisotropy index (FA) bilaterally in the medial frontal gyrus, the pars opercularis of the left inferior frontal gyrus, the middle occipital gyrus, the right cingulate gyrus, and the medial premotor cortex. Increased apparent diffusion coefficient (ADC) maps were detected in the left cingulate gyrus, prefrontal areas, left precentral gyrus, and left putamen. There was a negative correlation between tic severity and FA values in the left superior frontal gyrus, medial frontal gyrus bilaterally, cingulate gyrus bilaterally, and ventral posterior lateral nucleus of the right thalamus, and a positive correlation in the body of the corpus callosum, left thalamus, right superior temporal gyrus, and left parahippocampal gyrus. There was also a positive correlation between regional ADC values and tic severity in the left cingulate gyrus, putamen bilaterally, medial frontal gyrus bilaterally, left precentral gyrus, and ventral anterior nucleus of the left thalamus.

Conclusions

Our results confirm prior studies suggesting that tics are caused by alterations in prefrontal areas, thalamus and putamen, while changes in the cingulate gyrus seem to reflect secondary compensatory mechanisms. Due to the study design, influences from comorbidities, gender, medication and age can be excluded.  相似文献   

10.
It is well known that auditory system of human beings has excellent performance which automatic speech recognition(ASR) systems can’t match,and fractional Fourier transform (FrFT) has unique advantages in non-stationary signal processing.In this paper,the Gammatone filterbank is applied to speech signals for front-end temporal filtering,and then acoustic features of the output subband signals are extracted based on fractional Fourier transform. Considering the critical effect of transform order for FrFT,an order adaptation method based on the instantaneous frequency is proposed,and its performance is compared with the method based on ambiguity function.ASR experiments are conducted on clean and noisy Putonghua digits,and the results show that the proposed features achieve significantly higher recognition rate than the MFCC baseline,and the order adaptation method based on instantaneous frequency has much lower complexity than that based on ambiguity function.Further more,the FrFT-based features achieve the highest recognition rate using the proposed order adaptation method.  相似文献   

11.
12.
The aged population is at risk for impaired speech communication due to the increased likelihood of deterioration of central nervous system (CNS) processes that underlie cognition, language, and/or speech motor control. Vocal reaction time (RT) may provide a means of quantifying the efficiency of CNS processes that underlie speech production. The present study used a simple RT paradigm to investigate effects of the complexity of the required task on vocal RT in normal young and aged speakers. Task complexity was represented by two levels: a single word and a short sentence. Only the aged subjects showed a significant task complexity effect on vocal RT. Furthermore, the between-group RT difference increased as a function of task complexity. Specific causes for the increase in vocal RT for the aged subjects are presently unknown, but likely reside in altered respiratory biomechanics and reduced efficiency of CNS motor processing.  相似文献   

13.
By contrast to the adult literature, in which a consistent parietofrontal network for number processing has been identified, the data from studies of number processing in children have been less consistent, probably due to differences in study design and control conditions. Number processing was examined using functional magnetic resonance imaging in 18 right-handed children (8–12 years) from the Cape Coloured community in Cape Town, South Africa, using Proximity Judgment and Exact Addition (EA) tasks. The findings were consistent with the hypothesis that, as in adults, the anterior horizontal intraparietal sulcus (HIPS) plays a major role in the representation and manipulation of quantity in children. The posterior medial frontal cortex, believed to be involved in performance monitoring in more complex arithmetic manipulations in adults, was extensively activated even for relatively simple symbolic number processing in the children. Other areas activated to a greater degree in the children included the left precentral sulcus, which may mediate number knowledge and, for EA, the head of the caudate nucleus, which is part of a fronto-subcortical circuit involved in the behavioral execution of sequences. Two regions that have been linked to number processing in adults — the angular gyrus and posterior superior parietal lobule — were not activated in the children. The data are consistent with the inference that although the functional specialization of the anterior HIPS may increase as symbolic number processing becomes increasingly automatic, this region and other elements of the parietofrontal network identified in adults are already reliably and robustly activated by middle childhood.  相似文献   

14.
丁尚文  钱志余  李韪韬  陶玲  胡光霞 《光学学报》2012,32(5):517001-185
研究光诱发和静息两种状态下的脑功能网络的信息传输枢纽、网络聚合能力和信息传输的最小路径的差异性。采用小世界网络理论对脑功能网络进行建模,通过对脑功能网络连接度、簇系数和最小路径进行分析,得出光诱发状态下的信息传输重要枢纽为岛叶、后扣带回功能区;丘脑、海马两处功能网络有较大聚合能力。光诱发过程从额上回经颞中回传输到枕中回。静息状态下的信息传输重要枢纽为楔叶、舌回;中央旁小叶、颞上回脑功能网络有较大聚合能力。静息状态下的左半区最佳信息传输路径为左额上回、左颞中回、右楔叶最后到左枕中回;右脑半区的为右额上回、右前扣带回、左枕下回最后到右枕中回。光诱发状态与静息状态的最佳传输路径有明显的区别。  相似文献   

15.
Many hearing-impaired listeners suffer from distorted auditory processing capabilities. This study examines which aspects of auditory coding (i.e., intensity, time, or frequency) are distorted and how this affects speech perception. The distortion-sensitivity model is used: The effect of distorted auditory coding of a speech signal is simulated by an artificial distortion, and the sensitivity of speech intelligibility to this artificial distortion is compared for normal-hearing and hearing-impaired listeners. Stimuli (speech plus noise) are wavelet coded using a complex sinusoidal carrier with a Gaussian envelope (1/4 octave bandwidth). Intensity information is distorted by multiplying the modulus of each wavelet coefficient by a random factor. Temporal and spectral information are distorted by randomly shifting the wavelet positions along the temporal or spectral axis, respectively. Measured were (1) detection thresholds for each type of distortion, and (2) speech-reception thresholds for various degrees of distortion. For spectral distortion, hearing-impaired listeners showed increased detection thresholds and were also less sensitive to the distortion with respect to speech perception. For intensity and temporal distortion, this was not observed. Results indicate that a distorted coding of spectral information may be an important factor underlying reduced speech intelligibility for the hearing impaired.  相似文献   

16.
Natural spoken language processing includes not only speech recognition but also identification of the speaker's gender, age, emotional, and social status. Our purpose in this study is to evaluate whether temporal cues are sufficient to support both speech and speaker recognition. Ten cochlear-implant and six normal-hearing subjects were presented with vowel tokens spoken by three men, three women, two boys, and two girls. In one condition, the subject was asked to recognize the vowel. In the other condition, the subject was asked to identify the speaker. Extensive training was provided for the speaker recognition task. Normal-hearing subjects achieved nearly perfect performance in both tasks. Cochlear-implant subjects achieved good performance in vowel recognition but poor performance in speaker recognition. The level of the cochlear implant performance was functionally equivalent to normal performance with eight spectral bands for vowel recognition but only to one band for speaker recognition. These results show a disassociation between speech and speaker recognition with primarily temporal cues, highlighting the limitation of current speech processing strategies in cochlear implants. Several methods, including explicit encoding of fundamental frequency and frequency modulation, are proposed to improve speaker recognition for current cochlear implant users.  相似文献   

17.
Three experiments were conducted to study relative contributions of speaking rate, temporal envelope, and temporal fine structure to clear speech perception. Experiment I used uniform time scaling to match the speaking rate between clear and conversational speech. Experiment II decreased the speaking rate in conversational speech without processing artifacts by increasing silent gaps between phonetic segments. Experiment III created "auditory chimeras" by mixing the temporal envelope of clear speech with the fine structure of conversational speech, and vice versa. Speech intelligibility in normal-hearing listeners was measured over a wide range of signal-to-noise ratios to derive speech reception thresholds (SRT). The results showed that processing artifacts in uniform time scaling, particularly time compression, reduced speech intelligibility. Inserting gaps in conversational speech improved the SRT by 1.3 dB, but this improvement might be a result of increased short-term signal-to-noise ratios during level normalization. Data from auditory chimeras indicated that the temporal envelope cue contributed more to the clear speech advantage at high signal-to-noise ratios, whereas the temporal fine structure cue contributed more at low signal-to-noise ratios. Taken together, these results suggest that acoustic cues for the clear speech advantage are multiple and distributed.  相似文献   

18.
Behavioral experiments with infants, adults, and nonhuman animals converge with neurophysiological findings to suggest that there is a discontinuity in auditory processing of stimulus components differing in onset time by about 20 ms. This discontinuity has been implicated as a basis for boundaries between speech categories distinguished by voice onset time (VOT). Here, it is investigated how this discontinuity interacts with the learning of novel perceptual categories. Adult listeners were trained to categorize nonspeech stimuli that mimicked certain temporal properties of VOT stimuli. One group of listeners learned categories with a boundary coincident with the perceptual discontinuity. Another group learned categories defined such that the perceptual discontinuity fell within a category. Listeners in the latter group required significantly more experience to reach criterion categorization performance. Evidence of interactions between the perceptual discontinuity and the learned categories extended to generalization tests as well. It has been hypothesized that languages make use of perceptual discontinuities to promote distinctiveness among sounds within a language inventory. The present data suggest that discontinuities interact with category learning. As such, "learnability" may play a predictive role in selection of language sound inventories.  相似文献   

19.
颜永红 《应用声学》2012,31(1):35-41
本文对语言声学与内容理解研究的最新进展进行综述。首先介绍人类的言语的产生、感知以及声学分析方面的进展,接着分别介绍采用计算机来对语音中的各种信息进行抽取(包括语音、说话人和语种识别)和内容分析与理解(包括文档内容分析和理解与对话)的最新成果,最后对语言声学与内容理解的研究进行了总结和展望。  相似文献   

20.
Psychomotor experience can be based on what people predict they will experience, rather than on sensory inputs. It has been argued that disconnects between human experience and sensory inputs can be addressed better through further development of predictive processing theory. In this paper, the scope of predictive processing theory is extended through three developments. First, by going beyond previous studies that have encompassed embodied cognition but have not addressed some fundamental aspects of psychomotor functioning. Second, by proposing a scientific basis for explaining predictive processing that spans objective neuroscience and subjective experience. Third, by providing an explanation of predictive processing that can be incorporated into the planning and operation of systems involving robots and other new technologies. This is necessary because such systems are becoming increasingly common and move us farther away from the hunter-gatherer lifestyles within which our psychomotor functioning evolved. For example, beliefs that workplace robots are threatening can generate anxiety, while wearing hardware, such as augmented reality headsets and exoskeletons, can impede the natural functioning of psychomotor systems. The primary contribution of the paper is the introduction of a new formulation of hierarchical predictive processing that is focused on psychomotor functioning.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号