智能系统学报
智能繫統學報
지능계통학보
CAAI TRANSACTIONS ON INTELLIGENT SYSTEMS
2013年
4期
305-311
,共7页
DIVA模型%音素%语音-映射单元%语音生成与获取
DIVA模型%音素%語音-映射單元%語音生成與穫取
DIVA모형%음소%어음-영사단원%어음생성여획취
DIVA model%phoneme%speech sound-target cells%speech acquisition and production
针对DIVA模型中存在的“感知能力与语音生成技巧发育不平衡”问题,提出了一种自动获取语音-映射单元的方法。该方法将人耳模拟为一个具有不同带宽的并联带通滤波器组,分别与模型中21维度的听觉存储空间相关联,对不同听觉的不同反应,分别考虑其频带的屏蔽效应、听觉响度与频率的关系。在读取语音输入信号的过程中,模型能较好地获得初始听觉表示,其方式与婴儿咿呀学语的过程基本一致。仿真实验表明,通过边界定义、相似性比较以及搜索更新等步骤,此方法能很好地进行初始输入模式的自组织匹配,并最终使DIVA模型更具语音获取的自然特性。
針對DIVA模型中存在的“感知能力與語音生成技巧髮育不平衡”問題,提齣瞭一種自動穫取語音-映射單元的方法。該方法將人耳模擬為一箇具有不同帶寬的併聯帶通濾波器組,分彆與模型中21維度的聽覺存儲空間相關聯,對不同聽覺的不同反應,分彆攷慮其頻帶的屏蔽效應、聽覺響度與頻率的關繫。在讀取語音輸入信號的過程中,模型能較好地穫得初始聽覺錶示,其方式與嬰兒咿呀學語的過程基本一緻。倣真實驗錶明,通過邊界定義、相似性比較以及搜索更新等步驟,此方法能很好地進行初始輸入模式的自組織匹配,併最終使DIVA模型更具語音穫取的自然特性。
침대DIVA모형중존재적“감지능력여어음생성기교발육불평형”문제,제출료일충자동획취어음-영사단원적방법。해방법장인이모의위일개구유불동대관적병련대통려파기조,분별여모형중21유도적은각존저공간상관련,대불동은각적불동반응,분별고필기빈대적병폐효응、은각향도여빈솔적관계。재독취어음수입신호적과정중,모형능교호지획득초시은각표시,기방식여영인이하학어적과정기본일치。방진실험표명,통과변계정의、상사성비교이급수색경신등보취,차방법능흔호지진행초시수입모식적자조직필배,병최종사DIVA모형경구어음획취적자연특성。
Contraposing the shortage of Directions Into Velocities of Articulators ( DIVA) model about“infants per-ceptual abilities do develop faster at first than their speech production skills”, the paper presents an automatic ac-quisition method of speech sound-target cells. The method simulates the human ear as a parallel band-pass filter group with different bandwidth and associates respectively;the filter with the 21-dimensional storage space of audi-tory sense in DIVA model. This method was done in order for different auditory reactions, the shielding effect of fre-quency band, sound loudness, and frequency relation could be considered respectively for this study. In the process of reading the input signal of speech, the model can acquire good initial hearing and the process is consistent with baby's babble. The simulation results show that through boundary definition, similarity comparison, searching and updates and so on, the method has nicer self-organized pattern matching effect for initial input, which makes the DIVA model a more natural characteristic regarding speech acquisition.