Method for prosody generation by unit selection from an imitation speech database
    1.
    发明授权
    Method for prosody generation by unit selection from an imitation speech database 有权
    通过模仿语音数据库的单位选择产生韵律的方法

    公开(公告)号:US06829581B2

    公开(公告)日:2004-12-07

    申请号:US09918595

    申请日:2001-07-31

    Applicant: Joram Meron

    Inventor: Joram Meron

    CPC classification number: G10L13/06 G10L13/04

    Abstract: A method is provided for prosody generation by unit selection from an imitation speech database. A rule based method of text to speech conversion is used to produce a set of intonation events by selecting syllables on which there would be either a pitch peak or dip (or a combination), and produces the parameters to generate a pitch curve of the event. The synthetic pitch curve shape generated by the rule based method is then utilized to select the best matching units from an imitation speech database of a speaker's prosody, which are then concatenated to reduce the final prosody.

    Abstract translation: 提供了通过来自模仿语音数据库的单元选择来产生韵律的方法。 基于规则的文本到语音转换的方法被用于通过选择音调峰值或倾角(或组合)上的音节来产生一组语调事件,并且产生用于生成事件的音调曲线的参数 。 然后利用基于规则的方法生成的合成音调曲线形状,从扬声器韵律的模仿语音数据库中选出最佳匹配单元,然后连接起来,以减少最终的韵律。

    INTERACTIVE LANGUAGE PRONUNCIATION TEACHING
    2.
    发明申请
    INTERACTIVE LANGUAGE PRONUNCIATION TEACHING 审中-公开
    互动语言授权教学

    公开(公告)号:US20090004633A1

    公开(公告)日:2009-01-01

    申请号:US12165258

    申请日:2008-06-30

    CPC classification number: G09B19/04

    Abstract: Techniques for language instruction and teaching are described. Methods focus on the sound distinctions that learners have trouble discriminating. Learners practice discriminating these sounds. A learning system is developed using databases of speech from people discriminating these sounds. An embodiment of a method according to the present disclosure can utilize sets of words that differ by only a single syllable containing a sound that is difficult to pronounce, as a way to teach the pronunciation of a word. The sets of similar words can be of a desired number or have a desired number of constituent members. Embodiments of systems can include user interfaces and a automated speech recognition system, including suitable automated speech recognition software, that can interact with a user, e.g., in a pedagogical setting. Related software products including computer-readable instructions resident in a computer-readable medium are described. HMM and DTW algorithms may be used for the embodiments.

    Abstract translation: 描述语言教学和教学技术。 方法侧重于学习者难以辨别的良好区别。 学习者练习区分这些声音。 使用识别这些声音的人的言语数据库开发学习系统。 根据本公开的方法的实施例可以利用仅包含难以发音的声音的单个音节不同的单词集合,作为教导单词发音的方式。 类似的词组可以是期望的数目或具有期望数量的组成成员。 系统的实施例可以包括用户界面和包括合适的自动语音识别软件的自动语音识别系统,其可以与用户交互,例如在教学设置中。 描述包括驻留在计算机可读介质中的计算机可读指令的相关软件产品。 HMM和DTW算法可用于实施例。

    Prosody generation for text-to-speech synthesis based on micro-prosodic data
    3.
    发明申请
    Prosody generation for text-to-speech synthesis based on micro-prosodic data 审中-公开
    基于微韵律数据的文本到语音合成的韵律生成

    公开(公告)号:US20060074678A1

    公开(公告)日:2006-04-06

    申请号:US10953878

    申请日:2004-09-29

    CPC classification number: G10L13/10

    Abstract: A prosody modification system for use in text-to-speech includes an input receiving a sequence of prosodic data vectors Pn, measured at time Tn, which samples a sound waveform. A prosody data warping module directly derives new prosodic data vectors Qn from the original data vectors Pn using a function, which is controlled by warping parameters A0, . . . Ak, which avoids round-off errors in deriving quantized values, which has derivatives with respect to A0, . . . Ak, Pn, and Tn that are continuous, and which has sufficiently high complexity to model intentional prosody of the sound waveform, and sufficiently low complexity to avoid modeling micro-prosody of the sound waveform. The smoothness and simplicity of the function ensure that micro-prosodic perturbations and errors in measurement of Tn are transferred directly to the output Qn. The errors are thus reversed during re-synthesis and therefore eliminated, resulting in micro-prosodic perturbations being preserved during re-synthesis.

    Abstract translation: 用于文本到语音的韵律修改系统包括接收在时间Tn测量的韵律数据向量Pn序列的输入,其对声音波形进行采样。 韵律数据扭曲模块使用由翘曲参数A0控制的函数直接从原始数据向量Pn导出新的韵律数据向量Qn。 。 。 Ak,其避免导出量化值的舍入误差,其具有关于A0的导数。 。 。 Ak,Pn和Tn是连续的,并且具有足够高的复杂性来模拟声波形的有意韵律,以及足够低的复杂度以避免对声波形的微韵律建模。 功能的平滑性和简单性确保了Tn的微韵律扰动和测量误差直接转移到输出Qn。 因此,在重新合成期间,错误被反转,因此消除,导致在重新合成期间保留微韵律扰动。

Patent Agency Ranking