Keyword detector and keyword detection method

    公开(公告)号:US10008197B2

    公开(公告)日:2018-06-26

    申请号:US15332000

    申请日:2016-10-24

    申请人: FUJITSU LIMITED

    发明人: Shoji Hayakawa

    摘要: A keyword detector includes a processor configured to calculate a feature vector for each frame from a speech signal, input the feature vector for each frame to a DNN to calculate a first output probability for each triphone according to a sequence of phonemes contained in a predetermined keyword and a second output probability for each monophone, for each of at least one state of an HMM, calculate a first likelihood representing the probability that the predetermined keyword is uttered in the speech signal by applying the first output probability to the HMM, calculate a second likelihood for the most probable phoneme string in the speech signal by applying the second output probability to the HMM, and determine whether the keyword is to be detected on the basis of the first likelihood and the second likelihood.

    MEDIA GENERATING AND EDITING SYSTEM
    57.
    发明申请

    公开(公告)号:US20180053510A1

    公开(公告)日:2018-02-22

    申请号:US15557897

    申请日:2016-03-11

    申请人: TRINT LIMITED

    摘要: A media generating and editing system that generates audio playback in alignment with text that has been automatically transcribed from the audio. A transcript data file that includes a plurality of text words transcribed from audio words included in the audio data is stored. Timing data is paired with the text words indicating locations in the audio data of the corresponding audio words from which the text words are transcribed. The audio data is provided for playback at a user device. The text words are displayed on a display screen at a user device and a visual marker is displayed on the display screen to indicate the text words on the display screen in time alignment with the audio playback of the corresponding audio words at the user device. The text words in the transcript data file are amended in response to inputs from the user device.

    Behavior adjustment using speech recognition system

    公开(公告)号:US09899024B1

    公开(公告)日:2018-02-20

    申请号:US15393770

    申请日:2016-12-29

    申请人: Google Inc.

    摘要: Methods, systems, and apparatus are described for inducing a user of a speech recognition system to adjust their own behavior. For example, in one implementation, a speech recognition system that allows children to control electronic devices can improve the child's speech development, by encouraging the child to speak more clearly. To do so, the speech recognition system can generate a phonetic representation of a term spoken by the child, and can determine whether the phonetic representation matches a particular canonical pronunciation of the particular term that is deemed age-appropriate for the child. Upon determining that the particular canonical pronunciation that matches the phonetic representation of the term spoken by the child is not age-appropriate, the speech recognition system can select and implement a variety of remediation strategies for inducing the child to repeat the term using a pronunciation that is considered age-appropriate.