SYSTEM AND METHOD OF AUTOMATED EVALUATION OF TRANSCRIPTION QUALITY

    公开(公告)号:US20180068651A1

    公开(公告)日:2018-03-08

    申请号:US15676306

    申请日:2017-08-14

    Inventor: Oana Sidi Ron Wein

    CPC classification number: G10L15/01 G10L15/04 G10L15/12 G10L15/26

    Abstract: Systems and methods automatedly evaluate a transcription quality. Audio data is obtained. The audio data is segmented into a plurality of utterances with a voice activity detector operating on a computer processor. The plurality of utterances are transcribed into at least one word lattice with a large vocabulary continuous speech recognition system operating on the processor. A minimum Bayes risk decoder is applied to the at least one word lattice to create at least one confusion network. At least conformity ratio is calculated from the at least one confusion network.

    Blind diarization of recorded calls with arbitrary number of speakers
    13.
    发明授权
    Blind diarization of recorded calls with arbitrary number of speakers 有权
    用任意数量的扬声器对被录制的通话进行盲目的梳理

    公开(公告)号:US09460722B2

    公开(公告)日:2016-10-04

    申请号:US14319860

    申请日:2014-06-30

    Inventor: Oana Sidi Ron Wein

    Abstract: In a method of diarization of audio data, audio data is segmented into a plurality of utterances. Each utterance is represented as an utterance model representative of a plurality of feature vectors. The utterance models are clustered. A plurality of speaker models are constructed from the clustered utterance models. A hidden Markov model is constructed of the plurality of speaker models. A sequence of identified speaker models is decoded.

    Abstract translation: 在音频数据的分类方法中,将音频数据分割为多个话语。 每个话语被表示为代表多个特征向量的话语模型。 话语模型是聚类的。 从群集话语模型构建多个说话者模型。 由多个扬声器模型构成隐马尔可夫模型。 已识别的扬声器模型的序列被解码。

    Ontology expansion using entity-association rules and abstract relations

    公开(公告)号:US11030406B2

    公开(公告)日:2021-06-08

    申请号:US15007703

    申请日:2016-01-27

    Abstract: A method for expanding an initial ontology via processing of communication data, wherein the initial ontology is a structural representation of language elements comprising a set of entities, a set of terms, a set of term-entity associations, a set of entity-association rules, a set of abstract relations, and a set of relation instances. A method for extracting a set of significant phrases and a set of significant phrase co-occurrences from an input set of documents further includes utilizing the terms to identify relations within the training set of communication data, wherein a relation is a pair of terms that appear in proximity to one another.

    Blind Diarization of Recorded Calls with Arbitrary Number of Speakers
    19.
    发明申请
    Blind Diarization of Recorded Calls with Arbitrary Number of Speakers 有权
    用任意数量的演讲者打电话的盲目化

    公开(公告)号:US20150025887A1

    公开(公告)日:2015-01-22

    申请号:US14319860

    申请日:2014-06-30

    Inventor: Oana Sidi Ron Wein

    Abstract: In a method of diarization of audio data, audio data is segmented into a plurality of utterances. Each utterance is represented as an utterance model representative of a plurality of feature vectors. The utterance models are clustered. A plurality of speaker models are constructed from the clustered utterance models. A hidden Markov model is constructed of the plurality of speaker models. A sequence of identified speaker models is decoded.

    Abstract translation: 在音频数据的分类方法中,将音频数据分割为多个话语。 每个话语被表示为代表多个特征向量的话语模型。 话语模型是聚类的。 从群集话语模型构建多个说话者模型。 由多个扬声器模型构成隐马尔可夫模型。 已识别的扬声器模型的序列被解码。

Patent Agency Ranking