SYSTEM AND METHOD OF TEXT ZONING
    31.
    发明申请

    公开(公告)号:US20200090660A1

    公开(公告)日:2020-03-19

    申请号:US16553451

    申请日:2019-08-28

    Abstract: A method of zoning a transcription of audio data includes separating the transcription of audio data into a plurality of utterances. A that each word in an utterances is a meaning unit boundary is calculated. The utterance is split into two new utterances at a work with a maximum calculated probability. At least one of the two new utterances that is shorter than a maximum utterance threshold is identified as a meaning unit.

    DIARIZATION USING ACOUSTIC LABELING
    32.
    发明申请

    公开(公告)号:US20200043501A1

    公开(公告)日:2020-02-06

    申请号:US16594764

    申请日:2019-10-07

    Abstract: Systems and method of diarization of audio files use an acoustic voiceprint model. A plurality of audio files are analyzed to arrive at an acoustic voiceprint model associated to an identified speaker. Metadata associate with an audio file is used to select an acoustic voiceprint model. The selected acoustic voiceprint model is applied in a diarization to identify audio data of the identified speaker.

    Diarization using linguistic labeling

    公开(公告)号:US10522153B2

    公开(公告)日:2019-12-31

    申请号:US16170289

    申请日:2018-10-25

    Abstract: Systems and methods of diarization using linguistic labeling include receiving a set of diarized textual transcripts. A least one heuristic is automatedly applied to the diarized textual transcripts to select transcripts likely to be associated with an identified group of speakers. The selected transcripts are analyzed to create at least one linguistic model. The linguistic model is applied to transcripted audio data to label a portion of the transcripted audio data as having been spoken by the identified group of speakers. Still further embodiments of diarization using linguistic labeling may serve to label agent speech and customer speech in a recorded and transcripted customer service interaction.

    AUTOMATED ONTOLOGY DEVELOPMENT
    34.
    发明申请

    公开(公告)号:US20190325324A1

    公开(公告)日:2019-10-24

    申请号:US16458482

    申请日:2019-07-01

    Abstract: Systems and methods of automated ontology development include a corpus of communication data. The corpus of communication data includes communication data from a plurality of interactions and is processed. A plurality of terms are extracted from the corpus. Each term of the plurality is a plurality of words that identify a single concept within the corpus. An ontology is automatedly generated from the extracted terms.

    Diarization using textual and audio speaker labeling

    公开(公告)号:US10446156B2

    公开(公告)日:2019-10-15

    申请号:US16170297

    申请日:2018-10-25

    Abstract: Systems and methods of diarization using linguistic labeling include receiving a set of diarized textual transcripts. A least one heuristic is automatedly applied to the diarized textual transcripts to select transcripts likely to be associated with an identified group of speakers. The selected transcripts are analyzed to create at least one linguistic model. The linguistic model is applied to transcripted audio data to label a portion of the transcripted audio data as having been spoken by the identified group of speakers. Still further embodiments of diarization using linguistic labeling may serve to label agent speech and customer speech in a recorded and transcripted customer service interaction.

    DIARIZATION USING LINGUISTIC LABELING
    38.
    发明申请

    公开(公告)号:US20190066692A1

    公开(公告)日:2019-02-28

    申请号:US16170297

    申请日:2018-10-25

    Abstract: Systems and methods of diarization using linguistic labeling include receiving a set of diarized textual transcripts. A least one heuristic is automatedly applied to the diarized textual transcripts to select transcripts likely to be associated with an identified group of speakers. The selected transcripts are analyzed to create at least one linguistic model. The linguistic model is applied to transcripted audio data to label a portion of the transcripted audio data as having been spoken by the identified group of speakers. Still further embodiments of diarization using linguistic labeling may serve to label agent speech and customer speech in a recorded and transcripted customer service interaction.

    Diarization using acoustic labeling

    公开(公告)号:US10134400B2

    公开(公告)日:2018-11-20

    申请号:US14084974

    申请日:2013-11-20

    Abstract: Systems and method of diarization of audio files use an acoustic voiceprint model. A plurality of audio files are analyzed to arrive at an acoustic voiceprint model associated to an identified speaker. Metadata associate with an audio file is used to select an acoustic voiceprint model. The selected acoustic voiceprint model is applied in a diarization to identify audio data of the identified speaker.

    System and Method of Text Zoning
    40.
    发明申请
    System and Method of Text Zoning 审中-公开
    文本分区系统与方法

    公开(公告)号:US20150066506A1

    公开(公告)日:2015-03-05

    申请号:US14467783

    申请日:2014-08-25

    CPC classification number: G10L15/26 G10L15/04 G10L15/18 G10L15/1822

    Abstract: A method of zoning a transcription of audio data includes separating the transcription of audio data into a plurality of utterances. A that each word in an utterances is a meaning unit boundary is calculated. The utterance is split into two new utterances at a work with a maximum calculated probability. At least one of the two new utterances that is shorter than a maximum utterance threshold is identified as a meaning unit.

    Abstract translation: 对音频数据的转录进行分区的方法包括将音频数据的转录分离成多个话语。 一个话语中的每个单词都是一个意义单位边界的计算。 在最大计算概率的工作中,话语分为两个新的话语。 两个新语句中的至少一个短于最大话语阈值被识别为意义单元。

Patent Agency Ranking