METHODS AND SYSTEMS FOR COMPUTER-GENERATED VISUALIZATION OF SPEECH

    公开(公告)号:US20240087591A1

    公开(公告)日:2024-03-14

    申请号:US18451736

    申请日:2023-08-17

    Applicant: SomniQ, Inc.

    CPC classification number: G10L21/12 G10L21/10 G10L21/14 G10L25/93

    Abstract: Methods, systems and apparatuses for computer-generated visualization of speech are described herein. An example method of computer-generated visualization of speech including at least one segment includes: generating a graphical representation of an object corresponding to a segment of the speech; and displaying the graphical representation of the object on a screen of a computing device. Generating the graphical representation includes: representing a duration of the respective segment by a length of the object and representing intensity of the respective segment by a width of the object; and placing, in the graphical representation, a space between adjacent objects.

    Methods and systems for computer-generated visualization of speech

    公开(公告)号:US11735204B2

    公开(公告)日:2023-08-22

    申请号:US17404873

    申请日:2021-08-17

    Applicant: SomniQ, Inc.

    CPC classification number: G10L21/12 G10L21/10 G10L21/14 G10L25/93

    Abstract: Methods, systems and apparatuses for computer-generated visualization of speech are described herein. An example method of computer-generated visualization of speech including at least one segment includes: generating a graphical representation of an object corresponding to a segment of the speech; and displaying the graphical representation of the object on a screen of a computing device. Generating the graphical representation includes: representing a duration of the respective segment by a length of the object and representing intensity of the respective segment by a width of the object; and placing, in the graphical representation, a space between adjacent objects.

    Multi-Party Conversation Analyzer and Logger
    8.
    发明申请
    Multi-Party Conversation Analyzer and Logger 审中-公开
    多方对话分析器和记录仪

    公开(公告)号:US20160217807A1

    公开(公告)日:2016-07-28

    申请号:US15082959

    申请日:2016-03-28

    Abstract: A multi-party conversation analyzer and logger uses a variety of techniques including spectrographic voice analysis, absolute loudness measurements, directional microphones, and telephonic directional separation to determine the number of parties who take part in a conversation, and segment the conversation by speaking party. In one aspect, the invention monitors telephone conversations in real time to detect conditions of interest (for instance, calls to non-allowed parties or calls of a prohibited nature from prison inmates). In another aspect, automated prosody measurement algorithms are used in conjunction with speaker segmentation to extract emotional content of the speech of participants within a particular conversation, and speaker interactions and emotions are displayed in graphical form. A conversation database is generated which contains conversation recordings, and derived data such as transcription text, derived emotions, alert conditions, and correctness probabilities associated with derived data. Investigative tools allow flexible queries of the conversation database.

    Abstract translation: 多方会话分析器和记录器使用各种技术,包括光谱语音分析,绝对响度测量,定向麦克风和电话方向分离,以确定参与对话的各方数量,并通过会话分割对话。 一方面,本发明实时监控电话对话以检测感兴趣的情况(例如,对不允许的当事人的呼叫或被监禁的囚犯的被禁止的呼叫)。 在另一方面,自动化韵律测量算法与说话者分割结合使用以提取特定对话内的参与者的言语的情感内容,并且以图形形式显示说话人的交互和情绪。 生成会话数据库,其中包含会话记录,以及衍生数据,如转录文本,派生情绪,警报条件以及与派生数据相关联的正确性概率。 调查工具允许会话数据库的灵活查询。

    Multi-party conversation analyzer & logger
    9.
    发明申请
    Multi-party conversation analyzer & logger 有权
    多方会话分析器和记录器

    公开(公告)号:US20070071206A1

    公开(公告)日:2007-03-29

    申请号:US11475541

    申请日:2006-06-26

    Abstract: A multi-party conversation analyzer and logger uses a variety of techniques including spectrographic voice analysis, absolute loudness measurements, directional microphones, and telephonic directional separation to determine the number of parties who take part in a conversation, and segment the conversation by speaking party. In one aspect, the invention monitors telephone conversations in real time to detect conditions of interest (for instance, calls to non-allowed parties or calls of a prohibited nature from prison inmates). In another aspect, automated prosody measurement algorithms are used in conjunction with speaker segmentation to extract emotional content of the speech of participants within a particular conversation, and speaker interactions and emotions are displayed in graphical form. A conversation database is generated which contains conversation recordings, and derived data such as transcription text, derived emotions, alert conditions, and correctness probabilities associated with derived data. Investigative tools allow flexible queries of the conversation database.

    Abstract translation: 多方会话分析器和记录器使用各种技术,包括光谱语音分析,绝对响度测量,定向麦克风和电话方向分离,以确定参与对话的各方数量,并通过会话分割对话。 一方面,本发明实时监控电话对话以检测感兴趣的情况(例如,对不允许的当事人的呼叫或被监禁的囚犯的被禁止的呼叫)。 在另一方面,自动化韵律测量算法与说话者分割结合使用以提取特定对话内的参与者的言语的情感内容,并且以图形形式显示说话人的交互和情绪。 生成会话数据库,其中包含会话记录,以及衍生数据,如转录文本,派生情绪,警报条件以及与派生数据相关联的正确性概率。 调查工具允许会话数据库的灵活查询。

    MULTI-SPEAKER SPEECH RECOGNITION CORRECTION SYSTEM

    公开(公告)号:US20180182396A1

    公开(公告)日:2018-06-28

    申请号:US15823937

    申请日:2017-11-28

    Inventor: Munhak AN

    Abstract: The present invention relates to a multi-speaker speech recognition correction system for determining a speaker of an utterance with a simple method and easily correcting speech-recognized text during speech recognition for a plurality of speakers. According to the present invention, when speech signals are input to a multi-speaker speech recognition system from a plurality of microphones which are each provided to a corresponding one of a plurality of speakers, the multi-speaker speech recognition correction system may detect a speech session from a time point at which input of each of the speech signals is started to a time point at which the input of the speech signal is stopped, and a speech recognizer may convert only the detected speech sessions into text so that a speaker of an utterance can be identified by a simple method and speech recognition can be carried out at a low cost.

Patent Agency Ranking