Patent search ap:("Cisco Technology Page Inc.") AND inv:"Ananth Sankar"

1.

发明授权
Crowd sourcing audio transcription via re-speaking 有权
Title translation: 人群通过重演来录音

公开(公告)号：US09418660B2

公开(公告)日：2016-08-16

申请号：US14156032

申请日：2014-01-15

Applicant: Cisco Technology, Inc.

Inventor： Matthias Paulik , Vivek Halder , Ananth Sankar

IPC: G10L15/26 , G06Q10/06 , G10L15/04 , G10L25/87 , G10L15/32 , G10L15/07

CPC classification number: G10L15/26 , G06Q10/06311 , G10L15/04 , G10L15/07 , G10L15/32 , G10L25/87

Abstract: Speech audio that is intended for transcription into textual form is received. The received speech audio is divided into first speech segments. A plurality of speakers is identified. A speaker is configured for repeating in spoken form a first speech segment that the speaker has listened to. A subset of speakers is determined for sending each first speech segment. Each first speech segment is sent to the subset of speakers determined for the particular first speech segment. The second speech segments are received from the speakers. The second speech segment is a re-spoken version of a first speech segment that has been generated by a speaker by repeating in spoken form the first speech segment. The second speech segments are processed to generate partial transcripts. The partial transcripts are combined to generate a complete transcript that is a textual representation corresponding to the received speech audio.

Abstract translation: 接收用于转录为文本形式的语音音频。接收的语音音频被分成第一语音段。识别出多个扬声器。扬声器被配置为以口头形式重复说话者已经听过的第一语音段。确定扬声器的子集用于发送每个第一语音段。每个第一语音段被发送到为特定的第一语音段确定的扬声器的子集。从扬声器接收第二语音段。第二语音片段是已由扬声器通过以口头形式重复第一语音片段而产生的第一语音片段的重新说明版本。处理第二语音段以产生部分转录。组合部分抄本以产生完整抄本，其是对应于接收的语音音频的文本表示。

2.

发明授权
System and method for question detection based video segmentation, search and collaboration in a video processing environment 有权
Title translation: 用于基于问题检测的视频分割，视频处理环境中的搜索和协作的系统和方法

公开(公告)号：US08886011B2

公开(公告)日：2014-11-11

申请号：US13708717

申请日：2012-12-07

Applicant: Cisco Technology, Inc.

Inventor： Jim Chen Chou , Ananth Sankar , Sachin Kajarekar

IPC: H04N5/92 , H04N9/79

CPC classification number: H04N9/79 , G11B27/11 , G11B27/28 , H04N5/91 , H04N9/8205 , H04N9/8227 , H04N21/233 , H04N21/234336 , H04N21/251 , H04N21/25891 , H04N21/47217 , H04N21/4782 , H04N21/4788 , H04N21/4826 , H04N21/4828 , H04N21/6125 , H04N21/6175 , H04N21/84 , H04N21/8405 , H04N21/8455 , H04N21/8456

Abstract: An example method is provided and includes receiving a video bitstream in a network environment; detecting a question in a decoded audio portion of a video bitstream; and marking a segment of the video bitstream with a tag. The tag may correspond to a location of the question in the video bitstream, and can facilitate consumption of the video bitstream. The method can further include detecting keywords in the question, and combining the keywords to determine a content of the question. In specific embodiments, the method can also include receiving the question and a corresponding answer from a user interaction, crowdsourcing the question by a plurality of users, counting a number of questions in the video bitstream and other features.

Abstract translation: 提供了一种示例性方法，包括在网络环境中接收视频比特流; 检测视频比特流的解码音频部分中的问题; 并用标签标记视频比特流的片段。标签可以对应于视频比特流中的问题的位置，并且可以促进视频比特流的消费。该方法还可以包括检测问题中的关键词，并组合关键词以确定问题的内容。在具体实施例中，该方法还可以包括从用户交互接收问题和相应的答案，由多个用户众包众多的用户，对视频比特流中的问题的数量进行计数和其他特征。

3.

发明申请
SYSTEM AND METHOD FOR QUESTION DETECTION BASED VIDEO SEGMENTATION, SEARCH AND COLLABORATION IN A VIDEO PROCESSING ENVIRONMENT 有权
Title translation: 视频处理环境中基于问题检测的视频分段，搜索和协作的系统和方法

公开(公告)号：US20140161416A1

公开(公告)日：2014-06-12

申请号：US13708717

申请日：2012-12-07

Applicant: CISCO TECHNOLOGY, INC.

Inventor： Jim Chen Chou , Ananth Sankar , Sachin Kajarekar

IPC: H04N9/79

CPC classification number: H04N9/79 , G11B27/11 , G11B27/28 , H04N5/91 , H04N9/8205 , H04N9/8227 , H04N21/233 , H04N21/234336 , H04N21/251 , H04N21/25891 , H04N21/47217 , H04N21/4782 , H04N21/4788 , H04N21/4826 , H04N21/4828 , H04N21/6125 , H04N21/6175 , H04N21/84 , H04N21/8405 , H04N21/8455 , H04N21/8456

Abstract: An example method is provided and includes receiving a video bitstream in a network environment; detecting a question in a decoded audio portion of a video bitstream; and marking a segment of the video bitstream with a tag. The tag may correspond to a location of the question in the video bitstream, and can facilitate consumption of the video bitstream. The method can further include detecting keywords in the question, and combining the keywords to determine a content of the question. In specific embodiments, the method can also include receiving the question and a corresponding answer from a user interaction, crowdsourcing the question by a plurality of users, counting a number of questions in the video bitstream and other features.

Abstract translation: 提供了一种示例性方法，包括在网络环境中接收视频比特流; 检测视频比特流的解码音频部分中的问题; 并用标签标记视频比特流的片段。标签可以对应于视频比特流中的问题的位置，并且可以促进视频比特流的消费。该方法还可以包括检测问题中的关键词，并组合关键词以确定问题的内容。在具体实施例中，该方法还可以包括从用户交互接收问题和相应的答案，由多个用户众包众多的用户，对视频比特流中的问题的数量进行计数和其他特征。

4.

发明申请
Crowd Sourcing Audio Transcription Via Re-Speaking 有权
Title translation: 人群采购音频转录通过重新说话

公开(公告)号：US20150199966A1

公开(公告)日：2015-07-16

申请号：US14156032

申请日：2014-01-15

Applicant: Cisco Technology, Inc.

Inventor： Matthias Paulik , Vivek Halder , Ananth Sankar

IPC: G10L15/26

CPC classification number: G10L15/26 , G06Q10/06311 , G10L15/04 , G10L15/07 , G10L15/32 , G10L25/87

Abstract: Speech audio that is intended for transcription into textual form is received. The received speech audio is divided into first speech segments. A plurality of speakers is identified. A speaker is configured for repeating in spoken form a first speech segment that the speaker has listened to. A subset of speakers is determined for sending each first speech segment. Each first speech segment is sent to the subset of speakers determined for the particular first speech segment. The second speech segments are received from the speakers. The second speech segment is a re-spoken version of a first speech segment that has been generated by a speaker by repeating in spoken form the first speech segment. The second speech segments are processed to generate partial transcripts. The partial transcripts are combined to generate a complete transcript that is a textual representation corresponding to the received speech audio.

Abstract translation: 接收用于转录为文本形式的语音音频。接收的语音音频被分成第一语音段。识别出多个扬声器。扬声器被配置为以口头形式重复说话者已经听过的第一语音段。确定扬声器的子集用于发送每个第一语音段。每个第一语音段被发送到为特定的第一语音段确定的扬声器的子集。从扬声器接收第二语音段。第二语音片段是已由扬声器通过以口头形式重复第一语音片段而产生的第一语音片段的重新说明版本。处理第二语音段以产生部分转录。组合部分抄本以产生完整抄本，其是对应于接收的语音音频的文本表示。

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification