SYSTEM AND METHOD FOR LOCALIZING A TALKER USING AUDIO AND VIDEO INFORMATION
    51.
    发明申请
    SYSTEM AND METHOD FOR LOCALIZING A TALKER USING AUDIO AND VIDEO INFORMATION 有权
    使用音频和视频信息定位标签的系统和方法

    公开(公告)号:US20160140396A1

    公开(公告)日:2016-05-19

    申请号:US14943667

    申请日:2015-11-17

    Applicant: Polycom, Inc.

    Inventor: Jinwei Feng

    Abstract: A videoconferencing endpoint includes at least one processor a number of microphones and at least one camera. The endpoint can receive audio information and visual motion information during a teleconferencing session. The audio information includes one or more angles with respect to the microphone from a location of a teleconferencing session. The audio information is evaluated automatically to determine at least one candidate angle corresponding to a possible location of an active talker. The candidate angle can be analyzed further with respect to the motion information to determine whether the candidate angle correctly corresponds to person who is speaking during the teleconferencing session.

    Abstract translation: 视频会议端点包括至少一个处理器,多个麦克风和至少一个相机。 端点可以在电话会议期间接收音频信息和视觉运动信息。 音频信息包括从电话会议会话的位置相对于麦克风的一个或多个角度。 自动评估音频信息以确定与主动讲话者的可能位置相对应的至少一个候选角度。 可以相对于运动信息进一步分析候选角度,以确定候选角度是否正确对应于在电话会议期间正在说话的人。

    Speech-selective audio mixing for conference
    52.
    发明授权
    Speech-selective audio mixing for conference 有权
    语音选择性音频混合会议

    公开(公告)号:US09237238B2

    公开(公告)日:2016-01-12

    申请号:US14339244

    申请日:2014-07-23

    Applicant: Polycom, Inc.

    CPC classification number: H04M3/568 H04M3/56 H04M2201/14

    Abstract: A conference apparatus reduces or eliminates noise in audio for endpoints in a conference. Endpoints in the conference are designated as a primary talker and as secondary talkers. Audio for the endpoints is processed with speech detectors to characterize the audio as speech or not and to determine energy levels of the audio. As the audio is written to buffers and then read from the buffers, decisions for the gain settings of faders for read audio of the endpoints being combined in the speech selective mix. In addition, the conference apparatus can mitigate the effects of a possible speech collision that may occur during the conference between endpoints.

    Abstract translation: 会议设备减少或消除会议中端点的音频噪声。 会议的终点被指定为主要讲话者,也被指定为次要讲话者。 使用语音检测器处理端点的音频,以将音频表征为语音,并且确定音频的能级。 当音频被写入缓冲器然后从缓冲器读取时,用于在语音选择性混合中组合端点的读取音频的推子的增益设置的决定。 此外,会议装置可以减轻可能在端点之间的会议期间可能出现的语音冲突的影响。

    SYSTEM AND METHOD FOR A HYBRID TOPOLOGY MEDIA CONFERENCING SYSTEM
    53.
    发明申请
    SYSTEM AND METHOD FOR A HYBRID TOPOLOGY MEDIA CONFERENCING SYSTEM 有权
    混合拓扑媒体会议系统的系统和方法

    公开(公告)号:US20150281648A1

    公开(公告)日:2015-10-01

    申请号:US14674662

    申请日:2015-03-31

    Applicant: Polycom, Inc.

    Inventor: Eran Decker

    Abstract: Examples hybrid topologies of a conferencing system are disclosed. An example of a hybrid topology may comprise a plurality of endpoints and a central entity. Each of said plurality of endpoints may provide its primary video stream and audio stream to said centralized entity. The centralized entity provides the primary speaker stream and the mixed audio stream to each of said plurality of endpoint participants. In addition, some of plurality of endpoint establishes low bandwidth/low resolution media streams with other of said plurality of endpoint participants for non-speaker video.

    Abstract translation: 公开了会议系统的示例混合拓扑。 混合拓扑的示例可以包括多个端点和中心实体。 所述多个端点中的每一个可以将其主要视频流和音频流提供给所述集中式实体。 集中式实体将主扬声器流和混合音频流提供给所述多个终端参与者中的每一个。 此外,多个端点中的一些端点与用于非扬声器视频的所述多个终端参与者中的其他终端建立低带宽/低分辨率媒体流。

    Method and Systems for Optimizing Bandwidth Utilization in a Multi-Participant Full Mesh Peer-to-Peer Video Session
    54.
    发明申请
    Method and Systems for Optimizing Bandwidth Utilization in a Multi-Participant Full Mesh Peer-to-Peer Video Session 有权
    用于优化多参与者全网格对等视频会话中带宽利用率的方法和系统

    公开(公告)号:US20150281645A1

    公开(公告)日:2015-10-01

    申请号:US14674587

    申请日:2015-03-31

    Applicant: Polycom, Inc.

    Inventor: Deep Subhash Pai

    Abstract: An endpoint optimizes bandwidth by initiating a peer-to-peer conference with a plurality of remote devices, generating a first quality list comprising a first device of the plurality of remote devices from which to receive a first data stream at a first quality level, transmit a request to the first device to receive the first data stream at the first quality level, determining that a second device of the plurality of remote devices is not a member of the first quality list, and in response to determining that the second device of the plurality of remote devices is not a member of the first quality list, transmitting a request to the second device to receive a second data stream at a second quality level.

    Abstract translation: 端点通过与多个远程设备发起对等会议来优化带宽,生成包括多个远程设备中的第一设备的第一质量列表,从第一设备以第一质量级别接收第一数据流,传送 向所述第一设备请求以第一质量级别接收所述第一数据流,确定所述多个远程设备中的第二设备不是所述第一质量列表的成员,并且响应于确定所述第一设备的所述第二设备 多个远程设备不是第一质量列表的成员,将请求发送到第二设备以在第二质量水平接收第二数据流。

    Pairing Devices in Conference Using Ultrasonic Beacon amd Subsequent Control Thereof
    55.
    发明申请
    Pairing Devices in Conference Using Ultrasonic Beacon amd Subsequent Control Thereof 有权
    配对设备在会议中使用超声波信标及其后续控制

    公开(公告)号:US20150208033A1

    公开(公告)日:2015-07-23

    申请号:US14673477

    申请日:2015-03-30

    Applicant: Polycom, Inc.

    CPC classification number: H04N7/15 G06F3/165 H04M3/568 H04N7/142

    Abstract: A videoconferencing system has a videoconferencing unit that use portable devices as peripherals for the system. The portable devices obtain near-end audio and send the audio to the videoconferencing unit via a wireless connection. In turn, the videoconferencing unit sends the near-end audio from the loudest portable device along with near-end video to the far-end. The portable devices can control the videoconferencing unit and can initially establish the videoconference by connecting with the far-end and then transferring operations to the videoconferencing unit. To deal with acoustic coupling between the unit's loudspeaker and the portable device's microphone, the unit uses an echo canceller that is compensated for differences in the clocks used in the ND and D/A converters of the loudspeaker and microphone.

    Abstract translation: 视频会议系统具有视频会议单元,其使用便携式设备作为系统的外围设备。 便携式设备获得近端音频,并通过无线连接将音频发送到视频会议单元。 反过来,视频会议单元将最接近便携式设备的近端音频和近端视频一起发送到远端。 便携式设备可以控制视频会议单元,并且可以通过与远端连接然后将操作传送到视频会议单元来最初建立视频会议。 为了处理单元的扬声器和便携式设备的麦克风之间的声耦合,该单元使用回波消除器来补偿扬声器和麦克风的ND和D / A转换器中使用的时钟的差异。

    Providing direct eye contact videoconferencing
    56.
    发明授权
    Providing direct eye contact videoconferencing 有权
    提供直接的眼神接触视频会议

    公开(公告)号:US09088693B2

    公开(公告)日:2015-07-21

    申请号:US14041677

    申请日:2013-09-30

    Applicant: Polycom, Inc.

    CPC classification number: H04N7/15 H04N7/141 H04N7/142 H04N7/144

    Abstract: A videoconferencing unit comprises a display screen configured to display a video data stream comprising images of a far end participant. A processor is adapted to decode the video data stream and generate a modified region of the video data stream. The modified region of the video data stream is displayed on the display screen at a location where images of eyes of the far end participant are displayed on the display screen. A camera is configured with a lens to capture images of a near end participant through the modified region of the video data stream, with at least a portion of the lens positioned within the modified region of the video data stream.

    Abstract translation: 视频会议单元包括被配置为显示包括远端参与者的图像的视频数据流的显示屏幕。 处理器适于解码视频数据流并生成视频数据流的修改区域。 视频数据流的修改区域在显示屏上显示在远端参与者的眼睛的图像的位置的显示屏幕上。 相机配置有透镜,以通过视频数据流的修改区域捕获近端参与者的图像,透镜的至少一部分位于视频数据流的修改区域内。

    Method and system for adapting a CP layout according to interaction between conferees
    57.
    发明授权
    Method and system for adapting a CP layout according to interaction between conferees 有权
    根据与会者之间的交互调整CP布局的方法和系统

    公开(公告)号:US09041767B2

    公开(公告)日:2015-05-26

    申请号:US14014146

    申请日:2013-08-29

    Applicant: Polycom, Inc.

    CPC classification number: H04N7/15 G06T11/60 H04N7/152

    Abstract: A system and method is disclosed for adapting a continuous presence videoconferencing layout according to interactions between conferees. Using regions of interest found in video images, the arrangement of images of conferees may be dynamically arranged as displayed by endpoints. Arrangements may be responsive to various metrics, including the position of conferees in a room and dominant conferees in the videoconference. Video images may be manipulated as part of the arrangement, including cropping and mirroring the video image. As interactions between conferees change, the layout may be automatically rearranged responsive to the changed interactions.

    Abstract translation: 公开了一种根据与会者之间的交互来适应连续存在的视频会议布局的系统和方法。 使用视频图像中的兴趣区域,与会者的图像的布置可以被动态地排列,如端点所示。 安排可以对各种指标做出回应,包括与会者在房间中的地位以及视频会议中的主要参与者。 视频图像可以作为安排的一部分进行操作,包括裁剪和镜像视频图像。 随着与会者之间的交互变化,布局可能会根据变化的互动情况自动重新排列。

    METHOD AND SYSTEM FOR CONDUCTING VIDEO CONFERENCES OF DIVERSE PARTICIPATING DEVICES
    58.
    发明申请
    METHOD AND SYSTEM FOR CONDUCTING VIDEO CONFERENCES OF DIVERSE PARTICIPATING DEVICES 审中-公开
    用于引导多媒体参与设备视频会议的方法和系统

    公开(公告)号:US20140028788A1

    公开(公告)日:2014-01-30

    申请号:US13869781

    申请日:2013-04-24

    Applicant: POLYCOM, INC.

    Inventor: Avishay Halavy

    Abstract: A novel universal bridge (UB) can handle and conduct multimedia multipoint conferences between a plurality of MREs and LEPs without using an MRM, an MCU and a gateway. Further, a UB can be configured to allocate and release resources dynamically according to the current needs of each conferee and the session.

    Abstract translation: 一种新型通用网桥(UB)可以在不使用MRM,MCU和网关的情况下处理和进行多个MRE和LEP之间的多媒体多点会议。 此外,UB可以被配置为根据每个会议和会话的当前需要动态地分配和释放资源。

    Multipoint multimedia/audio conference using IP trunking
    59.
    发明申请
    Multipoint multimedia/audio conference using IP trunking 有权
    使用IP中继的多点多媒体/音频会议

    公开(公告)号:US20040047342A1

    公开(公告)日:2004-03-11

    申请号:US10462118

    申请日:2003-06-13

    Applicant: Polycom, Inc.

    Abstract: A multipoint communication system uses Internet protocol trunking to facilitate communication between media control units (for sending and receiving multipoint communication signals between end-point devices), a media gateway (for translating between non-Internet protocol multipoint communication signals and Internet protocol communication signals), and a controller (for establishing and controlling a multipoint communication session between the end-point devices). In addition, a multimedia gateway (for use in a multipoint communication system) is described that incorporates an interactive voice response unit through which users of non-Internet protocol devices (connected to the multimedia gateway) interact to establish a communication session with a multipoint communication system.

    Abstract translation: 多点通信系统使用因特网协议集群来促进媒体控制单元(用于在终点设备之间发送和接收多点通信信号)之间的通信,媒体网关(用于在非互联网协议多点通信信号和互联网协议通信信号之间进行转换) 以及控制器(用于建立和控制端点设备之间的多点通信会话)。 另外,描述了多媒体网关(用于多点通信系统),该多媒体网关包含交互式语音响应单元,非互联网协议设备(连接到多媒体网关)的用户通过该交互式语音响应单元进行交互,以建立与多点通信的通信会话 系统。

    System and method for computing a location of an acoustic source
    60.
    发明申请
    System and method for computing a location of an acoustic source 有权
    用于计算声源位置的系统和方法

    公开(公告)号:US20040032796A1

    公开(公告)日:2004-02-19

    申请号:US10414421

    申请日:2003-04-15

    Applicant: Polycom, Inc.

    Abstract: In accordance with the present invention, a system and method for computing a location of an acoustic source is disclosed. The method includes steps of processing a plurality of microphone signals in frequency space to search a plurality of candidate acoustic source locations for a maximum normalized signal energy. The method uses phase-delay look-up tables to efficiently determine phase delays for a given frequency bin number k based upon a candidate source location and a microphone location, thereby reducing system memory requirements. Furthermore, the method compares a maximum signal energy for each frequency bin number k with a threshold energy Et(k) to improve accuracy in locating the acoustic source.

    Abstract translation: 根据本发明,公开了一种用于计算声源的位置的系统和方法。 该方法包括在频率空间中处理多个麦克风信号以搜索多个候选声源位置以获得最大归一化信号能量的步骤。 该方法使用相位延迟查找表来有效地确定基于候选源位置和麦克风位置的给定频率仓数k的相位延迟,从而减少系统存储器要求。 此外,该方法将每个频率仓数k的最大信号能量与阈值能量Et(k)进行比较,以提高定位声源的精度。

Patent Agency Ranking