Automatic Switching Between Dynamic and Preset Camera Views in a Video Conference Endpoint
    1.
    发明申请
    Automatic Switching Between Dynamic and Preset Camera Views in a Video Conference Endpoint 有权
    自动切换视频会议端点的动态和预设摄像机视图

    公开(公告)号:US20160134838A1

    公开(公告)日:2016-05-12

    申请号:US14534557

    申请日:2014-11-06

    CPC classification number: H04N7/152 H04N5/23219 H04N7/142 H04N7/147

    Abstract: A video conference endpoint includes one or more cameras to capture video of different views and a microphone array to sense audio. One or more preset views are defined. The endpoint detects faces in the captured video and active audio sources from the sensed audio. The endpoint detects any active talker detected faces that coincide positionally with detected active audio sources, and also detects whether any active talker is in one of the preset views. Based on whether an active talker is detected in any of the preset views, the endpoint switches between capturing video of one of the preset views, and capturing video of a dynamic view.

    Abstract translation: 视频会议端点包括用于捕获不同视图的视频的一个或多个摄像机和用于感测音频的麦克风阵列。 定义一个或多个预设视图。 端点从感测到的音频中检测拍摄视频中的人脸和活动音频源。 端点检测与检测到的活动音频源在位置上重合的任何有效的说话者检测到的面部,并且还检测是否有任何活跃的讲话者处于预设视图之一。 基于在任何预设视图中是否检测到主动讲话者,端点在捕获一个预设视图的视频和捕获动态视图的视频之间切换。

    AUTOMATIC SWITCHING BETWEEN DYNAMIC AND PRESET CAMERA VIEWS IN A VIDEO CONFERENCE ENDPOINT

    公开(公告)号:US20170099462A1

    公开(公告)日:2017-04-06

    申请号:US15383231

    申请日:2016-12-19

    CPC classification number: H04N7/152 H04N5/23219 H04N7/142 H04N7/147

    Abstract: A video conference endpoint includes a camera to capture video and a microphone array to sense audio. One or more preset views are defined. Images in the captured video are processed with a face detection algorithm to detect faces. Active talkers are detected from the sensed audio. The camera is controlled to capture video from the preset views, and from dynamic views created without user input and which include a dynamic overview and a dynamic close-up view. The camera is controlled to dynamically adjust each of the dynamic views to track changing positions of detected faces over time, and dynamically switch the camera between the preset views, the dynamic overview, and the dynamic close-up view over time based on positions of the detected faces and the detected active talkers relative to the preset views and the dynamic views.

    Group and conversational framing for speaker tracking in a video conference system

    公开(公告)号:US10708544B2

    公开(公告)日:2020-07-07

    申请号:US16287191

    申请日:2019-02-27

    Abstract: In one embodiment, a method is provided to intelligently frame groups of participants in a meeting. This gives a more pleasing experience with fewer switches, better contextual understanding, and more natural framing, as would be seen in a video production made by a human director. Furthermore, in accordance with another embodiment, conversational framing techniques are provided. During speaker tracking, when two local participants are addressing each other, a method is provided to show a close-up framing showing both participants. By evaluating the direction participants are looking and a speaker history, it is determined if there is a local discussion going on, and an appropriate framing is selected to give far-end participants the most contextually rich experience.

    Group and conversational framing for speaker tracking in a video conference system

    公开(公告)号:US10257465B2

    公开(公告)日:2019-04-09

    申请号:US15908984

    申请日:2018-03-01

    Abstract: In one embodiment, a method is provided to intelligently frame groups of participants in a meeting. This gives a more pleasing experience with fewer switches, better contextual understanding, and more natural framing, as would be seen in a video production made by a human director. Furthermore, in accordance with another embodiment, conversational framing techniques are provided. During speaker tracking, when two local participants are addressing each other, a method is provided to show a close-up framing showing both participants. By evaluating the direction participants are looking and a speaker history, it is determined if there is a local discussion going on, and an appropriate framing is selected to give far-end participants the most contextually rich experience.

    Automatic switching between dynamic and preset camera views in a video conference endpoint

    公开(公告)号:US09883143B2

    公开(公告)日:2018-01-30

    申请号:US15383231

    申请日:2016-12-19

    CPC classification number: H04N7/152 H04N5/23219 H04N7/142 H04N7/147

    Abstract: A video conference endpoint includes a camera to capture video and a microphone array to sense audio. One or more preset views are defined. Images in the captured video are processed with a face detection algorithm to detect faces. Active talkers are detected from the sensed audio. The camera is controlled to capture video from the preset views, and from dynamic views created without user input and which include a dynamic overview and a dynamic close-up view. The camera is controlled to dynamically adjust each of the dynamic views to track changing positions of detected faces over time, and dynamically switch the camera between the preset views, the dynamic overview, and the dynamic close-up view over time based on positions of the detected faces and the detected active talkers relative to the preset views and the dynamic views.

    Automatic switching between dynamic and preset camera views in a video conference endpoint
    9.
    发明授权
    Automatic switching between dynamic and preset camera views in a video conference endpoint 有权
    在视频会议终端中自动切换动态和预设摄像机视图

    公开(公告)号:US09584763B2

    公开(公告)日:2017-02-28

    申请号:US14534557

    申请日:2014-11-06

    CPC classification number: H04N7/152 H04N5/23219 H04N7/142 H04N7/147

    Abstract: A video conference endpoint includes one or more cameras to capture video of different views and a microphone array to sense audio. One or more preset views are defined. The endpoint detects faces in the captured video and active audio sources from the sensed audio. The endpoint detects any active talker detected faces that coincide positionally with detected active audio sources, and also detects whether any active talker is in one of the preset views. Based on whether an active talker is detected in any of the preset views, the endpoint switches between capturing video of one of the preset views, and capturing video of a dynamic view.

    Abstract translation: 视频会议端点包括用于捕获不同视图的视频的一个或多个摄像机和用于感测音频的麦克风阵列。 定义一个或多个预设视图。 端点从感测到的音频中检测拍摄视频中的人脸和活动音频源。 端点检测与检测到的活动音频源在位置上重合的任何有效的说话者检测到的面部,并且还检测是否有任何活跃的讲话者处于预设视图之一。 基于在任何预设视图中是否检测到主动讲话者,端点在捕获一个预设视图的视频和捕获动态视图的视频之间切换。

Patent Agency Ranking