-
公开(公告)号:US20230274544A1
公开(公告)日:2023-08-31
申请号:US17681045
申请日:2022-02-25
Applicant: Cisco Technology, Inc.
Inventor: Hui-Ling Lu , Raul Alejandro Casas
CPC classification number: G06V20/41 , G06V10/235 , G06V10/82 , G06V10/225 , G06T7/20 , G06T3/40 , H04L65/403
Abstract: Presented herein are systems and methods for generating a multi-view focused video stream. The methods involve obtaining at least one video stream of a first participant in an online video communication session between at least the first participant at a first endpoint and a second participant at a second endpoint; determining a bounding region of at least one element in the video stream; generating a focused video stream from the video stream that includes a focused view of the at least one element within the bounding region; and presenting the focused video stream on the second endpoint to at least the second participant.
-
公开(公告)号:US20250131940A1
公开(公告)日:2025-04-24
申请号:US18539764
申请日:2023-12-14
Applicant: Cisco Technology, Inc.
Inventor: Rafal Pilarczyk , Amir Salah Abdelsamie Abdelwahed , Hui-Ling Lu , Ivana Balic , Yusuf Ziya Isik , David Guoqing Zhang , Xuehong Mao , Samer Lutfi Hijazi
IPC: G10L21/043 , G10L19/00
Abstract: A data-driven audio codec system that involves producing multiple compressed streams comprising encoded information (e.g., codeword indices) at different time scales (time intervals or frequency). This may allow for separation of different properties of speech, such as content and aspects of style (prosody), into the different compressed streams without explicitly enforcing it, i.e., in an unsupervised manner. Speech audio is encoded to produce a plurality of encoded streams comprising encoded information for the speech audio at different time scales. The plurality of encoded streams are decoded to generate output audio.
-
公开(公告)号:US11900677B2
公开(公告)日:2024-02-13
申请号:US17681045
申请日:2022-02-25
Applicant: Cisco Technology, Inc.
Inventor: Hui-Ling Lu , Raul Alejandro Casas
CPC classification number: G06V20/41 , G06T3/40 , G06T7/20 , G06V10/225 , G06V10/235 , G06V10/82 , G06T2207/10016 , H04L65/403
Abstract: Presented herein are systems and methods for generating a multi-view focused video stream. The methods involve obtaining at least one video stream of a first participant in an online video communication session between at least the first participant at a first endpoint and a second participant at a second endpoint; determining a bounding region of at least one element in the video stream; generating a focused video stream from the video stream that includes a focused view of the at least one element within the bounding region; and presenting the focused video stream on the second endpoint to at least the second participant.
-
-