Patent search ap:("Cisco Technology Page Inc.") AND inv:"Hui-Ling Lu"

1.

发明公开
USER-SELECTED MULTI-VIEW VIDEOCONFERENCING 审中-公开

公开(公告)号：US20230274544A1

公开(公告)日：2023-08-31

申请号：US17681045

申请日：2022-02-25

Applicant: Cisco Technology, Inc.

Inventor： Hui-Ling Lu , Raul Alejandro Casas

IPC: G06V20/40 , G06V10/22 , G06V10/82 , G06T7/20 , G06T3/40

CPC classification number: G06V20/41 , G06V10/235 , G06V10/82 , G06V10/225 , G06T7/20 , G06T3/40 , H04L65/403

Abstract: Presented herein are systems and methods for generating a multi-view focused video stream. The methods involve obtaining at least one video stream of a first participant in an online video communication session between at least the first participant at a first endpoint and a second participant at a second endpoint; determining a bounding region of at least one element in the video stream; generating a focused video stream from the video stream that includes a focused view of the at least one element within the bounding region; and presenting the focused video stream on the second endpoint to at least the second participant.

2.

发明申请
MULTI-TIME-SCALE NEURAL AUDIO CODEC STREAMS 有权

公开(公告)号：US20250131940A1

公开(公告)日：2025-04-24

申请号：US18539764

申请日：2023-12-14

Applicant: Cisco Technology, Inc.

Inventor： Rafal Pilarczyk , Amir Salah Abdelsamie Abdelwahed , Hui-Ling Lu , Ivana Balic , Yusuf Ziya Isik , David Guoqing Zhang , Xuehong Mao , Samer Lutfi Hijazi

IPC: G10L21/043 , G10L19/00

Abstract: A data-driven audio codec system that involves producing multiple compressed streams comprising encoded information (e.g., codeword indices) at different time scales (time intervals or frequency). This may allow for separation of different properties of speech, such as content and aspects of style (prosody), into the different compressed streams without explicitly enforcing it, i.e., in an unsupervised manner. Speech audio is encoded to produce a plurality of encoded streams comprising encoded information for the speech audio at different time scales. The plurality of encoded streams are decoded to generate output audio.

3.

发明授权
User-selected multi-view videoconferencing 有权

公开(公告)号：US11900677B2

公开(公告)日：2024-02-13

申请号：US17681045

申请日：2022-02-25

Applicant: Cisco Technology, Inc.

Inventor： Hui-Ling Lu , Raul Alejandro Casas

IPC: G06V20/10 , G06V20/40 , G06V10/22 , G06V10/82 , G06T7/20 , G06T3/40 , H04L65/403

CPC classification number: G06V20/41 , G06T3/40 , G06T7/20 , G06V10/225 , G06V10/235 , G06V10/82 , G06T2207/10016 , H04L65/403

Abstract: Presented herein are systems and methods for generating a multi-view focused video stream. The methods involve obtaining at least one video stream of a first participant in an online video communication session between at least the first participant at a first endpoint and a second participant at a second endpoint; determining a bounding region of at least one element in the video stream; generating a focused video stream from the video stream that includes a focused view of the at least one element within the bounding region; and presenting the focused video stream on the second endpoint to at least the second participant.

Patent Agency Ranking