Patent search ap:("Cisco Technology Page Inc.") AND inv:"Samir Ouelha"

1.

发明申请
DISCONTINUOUS NOISE REMOVAL IN AN AUDIO PROCESSING PIPELINE 有权

公开(公告)号：US20250037733A1

公开(公告)日：2025-01-30

申请号：US18360936

申请日：2023-07-28

Applicant: Cisco Technology, Inc.

Inventor： Chamran MoradiAshour , Samir Ouelha

IPC: G10L25/21 , G10L25/84 , H04M9/08

Abstract: A method comprises: detecting audio to produce audio frames; detecting whether voice is continuously present across multiple consecutive ones of the audio frames based on voice activity detection performed on the audio frames; computing a signal-to-noise ratio (SNR) of an audio frame of the audio frames; determining whether to bypass or not bypass background noise removal (BNR) on the audio frame based on whether the voice is continuously present and the SNR; upon determining to bypass the BNR, bypassing the BNR on the audio frame, and first encoding the audio frame to produce a first encoded audio frame; upon determining to not bypass the BNR, performing the BNR on the audio frame to produce a reduced-noise audio frame, and second encoding the reduced-noise audio frame to produce a second encoded audio frame; and transmitting the first encoded audio frame or the second encoded audio frame.

2.

发明申请
PACKET LOSS CONCEALMENT IN AN AUDIO DECODER 有权

公开(公告)号：US20250131933A1

公开(公告)日：2025-04-24

申请号：US18539804

申请日：2023-12-14

Applicant: Cisco Technology, Inc.

Inventor： Amir Salah Abdelsamie Abdelwahed , Yusuf Ziya Isik , Xuehong Mao , Samir Ouelha , Samer Lutfi Hijazi

IPC: G10L19/08 , G10L19/00

Abstract: A method of performing packet loss concealment in a neural audio encoder/decoder (codec) system. The method includes receiving an indication of a lost audio packet at a receive side of a neural network audio codec system that includes an audio encoder and an audio decoder, wherein the lost audio packet comprises an index of a codeword that is representative of a portion of speech audio presented to the audio encoder, predicting the index of the codeword in the lost packet to obtain a predicted index, deriving a predicted embedding vector from the predicted index, and decoding, by the audio decoder, the embedding vector to generate an audio output.

3.

发明申请
GENERATIVE SPEECH MODEL FOR COMPACT DATA-DRIVEN SPEECH VECTORS FOR VERSATILE SPEECH APPLICATIONS 有权

公开(公告)号：US20250131919A1

公开(公告)日：2025-04-24

申请号：US18539791

申请日：2023-12-14

Applicant: Cisco Technology, Inc.

Inventor： Xuehong Mao , Samer Lutfi Hijazi , Christopher Rowen , Mathew Shaji Kavalekalam , Ivana Balic , Mengjun Leng , Yusuf Ziya Isik , Adam Ali Sabra , Amir Salah Abdelsamie Abdelwahed , Samir Ouelha , Mihailo Kolundzija

IPC: G10L15/16 , G10L15/06 , G10L15/18

Abstract: A neural network audio codec system and related methods are provided. In one example, a method is provided comprising: obtaining speech audio to be encoded; applying the speech audio to an audio encoder that is part of a neural network audio codec system that includes the audio encoder and an audio decoder. The audio encoder and the audio decoder have been trained in an end-to-end manner. The speech audio is encoded with the audio encoder to generate embedding vectors that represent a snapshot of speech audio attributes over successive timeframes of the raw speech audio, and from the embedding vectors, codeword indices are generated to entries in a codebook. The codeword indices are then transmitted or stored for later retrieval and processing by the audio decoder.

Patent Agency Ranking