-
公开(公告)号:US20250037733A1
公开(公告)日:2025-01-30
申请号:US18360936
申请日:2023-07-28
Applicant: Cisco Technology, Inc.
Inventor: Chamran MoradiAshour , Samir Ouelha
Abstract: A method comprises: detecting audio to produce audio frames; detecting whether voice is continuously present across multiple consecutive ones of the audio frames based on voice activity detection performed on the audio frames; computing a signal-to-noise ratio (SNR) of an audio frame of the audio frames; determining whether to bypass or not bypass background noise removal (BNR) on the audio frame based on whether the voice is continuously present and the SNR; upon determining to bypass the BNR, bypassing the BNR on the audio frame, and first encoding the audio frame to produce a first encoded audio frame; upon determining to not bypass the BNR, performing the BNR on the audio frame to produce a reduced-noise audio frame, and second encoding the reduced-noise audio frame to produce a second encoded audio frame; and transmitting the first encoded audio frame or the second encoded audio frame.
-
公开(公告)号:US20250131933A1
公开(公告)日:2025-04-24
申请号:US18539804
申请日:2023-12-14
Applicant: Cisco Technology, Inc.
Inventor: Amir Salah Abdelsamie Abdelwahed , Yusuf Ziya Isik , Xuehong Mao , Samir Ouelha , Samer Lutfi Hijazi
Abstract: A method of performing packet loss concealment in a neural audio encoder/decoder (codec) system. The method includes receiving an indication of a lost audio packet at a receive side of a neural network audio codec system that includes an audio encoder and an audio decoder, wherein the lost audio packet comprises an index of a codeword that is representative of a portion of speech audio presented to the audio encoder, predicting the index of the codeword in the lost packet to obtain a predicted index, deriving a predicted embedding vector from the predicted index, and decoding, by the audio decoder, the embedding vector to generate an audio output.
-
3.
公开(公告)号:US20250131919A1
公开(公告)日:2025-04-24
申请号:US18539791
申请日:2023-12-14
Applicant: Cisco Technology, Inc.
Inventor: Xuehong Mao , Samer Lutfi Hijazi , Christopher Rowen , Mathew Shaji Kavalekalam , Ivana Balic , Mengjun Leng , Yusuf Ziya Isik , Adam Ali Sabra , Amir Salah Abdelsamie Abdelwahed , Samir Ouelha , Mihailo Kolundzija
Abstract: A neural network audio codec system and related methods are provided. In one example, a method is provided comprising: obtaining speech audio to be encoded; applying the speech audio to an audio encoder that is part of a neural network audio codec system that includes the audio encoder and an audio decoder. The audio encoder and the audio decoder have been trained in an end-to-end manner. The speech audio is encoded with the audio encoder to generate embedding vectors that represent a snapshot of speech audio attributes over successive timeframes of the raw speech audio, and from the embedding vectors, codeword indices are generated to entries in a codebook. The codeword indices are then transmitted or stored for later retrieval and processing by the audio decoder.
-
-