-
公开(公告)号:US20250131940A1
公开(公告)日:2025-04-24
申请号:US18539764
申请日:2023-12-14
Applicant: Cisco Technology, Inc.
Inventor: Rafal Pilarczyk , Amir Salah Abdelsamie Abdelwahed , Hui-Ling Lu , Ivana Balic , Yusuf Ziya Isik , David Guoqing Zhang , Xuehong Mao , Samer Lutfi Hijazi
IPC: G10L21/043 , G10L19/00
Abstract: A data-driven audio codec system that involves producing multiple compressed streams comprising encoded information (e.g., codeword indices) at different time scales (time intervals or frequency). This may allow for separation of different properties of speech, such as content and aspects of style (prosody), into the different compressed streams without explicitly enforcing it, i.e., in an unsupervised manner. Speech audio is encoded to produce a plurality of encoded streams comprising encoded information for the speech audio at different time scales. The plurality of encoded streams are decoded to generate output audio.
-
公开(公告)号:US20240322942A1
公开(公告)日:2024-09-26
申请号:US18680660
申请日:2024-05-31
Applicant: Cisco Technology, Inc.
Inventor: Amir Salah Abdelsamie Abdelwahed , Ivana Balic , Yusuf Ziya Isik , Xuehong Mao , Samer Lutfi Hijazi
CPC classification number: H04L1/0041 , H04L1/0002 , H04L1/0045 , H04L69/22
Abstract: In some aspects, the techniques described herein relate to a method including: encoding a current data portion to generate an encoded current data portion for inclusion in a data packet; encoding, based upon content of the current data portion, a forward error correction data portion for a previous data portion to generate an encoded forward error correction data portion; generating the data packet including the encoded current data portion and the encoded forward error correction data portion; and providing the data packet to a receiver.
-
公开(公告)号:US12040894B1
公开(公告)日:2024-07-16
申请号:US18151616
申请日:2023-01-09
Applicant: Cisco Technology, Inc.
Inventor: Amir Salah Abdelsamie Abdelwahed , Ivana Balic , Yusuf Ziya Isik , Xuehong Mao , Samer Lutfi Hijazi
CPC classification number: H04L1/0041 , H04L1/0002 , H04L1/0045 , H04L69/22
Abstract: In some aspects, the techniques described herein relate to a method including: encoding a current data portion to generate an encoded current data portion for inclusion in a data packet; encoding, based upon content of the current data portion, a forward error correction data portion for a previous data portion to generate an encoded forward error correction data portion; generating the data packet including the encoded current data portion and the encoded forward error correction data portion; and providing the data packet to a receiver.
-
4.
公开(公告)号:US20250131919A1
公开(公告)日:2025-04-24
申请号:US18539791
申请日:2023-12-14
Applicant: Cisco Technology, Inc.
Inventor: Xuehong Mao , Samer Lutfi Hijazi , Christopher Rowen , Mathew Shaji Kavalekalam , Ivana Balic , Mengjun Leng , Yusuf Ziya Isik , Adam Ali Sabra , Amir Salah Abdelsamie Abdelwahed , Samir Ouelha , Mihailo Kolundzija
Abstract: A neural network audio codec system and related methods are provided. In one example, a method is provided comprising: obtaining speech audio to be encoded; applying the speech audio to an audio encoder that is part of a neural network audio codec system that includes the audio encoder and an audio decoder. The audio encoder and the audio decoder have been trained in an end-to-end manner. The speech audio is encoded with the audio encoder to generate embedding vectors that represent a snapshot of speech audio attributes over successive timeframes of the raw speech audio, and from the embedding vectors, codeword indices are generated to entries in a codebook. The codeword indices are then transmitted or stored for later retrieval and processing by the audio decoder.
-
公开(公告)号:US20240235727A1
公开(公告)日:2024-07-11
申请号:US18151616
申请日:2023-01-09
Applicant: Cisco Technology, Inc.
Inventor: Amir Salah Abdelsamie Abdelwahed , Ivana Balic , Yusuf Ziya Isik , Xuehong Mao , Samer Lutfi Hijazi
CPC classification number: H04L1/0041 , H04L1/0002 , H04L1/0045 , H04L69/22
Abstract: In some aspects, the techniques described herein relate to a method including: encoding a current data portion to generate an encoded current data portion for inclusion in a data packet; encoding, based upon content of the current data portion, a forward error correction data portion for a previous data portion to generate an encoded forward error correction data portion; generating the data packet including the encoded current data portion and the encoded forward error correction data portion; and providing the data packet to a receiver.
-
-
-
-