-
公开(公告)号:US12057129B2
公开(公告)日:2024-08-06
申请号:US17697455
申请日:2022-03-17
Applicant: Huawei Technologies Co., Ltd.
IPC: G10L19/005 , G06N3/08 , G10L19/008 , G10L19/16 , H04R3/12
CPC classification number: G10L19/005 , G06N3/08 , G10L19/008 , G10L19/167 , H04R3/12 , H04R2420/07
Abstract: An audio coding method and apparatus are provided. The audio coding method includes: obtaining first audio data; obtaining a target bit rate and a Bluetooth packet type, where the target bit rate and the Bluetooth packet type correspond to a current status of a Bluetooth channel; obtaining one or more of a bit pool parameter set, a psychoacoustic parameter set, and a spectrum bandwidth parameter set by using a neural network obtained through pre-training based on the first audio data, the target bit rate, and the Bluetooth packet type; and coding the first audio data based on one or more of the bit pool parameter set, the psychoacoustic parameter set, and the spectrum bandwidth parameter set to obtain a to-be-sent bit stream. The status of the Bluetooth channel can be adaptively matched, and continuous audio listening experience is provided when audio quality is maximally ensured.
-
2.
公开(公告)号:US20250166640A1
公开(公告)日:2025-05-22
申请号:US19026344
申请日:2025-01-16
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
Inventor: Zhuo Wang , Bin Feng , Chunhui Du , Fan Fan
IPC: G10L19/008 , G10L19/00 , G10L19/02 , G10L19/032
Abstract: This disclosure discloses an audio encoding/decoding method and apparatus, a storage medium, and a computer program product, and belongs to the audio encoding/decoding field. In this solution, when an audio signal is a dual-channel signal, even if a bitstream includes a left-channel bitstream and a right-channel bitstream, based on a channel decoding mode, the left-channel bitstream is decoded but the right-channel bitstream is not decoded, or the right-channel bitstream is decoded but the left-channel bitstream is not decoded in a decoding process. Therefore, power consumption at a decoder side is reduced when resources at the decoder side are limited. Correspondingly, an encoder side can also sequentially encode left-channel data and right-channel data based on a condition met by the audio signal, rather than necessarily performing encoding in a dual-channel interleaving encoding scheme or a dual-channel deinterleaving encoding scheme.
-
3.
公开(公告)号:US20250157476A1
公开(公告)日:2025-05-15
申请号:US19026327
申请日:2025-01-16
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
Inventor: Zhuo Wang , Bin Feng , Chunhui Du , Fan Fan
IPC: G10L19/02
Abstract: An audio signal processing method and apparatus, a storage medium, and a computer program product are provided. An optimal sub-band division manner is selected from a plurality of sub-band division manners based on a characteristic of an audio signal. In other words, the sub-band division manner has a signal adaptation characteristic, and can adapt to an encoding bit rate of the audio signal. Specifically, a total scale value corresponding to each sub-band division manner is determined based on spectral values of the audio signal in sub-bands obtained through division, a bandwidth of each sub-band, and the encoding bit rate of the audio signal, and an optimal target sub-band division manner is selected based on the total scale value, to obtain an optimal sub-band set. Subsequently, spectral envelope shaping is performed based on a scale factor of each sub-band in the optimal sub-band set, to improve coding effect and compression efficiency.
-
公开(公告)号:US11941323B2
公开(公告)日:2024-03-26
申请号:US17836212
申请日:2022-06-09
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
Inventor: Meng Wang , Zhuo Wang , Fan Fan , Lelin Wang
CPC classification number: G06F3/167 , G06F3/0482 , G06F18/22 , G06T11/203 , G10L15/08 , G10L15/22 , G10L19/00 , G10L25/63 , G06T2200/24 , G10L2015/088 , G10L2015/223
Abstract: A meme creation method and apparatus are provided, and relate to the terminal field, to enrich forms and content of memes, and improve user experience. The method includes: displaying a first interface, where the first interface includes a speech input button; receiving, in response to an operation of triggering the speech input button by a user, a speech input by the user; recognizing the speech in a preset manner, where recognition in the preset manner includes at least content recognition, and if the speech includes a target keyword, recommending a first image meme set to the user; obtaining, in response to an operation of selecting one image meme from the first image meme set by the user, a target meme based on the image meme selected by the user and the speech or semantics corresponding to the speech; and sending the target meme.
-
-
-