Patent search ap:("Huawei Technologies Co. Page Ltd.") AND inv:"Zhuo Wang"

1.

发明授权
Audio coding method and apparatus 有权

公开(公告)号：US12057129B2

公开(公告)日：2024-08-06

申请号：US17697455

申请日：2022-03-17

Applicant: Huawei Technologies Co., Ltd.

Inventor： Zhuo Wang , Meng Wang , Fan Fan

IPC: G10L19/005 , G06N3/08 , G10L19/008 , G10L19/16 , H04R3/12

CPC classification number: G10L19/005 , G06N3/08 , G10L19/008 , G10L19/167 , H04R3/12 , H04R2420/07

Abstract: An audio coding method and apparatus are provided. The audio coding method includes: obtaining first audio data; obtaining a target bit rate and a Bluetooth packet type, where the target bit rate and the Bluetooth packet type correspond to a current status of a Bluetooth channel; obtaining one or more of a bit pool parameter set, a psychoacoustic parameter set, and a spectrum bandwidth parameter set by using a neural network obtained through pre-training based on the first audio data, the target bit rate, and the Bluetooth packet type; and coding the first audio data based on one or more of the bit pool parameter set, the psychoacoustic parameter set, and the spectrum bandwidth parameter set to obtain a to-be-sent bit stream. The status of the Bluetooth channel can be adaptively matched, and continuous audio listening experience is provided when audio quality is maximally ensured.

2.

发明申请
AUDIO ENCODING/DECODING METHOD AND APPARATUS, STORAGE MEDIUM, AND COMPUTER PROGRAM PRODUCT 有权

公开(公告)号：US20250166640A1

公开(公告)日：2025-05-22

申请号：US19026344

申请日：2025-01-16

Applicant: HUAWEI TECHNOLOGIES CO., LTD.

Inventor： Zhuo Wang , Bin Feng , Chunhui Du , Fan Fan

IPC: G10L19/008 , G10L19/00 , G10L19/02 , G10L19/032

Abstract: This disclosure discloses an audio encoding/decoding method and apparatus, a storage medium, and a computer program product, and belongs to the audio encoding/decoding field. In this solution, when an audio signal is a dual-channel signal, even if a bitstream includes a left-channel bitstream and a right-channel bitstream, based on a channel decoding mode, the left-channel bitstream is decoded but the right-channel bitstream is not decoded, or the right-channel bitstream is decoded but the left-channel bitstream is not decoded in a decoding process. Therefore, power consumption at a decoder side is reduced when resources at the decoder side are limited. Correspondingly, an encoder side can also sequentially encode left-channel data and right-channel data based on a condition met by the audio signal, rather than necessarily performing encoding in a dual-channel interleaving encoding scheme or a dual-channel deinterleaving encoding scheme.

3.

发明申请
AUDIO SIGNAL PROCESSING METHOD AND APPARATUS, STORAGE MEDIUM, AND COMPUTER PROGRAM PRODUCT 有权

公开(公告)号：US20250157476A1

公开(公告)日：2025-05-15

申请号：US19026327

申请日：2025-01-16

Applicant: HUAWEI TECHNOLOGIES CO., LTD.

Inventor： Zhuo Wang , Bin Feng , Chunhui Du , Fan Fan

IPC: G10L19/02

Abstract: An audio signal processing method and apparatus, a storage medium, and a computer program product are provided. An optimal sub-band division manner is selected from a plurality of sub-band division manners based on a characteristic of an audio signal. In other words, the sub-band division manner has a signal adaptation characteristic, and can adapt to an encoding bit rate of the audio signal. Specifically, a total scale value corresponding to each sub-band division manner is determined based on spectral values of the audio signal in sub-bands obtained through division, a bandwidth of each sub-band, and the encoding bit rate of the audio signal, and an optimal target sub-band division manner is selected based on the total scale value, to obtain an optimal sub-band set. Subsequently, spectral envelope shaping is performed based on a scale factor of each sub-band in the optimal sub-band set, to improve coding effect and compression efficiency.

4.

发明授权
Meme creation method and apparatus 有权

公开(公告)号：US11941323B2

公开(公告)日：2024-03-26

申请号：US17836212

申请日：2022-06-09

Applicant: HUAWEI TECHNOLOGIES CO., LTD.

Inventor： Meng Wang , Zhuo Wang , Fan Fan , Lelin Wang

IPC: G10L15/08 , G06F3/0482 , G06F3/16 , G06F18/22 , G06T11/20 , G10L15/22 , G10L19/00 , G10L25/63

CPC classification number: G06F3/167 , G06F3/0482 , G06F18/22 , G06T11/203 , G10L15/08 , G10L15/22 , G10L19/00 , G10L25/63 , G06T2200/24 , G10L2015/088 , G10L2015/223

Abstract: A meme creation method and apparatus are provided, and relate to the terminal field, to enrich forms and content of memes, and improve user experience. The method includes: displaying a first interface, where the first interface includes a speech input button; receiving, in response to an operation of triggering the speech input button by a user, a speech input by the user; recognizing the speech in a preset manner, where recognition in the preset manner includes at least content recognition, and if the speech includes a target keyword, recommending a first image meme set to the user; obtaining, in response to an operation of selecting one image meme from the first image meme set by the user, a target meme based on the image meme selected by the user and the speech or semantics corresponding to the speech; and sending the target meme.

Patent Agency Ranking