Patent search ap:("Tencent Technology (Shenzhen) Company Limited") AND inv:"Weiwei Li" Page 1

1.

发明授权
Audio recognition method, method, apparatus for positioning target audio, and device 有权

公开(公告)号：US11967316B2

公开(公告)日：2024-04-23

申请号：US17183209

申请日：2021-02-23

Applicant: Tencent Technology (Shenzhen) Company Limited

Inventor： Jimeng Zheng , Ian Ernan Liu , Yi Gao , Weiwei Li

IPC: G10L15/22 , G01S3/80 , G01S3/802 , G10L15/08 , G10L15/20 , G10L21/0224 , G10L21/0232 , G10L25/51 , G10L21/0208 , G10L21/0216

CPC classification number: G10L15/20 , G01S3/8006 , G01S3/802 , G10L15/08 , G10L15/22 , G10L21/0224 , G10L21/0232 , G10L25/51 , G10L2015/088 , G10L2021/02082 , G10L2021/02166

Abstract: Embodiments of this application disclose method and apparatus for positioning a target audio signal by an audio interaction device, and an audio interaction device The method includes: obtaining audio signals in a plurality of directions in a space, and performing echo cancellation on the audio signal, the audio signal including a target-audio direct signal; obtaining weights of a plurality of time-frequency points in the audio signals, a weight of each time-frequency point indicating, at the time-frequency point, a relative proportion of the target-audio direct signal in the audio signals; weighting time-frequency components of the audio signal at the plurality of time-frequency points separately for each of the plurality of directions by using the weights of the plurality of time-frequency points, to obtain a weighted audio signal energy distribution; and obtaining a sound source azimuth corresponding to the target-audio direct signal in the audio signals accordingly.

2.

发明授权
Multi-register-based speech detection method and related apparatus, and storage medium 有权

公开(公告)号：US12051441B2

公开(公告)日：2024-07-30

申请号：US17944067

申请日：2022-09-13

Applicant: Tencent Technology (Shenzhen) Company Limited

Inventor： Jimeng Zheng , Lianwu Chen , Weiwei Li , Zhiyi Duan , Meng Yu , Dan Su , Kaiyu Jiang

IPC: G10L25/84 , G06T7/20 , G10L17/02 , G10L17/22 , G10L21/028 , G10L25/21

CPC classification number: G10L25/84 , G06T7/20 , G10L17/02 , G10L17/22 , G10L21/028 , G10L25/21 , G06T2207/30201

Abstract: This application discloses a multi-sound area-based speech detection method and related apparatus, and a storage medium, which is applied to the field of artificial intelligence. The method includes: obtaining sound area information corresponding to N sound areas including multiple users speaking simultaneously; generating a control signal corresponding to each target detection sound area according to user information corresponding to the target detection sound area; processing multi-user speech input signals by using the control signals, to obtain a speech output signal corresponding to each target detection sound area; generating a speech detection result of the target detection sound area according to the speech output signal corresponding to the target detection sound area; and selecting, among the multiple users, a main speaker based on the user information, the speech output signals and speech detection results of multiple users in the N sound areas.

3.

发明申请
MULTI-REGISTER-BASED SPEECH DETECTION METHOD AND RELATED APPARATUS, AND STORAGE MEDIUM 有权

公开(公告)号：US20230013740A1

公开(公告)日：2023-01-19

申请号：US17944067

申请日：2022-09-13

Applicant: Tencent Technology (Shenzhen) Company Limited

Inventor： Jimeng ZHENG , Lianwu CHEN , Weiwei Li , Zhiyi Duan , Meng YU , Dan Su , Kaiyu Jiang

IPC: G10L25/84 , G10L17/22 , G10L21/028 , G10L17/02 , G06T7/20 , G10L25/21

Abstract: This application discloses a multi-sound area-based speech detection method and related apparatus, and a storage medium, which is applied to the field of artificial intelligence. The method includes: obtaining sound area information corresponding to each sound area in N sound areas; using the sound area as a target detection sound area, and generating a control signal corresponding to the target detection sound area according to sound area information corresponding to the target detection sound area; processing a speech input signal corresponding to the target detection sound area by using the control signal corresponding to the target detection sound area, to obtain a speech output signal corresponding to the target detection sound area; and generating a speech detection result of the target detection sound area according to the speech output signal corresponding to the target detection sound area. Speech signals in different directions are processed in parallel based on a plurality of sound areas, so that in a multi-sound source scenario, the speech signals in different directions may be retained or suppressed by a control signal, to separate and enhance speech of a target detection user in real time, thereby improving the accuracy of speech detection.

4.

发明授权
Object rendering method and apparatus, storage medium, and electronic device using a simulated pre-integration map 有权

公开(公告)号：US11276227B2

公开(公告)日：2022-03-15

申请号：US17165876

申请日：2021-02-02

Applicant: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED

Inventor： Dian Liu , Yucheng Qu , Chaoyu Hua , Weiwei Li , Jianeng Lu

IPC: G06T15/50 , G06T15/04 , G06T15/80 , A63F13/525 , G06T15/20 , A63F13/52 , A63F13/56

Abstract: This application discloses an object rendering method and apparatus, a storage medium, and an electronic device. The method includes obtaining a target pixel point to be processed in a diffuse map and a normal map of a to-be-rendered object; determining a first rendering color of the target pixel point according to a pre-integration simulation module and a normal direction parameter corresponding to the target pixel point, the pre-integration simulation module being configured to simulate a pre-integration map, the pre-integration map representing a correspondence between curvature and a color band, and the normal direction parameter representing a normal direction of the target pixel point in a world space coordinate system; determining a target rendering color of the target pixel point according to the first rendering color; and rendering the target pixel point by using the target rendering color.

5.

发明申请
SOUND ACQUISITION COMPONENT ARRAY AND SOUND ACQUISITION DEVICE 有权

公开(公告)号：US20210266664A1

公开(公告)日：2021-08-26

申请号：US17319024

申请日：2021-05-12

Applicant: Tencent Technology (Shenzhen) Company Limited

Inventor： Jimeng Zheng , Yi Gao , Xuan Ji , Weiwei Li , Meng Yu , Kai Xia , Jun Feng , Zhu Chen , Hongyang Chen , Wenbin Yang , Yu Wang , Yong Liu

IPC: H04R3/00

Abstract: This application discloses a sound acquisition component array, including: two first sound acquisition components, two second sound acquisition components, and two third sound acquisition components. The two second sound acquisition components are located at a first side of a line connecting the two first sound acquisition components, and the two third sound acquisition components are located at a second side of the connecting line that is opposite to the first side of the connecting line; the two second sound acquisition components are symmetrical about a perpendicular bisector of the connecting line, and the two third sound acquisition components are symmetrical about the perpendicular bisector; and a distance between the two first sound acquisition components, a distance between the two second sound acquisition components, and a distance between the two third sound acquisition components are respectively different from one another along a direction defined by the connecting line.

6.

发明公开
AUDIO RECOGNITION METHOD, METHOD, APPARATUS FOR POSITIONING TARGET AUDIO, AND DEVICE 审中-公开

公开(公告)号：US20240233719A1

公开(公告)日：2024-07-11

申请号：US18611585

申请日：2024-03-20

Applicant: Tencent Technology ( Shenzhen ) Company Limited

Inventor： Jimeng ZHENG , Ian Ernan Liu , Yi Gao , Weiwei Li

IPC: G10L15/20 , G01S3/80 , G01S3/802 , G10L15/08 , G10L15/22 , G10L21/0208 , G10L21/0216 , G10L21/0224 , G10L21/0232 , G10L25/51

CPC classification number: G10L15/20 , G01S3/8006 , G01S3/802 , G10L15/08 , G10L15/22 , G10L21/0224 , G10L21/0232 , G10L25/51 , G10L2015/088 , G10L2021/02082 , G10L2021/02166

Abstract: This application discloses a method for positioning a target audio signal by a computer device. The method includes: performing echo cancellation on the audio signals collected in a plurality of directions in a space, the audio signals comprising a target-audio direct signal; obtaining weights of a plurality of time-frequency points in the echo-canceled audio signals, a weight of each time-frequency point indicating a relative proportion of the target-audio direct signal in the echo-canceled audio signals at the time-frequency point; obtaining a weighted audio signal energy distribution of the audio signals in the plurality of directions by using the weights of the plurality of time-frequency points in the echo-canceled audio signals; and obtaining a sound source azimuth corresponding to the target-audio direct signal in the audio signals by using the weighted audio signal energy distribution of the audio signals in the plurality of directions.

7.

发明授权
Audio signal processing method, apparatus and device, and storage medium 有权

公开(公告)号：US12009006B2

公开(公告)日：2024-06-11

申请号：US17741285

申请日：2022-05-10

Applicant: Tencent Technology (Shenzhen) Company Limited

Inventor： Rilin Chen , Kaiyu Jiang , Weiwei Li

IPC: G10L21/02 , G10L21/003 , G10L21/0364 , H04R1/40 , H04R3/00

CPC classification number: G10L21/0364 , G10L21/003 , H04R1/406 , H04R3/005 , H04R2430/20

Abstract: An electronic device obtains audio signals collected by different microphones in a microphone array. The device filters the audio signals using a first filter to obtain a first target beam. The first filter is configured to suppress an interference speech in the audio signals and enhance a target speech in the audio signals. The device filters the audio signals using a second filter to obtain a first interference beam. The second filter is configured to suppress the target speech and enhance the interference speech. The device a second interference beam of the first interference beam using a third filter. The device determines a difference between the first target beam and the second interference beam as a first audio processing output. The device adaptively updates at least one of the second filter and the third filter, and updates the first filter according to the updated second filter and/or third filter.

8.

发明授权
Sound acquisition component array and sound acquisition device 有权

公开(公告)号：US11856376B2

公开(公告)日：2023-12-26

申请号：US17319024

申请日：2021-05-12

Applicant: Tencent Technology (Shenzhen) Company Limited

Inventor： Jimeng Zheng , Yi Gao , Xuan Ji , Weiwei Li , Meng Yu , Kai Xia , Jun Feng , Zhu Chen , Hongyang Chen , Wenbin Yang , Yu Wang , Yong Liu

IPC: H04R3/00

CPC classification number: H04R3/005

Abstract: This application discloses a sound acquisition component array, including: two first sound acquisition components, two second sound acquisition components, and two third sound acquisition components. The two second sound acquisition components are located at a first side of a line connecting the two first sound acquisition components, and the two third sound acquisition components are located at a second side of the connecting line that is opposite to the first side of the connecting line; the two second sound acquisition components are symmetrical about a perpendicular bisector of the connecting line, and the two third sound acquisition components are symmetrical about the perpendicular bisector; and a distance between the two first sound acquisition components, a distance between the two second sound acquisition components, and a distance between the two third sound acquisition components are respectively different from one another along a direction defined by the connecting line.

9.

发明申请
AUDIO SIGNAL PROCESSING METHOD, APPARATUS AND DEVICE, AND STORAGE MEDIUM 有权

公开(公告)号：US20220270631A1

公开(公告)日：2022-08-25

申请号：US17741285

申请日：2022-05-10

Applicant: Tencent Technology (Shenzhen) Company Limited

Inventor： Rilin CHEN , Kaiyu Jiang , Weiwei Li

IPC: G10L21/0364 , H04R3/00 , H04R1/40 , G10L21/003

Abstract: An electronic device obtains audio signals collected by different microphones in a microphone array. The device filters the audio signals using a first filter to obtain a first target beam. The first filter is configured to suppress an interference speech in the audio signals and enhance a target speech in the audio signals. The device filters the audio signals using a second filter to obtain a first interference beam. The second filter is configured to suppress the target speech and enhance the interference speech. The device a second interference beam of the first interference beam using a third filter. The device determines a difference between the first target beam and the second interference beam as a first audio processing output. The device adaptively updates at least one of the second filter and the third filter, and updates the first filter according to the updated second filter and/or third filter.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification