-
公开(公告)号:US11967316B2
公开(公告)日:2024-04-23
申请号:US17183209
申请日:2021-02-23
Inventor: Jimeng Zheng , Ian Ernan Liu , Yi Gao , Weiwei Li
IPC: G10L15/22 , G01S3/80 , G01S3/802 , G10L15/08 , G10L15/20 , G10L21/0224 , G10L21/0232 , G10L25/51 , G10L21/0208 , G10L21/0216
CPC classification number: G10L15/20 , G01S3/8006 , G01S3/802 , G10L15/08 , G10L15/22 , G10L21/0224 , G10L21/0232 , G10L25/51 , G10L2015/088 , G10L2021/02082 , G10L2021/02166
Abstract: Embodiments of this application disclose method and apparatus for positioning a target audio signal by an audio interaction device, and an audio interaction device The method includes: obtaining audio signals in a plurality of directions in a space, and performing echo cancellation on the audio signal, the audio signal including a target-audio direct signal; obtaining weights of a plurality of time-frequency points in the audio signals, a weight of each time-frequency point indicating, at the time-frequency point, a relative proportion of the target-audio direct signal in the audio signals; weighting time-frequency components of the audio signal at the plurality of time-frequency points separately for each of the plurality of directions by using the weights of the plurality of time-frequency points, to obtain a weighted audio signal energy distribution; and obtaining a sound source azimuth corresponding to the target-audio direct signal in the audio signals accordingly.
-
公开(公告)号:US12051441B2
公开(公告)日:2024-07-30
申请号:US17944067
申请日:2022-09-13
Inventor: Jimeng Zheng , Lianwu Chen , Weiwei Li , Zhiyi Duan , Meng Yu , Dan Su , Kaiyu Jiang
CPC classification number: G10L25/84 , G06T7/20 , G10L17/02 , G10L17/22 , G10L21/028 , G10L25/21 , G06T2207/30201
Abstract: This application discloses a multi-sound area-based speech detection method and related apparatus, and a storage medium, which is applied to the field of artificial intelligence. The method includes: obtaining sound area information corresponding to N sound areas including multiple users speaking simultaneously; generating a control signal corresponding to each target detection sound area according to user information corresponding to the target detection sound area; processing multi-user speech input signals by using the control signals, to obtain a speech output signal corresponding to each target detection sound area; generating a speech detection result of the target detection sound area according to the speech output signal corresponding to the target detection sound area; and selecting, among the multiple users, a main speaker based on the user information, the speech output signals and speech detection results of multiple users in the N sound areas.
-
公开(公告)号:US20230013740A1
公开(公告)日:2023-01-19
申请号:US17944067
申请日:2022-09-13
Inventor: Jimeng ZHENG , Lianwu CHEN , Weiwei Li , Zhiyi Duan , Meng YU , Dan Su , Kaiyu Jiang
Abstract: This application discloses a multi-sound area-based speech detection method and related apparatus, and a storage medium, which is applied to the field of artificial intelligence. The method includes: obtaining sound area information corresponding to each sound area in N sound areas; using the sound area as a target detection sound area, and generating a control signal corresponding to the target detection sound area according to sound area information corresponding to the target detection sound area; processing a speech input signal corresponding to the target detection sound area by using the control signal corresponding to the target detection sound area, to obtain a speech output signal corresponding to the target detection sound area; and generating a speech detection result of the target detection sound area according to the speech output signal corresponding to the target detection sound area. Speech signals in different directions are processed in parallel based on a plurality of sound areas, so that in a multi-sound source scenario, the speech signals in different directions may be retained or suppressed by a control signal, to separate and enhance speech of a target detection user in real time, thereby improving the accuracy of speech detection.
-
公开(公告)号:US11276227B2
公开(公告)日:2022-03-15
申请号:US17165876
申请日:2021-02-02
Inventor: Dian Liu , Yucheng Qu , Chaoyu Hua , Weiwei Li , Jianeng Lu
Abstract: This application discloses an object rendering method and apparatus, a storage medium, and an electronic device. The method includes obtaining a target pixel point to be processed in a diffuse map and a normal map of a to-be-rendered object; determining a first rendering color of the target pixel point according to a pre-integration simulation module and a normal direction parameter corresponding to the target pixel point, the pre-integration simulation module being configured to simulate a pre-integration map, the pre-integration map representing a correspondence between curvature and a color band, and the normal direction parameter representing a normal direction of the target pixel point in a world space coordinate system; determining a target rendering color of the target pixel point according to the first rendering color; and rendering the target pixel point by using the target rendering color.
-
公开(公告)号:US20210266664A1
公开(公告)日:2021-08-26
申请号:US17319024
申请日:2021-05-12
Inventor: Jimeng Zheng , Yi Gao , Xuan Ji , Weiwei Li , Meng Yu , Kai Xia , Jun Feng , Zhu Chen , Hongyang Chen , Wenbin Yang , Yu Wang , Yong Liu
IPC: H04R3/00
Abstract: This application discloses a sound acquisition component array, including: two first sound acquisition components, two second sound acquisition components, and two third sound acquisition components. The two second sound acquisition components are located at a first side of a line connecting the two first sound acquisition components, and the two third sound acquisition components are located at a second side of the connecting line that is opposite to the first side of the connecting line; the two second sound acquisition components are symmetrical about a perpendicular bisector of the connecting line, and the two third sound acquisition components are symmetrical about the perpendicular bisector; and a distance between the two first sound acquisition components, a distance between the two second sound acquisition components, and a distance between the two third sound acquisition components are respectively different from one another along a direction defined by the connecting line.
-
公开(公告)号:US20240233719A1
公开(公告)日:2024-07-11
申请号:US18611585
申请日:2024-03-20
Inventor: Jimeng ZHENG , Ian Ernan Liu , Yi Gao , Weiwei Li
IPC: G10L15/20 , G01S3/80 , G01S3/802 , G10L15/08 , G10L15/22 , G10L21/0208 , G10L21/0216 , G10L21/0224 , G10L21/0232 , G10L25/51
CPC classification number: G10L15/20 , G01S3/8006 , G01S3/802 , G10L15/08 , G10L15/22 , G10L21/0224 , G10L21/0232 , G10L25/51 , G10L2015/088 , G10L2021/02082 , G10L2021/02166
Abstract: This application discloses a method for positioning a target audio signal by a computer device. The method includes: performing echo cancellation on the audio signals collected in a plurality of directions in a space, the audio signals comprising a target-audio direct signal; obtaining weights of a plurality of time-frequency points in the echo-canceled audio signals, a weight of each time-frequency point indicating a relative proportion of the target-audio direct signal in the echo-canceled audio signals at the time-frequency point; obtaining a weighted audio signal energy distribution of the audio signals in the plurality of directions by using the weights of the plurality of time-frequency points in the echo-canceled audio signals; and obtaining a sound source azimuth corresponding to the target-audio direct signal in the audio signals by using the weighted audio signal energy distribution of the audio signals in the plurality of directions.
-
公开(公告)号:US12009006B2
公开(公告)日:2024-06-11
申请号:US17741285
申请日:2022-05-10
Inventor: Rilin Chen , Kaiyu Jiang , Weiwei Li
IPC: G10L21/02 , G10L21/003 , G10L21/0364 , H04R1/40 , H04R3/00
CPC classification number: G10L21/0364 , G10L21/003 , H04R1/406 , H04R3/005 , H04R2430/20
Abstract: An electronic device obtains audio signals collected by different microphones in a microphone array. The device filters the audio signals using a first filter to obtain a first target beam. The first filter is configured to suppress an interference speech in the audio signals and enhance a target speech in the audio signals. The device filters the audio signals using a second filter to obtain a first interference beam. The second filter is configured to suppress the target speech and enhance the interference speech. The device a second interference beam of the first interference beam using a third filter. The device determines a difference between the first target beam and the second interference beam as a first audio processing output. The device adaptively updates at least one of the second filter and the third filter, and updates the first filter according to the updated second filter and/or third filter.
-
公开(公告)号:US11856376B2
公开(公告)日:2023-12-26
申请号:US17319024
申请日:2021-05-12
Inventor: Jimeng Zheng , Yi Gao , Xuan Ji , Weiwei Li , Meng Yu , Kai Xia , Jun Feng , Zhu Chen , Hongyang Chen , Wenbin Yang , Yu Wang , Yong Liu
IPC: H04R3/00
CPC classification number: H04R3/005
Abstract: This application discloses a sound acquisition component array, including: two first sound acquisition components, two second sound acquisition components, and two third sound acquisition components. The two second sound acquisition components are located at a first side of a line connecting the two first sound acquisition components, and the two third sound acquisition components are located at a second side of the connecting line that is opposite to the first side of the connecting line; the two second sound acquisition components are symmetrical about a perpendicular bisector of the connecting line, and the two third sound acquisition components are symmetrical about the perpendicular bisector; and a distance between the two first sound acquisition components, a distance between the two second sound acquisition components, and a distance between the two third sound acquisition components are respectively different from one another along a direction defined by the connecting line.
-
公开(公告)号:US20220270631A1
公开(公告)日:2022-08-25
申请号:US17741285
申请日:2022-05-10
Inventor: Rilin CHEN , Kaiyu Jiang , Weiwei Li
IPC: G10L21/0364 , H04R3/00 , H04R1/40 , G10L21/003
Abstract: An electronic device obtains audio signals collected by different microphones in a microphone array. The device filters the audio signals using a first filter to obtain a first target beam. The first filter is configured to suppress an interference speech in the audio signals and enhance a target speech in the audio signals. The device filters the audio signals using a second filter to obtain a first interference beam. The second filter is configured to suppress the target speech and enhance the interference speech. The device a second interference beam of the first interference beam using a third filter. The device determines a difference between the first target beam and the second interference beam as a first audio processing output. The device adaptively updates at least one of the second filter and the third filter, and updates the first filter according to the updated second filter and/or third filter.
-
-
-
-
-
-
-
-