-
公开(公告)号:US20220272447A1
公开(公告)日:2022-08-25
申请号:US17666604
申请日:2022-02-08
Applicant: GN Audio A/S
Inventor: Rasmus Kongsgaard Olsson , Thomas Fuglsang
IPC: H04R3/00 , G10L25/78 , G10L25/30 , H04R1/40 , G10L25/18 , H04L65/1089 , H04L65/403 , H04M3/56 , G06N3/08
Abstract: A conference device and a computer-implemented method for training a neural network are disclosed, the conference device comprising a conference controller; a microphone array comprising a plurality of microphones for provision of audio signals representing audio from one or more sound sources; a direction estimator connected to the conference controller and the microphone array, the direction estimator configured to obtain, from the microphone array, a plurality of audio signals including a first audio signal and a second audio signal; determine direction data based on the plurality of audio signals, the direction data comprising an indication of an estimated probability of voice activity for one or more directions, wherein to determine direction data comprises to apply an offline-trained neural network; and output audio data based on the direction data to the conference controller.
-
公开(公告)号:US11778374B2
公开(公告)日:2023-10-03
申请号:US17666604
申请日:2022-02-08
Applicant: GN Audio A/S
Inventor: Rasmus Kongsgaard Olsson , Thomas Fuglsang
IPC: H04R3/00 , G06N3/08 , G10L25/30 , G10L25/78 , H04L65/403 , H04M3/56 , H04R1/40 , G10L25/18 , H04L65/1089
CPC classification number: H04R3/005 , G06N3/08 , G10L25/18 , G10L25/30 , G10L25/78 , H04L65/1089 , H04L65/403 , H04M3/568 , H04R1/406 , H04M2201/40 , H04R2201/401
Abstract: A conference device and a computer-implemented method for training a neural network are disclosed, the conference device comprising a conference controller; a microphone array comprising a plurality of microphones for provision of audio signals representing audio from one or more sound sources; a direction estimator connected to the conference controller and the microphone array, the direction estimator configured to obtain, from the microphone array, a plurality of audio signals including a first audio signal and a second audio signal; determine direction data based on the plurality of audio signals, the direction data comprising an indication of an estimated probability of voice activity for one or more directions, wherein to determine direction data comprises to apply an offline-trained neural network; and output audio data based on the direction data to the conference controller.
-