Transform ambisonic coefficients using an adaptive network

    公开(公告)号:US11636866B2

    公开(公告)日:2023-04-25

    申请号:US17210357

    申请日:2021-03-23

    Abstract: A device includes a memory configured to store untransformed ambisonic coefficients at different time segments. The device also includes one or more processors configured to obtain the untransformed ambisonic coefficients at the different time segments, where the untransformed ambisonic coefficients at the different time segments represent a soundfield at the different time segments. The one or more processors are also configured to apply one adaptive network, based on a constraint, to the untransformed ambisonic coefficients at the different time segments to generate transformed ambisonic coefficients at the different time segments, wherein the transformed ambisonic coefficients at the different time segments represent a modified soundfield at the different time segments, that was modified based on the constraint.

    Active self-voice naturalization using a bone conduction sensor

    公开(公告)号:US11606643B2

    公开(公告)日:2023-03-14

    申请号:US17530320

    申请日:2021-11-18

    Abstract: Methods, systems, and devices for signal processing are described. Generally, as provided for by the described techniques, a wearable device to receive an input audio signal from one or more outer microphones, an input audio signal from one or more inner microphones, and a bone conduction signal from a bone conduction sensor based on the input audio signals. The wearable device may filter the bone conduction signal based on a set of frequencies of the input audio signals, such as a low frequency portion of the input audio signals. For example, the wearable device may apply a filter to the bone conduction signal that accounts for an error in the input audio signals. The wearable device may add a gain to the filtered bone conduction signal and may equalize the filtered bone conduction signal based on the gain. The wearable device may output an audio signal to a speaker.

    Active self-voice naturalization using a bone conduction sensor

    公开(公告)号:US11259119B1

    公开(公告)日:2022-02-22

    申请号:US17064146

    申请日:2020-10-06

    Abstract: Methods, systems, and devices for signal processing are described. Generally, as provided for by the described techniques, a wearable device to receive an input audio signal from one or more outer microphones, an input audio signal from one or more inner microphones, and a bone conduction signal from a bone conduction sensor based on the input audio signals. The wearable device may filter the bone conduction signal based on a set of frequencies of the input audio signals, such as a low frequency portion of the input audio signals. For example, the wearable device may apply a filter to the bone conduction signal that accounts for an error in the input audio signals. The wearable device may add a gain to the filtered bone conduction signal and may equalize the filtered bone conduction signal based on the gain. The wearable device may output an audio signal to a speaker.

    Audio analytics for natural language processing

    公开(公告)号:US11094316B2

    公开(公告)日:2021-08-17

    申请号:US15972011

    申请日:2018-05-04

    Abstract: A device includes a memory configured to store category labels associated with categories of a natural language processing library. A processor is configured to analyze input audio data to generate a text string and to perform natural language processing on at least the text string to generate an output text string including an action associated with a first device, a speaker, a location, or a combination thereof. The processor is configured to compare the input audio data to audio data of the categories to determine whether the input audio data matches any of the categories and, in response to determining that the input audio data does not match any of the categories: create a new category label, associate the new category label with at least a portion of the output text string, update the categories with the new category label, and generate a notification indicating the new category label.

    Speaker template update with embedding vectors based on distance metric

    公开(公告)号:US11017783B2

    公开(公告)日:2021-05-25

    申请号:US16296733

    申请日:2019-03-08

    Abstract: A device includes a processor configured to determine a feature vector based on an utterance and to determine a first embedding vector by processing the feature vector using a trained embedding network. The processor is configured to determine a first distance metric based on distances between the first embedding vector and each embedding vector of a speaker template. The processor is configured to determine, based on the first distance metric, that the utterance is verified to be from a particular user. The processor is configured to, based on a comparison of a first particular distance metric associated with the first embedding vector to a second distance metric associated with a first test embedding vector of the speaker template, generate an updated speaker template by adding the first embedding vector as a second test embedding vector and removing the first test embedding vector from test embedding vectors of the speaker template.

    AUDIO ANALYTICS FOR NATURAL LANGUAGE PROCESSING

    公开(公告)号:US20190341026A1

    公开(公告)日:2019-11-07

    申请号:US15972011

    申请日:2018-05-04

    Abstract: A device includes a memory configured to store category labels associated with categories of a natural language processing library. A processor is configured to analyze input audio data to generate a text string and to perform natural language processing on at least the text string to generate an output text string including an action associated with a first device, a speaker, a location, or a combination thereof. The processor is configured to compare the input audio data to audio data of the categories to determine whether the input audio data matches any of the categories and, in response to determining that the input audio data does not match any of the categories: create a new category label, associate the new category label with at least a portion of the output text string, update the categories with the new category label, and generate a notification indicating the new category label.

    ACOUSTIC EVENT ENABLED GEOGRAPHIC MAPPING
    39.
    发明申请

    公开(公告)号:US20180307753A1

    公开(公告)日:2018-10-25

    申请号:US15494379

    申请日:2017-04-21

    Abstract: An electronic device includes a classifier circuit, a ranking circuit, and a data generator circuit. The classifier circuit is configured to determine, based on first data indicating samples of sounds detected at a plurality of geographic locations, a plurality of acoustic event classifications associated with the plurality of geographic locations. The ranking circuit is configured to determine a plurality of index scores associated with the plurality of geographic locations by ranking each of the plurality of geographic locations based on the plurality of acoustic event classifications. The data generator circuit is configured to generate, based on the plurality of index scores, second data indicating a geographic map corresponding to the plurality of geographic locations. The second data further indicates the plurality of index scores and a prompt to enable a search for a particular type of acoustic event.

Patent Agency Ranking