COMPLEMENTARY VIRTUAL AUDIO GENERATION
    11.
    发明申请

    公开(公告)号:US20190320281A1

    公开(公告)日:2019-10-17

    申请号:US15951907

    申请日:2018-04-12

    Abstract: An apparatus includes a processor configured to receive one or more media signals associated with a scene. The processor is also configured to identify a spatial location in the scene for each source of the one or more media signals. The processor is further configured to identify audio content for each media signal of the one or more media signals. The processor is also configured to determine one or more candidate spatial locations in the scene based on the identified spatial locations. The processor is further configured to generate audio to playback as virtual sounds that originate from the one or more candidate spatial locations.

    MULTIPLE MICROPHONE SPEECH GENERATIVE NETWORKS

    公开(公告)号:US20190311728A1

    公开(公告)日:2019-10-10

    申请号:US15948681

    申请日:2018-04-09

    Abstract: Methods, systems, and devices for auditory enhancement are described. A device may receive a respective auditory signal at each of a set of microphones, where each auditory signal includes a respective representation of a target auditory component and one or more noise artifacts. The device may identify a directionality associated with a source of the target auditory component (e.g., based on an arrangement of the multiple microphones). The device may determine a distribution function for the target auditory component based at least in part on the directionality associated with the source and on the received plurality of auditory signals. The device may generate an estimate of the target auditory component based at least in part on the distribution function and output the estimate of the target auditory component.

    DEEP NEURAL NET BASED FILTER PREDICTION FOR AUDIO EVENT CLASSIFICATION AND EXTRACTION
    19.
    发明申请
    DEEP NEURAL NET BASED FILTER PREDICTION FOR AUDIO EVENT CLASSIFICATION AND EXTRACTION 有权
    深度基于神经网络的滤波器预测音频事件分类和提取

    公开(公告)号:US20160284346A1

    公开(公告)日:2016-09-29

    申请号:US14671850

    申请日:2015-03-27

    Abstract: Disclosed is a feature extraction and classification methodology wherein audio data is gathered in a target environment under varying conditions. From this collected data, corresponding features are extracted, labeled with appropriate filters (e.g., audio event descriptions), and used for training deep neural networks (DNNs) to extract underlying target audio events from unlabeled training data. Once trained, these DNNs are used to predict underlying events in noisy audio to extract therefrom features that enable the separation of the underlying audio events from the noisy components thereof.

    Abstract translation: 公开了一种特征提取和分类方法,其中音频数据在不同条件下收集在目标环境中。 从该收集的数据中,提取相应的特征,用适当的滤波器(例如,音频事件描述)标记,并用于训练深层神经网络(DNN)以从未标记的训练数据中提取潜在的目标音频事件。 一旦被训练,这些DNN用于预测嘈杂音频中的底层事件以从其中提取能够将底层音频事件与其噪声分量分开的特征。

    METHOD, SYSTEM AND ARTICLE OF MANUFACTURE FOR PROCESSING SPATIAL AUDIO
    20.
    发明申请
    METHOD, SYSTEM AND ARTICLE OF MANUFACTURE FOR PROCESSING SPATIAL AUDIO 有权
    用于处理空间音频的制造方法,系统和制品

    公开(公告)号:US20160198282A1

    公开(公告)日:2016-07-07

    申请号:US14807760

    申请日:2015-07-23

    Abstract: Techniques for processing directionally-encoded audio to account for spatial characteristics of a listener playback environment are disclosed. The directionally-encoded audio data includes spatial information indicative of one or more directions of sound sources in an audio scene. The audio data is modified based on input data identifying the spatial characteristics of the playback environment. The spatial characteristics may correspond to actual loudspeaker locations in the playback environment. The directionally-encoded audio may also be processed to permit focusing/defocusing on sound sources or particular directions in an audio scene. The disclosed techniques may allow a recorded audio scene to be more accurately reproduced at playback time, regardless of the output loudspeaker setup. Another advantage is that a user may dynamically configure audio data so that it better conforms to the user's particular loudspeaker layouts and/or the user's desired focus on particular subjects or areas in an audio scene.

    Abstract translation: 公开了用于处理定向编码的音频以考虑收听者回放环境的空间特征的技术。 定向编码的音频数据包括指示音频场景中的声源的一个或多个方向的空间信息。 基于识别回放环境的空间特性的输入数据来修改音频数据。 空间特征可以对应于播放环境中的实际扬声器位置。 定向编码的音频也可以被处理以允许对音频场景中的声源或特定方向进行聚焦/散焦。 所公开的技术可以允许在播放时间更准确地再现记录的音频场景,而与输出的扬声器设置无关。 另一个优点是用户可以动态地配置音频数据,使得其更好地符合用户的特定扬声器布局和/或用户对音频场景中的特定主体或区域的期望焦点。

Patent Agency Ranking