Characterizing audio using transchromagrams

    公开(公告)号:US10475426B2

    公开(公告)日:2019-11-12

    申请号:US16203811

    申请日:2018-11-29

    Abstract: Methods, systems and apparatus to characterize audio using transchromagrams are disclosed. An example apparatus includes a transchromagram generator to generate a data structure based on a set of transition matrices corresponding to a plurality of time frames of audio data, the data structure indicative of probabilities that first musical notes will transition to second musical notes, a database controller to prompt a database to store the data structure within the audio data, and a notification manager to generate, based on a comparison between query audio data and the stored data structure of the audio data, a notification identifying at least one characteristic of the query audio data.

    Machine-led mood change
    3.
    发明授权

    公开(公告)号:US10048931B2

    公开(公告)日:2018-08-14

    申请号:US15721161

    申请日:2017-09-29

    Abstract: A machine is configured to identify a media file that, when played to a user, is likely to modify an emotional or physical state of the user to or towards a target emotional or physical state. The machine accesses play counts that quantify playbacks of media files for the user. The playbacks may be locally performed or detected by the machine from ambient sound. The machine accesses arousal scores of the media files and determines a distribution of the play counts over the arousal scores. The machine uses one or more relative maxima in the distribution in selecting a target arousal score for the user based on contextual data that describes an activity of the user. The machine selects one or more media files based on the target arousal score. The machine may then cause the selected media file to be played to the user.

    Generating a video presentation to accompany audio

    公开(公告)号:US11915722B2

    公开(公告)日:2024-02-27

    申请号:US15474305

    申请日:2017-03-30

    CPC classification number: G11B27/031 G06F16/438 G06F16/73 G06F16/7834

    Abstract: Example methods and systems for generating a video presentation to accompany audio are described. The video presentation to accompany the audio track is generated from one or more video sequences. In some example embodiments, the video sequences are divided into video segments that correspond to discontinuities between frames. Video segments are concatenated to form a video presentation to which the audio track is added. In some example embodiments, only video segments having a duration equal to an integral number of beats of music in the audio track are used to form the video presentation. In these example embodiments, transitions between video segments in the video presentation that accompanies the audio track are aligned with the beats of the music.

    Model-based media classification service using sensed media noise characteristics

    公开(公告)号:US10635701B2

    公开(公告)日:2020-04-28

    申请号:US15185654

    申请日:2016-06-17

    Abstract: A neural network-based classifier system can receive a query including a media signal and, in response, provide an indication that the query corresponds to a specified media type or media class. The neural network-based classifier system can select and apply various models to facilitate media classification. In an example embodiment, a query can be analyzed for various characteristics, such as a noise profile, before it is input to the network-based classifier. If the query has greater than a specified threshold noise characteristic, then a successful classification can be unlikely and a classification process based on the query can be terminated before computational resources are expended. Query signals that meet or exceed a threshold condition can be provided to the network-based classifier for media classification. In an example embodiment, a remote device or a central media classifier circuit can determine a noise profile for a query.

    METHODS AND APPARATUS FOR DYNAMIC VOLUME ADJUSTMENT VIA AUDIO CLASSIFICATION

    公开(公告)号:US20200081683A1

    公开(公告)日:2020-03-12

    申请号:US16563717

    申请日:2019-09-06

    Abstract: Methods, apparatus, systems and articles of manufacture are disclosed for dynamic volume adjustment via audio classification. Examples methods include analyzing, with a neural network trained model, a parameter of an audio signal associated with a first volume level to determine a classification group associated with the audio signal, determining an input volume of the audio signal, the selection based on the classification group associated with the audio signal, applying a gain value to the audio signal, the gain value based on the classification group and the input volume, the gain value to modify the first volume level to a second volume level, and applying a compression value to the audio signal, the compression value to modify the second volume level to a third volume level that satisfies a target volume threshold.

    MACHINE-LED MOOD CHANGE
    8.
    发明申请

    公开(公告)号:US20180024810A1

    公开(公告)日:2018-01-25

    申请号:US15721161

    申请日:2017-09-29

    CPC classification number: G06F3/165 G06F17/30772 H04L67/22

    Abstract: A machine is configured to identify a media file that, when played to a user, is likely to modify an emotional or physical state of the user to or towards a target emotional or physical state. The machine accesses play counts that quantify playbacks of media files for the user. The playbacks may be locally performed or detected by the machine from ambient sound. The machine accesses arousal scores of the media files and determines a distribution of the play counts over the arousal scores. The machine uses one or more relative maxima in the distribution in selecting a target arousal score for the user based on contextual data that describes an activity of the user. The machine selects one or more media files based on the target arousal score. The machine may then cause the selected media file to be played to the user.

    Machine-led mood change
    9.
    发明授权

    公开(公告)号:US09792084B2

    公开(公告)日:2017-10-17

    申请号:US14980650

    申请日:2015-12-28

    CPC classification number: G06F3/165 G06F17/30772 H04L67/22

    Abstract: A machine is configured to identify a media file that, when played to a user, is likely to modify an emotional or physical state of the user to or towards a target emotional or physical state. The machine accesses play counts that quantify playbacks of media files for the user. The playbacks may be locally performed or detected by the machine from ambient sound. The machine accesses arousal scores of the media files and determines a distribution of the play counts over the arousal scores. The machine uses one or more relative maxima in the distribution in selecting a target arousal score for the user based on contextual data that describes an activity of the user. The machine selects one or more media files based on the target arousal score. The machine may then cause the selected media file to be played to the user.

    MODEL-BASED MEDIA CLASSIFICATION SERVICE USING SENSED MEDIA NOISE CHARACTERISTICS

    公开(公告)号:US20170193097A1

    公开(公告)日:2017-07-06

    申请号:US15185654

    申请日:2016-06-17

    Abstract: A neural network-based classifier system can receive a query including a media signal and, in response, provide an indication that the query corresponds to a specified media type or media class. The neural network-based classifier system can select and apply various models to facilitate media classification. In an example embodiment, a query can be analyzed for various characteristics, such as a noise profile, before it is input to the network-based classifier. If the query has greater than a specified threshold noise characteristic, then a successful classification can be unlikely and a classification process based on the query can be terminated before computational resources are expended. Query signals that meet or exceed a threshold condition can be provided to the network-based classifier for media classification. In an example embodiment, a remote device or a central media classifier circuit can determine a noise profile for a query.

Patent Agency Ranking