Automatic voiceover correction system

    公开(公告)号:US10453475B2

    公开(公告)日:2019-10-22

    申请号:US15432249

    申请日:2017-02-14

    Applicant: Adobe Inc.

    Abstract: In some aspects, errors are replaced within an audio file by receiving a first audio sequence and a second audio sequence. The first audio sequence includes an erroneous subsequence and the second audio sequence includes a corrected subsequence for inclusion in the first audio sequence to replace the erroneous subsequence. The location of the erroneous subsequence in the first audio sequence is determined by applying a suitable matching operation (e.g., dynamic time warping). One or more matching subsequences of the first audio sequence located proximate to the erroneous subsequence in the first audio sequence and matching corresponding subsequences of the second audio sequence are located proximate to the corrected subsequence. A corrected first audio sequence is generated by replacing the erroneous subsequence and a matching subsequence of the first audio sequence with the corrected subsequence and the matching corresponding subsequence of the second audio sequence.

    Sound rate modification
    2.
    发明授权

    公开(公告)号:US10249321B2

    公开(公告)日:2019-04-02

    申请号:US13681643

    申请日:2012-11-20

    Applicant: Adobe Inc.

    Abstract: Sound rate modification techniques are described. In one or more implementations, an indication is received of an amount that a rate of output of sound data is to be modified. One or more sound rate rules are applied to the sound data that, along with the received indication, are usable to calculate different rates at which different portions of the sound data are to be modified, respectively. The sound data is then output such that the calculated rates are applied.

    Deep encoder for performing audio processing

    公开(公告)号:US11900902B2

    公开(公告)日:2024-02-13

    申请号:US17228357

    申请日:2021-04-12

    Applicant: Adobe Inc.

    Abstract: Embodiments are disclosed for determining an answer to a query associated with a graphical representation of data. In particular, in one or more embodiments, the disclosed systems and methods comprise receiving an input including an unprocessed audio sequence and a request to perform an audio signal processing effect on the unprocessed audio sequence. The one or more embodiments further include analyzing, by a deep encoder, the unprocessed audio sequence to determine parameters for processing the unprocessed audio sequence. The one or more embodiments further include sending the unprocessed audio sequence and the parameters to one or more audio signal processing effects plugins to perform the requested audio signal processing effect using the parameters and outputting a processed audio sequence after processing of the unprocessed audio sequence using the parameters of the one or more audio signal processing effects plugins.

    Audio production assistant for style transfers of audio recordings using one-shot parametric predictions

    公开(公告)号:US11082789B1

    公开(公告)日:2021-08-03

    申请号:US15931505

    申请日:2020-05-13

    Applicant: Adobe Inc.

    Abstract: One example method involves operations for receiving input to transform audio to a target style. Operations further include providing the audio to a predictive model trained to transform the audio into produced audio. Training the predictive model includes accessing representations of audios and unpaired audios. Further, training includes generating feature embeddings by extracting features from representations of an audio and an unpaired audio. The unpaired audio includes a reference production style, and the feature embeddings correspond to their representations. Training further includes generating a feature vector by comparing the feature embeddings using a comparison model. Further, training includes computing prediction parameters using a learned function. The prediction parameters can transform the feature vector into the reference style. Training further includes updating the predictive model with the prediction parameters. In addition, operations include generating the produced audio by modifying audio effects of the audio into the target style.

    Variable sound decomposition masks

    公开(公告)号:US10262680B2

    公开(公告)日:2019-04-16

    申请号:US13931450

    申请日:2013-06-28

    Applicant: Adobe Inc.

    Abstract: Variable sound decomposition masking techniques are described. In one or more implementations, a mask is generated that incorporates a user input as part of the mask, the user input is usable at least in part to define a threshold that is variable based on the user input and configured for use in performing a sound decomposition process. The sound decomposition process is performed using the mask to assign portions of sound data to respective ones of a plurality of sources of the sound data.

Patent Agency Ranking