-
公开(公告)号:US10453475B2
公开(公告)日:2019-10-22
申请号:US15432249
申请日:2017-02-14
Applicant: Adobe Inc.
Inventor: Shrikant Venkataramani , Paris Smaragdis , Gautham Mysore
Abstract: In some aspects, errors are replaced within an audio file by receiving a first audio sequence and a second audio sequence. The first audio sequence includes an erroneous subsequence and the second audio sequence includes a corrected subsequence for inclusion in the first audio sequence to replace the erroneous subsequence. The location of the erroneous subsequence in the first audio sequence is determined by applying a suitable matching operation (e.g., dynamic time warping). One or more matching subsequences of the first audio sequence located proximate to the erroneous subsequence in the first audio sequence and matching corresponding subsequences of the second audio sequence are located proximate to the corrected subsequence. A corrected first audio sequence is generated by replacing the erroneous subsequence and a matching subsequence of the first audio sequence with the corrected subsequence and the matching corresponding subsequence of the second audio sequence.
-
公开(公告)号:US10249321B2
公开(公告)日:2019-04-02
申请号:US13681643
申请日:2012-11-20
Applicant: Adobe Inc.
Inventor: Brian John King , Gautham J. Mysore , Paris Smaragdis
IPC: G10L21/00 , G10L21/043
Abstract: Sound rate modification techniques are described. In one or more implementations, an indication is received of an amount that a rate of output of sound data is to be modified. One or more sound rate rules are applied to the sound data that, along with the received indication, are usable to calculate different rates at which different portions of the sound data are to be modified, respectively. The sound data is then output such that the calculated rates are applied.
-
公开(公告)号:US11900902B2
公开(公告)日:2024-02-13
申请号:US17228357
申请日:2021-04-12
Applicant: Adobe Inc.
Inventor: Marco Antonio Martinez Ramirez , Nicholas J. Bryan , Oliver Wang , Paris Smaragdis
CPC classification number: G10H1/0008 , G06N3/084 , H03G3/32 , H03G5/025 , H04R5/04 , G10H2250/165
Abstract: Embodiments are disclosed for determining an answer to a query associated with a graphical representation of data. In particular, in one or more embodiments, the disclosed systems and methods comprise receiving an input including an unprocessed audio sequence and a request to perform an audio signal processing effect on the unprocessed audio sequence. The one or more embodiments further include analyzing, by a deep encoder, the unprocessed audio sequence to determine parameters for processing the unprocessed audio sequence. The one or more embodiments further include sending the unprocessed audio sequence and the parameters to one or more audio signal processing effects plugins to perform the requested audio signal processing effect using the parameters and outputting a processed audio sequence after processing of the unprocessed audio sequence using the parameters of the one or more audio signal processing effects plugins.
-
公开(公告)号:US11082789B1
公开(公告)日:2021-08-03
申请号:US15931505
申请日:2020-05-13
Applicant: Adobe Inc.
Inventor: Stylianos Ioannis Mimilakis , Paris Smaragdis , Nicholas Bryan
Abstract: One example method involves operations for receiving input to transform audio to a target style. Operations further include providing the audio to a predictive model trained to transform the audio into produced audio. Training the predictive model includes accessing representations of audios and unpaired audios. Further, training includes generating feature embeddings by extracting features from representations of an audio and an unpaired audio. The unpaired audio includes a reference production style, and the feature embeddings correspond to their representations. Training further includes generating a feature vector by comparing the feature embeddings using a comparison model. Further, training includes computing prediction parameters using a learned function. The prediction parameters can transform the feature vector into the reference style. Training further includes updating the predictive model with the prediction parameters. In addition, operations include generating the produced audio by modifying audio effects of the audio into the target style.
-
公开(公告)号:US10262680B2
公开(公告)日:2019-04-16
申请号:US13931450
申请日:2013-06-28
Applicant: Adobe Inc.
Inventor: Gautham J. Mysore , Paris Smaragdis
IPC: G10L21/0364 , G10L15/20 , G10L21/0208 , G10L25/84 , G10L25/51
Abstract: Variable sound decomposition masking techniques are described. In one or more implementations, a mask is generated that incorporates a user input as part of the mask, the user input is usable at least in part to define a threshold that is variable based on the user input and configured for use in performing a sound decomposition process. The sound decomposition process is performed using the mask to assign portions of sound data to respective ones of a plurality of sources of the sound data.
-
-
-
-