-
公开(公告)号:US11082789B1
公开(公告)日:2021-08-03
申请号:US15931505
申请日:2020-05-13
Applicant: Adobe Inc.
Inventor: Stylianos Ioannis Mimilakis , Paris Smaragdis , Nicholas Bryan
Abstract: One example method involves operations for receiving input to transform audio to a target style. Operations further include providing the audio to a predictive model trained to transform the audio into produced audio. Training the predictive model includes accessing representations of audios and unpaired audios. Further, training includes generating feature embeddings by extracting features from representations of an audio and an unpaired audio. The unpaired audio includes a reference production style, and the feature embeddings correspond to their representations. Training further includes generating a feature vector by comparing the feature embeddings using a comparison model. Further, training includes computing prediction parameters using a learned function. The prediction parameters can transform the feature vector into the reference style. Training further includes updating the predictive model with the prediction parameters. In addition, operations include generating the produced audio by modifying audio effects of the audio into the target style.