Abstract:
Systems and techniques for removing a sound recording from an audio recording (e.g., an audio recording embedded in a media file) are presented. The system can include an identification component, a first subtraction component and a second subtraction component. The identification component identifies a sound recording in a mixed audio recording. The first subtraction component determines a local linear transformation of the sound recording and subtracts the local linear transformation of the sound recording from the mixed audio recording to generate a new mixed audio recording. The second subtraction component compares one or more segments of the sound recording with one or more corresponding segments of the new mixed audio recording and reduces a power level of the new mixed audio recording based at least in part on correlation of the one or more corresponding segments with the one or more segments.
Abstract:
The subject matter of this specification can be implemented in, among other things, a computer-implemented method including detecting positions of objects of a specific type within an ordered sequence of images. The method includes estimating one or more intermediate positions of one or more intermediate instances of an object in one or more intermediate images within the ordered sequence of images between an initial image and a subsequent image based on an initial position of an initial instance of the object in the initial image and a subsequent position of a subsequent instance of the object in the subsequent image. The method includes providing a list of the objects for presentation. The method includes receiving a selection of the object from the list. The method includes performing an operation on the initial instance, the intermediate instances, and the subsequent instance of the object.