Machine learning system for matching groups of related media files

    公开(公告)号:US10832692B1

    公开(公告)日:2020-11-10

    申请号:US16049369

    申请日:2018-07-30

    Abstract: Techniques are described for verifying that an audio file corresponds to an instance of media content. An audio file is divided into a plurality of audio segments, and a digital fingerprint is generated for each of the plurality of audio segments. A digital signature is generated for the audio file by aggregating the digital fingerprints. The generated digital signature and at least one other digital signature corresponding to an instance of media content are processed as inputs to a linear regression machine learning model, to determine a measure of similarity between the generated digital signature and the at least one other digital signature. The linear regression machine learning model can be trained using a supervised learning approach and a set of training data. Embodiments determine whether the audio file corresponds to the instance of media content, based on the measure of similarity.

Patent Agency Ranking