METHODS, SYSTEMS, AND MEDIA FOR LANGUAGE IDENTIFICATION OF A MEDIA CONTENT ITEM BASED ON COMMENTS

    公开(公告)号:US20170300976A1

    公开(公告)日:2017-10-19

    申请号:US15174668

    申请日:2016-06-06

    Applicant: Google Inc.

    CPC classification number: G06Q30/0269 G06F17/275 G06Q30/0241 G06Q50/01

    Abstract: Methods, systems, and media for language identification of a media content item based on comments are provided. In some embodiments, the method comprises: obtaining a plurality of comments associated with a media content item; selecting a subset of the plurality of comments based on one or more criteria; assigning, for each comment in the subset of the plurality of comments, a vector of language probabilities, wherein each component of the vector is assigned a language probability that indicates the likelihood that the comment includes content in a language from a plurality of languages; combining the vector of language probabilities for each comment in the subset of the plurality of comments to generate a combined language vector; identifying a language associated with the media content item based on the combined language vector; and performing an action based on the identified language.

Patent Agency Ranking