-
1.
公开(公告)号:US20170300976A1
公开(公告)日:2017-10-19
申请号:US15174668
申请日:2016-06-06
Applicant: Google Inc.
Inventor: Ayse Seza Dogruöz , Natalia Ponomareva , Christoph Urs Oehler , Dimitri Kanevsky
IPC: G06Q30/02
CPC classification number: G06Q30/0269 , G06F17/275 , G06Q30/0241 , G06Q50/01
Abstract: Methods, systems, and media for language identification of a media content item based on comments are provided. In some embodiments, the method comprises: obtaining a plurality of comments associated with a media content item; selecting a subset of the plurality of comments based on one or more criteria; assigning, for each comment in the subset of the plurality of comments, a vector of language probabilities, wherein each component of the vector is assigned a language probability that indicates the likelihood that the comment includes content in a language from a plurality of languages; combining the vector of language probabilities for each comment in the subset of the plurality of comments to generate a combined language vector; identifying a language associated with the media content item based on the combined language vector; and performing an action based on the identified language.