Identifying longform articles
    1.
    发明授权

    公开(公告)号:US09773166B1

    公开(公告)日:2017-09-26

    申请号:US14931576

    申请日:2015-11-03

    Applicant: Google Inc.

    CPC classification number: G06F17/271 G06F17/30705 G06N99/005

    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for classifying documents. One of the methods includes obtaining a collection of training documents, the training documents including positive documents identified as being longform documents and negative documents identified as not being longform documents; extracting one or more features from the training documents, wherein the features represent lexical or textual content of the training documents; and generating a longform document classifier trained using feature instances extracted from the training documents, wherein the generated longform document classifier is trained such that input documents are classified as being longform documents or classified as not being longform documents.

    SURFACING IN-DEPTH ARTICLES IN SEARCH RESULTS
    2.
    发明申请
    SURFACING IN-DEPTH ARTICLES IN SEARCH RESULTS 有权
    在搜索结果中表达深入文章

    公开(公告)号:US20150379140A1

    公开(公告)日:2015-12-31

    申请号:US14751774

    申请日:2015-06-26

    Applicant: Google Inc.

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for performing actions of determining that one or more in-depth article search results are to be provided in response to a query, obtaining a topicality score for each in-depth article of a plurality of in-depth articles, each topicality score indicating a degree of relevance of a respective in-depth article to the query, obtaining a document score for each in-depth article of the plurality of in-depth article, each document score being based on a respective topicality score and a respective in-depth article score, selecting one or more in-depth articles from the plurality of in-depth articles based on respective document scores, and providing the one or more in-depth article search results for display, each in-depth article search result representing an in-depth article of the one or more in-depth articles.

    Abstract translation: 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于执行确定要响应于查询的一个或多个深入文章搜索结果的动作, 多个深度文章的深度文章,每个主题分数表示相应深度的文章与查询的相关程度,获得多个深入文章中每个深入文章的文档分数,每个 文档分数基于相应的话题评分和相应的深度文章分数,基于相应的文档分数从多个深入的文章中选择一个或多个深入的文章,以及提供一个或多个深入的文章 搜索结果进行显示,每个深入文章搜索结果代表一个或多个深入文章的深入文章。

Patent Agency Ranking