GENERATING VECTOR REPRESENTATIONS OF DOCUMENTS

    公开(公告)号:US20200293873A1

    公开(公告)日:2020-09-17

    申请号:US15262959

    申请日:2016-09-12

    Applicant: Google Inc.

    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating document vector representations. One of the methods includes obtaining a new document; selecting a plurality of new document word sets; and determining a vector representation for the new document using a trained neural network system, wherein the trained neural network system comprises: a document embedding layer and a classifier, and wherein determining the vector representation for the new document using the trained neural network system comprises iteratively providing each of the plurality of new document word sets to the trained neural network system to determine the vector representation for the new document using gradient descent.

Patent Agency Ranking