-
公开(公告)号:US09336186B1
公开(公告)日:2016-05-10
申请号:US14050863
申请日:2013-10-10
Applicant: Google Inc.
Inventor: Ekaterina Filippova , Yasemin Altun
CPC classification number: G06F17/2264
Abstract: Methods and apparatus related to sentence compression. Some implementations are generally directed toward generating a corpus of extractive compressions and associated sentences based on a set of headline, sentence pairs from documents. Some implementations are generally directed toward utilizing a corpus of sentences and associated sentence compressions in training a supervised compression system. Some implementations are generally directed toward determining a compression of a sentence based on edge weights for edges of the sentence that are determined based on weights of features associated with the edges.
Abstract translation: 与句子压缩相关的方法和设备。 一些实现通常针对基于文档中的一组标题语句对来生成提取压缩和相关句子的语料库。 一些实施方式通常旨在在训练监督的压缩系统中使用句子语料库和相关的句子压缩。 一些实施方式通常针对基于根据与边缘相关联的特征的权重确定的句子边缘的边缘权重来确定句子的压缩。
-
公开(公告)号:US09881077B1
公开(公告)日:2018-01-30
申请号:US13962705
申请日:2013-08-08
Applicant: Google Inc.
Inventor: Enrique Alfonseca , Yasemin Altun , Massimiliano Ciaramita , Jean-Yves Delort , Ekaterina Filippova , Thomas Hofmann , Evangelos Kanoulas , Ioannis Tsochantaridis
IPC: G06F17/30
CPC classification number: G06F17/30705 , G06F17/3089
Abstract: News documents from one or more sources are aggregated. The news documents are grouped into a plurality of news collections. Each of the news collections includes a sub-set of the news documents having related content. Objects described by the news collections are determined. The objects collectively form a set of objects. A relevance of each of the news collections is measured with respect to the objects respectively described by the news collections and one or more news collections are determined from the plurality of news collections to be associated with a first object included in the set of objects based on the relevance of the one or more news collections to the first object.
-