-
公开(公告)号:US09336186B1
公开(公告)日:2016-05-10
申请号:US14050863
申请日:2013-10-10
Applicant: Google Inc.
Inventor: Ekaterina Filippova , Yasemin Altun
CPC classification number: G06F17/2264
Abstract: Methods and apparatus related to sentence compression. Some implementations are generally directed toward generating a corpus of extractive compressions and associated sentences based on a set of headline, sentence pairs from documents. Some implementations are generally directed toward utilizing a corpus of sentences and associated sentence compressions in training a supervised compression system. Some implementations are generally directed toward determining a compression of a sentence based on edge weights for edges of the sentence that are determined based on weights of features associated with the edges.
Abstract translation: 与句子压缩相关的方法和设备。 一些实现通常针对基于文档中的一组标题语句对来生成提取压缩和相关句子的语料库。 一些实施方式通常旨在在训练监督的压缩系统中使用句子语料库和相关的句子压缩。 一些实施方式通常针对基于根据与边缘相关联的特征的权重确定的句子边缘的边缘权重来确定句子的压缩。
-
公开(公告)号:US09881077B1
公开(公告)日:2018-01-30
申请号:US13962705
申请日:2013-08-08
Applicant: Google Inc.
Inventor: Enrique Alfonseca , Yasemin Altun , Massimiliano Ciaramita , Jean-Yves Delort , Ekaterina Filippova , Thomas Hofmann , Evangelos Kanoulas , Ioannis Tsochantaridis
IPC: G06F17/30
CPC classification number: G06F17/30705 , G06F17/3089
Abstract: News documents from one or more sources are aggregated. The news documents are grouped into a plurality of news collections. Each of the news collections includes a sub-set of the news documents having related content. Objects described by the news collections are determined. The objects collectively form a set of objects. A relevance of each of the news collections is measured with respect to the objects respectively described by the news collections and one or more news collections are determined from the plurality of news collections to be associated with a first object included in the set of objects based on the relevance of the one or more news collections to the first object.
-
公开(公告)号:US09619450B2
公开(公告)日:2017-04-11
申请号:US14060562
申请日:2013-10-22
Applicant: Google Inc.
Inventor: Enrique Alfonseca , Daniele Pighin , Guillermo Garrido Yuste , Ekaterina Filippova
CPC classification number: G06F17/24 , G06F17/274 , G06F17/2745 , G06F17/2881
Abstract: Sets of equivalent syntactic patterns are learned from a corpus of documents. A set of one or more input documents is received. The set of one or more input documents is processed for one or more expressions that match a set of equivalent syntactic patterns from among the sets of equivalent syntactic patterns. A syntactic pattern from among the set of equivalent syntactic patterns is selected for a headline. The syntactic pattern reflects a main event described by the set of one or more input documents. The headline is generated using the syntactic pattern.
-
公开(公告)号:US20150006512A1
公开(公告)日:2015-01-01
申请号:US14060562
申请日:2013-10-22
Applicant: Google Inc.
Inventor: Enrique Alfonseca , Daniele Pighin , Guillermo Garrido Yuste , Ekaterina Filippova
IPC: G06F17/24
CPC classification number: G06F17/24 , G06F17/274 , G06F17/2745 , G06F17/2881
Abstract: Sets of equivalent syntactic patterns are learned from a corpus of documents. A set of one or more input documents is received. The set of one or more input documents is processed for one or more expressions that match a set of equivalent syntactic patterns from among the sets of equivalent syntactic patterns. A syntactic pattern from among the set of equivalent syntactic patterns is selected for a headline. The syntactic pattern reflects a main event described by the set of one or more input documents. The headline is generated using the syntactic pattern.
Abstract translation: 从文档语料库中学习了等效句法模式的集合。 接收一组一个或多个输入文档。 处理一组或多个输入文档的一个或多个表达式,其匹配一组等效句法模式中的等效句法模式。 选择一组等效句法模式中的句法模式作为标题。 语法模式反映了一组或多个输入文档描述的主要事件。 标题是使用句法模式生成的。
-
-
-