-
公开(公告)号:US09881077B1
公开(公告)日:2018-01-30
申请号:US13962705
申请日:2013-08-08
Applicant: Google Inc.
Inventor: Enrique Alfonseca , Yasemin Altun , Massimiliano Ciaramita , Jean-Yves Delort , Ekaterina Filippova , Thomas Hofmann , Evangelos Kanoulas , Ioannis Tsochantaridis
IPC: G06F17/30
CPC classification number: G06F17/30705 , G06F17/3089
Abstract: News documents from one or more sources are aggregated. The news documents are grouped into a plurality of news collections. Each of the news collections includes a sub-set of the news documents having related content. Objects described by the news collections are determined. The objects collectively form a set of objects. A relevance of each of the news collections is measured with respect to the objects respectively described by the news collections and one or more news collections are determined from the plurality of news collections to be associated with a first object included in the set of objects based on the relevance of the one or more news collections to the first object.
-
公开(公告)号:US08983898B1
公开(公告)日:2015-03-17
申请号:US14027586
申请日:2013-09-16
Applicant: Google Inc.
Inventor: Enrique Alfonseca , Marius Pasca , Enrique Robledo-Arnuncio
IPC: G06F17/30
CPC classification number: G06F17/30707 , G06F17/30705 , G06F17/30734
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for extracting instance attributes from text are described. In one aspect, a method exploits weakly-supervised and unsupervised instance relatedness data, available in the form of labeled classes of instances and distributionally similar instances. The method organizes the data into a graph containing instances, class labels, and attributes. The method propagates attributes among related instances, through random walks over the graph.
Abstract translation: 描述了用于从文本中提取实例属性的方法,系统和装置,包括在计算机存储介质上编码的计算机程序。 在一个方面,一种方法利用弱监督和无监督的实例相关性数据,其以标记的实例类和分布类似的实例的形式可用。 该方法将数据组织到包含实例,类标签和属性的图表中。 该方法通过图形上的随机游走在相关实例中传播属性。
-
3.
公开(公告)号:US08825571B1
公开(公告)日:2014-09-02
申请号:US13909715
申请日:2013-06-04
Applicant: Google Inc.
Inventor: Enrique Alfonseca , Massimiliano Ciaramita , Keith B. Hall
IPC: G06N5/02
CPC classification number: G06N5/022 , G06F17/3064 , G06F17/30864
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for determining query suggestions from multiple correlation measures. In one aspect, a method includes receiving a first query and second queries, each of the first and second queries including one or more terms; for each second query and a linear model, receiving correlation scores measuring the correlation between the first query and the respective second query, each correlation score received from a respective correlation process, and each respective correlation process being different from the other respective correlation processes, and applying the linear model to the plurality of correlation scores to determine a combined correlation score that quantifies a combined correlation between the first query and the respective second query based on the plurality of correlation scores. The second queries are ranked in an order according to their respective combined correlations scores.
Abstract translation: 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于根据多个相关度量确定查询建议。 一方面,一种方法包括接收第一查询和第二查询,所述第一和第二查询中的每一个包括一个或多个项; 对于每个第二查询和线性模型,接收测量第一查询和相应第二查询之间的相关性的相关分数,从相应的相关处理接收的每个相关得分,以及各个相关处理与其它各自的相关处理不同,以及 将所述线性模型应用于所述多个相关分数,以基于所述多个相关分数来确定量化所述第一查询和所述相应第二查询之间的组合相关性的组合相关分数。 第二个查询根据其各自的组合相关分数按顺序排列。
-
公开(公告)号:US09619450B2
公开(公告)日:2017-04-11
申请号:US14060562
申请日:2013-10-22
Applicant: Google Inc.
Inventor: Enrique Alfonseca , Daniele Pighin , Guillermo Garrido Yuste , Ekaterina Filippova
CPC classification number: G06F17/24 , G06F17/274 , G06F17/2745 , G06F17/2881
Abstract: Sets of equivalent syntactic patterns are learned from a corpus of documents. A set of one or more input documents is received. The set of one or more input documents is processed for one or more expressions that match a set of equivalent syntactic patterns from among the sets of equivalent syntactic patterns. A syntactic pattern from among the set of equivalent syntactic patterns is selected for a headline. The syntactic pattern reflects a main event described by the set of one or more input documents. The headline is generated using the syntactic pattern.
-
公开(公告)号:US20150006512A1
公开(公告)日:2015-01-01
申请号:US14060562
申请日:2013-10-22
Applicant: Google Inc.
Inventor: Enrique Alfonseca , Daniele Pighin , Guillermo Garrido Yuste , Ekaterina Filippova
IPC: G06F17/24
CPC classification number: G06F17/24 , G06F17/274 , G06F17/2745 , G06F17/2881
Abstract: Sets of equivalent syntactic patterns are learned from a corpus of documents. A set of one or more input documents is received. The set of one or more input documents is processed for one or more expressions that match a set of equivalent syntactic patterns from among the sets of equivalent syntactic patterns. A syntactic pattern from among the set of equivalent syntactic patterns is selected for a headline. The syntactic pattern reflects a main event described by the set of one or more input documents. The headline is generated using the syntactic pattern.
Abstract translation: 从文档语料库中学习了等效句法模式的集合。 接收一组一个或多个输入文档。 处理一组或多个输入文档的一个或多个表达式,其匹配一组等效句法模式中的等效句法模式。 选择一组等效句法模式中的句法模式作为标题。 语法模式反映了一组或多个输入文档描述的主要事件。 标题是使用句法模式生成的。
-
-
-
-