-
公开(公告)号:US08412727B1
公开(公告)日:2013-04-02
申请号:US12794069
申请日:2010-06-04
申请人: Abhinandan S. Das , Anwis Das
发明人: Abhinandan S. Das , Anwis Das
IPC分类号: G06F17/30
CPC分类号: G06F17/30522 , G06F17/30064 , G06F17/30864 , G06F17/3097
摘要: Methods, systems, and apparatus, including computer program products, for generating query refinements from user preference data. A group of query pairs are obtained. Each query pair includes a first query and a second query. A quality score is determined for each query pair from user preference data for documents responsive to both the first and the second query. A diversity score is determined for each query pair having a quality score satisfying a quality threshold, the diversity score determined from user preference data for documents responsive to the second, but not the first, query. For each query pair having a quality score satisfying the quality threshold and a diversity score satisfying a diversity threshold, the second query of the query pair is associated with the first query of the query pair as a candidate refinement for the first query.
摘要翻译: 用于从用户偏好数据生成查询改进的方法,系统和装置,包括计算机程序产品。 获得一组查询对。 每个查询对包括第一个查询和第二个查询。 对于响应于第一和第二查询的文档的用户偏好数据,确定每个查询对的质量得分。 确定具有质量分数满足质量阈值的每个查询对的分集得分,根据用户偏好数据确定的分集得分,所述文献响应于第二次但不是第一个查询。 对于具有满足质量阈值的质量分数和满足分集阈值的分集分数的每个查询对,查询对的第二查询与作为第一查询的候选细化的查询对的第一查询相关联。
-
公开(公告)号:US08407219B1
公开(公告)日:2013-03-26
申请号:US13347377
申请日:2012-01-10
申请人: Abhinandan S. Das , Ashutosh Garg , Mayur Datar
发明人: Abhinandan S. Das , Ashutosh Garg , Mayur Datar
CPC分类号: G06F17/30699 , G06F17/30598 , G06F17/30979
摘要: Systems, methods, and apparatus, including computer program products, for collaborative filtering are provided. A method is provided. The method includes clustering a plurality of entities with respect to one or more latent variables in a probability distribution model of a relationship between a set of entities and a set of items, the probability distribution model comprising a probability distribution of the set of items with respect to the latent variables. The method also includes, as new items are added to the set of items, updating the probability distribution of the set of the items with respect to the latent variables, and generating an updated relationship score for an entity with respect to the set of items based on the entity's fractional membership in the clustering with respect to the latent variables and based on the updated probability distribution of the set of the items with respect to the latent variables.
-
公开(公告)号:US08364709B1
公开(公告)日:2013-01-29
申请号:US12951529
申请日:2010-11-22
申请人: Abhinandan S. Das , Harry S. Fung
发明人: Abhinandan S. Das , Harry S. Fung
IPC分类号: G06F7/00
CPC分类号: G06F17/3097 , G06F17/30542 , G06F17/30967 , G06F17/30979
摘要: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for determining word boundary likelihoods in potentially incomplete text. In one aspect, a method includes selecting query sequences from the query, each query sequence being at least a portion of a word n-gram, the word n-gram being a subsequence of up to n words selected from the second sequence of words of the query, and for each query sequence: determining one or more query sequence keys for the query sequence; determining at least one of a word boundary count and a non-word boundary count for each query sequence key, each word-boundary count and non-word boundary count being dependent on the context of the query sequence; and associating, in a data storage device, the at least one word boundary count and non-word boundary counts with each query sequence key.
-
公开(公告)号:US09031970B1
公开(公告)日:2015-05-12
申请号:US13186930
申请日:2011-07-20
CPC分类号: G06F17/3064 , G06F17/276
摘要: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for obtaining query completions. In one aspect, a method includes receiving a query input in a search engine query input field in a user interface. The method also includes submitting the query input as a first query stem to an autocompletion module. The method also includes receiving a first response from the autocompletion module, the first response providing no first query autocompletions. The method also includes submitting a second query stem to the autocompletion module, the second query stem being the first query stem with a first prefix removed. The method also includes receiving a second response from the autocompletion module including one or more second autocompletions satisfying a second quality test. The method also includes providing second autocompletions for presentation on the user interface.
摘要翻译: 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于获得查询完成。 一方面,一种方法包括在用户界面中的搜索引擎查询输入字段中接收查询输入。 该方法还包括将查询输入作为第一查询句柄提交到自动完成模块。 该方法还包括从自动完成模块接收第一响应,第一响应不提供第一查询自动填充。 所述方法还包括向所述自动完成模块提交第二查询句柄,所述第二查询句柄是删除了第一前缀的第一查询句柄。 该方法还包括从自动完成模块接收包括满足第二质量测试的一个或多个第二自动完成的第二响应。 该方法还包括提供用于在用户界面上呈现的第二自动填充。
-
公开(公告)号:US20140214840A1
公开(公告)日:2014-07-31
申请号:US12955253
申请日:2010-11-29
申请人: Nitin Gupta , Abhinandan S. Das
发明人: Nitin Gupta , Abhinandan S. Das
IPC分类号: G06F17/30
CPC分类号: G06F17/3064
摘要: Methods, systems and apparatus, including computer programs encoded on a computer storage medium, for disambiguating names in a document corpus. In an aspect, a method includes generating context term lists for a person name, each context term list being a list of context terms from a resource for the person name; clustering the context term lists into a plurality of clusters, each of the clusters of context term lists including context term lists that are most similar to the cluster relative to other clusters; for each of the clusters, selecting a representative term for the cluster; receiving the person name as a search query; and generating a plurality of query suggestions from the search query and the representative terms for the clusters, each query suggesting being a combination of the person name and one representative term.
摘要翻译: 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于消除文档语料库中的名称。 一方面,一种方法包括为个人名称生成上下文词列表,每个上下文词列表是来自人名的资源的上下文术语列表; 将上下文术语列表聚类成多个集群,每个上下文术语表的集群包括与集群相对于其他集群最相似的上下文术语列表; 对于每个集群,选择集群的代表性术语; 接收人名作为搜索查询; 以及从所述搜索查询和所述群集的代表性条件生成多个查询建议,每个查询建议是所述人名和一个代表词的组合。
-
公开(公告)号:US08374985B1
公开(公告)日:2013-02-12
申请号:US13300987
申请日:2011-11-21
申请人: Abhinandan S. Das , Ashutosh Garg , Mayur Datar
发明人: Abhinandan S. Das , Ashutosh Garg , Mayur Datar
CPC分类号: G06F17/30864
摘要: Methods, systems and apparatus, including computer program products, for providing a diversity of recommendations. According to one method, results are identified so as to increase the likelihood that at least one result will be of interest to a user. Following the identification of a first result, second and later results are identified based on an assumption that the previously identified results are not of interest to the user. The identification of diverse results can be based on formulas that approximate the probability or provide a likelihood score of a user selecting a given result, where a measured similarity between a given object and previously identified results tends to decrease the calculated probability approximation or likelihood score for that object.
摘要翻译: 方法,系统和设备,包括计算机程序产品,用于提供多种建议。 根据一种方法,识别结果以便增加至少一个结果对用户感兴趣的可能性。 在识别出第一结果之后,基于以前认为的结果对用户不感兴趣的假设来识别第二和后续结果。 不同结果的识别可以基于近似概率或提供用户选择给定结果的似然分数的公式,其中给定对象和先前识别的结果之间的测量相似度倾向于降低计算的概率近似或似然分数 那个对象。
-
公开(公告)号:US08244749B1
公开(公告)日:2012-08-14
申请号:US12557425
申请日:2009-09-10
申请人: Anwis Das , Abhinandan S. Das
发明人: Anwis Das , Abhinandan S. Das
IPC分类号: G06F17/30
CPC分类号: G06F17/30522 , G06F17/30064 , G06F17/30864 , G06F17/3097
摘要: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for identifying query refinements from sibling queries. In one aspect, a method includes associating each of a plurality of parent queries with a respective group of one or more child queries for the parent query, identifying one or more candidate sibling queries for a particular child query, selecting one or more final sibling queries for the particular child query from the one or more candidate sibling queries, and associating the final sibling queries with the particular child query as query refinements.
摘要翻译: 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于识别来自兄弟姐妹查询的查询改进。 在一个方面,一种方法包括将多个父查询中的每一个与父查询的一个或多个子查询的相应组相关联,识别特定子查询的一个或多个候选兄弟查询,选择一个或多个最终同级查询 对于来自一个或多个候选兄弟查询的特定子查询,以及将最终同胞查询与特定子查询相关联作为查询优化。
-
-
-
-
-
-