-
1.
公开(公告)号:US20100332499A1
公开(公告)日:2010-12-30
申请号:US12493097
申请日:2009-06-26
申请人: Yufei Pan , Hui Li , Justin Sarma , David Soukal , Alessio Signorini , Apostolos Gerasoulis , Tomasz Imielinski
发明人: Yufei Pan , Hui Li , Justin Sarma , David Soukal , Alessio Signorini , Apostolos Gerasoulis , Tomasz Imielinski
IPC分类号: G06F17/30
CPC分类号: G06F17/30864 , G06F17/3053
摘要: In a method for a direct answer for search, a search query is received over a network, one or more answer candidate snippets for the search query are received, with an answer candidate snippet having at least a portion of content available over the network for an answer candidate, one or more answer entities are determined within a selected answer candidate snippet, a frequency of the one or more answer entities found within the one or more answer candidate snippets for the search query is determined, a confidence score is adjusted for the selected answer candidate in accordance with the frequency of the one or more answer entities found within the one or more answer candidate snippets, and at least one answer candidate snippet is sent for a response to the search query.
摘要翻译: 在用于搜索的直接答复的方法中,通过网络接收搜索查询,接收用于搜索查询的一个或多个应答候选片段,其中应答候选片段具有网络上的至少一部分内容可用于 答案候选者中,在选择的答案候选片段内确定一个或多个应答实体,确定在用于搜索查询的一个或多个应答候选片段内找到的一个或多个应答实体的频率,为所选择的候选片段调整置信度分数 根据在一个或多个应答候选片段中发现的一个或多个应答实体的频率来应答候选者,并且发送至少一个应答候选片段用于对搜索查询的响应。
-
2.
公开(公告)号:US09239879B2
公开(公告)日:2016-01-19
申请号:US12493097
申请日:2009-06-26
申请人: Yufei Pan , Hui Li , Justin Sarma , David Soukal , Alessio Signorini , Apostolos Gerasoulis , Tomasz Imielinski
发明人: Yufei Pan , Hui Li , Justin Sarma , David Soukal , Alessio Signorini , Apostolos Gerasoulis , Tomasz Imielinski
CPC分类号: G06F17/30864 , G06F17/3053
摘要: In a method for a direct answer for search, a search query is received over a network, one or more answer candidate snippets for the search query are received, with an answer candidate snippet having at least a portion of content available over the network for an answer candidate, one or more answer entities are determined within a selected answer candidate snippet, a frequency of the one or more answer entities found within the one or more answer candidate snippets for the search query is determined, a confidence score is adjusted for the selected answer candidate in accordance with the frequency of the one or more answer entities found within the one or more answer candidate snippets, and at least one answer candidate snippet is sent for a response to the search query.
摘要翻译: 在用于搜索的直接答复的方法中,通过网络接收搜索查询,接收用于搜索查询的一个或多个应答候选片段,其中应答候选片段具有网络上的至少一部分内容可用于 答案候选者中,在选择的答案候选片段内确定一个或多个应答实体,确定在用于搜索查询的一个或多个应答候选片段内找到的一个或多个应答实体的频率,为所选择的候选片段调整置信度分数 根据在一个或多个应答候选片段中发现的一个或多个应答实体的频率来应答候选者,并且发送至少一个应答候选片段用于对搜索查询的响应。
-
公开(公告)号:US20110208714A1
公开(公告)日:2011-08-25
申请号:US12708541
申请日:2010-02-19
申请人: David Soukal , Fang Yu , Yinglian Xie , Qifa Ke , Zijian Zheng , Frederic H. Behr, JR.
发明人: David Soukal , Fang Yu , Yinglian Xie , Qifa Ke , Zijian Zheng , Frederic H. Behr, JR.
CPC分类号: G06F21/552 , G06F16/951 , H04L63/1408 , H04L63/1425 , H04L63/1458 , H04L2463/144
摘要: A framework may be used for identifying low-rate search bot traffic within query logs by capturing groups of distributed, coordinated search bots. Search log data may be input to a history-based anomaly detection engine to determine if query-click pairs associated with a query are suspicious in view of historical query-click pairs for the query. Users associated with suspicious query-click pairs may be input to a matrix-based bot detection engine to determine correlations between queries submitted by the users. Those users indicating strong correlations may be categorized as bots, whereas those who do not may be categorized as part of flash crowd traffic.
摘要翻译: 可以通过捕获分布式,协调的搜索机器人组来识别查询日志中的低速搜索bot流量的框架。 搜索日志数据可以被输入到基于历史的异常检测引擎,以鉴于查询的历史查询 - 点击对来确定与查询相关联的查询 - 点击对是否是可疑的。 与可疑查询点击对相关联的用户可以输入到基于矩阵的机器人检测引擎,以确定用户提交的查询之间的相关性。 指示强相关性的用户可能被归类为机器人,而不能被分类为闪存人群流量的一部分的那些用户。
-
-