-
公开(公告)号:US08738467B2
公开(公告)日:2014-05-27
申请号:US11377480
申请日:2006-03-16
申请人: Chenxi Lin , Gui-Rong Xue , Hua-Jun Zeng , Zheng Chen , Benyu Zhang , Jian Wang
发明人: Chenxi Lin , Gui-Rong Xue , Hua-Jun Zeng , Zheng Chen , Benyu Zhang , Jian Wang
IPC分类号: G06Q30/00
CPC分类号: G06F17/30867 , G06Q30/0212 , G06Q30/0601 , G06Q30/0631
摘要: Methods for determining a predictive rating are disclosed. In an embodiment, an active user is compared to a set of clusters. One or more of the clusters are determined to be most similar to the active user. From the one or more clusters, K users are determined to be most similar to the active user. Prior ratings for an item by the K users may be used to predict a rating for the item for the active user.
摘要翻译: 公开了确定预测等级的方法。 在一个实施例中,将活动用户与一组集群进行比较。 集群中的一个或多个被确定为与活动用户最相似。 从一个或多个集群中,K个用户被确定为与活动用户最相似。 K用户对某项目的先前评级可用于预测活动用户的项目评级。
-
公开(公告)号:US08122049B2
公开(公告)日:2012-02-21
申请号:US11378323
申请日:2006-03-20
申请人: Li Li , Tarek Najm , Ying Li , Zheng Chen , Hua-Jun Zeng , Ke Tang , Zhifeng Yang , FengPing Zeng , Xianfang Wang , Xiaofeng Dai , Benyu Zhang , Jian Wang
发明人: Li Li , Tarek Najm , Ying Li , Zheng Chen , Hua-Jun Zeng , Ke Tang , Zhifeng Yang , FengPing Zeng , Xianfang Wang , Xiaofeng Dai , Benyu Zhang , Jian Wang
CPC分类号: G06Q30/0241 , G06F17/30867 , G06Q30/02 , G06Q30/0254
摘要: A system and method are disclosed for providing documents related to a search request. The search request may include a search query of one or more keywords, or the search request may be a demographic search query including one or more demographic attributes. An index containing data crawled from publisher's websites, demographic information of registered users, along with the search history of the registered users can be created. Once a search request is received, the search request can be compared to the information stored in the index, and one or more documents related to the request can be provided.
摘要翻译: 公开了一种用于提供与搜索请求相关的文档的系统和方法。 搜索请求可以包括一个或多个关键字的搜索查询,或者搜索请求可以是包括一个或多个人口统计属性的人口统计学搜索查询。 可以创建包含从发布商网站爬取的数据,注册用户的人口统计信息以及注册用户的搜索记录的索引。 一旦接收到搜索请求,可以将搜索请求与存储在索引中的信息进行比较,并且可以提供与该请求相关的一个或多个文档。
-
公开(公告)号:US07870132B2
公开(公告)日:2011-01-11
申请号:US12020574
申请日:2008-01-28
申请人: Weizhu Chen , Benyu Zhang , Zheng Chen , Jian Wang , Dou Shen
发明人: Weizhu Chen , Benyu Zhang , Zheng Chen , Jian Wang , Dou Shen
IPC分类号: G06F17/30
CPC分类号: G06F17/30864
摘要: The claimed subject matter is directed to constructing query hierarchies in response to a query request. To construct a query hierarchy, a list of related candidate queries is generated in response to the received query request. The list of related candidate queries is generated by determining the relative coverage of information shared by the candidate queries and the query request. Relationships between the submitted query request and the candidate queries in the list are determined based upon the extent of relative coverage of information shared by the candidate queries and the query request. A query hierarchy is then constructed to reflect the determined relationships between the query request and the candidate queries.
摘要翻译: 所要求保护的主题涉及响应于查询请求构建查询层次结构。 为了构建查询层次结构,响应于接收的查询请求生成相关候选查询的列表。 通过确定候选查询和查询请求共享的信息的相对覆盖率来生成相关候选查询的列表。 基于候选查询和查询请求共享的信息的相对覆盖范围确定列表中提交的查询请求与候选查询之间的关系。 然后构建查询层次结构以反映所确定的查询请求和候选查询之间的关系。
-
公开(公告)号:US07861149B2
公开(公告)日:2010-12-28
申请号:US11372365
申请日:2006-03-09
申请人: Min Wang , Benyu Zhang , Hua-Jun Zeng , Jian Wang , Shiguang Liu , Zheng Chen
发明人: Min Wang , Benyu Zhang , Hua-Jun Zeng , Jian Wang , Shiguang Liu , Zheng Chen
CPC分类号: G06F17/30713 , G06F17/272 , G06F17/2775 , G06F17/30643 , Y10S707/92
摘要: Computer-readable media having computer-executable instructions and apparatuses provide a keyphrase navigation map (KNM) for a document page. Keyphrases are extracted from the document page. Keyphrase clusters are subsequently formed by a measure of relevancy, and a salient keyphrase is determined for each cluster. A thumbnail is formed with tags corresponding to the salient keyphrases. A selected tag is expanded with associated keyphrases. An associated keyphrase may be further selected in order to facilitate the navigation of the document page. The displayed tags on the thumbnail are positioned in accordance with locations of associated keyphrases in the document page.
摘要翻译: 具有计算机可执行指令和装置的计算机可读介质为文档页面提供关键词导航映射(KNM)。 从文档页面提取关键短语。 随后通过相关性的量度形成关键词组,并且为每个簇确定显着的关键短语。 使用与突出关键短语相对应的标签形成缩略图。 所选标签用相关的关键短语展开。 可以进一步选择相关联的关键短语,以便于文档页面的导航。 缩略图上显示的标签根据文档页面中相关联的关键短语的位置进行定位。
-
公开(公告)号:US07693823B2
公开(公告)日:2010-04-06
申请号:US11770385
申请日:2007-06-28
申请人: Ning Liu , Jun Yan , Benyu Zhang , Zheng Chen , Jian Wang
发明人: Ning Liu , Jun Yan , Benyu Zhang , Zheng Chen , Jian Wang
IPC分类号: G06F17/30
CPC分类号: G06F17/30864 , G06Q30/02
摘要: Techniques for analyzing and modeling the frequency of queries are provided by a query analysis system. A query analysis system analyzes frequencies of a query over time to determine whether the query is time-dependent or time-independent. The query analysis system forecasts the frequency of time-dependent queries based on their periodicities. The query analysis system forecasts the frequency of time-independent queries based on causal relationships with other queries. To forecast the frequency of time-independent queries, the query analysis system analyzes the frequency of a query over time to identify significant increases in the frequency, which are referred to as “query events” or “events.” The query analysis system forecasts frequencies of time-independent queries based on queries with events that tend to causally precede events of the query to be forecasted.
摘要翻译: 用于分析和建模查询频率的技术由查询分析系统提供。 查询分析系统分析查询的频率,以确定查询是时间依赖还是时间无关。 查询分析系统根据其周期性预测与时间相关的查询的频率。 查询分析系统根据与其他查询的因果关系预测与时间无关的查询的频率。 为了预测时间无关查询的频率,查询分析系统随时间分析查询的频率,以识别频率的显着增加,这被称为“查询事件”或“事件”。查询分析系统预测频率 基于具有事件倾向于在要预测的查询的事件之前的查询的与时间无关的查询。
-
公开(公告)号:US07689622B2
公开(公告)日:2010-03-30
申请号:US11770423
申请日:2007-06-28
申请人: Ning Liu , Jun Yan , Benyu Zhang , Zheng Chen , Jian Wang
发明人: Ning Liu , Jun Yan , Benyu Zhang , Zheng Chen , Jian Wang
IPC分类号: G06F17/30
CPC分类号: G06F17/30864 , G06Q30/02
摘要: Techniques for analyzing and modeling the frequency of queries are provided by a query analysis system. A query analysis system analyzes frequencies of a query over time to determine whether the query is time-dependent or time-independent. The query analysis system forecasts the frequency of time-dependent queries based on their periodicities. The query analysis system forecasts the frequency of time-independent queries based on causal relationships with other queries. To forecast the frequency of time-independent queries, the query analysis system analyzes the frequency of a query over time to identify significant increases in the frequency, which are referred to as “query events” or “events.” The query analysis system forecasts frequencies of time-independent queries based on queries with events that tend to causally precede events of the query to be forecasted.
摘要翻译: 用于分析和建模查询频率的技术由查询分析系统提供。 查询分析系统分析查询的频率,以确定查询是时间依赖还是时间无关。 查询分析系统根据其周期性预测与时间相关的查询的频率。 查询分析系统根据与其他查询的因果关系预测与时间无关的查询的频率。 为了预测时间无关查询的频率,查询分析系统随时间分析查询的频率,以识别频率的显着增加,这被称为“查询事件”或“事件”。查询分析系统预测频率 基于具有事件倾向于在要预测的查询的事件之前的查询的与时间无关的查询。
-
公开(公告)号:US07624130B2
公开(公告)日:2009-11-24
申请号:US11392640
申请日:2006-03-30
申请人: Zheng Chen , Lei Li , Chenxi Lin , Qiaoling Liu , Jian Wang , Benyu Zhang
发明人: Zheng Chen , Lei Li , Chenxi Lin , Qiaoling Liu , Jian Wang , Benyu Zhang
IPC分类号: G06F17/30
CPC分类号: G06F17/30616 , Y10S707/99936 , Y10S707/99953 , Y10S707/99954
摘要: Extraction of semantic information and the generation of semantic attributes allows for improved organization and management of data. Semantic attributes are automatically generated and eliminate the need for manual entry of attribute information. A semantic file network may further be constructed based on similarities between files that are based on the semantic attribute information. Semantic links representing a semantic relationship may be built between similar or relevant files. In addition, user operations and user operation patterns may also be considered in building the file network. Semantic attributes and information may further facilitate browsing the file systems as well as improve the accuracy and speed of queries.
摘要翻译: 语义信息的提取和语义属性的产生可以改善数据的组织和管理。 自动生成语义属性,无需手动输入属性信息。 还可以基于基于语义属性信息的文件之间的相似性来构建语义文件网络。 表示语义关系的语义链接可以在相似或相关文件之间建立。 此外,在构建文件网络时也可以考虑用户操作和用户操作模式。 语义属性和信息可以进一步促进文件系统的浏览以及提高查询的准确性和速度。
-
公开(公告)号:US07571162B2
公开(公告)日:2009-08-04
申请号:US11365961
申请日:2006-03-01
申请人: Jian-Tao Sun , Xuanhui Wang , Dou Shen , Hua-Jun Zeng , Jian Wang , Zheng Chen
发明人: Jian-Tao Sun , Xuanhui Wang , Dou Shen , Hua-Jun Zeng , Jian Wang , Zheng Chen
CPC分类号: G06F17/30864 , G06F17/3071 , Y10S707/99935 , Y10S707/99945
摘要: Methods and systems are provided for performing a comparative search. In one example, the comparative search is performed over a network, such as the web, or a database. In one exemplary implementation, a user transmits a plurality of queries which represent the topics that a user wants to compare, and a computing system can automatically retrieve and rank web pages or documents based on both their relevance to queries and the comparative contents they contain. In one such example, the comparative pages are displayed in a pair or other form of a grouping. In another example, comparative results having similar contents may be clustered into meaningful themes.
摘要翻译: 提供了用于执行比较搜索的方法和系统。 在一个示例中,比较搜索通过诸如网络或数据库的网络执行。 在一个示例性实现中,用户发送表示用户想要比较的主题的多个查询,并且计算系统可以基于它们与查询的相关性及其包含的比较内容来自动检索和排序网页或文档。 在一个这样的示例中,比较页面以一对或其他形式的分组显示。 在另一个例子中,具有相似内容的比较结果可以聚集成有意义的主题。
-
公开(公告)号:US07555480B2
公开(公告)日:2009-06-30
申请号:US11456753
申请日:2006-07-11
申请人: Benyu Zhang , Chenxi Lin , Hua-Jun Zeng , Jian Wang , Ke Tang , Zheng Chen
发明人: Benyu Zhang , Chenxi Lin , Hua-Jun Zeng , Jian Wang , Ke Tang , Zheng Chen
CPC分类号: G06F17/30864 , Y10S707/99935
摘要: The invention provides a method of interactively crawling data records on a web page. Users may select various data records of interest on a web page to generate templates to search for similar data items on the same web page or on different web pages. A tree matching algorithm may be used to compare and extract data matching the generated template.
摘要翻译: 本发明提供了一种在网页上交互地爬行数据记录的方法。 用户可以在网页上选择感兴趣的各种数据记录,以生成在同一网页或不同网页上搜索类似数据项的模板。 树匹配算法可用于比较和提取与生成的模板匹配的数据。
-
公开(公告)号:US20090132530A1
公开(公告)日:2009-05-21
申请号:US11941968
申请日:2007-11-19
申请人: Weizhu Chen , Long Jiang , Ming Zhou , Benyu Zhang , Zheng Chen , Jian Wang
发明人: Weizhu Chen , Long Jiang , Ming Zhou , Benyu Zhang , Zheng Chen , Jian Wang
IPC分类号: G06F17/30
CPC分类号: G06F17/30864
摘要: Described herein is technology for, among other things, mining pair-based data on the web. The technology involves an online pair-based data mining system as well as an offline SVM training system. By subjecting a pair-based input data to the systems, one may grow a pool of pair-based data which share characteristics of the pair-based input data in more efficient manner.
摘要翻译: 这里描述的是用于在网络上挖掘基于对的数据的技术。 该技术涉及一个在线的基于对的数据挖掘系统以及离线SVM培训系统。 通过对基于对的输入数据进行系统的处理,可以以更有效的方式增加基于成对的输入数据的特征的基于对的数据池。
-
-
-
-
-
-
-
-
-