-
公开(公告)号:US20130097027A1
公开(公告)日:2013-04-18
申请号:US13272844
申请日:2011-10-13
申请人: Taifeng Wang , Tie-Yan Liu , Bin Gao , Tao Qin
发明人: Taifeng Wang , Tie-Yan Liu , Bin Gao , Tao Qin
IPC分类号: G06Q30/02
CPC分类号: G06Q30/02
摘要: A task guidance tool that displays instructional steps and associated advertisements may facilitate the accomplishment of a task by users who are otherwise unfamiliar with the task. The task guidance tool may be developed from input data mined from various sources. The task guidance tool may display a series of step pages in which each step page include instructions for accomplishing a corresponding step of the task. Further, one or more step pages of the task guidance tool may be provided with selected advertisements that are displayed with the step instructions.
摘要翻译: 显示教学步骤和相关联广告的任务指导工具可以促进由不熟悉任务的用户完成任务。 任务指导工具可以从从各种来源挖掘的输入数据中开发。 任务指导工具可以显示一系列步骤页面,其中每个步骤页面包括用于完成任务的相应步骤的指令。 此外,可以为任务指导工具的一个或多个步骤页面提供与步骤指令一起显示的所选择的广告。
-
公开(公告)号:US20120143792A1
公开(公告)日:2012-06-07
申请号:US12959060
申请日:2010-12-02
申请人: Taifeng Wang , Bin Gao , Tie-Yan Liu
发明人: Taifeng Wang , Bin Gao , Tie-Yan Liu
CPC分类号: G06F17/30873 , G06F17/30867
摘要: Some implementations provide techniques for selecting web pages for inclusion in an index. For example, some implementations apply regularization to select a subset of the crawled web pages for indexing based on link relationships between the crawled web pages, features extracted from the crawled web pages, and user behavior information determined for at least some of the crawled web pages. Further, in some implementations, the user behavior information may be used to sort a training set of crawled web pages into a plurality of labeled groups. The labeled groups may be represented in a directed graph that indicates relative priorities for being selected for indexing.
摘要翻译: 一些实现提供用于选择包括在索引中的网页的技术。 例如,一些实现应用正则化来基于被爬网的网页之间的链接关系,从被爬网的网页提取的特征以及为至少一些被爬网的网页确定的用户行为信息来选择用于索引的被爬网网页的子集 。 此外,在一些实现中,可以使用用户行为信息来将爬网网页的训练集合分类成多个标记的组。 标记的组可以在有向图中表示,其指示被选择用于索引的相对优先级。
-
公开(公告)号:US20110295845A1
公开(公告)日:2011-12-01
申请号:US12789278
申请日:2010-05-27
申请人: Bin Gao , Taifeng Wang , Tie-Yan Liu
发明人: Bin Gao , Taifeng Wang , Tie-Yan Liu
IPC分类号: G06F17/30
CPC分类号: G06F16/951
摘要: Importance ranking of web pages is performed by defining a graph-based regularization term based on document features, edge features, and a web graph of a plurality of web pages, and deriving a loss term based on human feedback data. The graph-based regularization term and the loss term are combined to obtain a global objective function. The global objective function is optimized to obtain parameters for the document features and edge features and to produce static rank scores for the plurality of web pages. Further, the plurality of web pages is ordered based on the static rank scores.
摘要翻译: 通过基于文档特征,边缘特征和多个网页的网络图定义基于图形的正则化术语,并且基于人类反馈数据导出丢失项来执行网页的重要性排名。 基于图形的正则化项和损失项被组合以获得全局目标函数。 优化全局目标函数以获得文档特征和边缘特征的参数,并且为多个网页产生静态等级分数。 此外,基于静态等级分数来排序多个网页。
-
公开(公告)号:US20090216868A1
公开(公告)日:2009-08-27
申请号:US12035124
申请日:2008-02-21
申请人: Bin Gao , Tie-Yan Liu , Hang Li , Lei Yang
发明人: Bin Gao , Tie-Yan Liu , Hang Li , Lei Yang
IPC分类号: G06F15/173
CPC分类号: G06F17/30899 , G06F21/50
摘要: An anti-spam tool works with a web browser to detect spam webpages locally on a client machine. The anti-spam tool can be implemented either as a plug-in module or an integral part of the browser, and manifested as a toolbar. The tool can perform an anti-spam action whenever a webpage is accessed through the browser, and does not require direct involvement of a search engine. A spam detection module installed on the computing device determines whether a webpage being accessed or whether a link contained in the webpage being accessed is spam, by comparing the URL of the webpage or the link with a spam list. The spam list can be downloaded from a remote search engine server, stored locally and updated from time to time. A two-level indexing technique is also introduced to improve the efficiency of the anti-spam tool's use of the spam list.
摘要翻译: 反垃圾邮件工具与网络浏览器配合使用,可以在客户机上本地检测垃圾邮件网页。 反垃圾邮件工具可以作为插件模块或浏览器的组成部分来实现,并且表现为工具栏。 每当通过浏览器访问网页时,该工具都可以执行反垃圾邮件操作,并且不需要直接参与搜索引擎。 安装在计算设备上的垃圾邮件检测模块通过将网页或链接的URL与垃圾邮件列表进行比较来确定正在访问的网页是否被访问的网页中包含的链接是垃圾邮件。 垃圾邮件列表可以从远程搜索引擎服务器下载,本地存储和不时更新。 还引入了两级索引技术,以提高反垃圾邮件工具使用垃圾邮件列表的效率。
-
公开(公告)号:US20080313168A1
公开(公告)日:2008-12-18
申请号:US11764554
申请日:2007-06-18
申请人: Tie-Yan Liu , Hang Li , Bin Gao , Lei Yang , Lei Qi
发明人: Tie-Yan Liu , Hang Li , Bin Gao , Lei Yang , Lei Qi
IPC分类号: G06F7/08
CPC分类号: G06F17/30864
摘要: Ranking documents based on a series of web graphs collected over time is provided. A ranking system provides multiple transition probability distributions representing different snapshots or times. Each transition probability distribution represents a probability of transitioning from one document to another document within a collection of documents using a link of the document. The ranking system determines a stationary probability distribution for each snapshot based on the transition probability distributions for that snapshot and the stationary probability distribution of the previous snapshot. The stationary probability distributions represent a ranking of the documents over time.
摘要翻译: 提供了基于随时间收集的一系列网络图表排列文档。 排名系统提供代表不同快照或时间的多个转移概率分布。 每个转移概率分布表示使用文档的链接在一个文档集合内从一个文档转换到另一个文档的概率。 排名系统基于该快照的转移概率分布和先前快照的固定概率分布确定每个快照的固定概率分布。 固定概率分布代表文档随时间的排列。
-
公开(公告)号:US20080256051A1
公开(公告)日:2008-10-16
申请号:US11734336
申请日:2007-04-12
申请人: Tie-Yan Liu , Hang Li , Lei Qi , Bin Gao , Lei Yang
发明人: Tie-Yan Liu , Hang Li , Lei Qi , Bin Gao , Lei Yang
IPC分类号: G06F17/30
CPC分类号: G06F17/30882 , G06F17/30864
摘要: A method and system for determining temporal importance of documents having links between documents based on a temporal analysis of the links is provided. A temporal ranking system collects link information or snapshots indicating the links between documents at various snapshot times. The temporal ranking system calculates a current temporal importance of a document by factoring in the current importance of the document derived from the current snapshot (i.e., with the latest snapshot time) and the historical importance of the document derived from the past snapshots. To calculate the current temporal importance of a web page, the temporal ranking system aggregates the importance of the web page for each snapshot.
摘要翻译: 提供了一种用于基于链接的时间分析来确定具有文档之间的链接的文档的时间重要性的方法和系统。 时间排序系统收集指示各种快照时间的文档之间的链接的链接信息或快照。 时间排序系统通过考虑从当前快照(即,具有最新快照时间)导出的文档的当前重要性以及从过去快照导出的文档的历史重要性来计算文档的当前时间重要性。 为了计算网页的当前时间重要性,时间排序系统聚合每个快照的网页的重要性。
-
公开(公告)号:US08645288B2
公开(公告)日:2014-02-04
申请号:US12959060
申请日:2010-12-02
申请人: Taifeng Wang , Bin Gao , Tie-Yan Liu
发明人: Taifeng Wang , Bin Gao , Tie-Yan Liu
IPC分类号: G06F15/18
CPC分类号: G06F17/30873 , G06F17/30867
摘要: Some implementations provide techniques for selecting web pages for inclusion in an index. For example, some implementations apply regularization to select a subset of the crawled web pages for indexing based on link relationships between the crawled web pages, features extracted from the crawled web pages, and user behavior information determined for at least some of the crawled web pages. Further, in some implementations, the user behavior information may be used to sort a training set of crawled web pages into a plurality of labeled groups. The labeled groups may be represented in a directed graph that indicates relative priorities for being selected for indexing.
摘要翻译: 一些实现提供用于选择包括在索引中的网页的技术。 例如,一些实现应用正则化来基于被爬网的网页之间的链接关系,从被爬网的网页提取的特征以及为至少一些被爬网的网页确定的用户行为信息来选择用于索引的被爬网网页的子集 。 此外,在一些实现中,可以使用用户行为信息来将爬网网页的训练集合分类成多个标记的组。 标记的组可以在有向图中表示,其指示被选择用于索引的相对优先级。
-
公开(公告)号:US20130097011A1
公开(公告)日:2013-04-18
申请号:US13273924
申请日:2011-10-14
申请人: Taifeng Wang , Tie-Yan Liu , Bin Gao , Tao Qin
发明人: Taifeng Wang , Tie-Yan Liu , Bin Gao , Tao Qin
IPC分类号: G06Q30/02
CPC分类号: G06Q30/02
摘要: An advertisement perception predictor may forecast the effectiveness of an online advertisement in a web page by predicting whether the online advertisement may be perceived by a consumer. The advertisement perception predictor may use a perception model that is trained for determining perception probability values of online advertisements. The perception model may be applied to an online advertisement to determine a perception probability value for the online advertisement. The perception probability value may indicate the likelihood that a consumer is likely to view the online advertisement.
摘要翻译: 广告感知预测器可以通过预测在线广告是否可被消费者感知来预测网页中的在线广告的有效性。 广告感知预测器可以使用被训练用于确定在线广告的感知概率值的感知模型。 感知模型可以应用于在线广告以确定在线广告的感知概率值。 感知概率值可以指示消费者可能查看在线广告的可能性。
-
公开(公告)号:US08069167B2
公开(公告)日:2011-11-29
申请号:US12413502
申请日:2009-03-27
申请人: Bin Gao , Tie-Yan Liu
发明人: Bin Gao , Tie-Yan Liu
IPC分类号: G06F17/30
CPC分类号: G06F17/30864
摘要: The page ranking technique described herein employs a Markov Skeleton Mirror Process (MSMP), which is a particular case of Markov Skeleton Processes, to model and calculate page importance scores. Given a web graph and its metadata, the technique builds an MSMP model on the web graph. It first estimates the stationary distribution of a EMC and views it as transition probability. It next computes the mean staying time using the metadata. Finally, it calculates the product of transition probability and mean staying time, which is actually the stationary distribution of MSMP. This is regarded as page importance.
摘要翻译: 本文描述的页面排序技术使用马尔可夫骨架镜像过程(MSMP),其是马可夫骨骼过程的特定情况,用于建模和计算页面重要性分数。 给定一个网络图及其元数据,该技术在网络图上构建一个MSMP模型。 它首先估计了EMC的固定分布,并将其视为转移概率。 接下来使用元数据计算平均停留时间。 最后,计算转移概率和平均停留时间的乘积,实际上是MSMP的固定分布。 这被认为是页面重要性。
-
公开(公告)号:US20100076910A1
公开(公告)日:2010-03-25
申请号:US12237392
申请日:2008-09-25
申请人: Bin Gao , Tie-Yan Liu , Hang Li , Yuting Liu
发明人: Bin Gao , Tie-Yan Liu , Hang Li , Yuting Liu
CPC分类号: G06F17/30864 , G06Q30/02
摘要: Method for determining a webpage importance, including receiving web browsing behavior data of one or more users; creating a model of the web browsing behavior data; calculating a stationary probability distribution of the model; and correlating the stationary probability distribution to the webpage importance.
摘要翻译: 用于确定网页重要性的方法,包括接收一个或多个用户的网页浏览行为数据; 创建网络浏览行为数据的模型; 计算模型的固定概率分布; 并将固定概率分布与网页重要性相关联。
-
-
-
-
-
-
-
-
-