Calculating a webpage importance from a web browsing graph
    1.
    发明授权
    Calculating a webpage importance from a web browsing graph 有权
    从网页浏览图计算网页重要性

    公开(公告)号:US08368698B2

    公开(公告)日:2013-02-05

    申请号:US12236516

    申请日:2008-09-24

    IPC分类号: G06T11/20 G06F3/00

    CPC分类号: G06F17/30882 G06F17/30864

    摘要: Method for creating a graph representing web browsing behavior, including receiving web browsing behavior data from one or more web browsers; adding a node on the graph for each web page listed in the web browsing behavior data; adding a first link connecting two or more nodes on the graph, wherein the first link representing a hyperlink for accessing a webpage; calculating an amount of time in which each web page is being accessed; determining a number of units of time in the calculated amount of time; adding one or more virtual nodes to the graph based on the number of units of time; and adding a second link connecting two or more virtual nodes on the graph, wherein the second link representing a virtual hyperlink for accessing a webpage.

    摘要翻译: 用于创建表示网页浏览行为的图形的方法,包括从一个或多个网络浏览器接收网页浏览行为数据; 在网络浏览行为数据中列出的每个网页的图形上添加一个节点; 添加连接图上的两个或多个节点的第一链接,其中第一链接表示用于访问网页的超链接; 计算每个网页被访问的时间量; 在计算的时间量中确定时间单位的数量; 基于时间单位的数量向图中添加一个或多个虚拟节点; 以及添加连接所述图上的两个或多个虚拟节点的第二链接,其中所述第二链接表示用于访问网页的虚拟超链接。

    Calculating web page importance based on web behavior model
    2.
    发明授权
    Calculating web page importance based on web behavior model 有权
    基于Web行为模型计算网页重要性

    公开(公告)号:US08103599B2

    公开(公告)日:2012-01-24

    申请号:US12237392

    申请日:2008-09-25

    IPC分类号: G06F17/00 G06F17/20

    CPC分类号: G06F17/30864 G06Q30/02

    摘要: Method for determining a webpage importance, including receiving web browsing behavior data of one or more users; creating a model of the web browsing behavior data; calculating a stationary probability distribution of the model; and correlating the stationary probability distribution to the webpage importance.

    摘要翻译: 用于确定网页重要性的方法,包括接收一个或多个用户的网页浏览行为数据; 创建网络浏览行为数据的模型; 计算模型的固定概率分布; 并将固定概率分布与网页重要性相关联。

    CALCULATING A WEBPAGE IMPORTANCE FROM A WEB BROWSING GRAPH
    4.
    发明申请
    CALCULATING A WEBPAGE IMPORTANCE FROM A WEB BROWSING GRAPH 有权
    从网页浏览图中计算一个重要性

    公开(公告)号:US20100073374A1

    公开(公告)日:2010-03-25

    申请号:US12236516

    申请日:2008-09-24

    IPC分类号: G06T11/20

    CPC分类号: G06F17/30882 G06F17/30864

    摘要: Method for creating a graph representing web browsing behavior, including receiving web browsing behavior data from one or more web browsers; adding a node on the graph for each web page listed in the web browsing behavior data; adding a first link connecting two or more nodes on the graph, wherein the first link representing a hyperlink for accessing a webpage; calculating an amount of time in which each web page is being accessed; determining a number of units of time in the calculated amount of time; adding one or more virtual nodes to the graph based on the number of units of time; and adding a second link connecting two or more virtual nodes on the graph, wherein the second link representing a virtual hyperlink for accessing a webpage.

    摘要翻译: 用于创建表示网页浏览行为的图形的方法,包括从一个或多个网络浏览器接收网页浏览行为数据; 在网络浏览行为数据中列出的每个网页的图形上添加一个节点; 添加连接图上的两个或多个节点的第一链接,其中第一链接表示用于访问网页的超链接; 计算每个网页被访问的时间量; 在计算的时间量中确定时间单位的数量; 基于时间单位的数量向图中添加一个或多个虚拟节点; 以及添加连接所述图上的两个或多个虚拟节点的第二链接,其中所述第二链接表示用于访问网页的虚拟超链接。

    Anti-spam tool for browser
    5.
    发明授权
    Anti-spam tool for browser 有权
    用于浏览器的反垃圾邮件工具

    公开(公告)号:US07860971B2

    公开(公告)日:2010-12-28

    申请号:US12035124

    申请日:2008-02-21

    IPC分类号: G06F15/16

    CPC分类号: G06F17/30899 G06F21/50

    摘要: An anti-spam tool works with a web browser to detect spam webpages locally on a client machine. The anti-spam tool can be implemented either as a plug-in module or an integral part of the browser, and manifested as a toolbar. The tool can perform an anti-spam action whenever a webpage is accessed through the browser, and does not require direct involvement of a search engine. A spam detection module installed on the computing device determines whether a webpage being accessed or whether a link contained in the webpage being accessed is spam, by comparing the URL of the webpage or the link with a spam list. The spam list can be downloaded from a remote search engine server, stored locally and updated from time to time. A two-level indexing technique is also introduced to improve the efficiency of the anti-spam tool's use of the spam list.

    摘要翻译: 反垃圾邮件工具与网络浏览器配合使用,可以在客户机上本地检测垃圾邮件网页。 反垃圾邮件工具可以作为插件模块或浏览器的组成部分来实现,并且表现为工具栏。 每当通过浏览器访问网页时,该工具都可以执行反垃圾邮件操作,并且不需要直接参与搜索引擎。 安装在计算设备上的垃圾邮件检测模块通过将网页或链接的URL与垃圾邮件列表进行比较来确定正在访问的网页是否被访问的网页中包含的链接是垃圾邮件。 垃圾邮件列表可以从远程搜索引擎服务器下载,本地存储和不时更新。 还引入了两级索引技术,以提高反垃圾邮件工具使用垃圾邮件列表的效率。

    Forum Mining for Suspicious Link Spam Sites Detection
    6.
    发明申请
    Forum Mining for Suspicious Link Spam Sites Detection 有权
    可疑链接垃圾邮件站点检测的论坛挖掘

    公开(公告)号:US20090198673A1

    公开(公告)日:2009-08-06

    申请号:US12027259

    申请日:2008-02-06

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30864

    摘要: An anti-spam technique for protecting search engine ranking is based on mining search engine optimization (SEO) forums. The anti-spam technique collects webpages such as SEO forum posts from a list of suspect spam websites, and extracts suspicious link exchange URLs and corresponding link formation from the collected webpages. A search engine ranking penalty is then applied to the suspicious link exchange URLs. The penalty is at least partially determined by the link information associated with the respective suspicious link exchange URL. To detect more suspicious link exchange URLs, the technique may propagate one or more levels from a seed set of suspicious link exchange URLs generated by mining SEO forums.

    摘要翻译: 用于保护搜索引擎排名的反垃圾邮件技术是基于挖掘搜索引擎优化(SEO)论坛。 反垃圾邮件技术从可疑垃圾邮件网站列表中收集诸如SEO论坛帖子的网页,并从收集的网页中提取可疑链接交换网址和相应的链接形成。 然后将搜索引擎排名惩罚应用于可疑链接交换URL。 惩罚至少部分地由与相应的可疑链接交换URL相关联的链接信息确定。 为了检测更多可疑的链接交换URL,该技术可以从采矿SEO论坛产生的可疑链接交换URL的种子集传播一个或多个级别。

    Calculating global importance of documents based on global hitting times
    7.
    发明授权
    Calculating global importance of documents based on global hitting times 失效
    根据全球打击时间计算文件的全球重要性

    公开(公告)号:US07930303B2

    公开(公告)日:2011-04-19

    申请号:US11742276

    申请日:2007-04-30

    IPC分类号: G06F7/00 G06F17/30

    CPC分类号: G06F17/30864

    摘要: A calculate importance system calculates the global importance of a web page based on a “mean hitting time.” Hitting time of a target web page is a measure of the minimum number of transitions needed to land on the target web page. Mean hitting time of a target web page is an average number of such transitions for all possible starting web pages. The calculate importance system calculates a global importance score for a web page based on the reciprocal of a mean hitting time. A search engine may rank web pages of a search result based on a combination of relevance of the web pages to the search request and global importance of the web pages based on a global hitting time.

    摘要翻译: 计算重要度系统基于“平均打击时间”计算网页的全局重要性。目标网页的打击时间是衡量目标网页上所需的最小转换次数的度量。 目标网页的平均打击时间是所有可能的起始网页的平均数量。 计算重要性系统基于平均击球时间的倒数计算网页的全局重要性得分。 搜索引擎可以基于网页与搜索请求的相关性和基于全局打击时间的网页的全球重要性的组合来对搜索结果的网页进行排序。

    Calculating importance of documents factoring historical importance
    8.
    发明授权
    Calculating importance of documents factoring historical importance 有权
    计算历史重要性文件的重要性

    公开(公告)号:US07676520B2

    公开(公告)日:2010-03-09

    申请号:US11734336

    申请日:2007-04-12

    IPC分类号: G06F7/00 G06F17/30

    CPC分类号: G06F17/30882 G06F17/30864

    摘要: A method and system for determining temporal importance of documents having links between documents based on a temporal analysis of the links is provided. A temporal ranking system collects link information or snapshots indicating the links between documents at various snapshot times. The temporal ranking system calculates a current temporal importance of a document by factoring in the current importance of the document derived from the current snapshot (i.e., with the latest snapshot time) and the historical importance of the document derived from the past snapshots. To calculate the current temporal importance of a web page, the temporal ranking system aggregates the importance of the web page for each snapshot.

    摘要翻译: 提供了一种用于基于链接的时间分析来确定具有文档之间的链接的文档的时间重要性的方法和系统。 时间排序系统收集指示各种快照时间的文档之间的链接的链接信息或快照。 时间排序系统通过考虑从当前快照(即,具有最新快照时间)导出的文档的当前重要性以及从过去快照导出的文档的历史重要性来计算文档的当前时间重要性。 为了计算网页的当前时间重要性,时间排序系统聚合每个快照的网页的重要性。

    Ranking documents based on a series of document graphs
    9.
    发明授权
    Ranking documents based on a series of document graphs 有权
    基于一系列文档图表排列文档

    公开(公告)号:US08244737B2

    公开(公告)日:2012-08-14

    申请号:US11764554

    申请日:2007-06-18

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30864

    摘要: Ranking documents based on a series of web graphs collected over time is provided. A ranking system provides multiple transition probability distributions representing different snapshots or times. Each transition probability distribution represents a probability of transitioning from one document to another document within a collection of documents using a link of the document. The ranking system determines a stationary probability distribution for each snapshot based on the transition probability distributions for that snapshot and the stationary probability distribution of the previous snapshot. The stationary probability distributions represent a ranking of the documents over time.

    摘要翻译: 提供了基于随时间收集的一系列网络图表排列文档。 排名系统提供代表不同快照或时间的多个转移概率分布。 每个转移概率分布表示使用文档的链接在一个文档集合内从一个文档转换到另一个文档的概率。 排名系统基于该快照的转移概率分布和先前快照的固定概率分布确定每个快照的固定概率分布。 固定概率分布代表文档随时间的排列。

    Forum mining for suspicious link spam sites detection
    10.
    发明授权
    Forum mining for suspicious link spam sites detection 有权
    论坛挖掘可疑链接垃圾邮件网站检测

    公开(公告)号:US08219549B2

    公开(公告)日:2012-07-10

    申请号:US12027259

    申请日:2008-02-06

    IPC分类号: G06F7/00

    CPC分类号: G06F17/30864

    摘要: An anti-spam technique for protecting search engine ranking is based on mining search engine optimization (SEO) forums. The anti-spam technique collects webpages such as SEO forum posts from a list of suspect spam websites, and extracts suspicious link exchange URLs and corresponding link formation from the collected webpages. A search engine ranking penalty is then applied to the suspicious link exchange URLs. The penalty is at least partially determined by the link information associated with the respective suspicious link exchange URL. To detect more suspicious link exchange URLs, the technique may propagate one or more levels from a seed set of suspicious link exchange URLs generated by mining SEO forums.

    摘要翻译: 用于保护搜索引擎排名的反垃圾邮件技术是基于挖掘搜索引擎优化(SEO)论坛。 反垃圾邮件技术从可疑垃圾邮件网站列表中收集诸如SEO论坛帖子的网页,并从收集的网页中提取可疑链接交换网址和相应的链接形成。 然后将搜索引擎排名惩罚应用于可疑链接交换URL。 惩罚至少部分地由与相应的可疑链接交换URL相关联的链接信息确定。 为了检测更多可疑的链接交换URL,该技术可以从采矿SEO论坛产生的可疑链接交换URL的种子集传播一个或多个级别。