Niche Keyword Recommendation
    11.
    发明申请
    Niche Keyword Recommendation 审中-公开
    利基关键字推荐

    公开(公告)号:US20130085867A1

    公开(公告)日:2013-04-04

    申请号:US13249939

    申请日:2011-09-30

    申请人: Bin Gao Tie-Yan Liu

    发明人: Bin Gao Tie-Yan Liu

    IPC分类号: G06Q30/02 G06Q30/08

    摘要: A computing device is described herein that is configured to select a subset of keywords from a plurality of keywords based at least on measures of competition associated with the keywords and to suggest the selected subset for bidding. The plurality of keywords is relevant to at least one advertising target. The computing device calculates a measure of competition for a respective keyword based on a number of bidders for the respective keyword and on a number of available advertisement slots in search results provided responsive to queries for the respective keyword.

    摘要翻译: 本文描述的计算设备被配置为至少基于与关键字相关联的竞争度量来选择来自多个关键字的关键词的子集,并且建议所选择的用于投标的子集。 多个关键字与至少一个广告目标相关。 计算设备基于针对相应关键字的投标者的数量以及响应于对各个关键字的查询而提供的搜索结果中的可用广告时隙的数量来计算相应关键字的竞争度量。

    Calculating a webpage importance from a web browsing graph
    12.
    发明授权
    Calculating a webpage importance from a web browsing graph 有权
    从网页浏览图计算网页重要性

    公开(公告)号:US08368698B2

    公开(公告)日:2013-02-05

    申请号:US12236516

    申请日:2008-09-24

    IPC分类号: G06T11/20 G06F3/00

    CPC分类号: G06F17/30882 G06F17/30864

    摘要: Method for creating a graph representing web browsing behavior, including receiving web browsing behavior data from one or more web browsers; adding a node on the graph for each web page listed in the web browsing behavior data; adding a first link connecting two or more nodes on the graph, wherein the first link representing a hyperlink for accessing a webpage; calculating an amount of time in which each web page is being accessed; determining a number of units of time in the calculated amount of time; adding one or more virtual nodes to the graph based on the number of units of time; and adding a second link connecting two or more virtual nodes on the graph, wherein the second link representing a virtual hyperlink for accessing a webpage.

    摘要翻译: 用于创建表示网页浏览行为的图形的方法,包括从一个或多个网络浏览器接收网页浏览行为数据; 在网络浏览行为数据中列出的每个网页的图形上添加一个节点; 添加连接图上的两个或多个节点的第一链接,其中第一链接表示用于访问网页的超链接; 计算每个网页被访问的时间量; 在计算的时间量中确定时间单位的数量; 基于时间单位的数量向图中添加一个或多个虚拟节点; 以及添加连接所述图上的两个或多个虚拟节点的第二链接,其中所述第二链接表示用于访问网页的虚拟超链接。

    ACTIVE PREDICTION OF DIVERSE SEARCH INTENT BASED UPON USER BROWSING BEHAVIOR
    13.
    发明申请
    ACTIVE PREDICTION OF DIVERSE SEARCH INTENT BASED UPON USER BROWSING BEHAVIOR 审中-公开
    基于用户浏览行为的多元搜索内容的主动预测

    公开(公告)号:US20110258148A1

    公开(公告)日:2011-10-20

    申请号:US12762423

    申请日:2010-04-19

    申请人: Bin Gao Tie-Yan Liu

    发明人: Bin Gao Tie-Yan Liu

    IPC分类号: G06F17/30 G06F15/18

    CPC分类号: G06F17/30867

    摘要: Many search engines attempt to understand and predict a user's search intent after the submission of search queries. Predicting search intent allows search engines to tailor search results to particular information needs of the user. Unfortunately, current techniques passively predict search intent after a query is submitted. Accordingly, one or more systems and/or techniques for actively predicting search intent from user browsing behavior data are disclosed herein. For example, search patterns of a user browsing a web page and shortly thereafter performing a query may be extracted from user browsing behavior. Queries within the search patterns may be ranked based upon a search trigger likelihood that content of the web page motivated the user to perform the query. In this way, query suggestions having a high search trigger likelihood and a diverse range of topics may be generated and/or presented to users of the web page.

    摘要翻译: 许多搜索引擎尝试在提交搜索查询之后了解和预测用户的搜索意图。 预测搜索意图允许搜索引擎根据用户的特定信息需求定制搜索结果。 不幸的是,目前的技术在提交查询后被动地预测搜索意图。 因此,本文公开了一种或多种用于从用户浏览行为数据主动地预测搜索意图的系统和/或技术。 例如,可以从用户浏览行为中提取浏览网页的用户的搜索模式并且之后不久执行查询。 搜索模式中的查询可以基于网页内容促使用户执行查询的搜索触发可能性来排序。 以这种方式,可以产生和/或向网页的用户呈现具有高搜索触发可能性和不同范围的主题的查询建议。

    Anti-spam tool for browser
    14.
    发明授权
    Anti-spam tool for browser 有权
    用于浏览器的反垃圾邮件工具

    公开(公告)号:US07860971B2

    公开(公告)日:2010-12-28

    申请号:US12035124

    申请日:2008-02-21

    IPC分类号: G06F15/16

    CPC分类号: G06F17/30899 G06F21/50

    摘要: An anti-spam tool works with a web browser to detect spam webpages locally on a client machine. The anti-spam tool can be implemented either as a plug-in module or an integral part of the browser, and manifested as a toolbar. The tool can perform an anti-spam action whenever a webpage is accessed through the browser, and does not require direct involvement of a search engine. A spam detection module installed on the computing device determines whether a webpage being accessed or whether a link contained in the webpage being accessed is spam, by comparing the URL of the webpage or the link with a spam list. The spam list can be downloaded from a remote search engine server, stored locally and updated from time to time. A two-level indexing technique is also introduced to improve the efficiency of the anti-spam tool's use of the spam list.

    摘要翻译: 反垃圾邮件工具与网络浏览器配合使用,可以在客户机上本地检测垃圾邮件网页。 反垃圾邮件工具可以作为插件模块或浏览器的组成部分来实现,并且表现为工具栏。 每当通过浏览器访问网页时,该工具都可以执行反垃圾邮件操作,并且不需要直接参与搜索引擎。 安装在计算设备上的垃圾邮件检测模块通过将网页或链接的URL与垃圾邮件列表进行比较来确定正在访问的网页是否被访问的网页中包含的链接是垃圾邮件。 垃圾邮件列表可以从远程搜索引擎服务器下载,本地存储和不时更新。 还引入了两级索引技术,以提高反垃圾邮件工具使用垃圾邮件列表的效率。

    Calculating Web Page Importance
    15.
    发明申请
    Calculating Web Page Importance 有权
    计算网页重要性

    公开(公告)号:US20100250555A1

    公开(公告)日:2010-09-30

    申请号:US12413502

    申请日:2009-03-27

    申请人: Bin Gao Tie-Yan Liu

    发明人: Bin Gao Tie-Yan Liu

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30864

    摘要: The page ranking technique described herein employs a Markov Skeleton Mirror Process (MSMP), which is a particular case of Markov Skeleton Processes, to model and calculate page importance scores. Given a web graph and its metadata, the technique builds an MSMP model on the web graph. It first estimates the stationary distribution of a EMC and views it as transition probability. It next computes the mean staying time using the metadata. Finally, it calculates the product of transition probability and mean staying time, which is actually the stationary distribution of MSMP. This is regarded as page importance.

    摘要翻译: 本文描述的页面排序技术使用马尔可夫骨架镜像过程(MSMP),其是马可夫骨骼过程的特定情况,用于建模和计算页面重要性分数。 给定一个网络图及其元数据,该技术在网络图上构建一个MSMP模型。 它首先估计了EMC的固定分布,并将其视为转移概率。 接下来使用元数据计算平均停留时间。 最后,计算转移概率和平均停留时间的乘积,实际上是MSMP的固定分布。 这被认为是页面重要性。

    Forum Mining for Suspicious Link Spam Sites Detection
    16.
    发明申请
    Forum Mining for Suspicious Link Spam Sites Detection 有权
    可疑链接垃圾邮件站点检测的论坛挖掘

    公开(公告)号:US20090198673A1

    公开(公告)日:2009-08-06

    申请号:US12027259

    申请日:2008-02-06

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30864

    摘要: An anti-spam technique for protecting search engine ranking is based on mining search engine optimization (SEO) forums. The anti-spam technique collects webpages such as SEO forum posts from a list of suspect spam websites, and extracts suspicious link exchange URLs and corresponding link formation from the collected webpages. A search engine ranking penalty is then applied to the suspicious link exchange URLs. The penalty is at least partially determined by the link information associated with the respective suspicious link exchange URL. To detect more suspicious link exchange URLs, the technique may propagate one or more levels from a seed set of suspicious link exchange URLs generated by mining SEO forums.

    摘要翻译: 用于保护搜索引擎排名的反垃圾邮件技术是基于挖掘搜索引擎优化(SEO)论坛。 反垃圾邮件技术从可疑垃圾邮件网站列表中收集诸如SEO论坛帖子的网页,并从收集的网页中提取可疑链接交换网址和相应的链接形成。 然后将搜索引擎排名惩罚应用于可疑链接交换URL。 惩罚至少部分地由与相应的可疑链接交换URL相关联的链接信息确定。 为了检测更多可疑的链接交换URL,该技术可以从采矿SEO论坛产生的可疑链接交换URL的种子集传播一个或多个级别。

    USER INTENT STRENGTH AGGREGATING BY DECAY FACTOR
    17.
    发明申请
    USER INTENT STRENGTH AGGREGATING BY DECAY FACTOR 审中-公开
    用衰减因子聚合的用户信度强度

    公开(公告)号:US20120253930A1

    公开(公告)日:2012-10-04

    申请号:US13078300

    申请日:2011-04-01

    IPC分类号: G06Q30/00

    CPC分类号: G06Q30/0251

    摘要: This application describes a system and method for estimating user intent towards categories of content. The estimation of user intent may be based at least in part on a score for prior user actions and a decay function that is applied to that score to provide an estimate of current user intent. The estimate represents current user intent for time periods in which user actions towards a category of content are negligible or non-existent.

    摘要翻译: 该应用描述了用于估计用户对内容类别的意图的系统和方法。 用户意图的估计可至少部分地基于用于先前用户动作的分数和应用于该分数以提供当前用户意图的估计的衰减函数。 估计值表示用户对一类内容的操作可忽略或不存在的时间段的当前用户意图。

    MULTI-LEVEL COVERAGE FOR CRAWLING SELECTION
    18.
    发明申请
    MULTI-LEVEL COVERAGE FOR CRAWLING SELECTION 审中-公开
    多层次搜索选择

    公开(公告)号:US20120143844A1

    公开(公告)日:2012-06-07

    申请号:US12958611

    申请日:2010-12-02

    IPC分类号: G06F17/30

    CPC分类号: G06F16/951

    摘要: Some implementations provide techniques for determining which URLs to select for crawling from a pool of URLs. For example, the selection of URLs for crawling may be made based on maintaining a high coverage of the known URLs and/or high discoverability of the World Wide Web. Some implementations provide a multi-level coverage strategy for crawling selection. Further, some implementations provide techniques for discovering unseen URLs.

    摘要翻译: 一些实现提供了用于确定哪些URL被选择用于从URL池中进行爬网的技术。 例如,可以基于保持已知URL的高覆盖率和/或万维网的高可发现性来进行用于爬网的URL的选择。 一些实现提供了用于爬网选择的多级覆盖策略。 此外,一些实现提供用于发现不可见URL的技术。

    Calculating web page importance based on web behavior model
    19.
    发明授权
    Calculating web page importance based on web behavior model 有权
    基于Web行为模型计算网页重要性

    公开(公告)号:US08103599B2

    公开(公告)日:2012-01-24

    申请号:US12237392

    申请日:2008-09-25

    IPC分类号: G06F17/00 G06F17/20

    CPC分类号: G06F17/30864 G06Q30/02

    摘要: Method for determining a webpage importance, including receiving web browsing behavior data of one or more users; creating a model of the web browsing behavior data; calculating a stationary probability distribution of the model; and correlating the stationary probability distribution to the webpage importance.

    摘要翻译: 用于确定网页重要性的方法,包括接收一个或多个用户的网页浏览行为数据; 创建网络浏览行为数据的模型; 计算模型的固定概率分布; 并将固定概率分布与网页重要性相关联。

    Calculating global importance of documents based on global hitting times
    20.
    发明授权
    Calculating global importance of documents based on global hitting times 失效
    根据全球打击时间计算文件的全球重要性

    公开(公告)号:US07930303B2

    公开(公告)日:2011-04-19

    申请号:US11742276

    申请日:2007-04-30

    IPC分类号: G06F7/00 G06F17/30

    CPC分类号: G06F17/30864

    摘要: A calculate importance system calculates the global importance of a web page based on a “mean hitting time.” Hitting time of a target web page is a measure of the minimum number of transitions needed to land on the target web page. Mean hitting time of a target web page is an average number of such transitions for all possible starting web pages. The calculate importance system calculates a global importance score for a web page based on the reciprocal of a mean hitting time. A search engine may rank web pages of a search result based on a combination of relevance of the web pages to the search request and global importance of the web pages based on a global hitting time.

    摘要翻译: 计算重要度系统基于“平均打击时间”计算网页的全局重要性。目标网页的打击时间是衡量目标网页上所需的最小转换次数的度量。 目标网页的平均打击时间是所有可能的起始网页的平均数量。 计算重要性系统基于平均击球时间的倒数计算网页的全局重要性得分。 搜索引擎可以基于网页与搜索请求的相关性和基于全局打击时间的网页的全球重要性的组合来对搜索结果的网页进行排序。