专利检索 ap:("Krishna A. Bharat" OR "Andrei Z. Broder" OR "Steven C. Glassman" OR "Jeffrey Dean" OR "Monika R. Henzinger") AND inv:"Jeffrey Dean" 第 1 页

1.

发明授权
Method and apparatus for finding mirrored hosts by analyzing connectivity and IP addresses 有权
标题翻译：通过分析连接和IP地址查找镜像主机的方法和设备

公开(公告)号：US06487555B1

公开(公告)日：2002-11-26

申请号：US09307153

申请日：1999-05-07

申请人： Krishna A. Bharat , Andrei Z. Broder , Steven C. Glassman , Jeffrey Dean , Monika R. Henzinger

发明人： Krishna A. Bharat , Andrei Z. Broder , Steven C. Glassman , Jeffrey Dean , Monika R. Henzinger

IPC分类号： G06F1730

CPC分类号： G06F17/30864

摘要： A method and system that detects mirrored host pairs using information about a large set of pages, including one or more of: URLs, IP addresses, and connectivity information. The identities of the detected mirrored hosts are then saved so that browsers, crawlers, proxy servers, or the like can correctly identify mirrored web sites. The described embodiments of the present invention use one or a combination of techniques to identify mirrors. A first group of techniques involves determining mirrors based on URLs and information about connectivity (i.e., hyperlinks) between pages. A second group of techniques looks at connectivity information at a higher granularity, considering all links from all pages on a host as one group and ignoring the target of each link beyond the host level.

摘要翻译： 使用关于大量页面的信息来检测镜像主机对的方法和系统，包括以下一个或多个：URL，IP地址和连接信息。然后保存检测到的镜像主机的身份，以便浏览器，爬网程序，代理服务器等可以正确识别镜像的网站。所描述的本发明的实施例使用一种或技术的组合来识别反射镜。第一组技术涉及基于URL和关于页面之间的连接性（即，超链接）的信息来确定镜像。第二组技术以更高的粒度来考虑连接信息，考虑到主机上所有页面的所有链接为一个组，并忽略超出主机级别的每个链接的目标。

2.

发明授权
Method and apparatus for finding mirrored hosts by analyzing urls 有权
标题翻译：通过分析网址查找镜像主机的方法和设备

公开(公告)号：US06286006B1

公开(公告)日：2001-09-04

申请号：US09307320

申请日：1999-05-07

申请人： Krishna A. Bharat , Andrei Broder , Steven C. Glassman , Jeffrey Dean , Monika R. Henzinger

发明人： Krishna A. Bharat , Andrei Broder , Steven C. Glassman , Jeffrey Dean , Monika R. Henzinger

IPC分类号： G06F1730

CPC分类号： G06F17/30902 , Y10S707/99935

摘要： A method and apparatus that detects mirrored host pairs using information about a large set of pages, including URLs. The identities of the detected mirrored hosts are then saved so that browsers, crawlers, proxy servers, or the like can correctly identify mirrored web sites. The described embodiments of the present invention look at the URLs of pages hosts to determine whether the hosts are potentially mirrored.

摘要翻译： 使用关于包含URL的大量页面的信息检测镜像主机对的方法和装置。然后保存检测到的镜像主机的身份，以便浏览器，爬网程序，代理服务器等可以正确识别镜像的网站。所描述的本发明的实施例查看页面主机的URL以确定主机是否被潜在地镜像。

3.

发明授权
Method for identifying related pages in a hyperlinked database 有权

公开(公告)号：US06665837B1

公开(公告)日：2003-12-16

申请号：US09131473

申请日：1998-08-10

申请人： Jeffrey Dean , Monika R. Henzinger , Andrei Z. Broder

发明人： Jeffrey Dean , Monika R. Henzinger , Andrei Z. Broder

IPC分类号： G06F1500

CPC分类号： G06F17/30958 , G06F17/30864 , Y10S707/99932 , Y10S707/99934 , Y10S707/99935 , Y10S707/99936 , Y10S707/99937 , Y10S707/99945

摘要： A method is described for identifying related pages among a plurality of pages in a linked database such as the World Wide Web. An initial page is selected from the plurality of pages. Pages linked to the initial page are represented as a graph in a memory. The pages represented in the graph are scored on content, and a set of pages is selected, the selected set of pages having scores greater than a first predetermined threshold. The selected set of pages is scored on connectivity, and a subset of the set of pages that have scores greater than a second predetermined threshold are selected as related pages.

4.

发明授权
Method for identifying near duplicate pages in a hyperlinked database 有权
标题翻译：在超链接数据库中识别近重复页面的方法

公开(公告)号：US6138113A

公开(公告)日：2000-10-24

申请号：US131469

申请日：1998-08-10

申请人： Jeffrey Dean , Monika R. Henzinger

发明人： Jeffrey Dean , Monika R. Henzinger

IPC分类号： G06F17/30

CPC分类号： G06F17/30864 , Y10S707/99932

摘要： A method is described for identifying pages that are near duplicates in a linked database. In the linked database, pages can have incoming links and outgoing links. Two pages are selected, a first page and a second page. For each selected page, the number of outgoing links is determined. The two pages are marked as near duplicates based on the number of common outgoing links for the two pages.

摘要翻译： 描述了一种用于识别链接数据库中几乎重复的页面的方法。在链接的数据库中，页面可以具有传入链接和传出链接。选择两页，第一页和第二页。对于每个所选页面，确定输出链接的数量。这两个页面根据两页的通用传出链接的数量被标记为近似的重复。

5.

发明授权
Method and apparatus for preventing topic drift in queries in hyperlinked environments 有权
标题翻译：用于在超链接环境中的查询中防止主题漂移的方法和装置

公开(公告)号：US06321220B1

公开(公告)日：2001-11-20

申请号：US09207215

申请日：1998-12-07

申请人： Jeffrey Dean , Monika R. Henzinger , Krishna Asur Bharat

发明人： Jeffrey Dean , Monika R. Henzinger , Krishna Asur Bharat

IPC分类号： G06F1730

CPC分类号： G06F17/30882 , G06F17/30864 , Y10S707/99932 , Y10S707/99933 , Y10S707/99935

摘要： A method and apparatus for preventing topic drift in queries in hyperlinked environments uses equivalence components for ranking pages containing information that is relevant to the topic of a user query input to a search engine. The method includes the step of providing a query to a search engine, where the query represents a predetermined topic; retrieving at least one page associated with the query; constructing a graph representing the pages in memory; creating at least one equivalence component representing a subset of the graph; processing each equivalence component; eliminating the equivalence component in accordance with whether it matches the predetermined topic; and ranking the remaining pages.

摘要翻译： 用于防止在超链接环境中的查询中的主题漂移的方法和装置使用等价组件来排列包含与搜索引擎输入的用户查询的主题相关的信息的页面。该方法包括向搜索引擎提供查询的步骤，其中查询表示预定的主题; 检索与查询相关联的至少一个页面; 构建表示存储器中的页面的图形; 创建表示图的子集的至少一个等价分量; 处理每个等价分量; 根据是否匹配预定的主题来消除等价分量; 并排列剩下的页面。

6.

发明授权
System and method for impromptu shared communication spaces 有权
标题翻译：即兴共享通信空间的系统和方法

公开(公告)号：US09425971B1

公开(公告)日：2016-08-23

申请号：US13616467

申请日：2012-09-14

申请人： Jeffrey Dean , Georges Harik , Obeka Tallis Brown Bakin

发明人： Jeffrey Dean , Georges Harik , Obeka Tallis Brown Bakin

IPC分类号： G06F17/30 , H04L12/18 , G06F15/16

CPC分类号： H04L12/1818 , G06F17/30699 , G06F17/30702 , G06F17/30867 , Y10S707/99933 , Y10S707/99939

摘要： Communications between entities who may share common interests. For entities determined to be sharing common interests (e.g., searching using the same terms or topics, browsing a page, a site or a groups of topically related sites), options for communication among the entities are provided. For example, a chat room may be dynamically created for persons who are currently searching or browsing the same or related information. As another example, a “homepage” may be created for each query and contain various types of information related to the query. A permission module controls which entities may participate, what types of information (and from what sources) an entity can (or desires to) receive, what types of information the entity may (or desires to) share.

摘要翻译： 可能有共同利益的实体之间的沟通。对于确定为共享共同兴趣的实体（例如，使用相同的术语或主题进行搜索，浏览页面，站点或局部相关站点的组），提供了实体之间的通信选项。例如，可以为正在搜索或浏览相同或相关信息的人员动态地创建聊天室。作为另一示例，可以为每个查询创建“主页”，并且包含与查询相关的各种类型的信息。许可模块控制哪些实体可以参与，实体可以（或期望）接收哪些类型的信息（以及从什么来源），实体可能（或希望）共享什么类型的信息。

7.

发明授权
System and method for analyzing data records 有权

公开(公告)号：US09405808B2

公开(公告)日：2016-08-02

申请号：US13407632

申请日：2012-02-28

申请人： Robert C. Pike , Sean Quinlan , Sean M. Dorward , Jeffrey Dean , Sanjay Ghemawat

发明人： Robert C. Pike , Sean Quinlan , Sean M. Dorward , Jeffrey Dean , Sanjay Ghemawat

IPC分类号： G06F17/30 , G06F11/14

CPC分类号： G06F17/30501 , G06F11/1482 , G06F17/30545 , G06F17/30598 , Y10S707/99933 , Y10S707/99937

摘要： A method and system for analyzing data records includes allocating groups of records to respective processes of a first plurality of processes executing in parallel. In each respective process of the first plurality of processes, for each record in the group of records allocated to the respective process, a query is applied to the record so as to produce zero or more values. Zero or more emit operators are applied to each of the zero or more produced values so as to add corresponding information to an intermediate data structure. Information from a plurality of the intermediate data structures is aggregated to produce output data.

8.

发明授权
Document scoring based on query analysis 有权
标题翻译：基于查询分析的文档评分

公开(公告)号：US08639690B2

公开(公告)日：2014-01-28

申请号：US13454424

申请日：2012-04-24

申请人： Jeffrey Dean , Paul Haahr , Monika Henzinger , Steve Lawrence , Karl Pfleger , Olcan Sercinoglu , Simon Tong

发明人： Jeffrey Dean , Paul Haahr , Monika Henzinger , Steve Lawrence , Karl Pfleger , Olcan Sercinoglu , Simon Tong

IPC分类号： G06F17/30

CPC分类号： G06Q30/0246 , G06F17/30864 , Y10S707/99933

摘要： A system may determine an extent to which a document is selected when the document is included in a set of search results, generate a score for the document based, at least in part, on the extent to which the document is selected when the document is included in a set of search results; and rank the document with regard to at least one other document based, at least in part, on the score.

摘要翻译： 当文档被包括在一组搜索结果中时，系统可以确定文档被选择的程度，至少部分地基于在文档是文档是文档时选择文档的程度的文档的分数包含在一组搜索结果中; 并且至少部分地基于得分来排列关于至少一个其他文档的文档。

9.

发明申请
System and Method for Analyzing Data Records 有权
标题翻译：用于分析数据记录的系统和方法

公开(公告)号：US20120215787A1

公开(公告)日：2012-08-23

申请号：US13407632

申请日：2012-02-28

申请人： Robert C. Pike , Sean Quinlan , Sean M. Dorward , Jeffrey Dean , Sanjay Ghemawat

发明人： Robert C. Pike , Sean Quinlan , Sean M. Dorward , Jeffrey Dean , Sanjay Ghemawat

IPC分类号： G06F17/30

CPC分类号： G06F17/30501 , G06F11/1482 , G06F17/30545 , G06F17/30598 , Y10S707/99933 , Y10S707/99937

摘要： A method and system for analyzing data records includes allocating groups of records to respective processes of a first plurality of processes executing in parallel. In each respective process of the first plurality of processes, for each record in the group of records allocated to the respective process, a query is applied to the record so as to produce zero or more values. Zero or more emit operators are applied to each of the zero or more produced values so as to add corresponding information to an intermediate data structure. Information from a plurality of the intermediate data structures is aggregated to produce output data.

摘要翻译： 用于分析数据记录的方法和系统包括：将记录组分配给并行执行的第一多个进程的各个进程。在第一多个处理的每个相应处理中，对于分配给相应处理的记录组中的每个记录，将对该记录应用查询以产生零个或多个值。将零个或更多个发射操作符应用于零或更多产生的值中的每一个，以便将相应的信息添加到中间数据结构。来自多个中间数据结构的信息被聚合以产生输出数据。

10.

发明申请
DOCUMENT SCORING BASED ON QUERY ANALYSIS 有权

公开(公告)号：US20120016874A1

公开(公告)日：2012-01-19

申请号：US13244863

申请日：2011-09-26

申请人： Jeffrey Dean , Paul Haahr , Monika Henzinger , Steve Lawrence , Karl Pfleger , Olcan Sercinoglu , Simon Tong

发明人： Jeffrey Dean , Paul Haahr , Monika Henzinger , Steve Lawrence , Karl Pfleger , Olcan Sercinoglu , Simon Tong

IPC分类号： G06F17/30

CPC分类号： G06Q30/0246 , G06F17/30864 , Y10S707/99933

摘要： A system may determine an extent to which a document is selected when the document is included in a set of search results, generate a score for the document based, at least in part, on the extent to which the document is selected when the document is included in a set of search results; and rank the document with regard to at least one other document based, at least in part, on the score.

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类