Classifying functions of web blocks based on linguistic features
    11.
    发明授权
    Classifying functions of web blocks based on linguistic features 有权
    基于语言特征分类网页功能

    公开(公告)号:US07895148B2

    公开(公告)日:2011-02-22

    申请号:US11742283

    申请日:2007-04-30

    IPC分类号: G06N5/00

    CPC分类号: G06Q10/10

    摘要: A classification system trains a classifier to classify blocks of the web page into various classifications of the function of the block. The classification system trains a classifier using training web pages. To train a classifier, the classification system identifies the blocks of the training web pages, generates feature vectors for the blocks that include a linguistic feature, and inputs classification labels for each block. The classification system learns the coefficients of the classifier using any of a variety of machine learning techniques. The classification system can then use the classifier to classify blocks of web pages.

    摘要翻译: 分类系统训练分类器将网页的块分类为块的功能的各种分类。 分类系统使用训练网页训练分类器。 为了训练分类器,分类系统识别训练网页的块,为包括语言特征的块生成特征向量,并为每个块输入分类标签。 分类系统使用各种机器学习技术中的任何一种学习分类器的系数。 然后,分类系统可以使用分类器对网页块进行分类。

    METHOD AND SYSTEM FOR WEB RESOURCE LOCATION CLASSIFICATION AND DETECTION
    12.
    发明申请
    METHOD AND SYSTEM FOR WEB RESOURCE LOCATION CLASSIFICATION AND DETECTION 有权
    网页资源位置分类与检测方法与系统

    公开(公告)号:US20100010945A1

    公开(公告)日:2010-01-14

    申请号:US12539555

    申请日:2009-08-11

    IPC分类号: G06F15/18 G06F15/16 G06N5/02

    摘要: A method and system for identifying locations associated with a web resource is provided. The location system identifies three different types of geographic locations: a provider location, a content location, and a serving location. A provider location identifies the geographic location of the entity that provides the web resource. A content location identifies the geographic location that is the subject of the web resource. A serving location identifies the geographic scope that the web page reaches. An application can select to use the type of location that is of particular interest.

    摘要翻译: 提供了一种用于识别与web资源相关联的位置的方法和系统。 位置系统识别三种不同类型的地理位置:提供者位置,内容位置和服务位置。 提供商位置标识提供网络资源的实体的地理位置。 内容位置标识作为Web资源主题的地理位置。 服务位置标识网页到达的地理范围。 应用程序可以选择使用特别感兴趣的位置类型。

    HYBRID LOCATION AND KEYWORD INDEX
    13.
    发明申请
    HYBRID LOCATION AND KEYWORD INDEX 有权
    混合位置和关键字索引

    公开(公告)号:US20090019066A1

    公开(公告)日:2009-01-15

    申请号:US12234563

    申请日:2008-09-19

    IPC分类号: G06F17/00 G06F17/30

    摘要: A method and system for generating a hybrid index for indexing objects based on location and keyword attributes and performing location-based searching is provided. A search system performs a location-based search using a hybrid index that indexes both location and keyword attributes of objects. The search system generates the hybrid index either using the location attribute as the primary index or the keyword attribute as the primary index. When the location attribute is the primary index, the keyword attribute is the secondary index, and vice versa. To generate the hybrid index, the search system identifies the values for the keyword and location attributes of each object. The search system generates the primary index to map each value of a first attribute to a secondary index. The search system thus generates, for each value of the first attribute, a secondary index to map values of a second attribute to objects that have the associated values of the first and second attributes. The search system then uses the hybrid index to perform location-based searching.

    摘要翻译: 提供了一种用于基于位置和关键字属性生成用于索引对象的混合索引并执行基于位置的搜索的方法和系统。 搜索系统使用索引对象的位置和关键字属性的混合索引来执行基于位置的搜索。 搜索系统使用location属性作为主索引或keyword属性作为主索引来生成混合索引。 当location属性是主索引时,关键字属性是辅助索引,反之亦然。 为了生成混合索引,搜索系统识别每个对象的关键字和位置属性的值。 搜索系统生成主索引以将第一个属性的每个值映射到次要索引。 因此,搜索系统为第一属性的每个值生成辅助索引,以将第二属性的值映射到具有第一和第二属性的相关联值的对象。 然后,搜索系统使用混合索引来执行基于位置的搜索。

    MINING GEOGRAPHIC KNOWLEDGE USING A LOCATION AWARE TOPIC MODEL
    14.
    发明申请
    MINING GEOGRAPHIC KNOWLEDGE USING A LOCATION AWARE TOPIC MODEL 有权
    采用地理位置主题模型挖掘地理知识

    公开(公告)号:US20080319974A1

    公开(公告)日:2008-12-25

    申请号:US11766716

    申请日:2007-06-21

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30241

    摘要: Mining geographic knowledge using a location aware topic model is provided. A location system estimates topics and locations associated with documents based on a location aware topic (“LAT”) model. The location system generates the model from a collection of documents that are labeled with their associated locations. The location system generates collection level parameters based on an LDA-style model. To generate the collection level parameters, the location system estimates probabilities of latent topics, locations, and words of the collection. After the model is generated, the location system uses the collection level parameters to estimate probabilities of topics and locations being associated with target documents.

    摘要翻译: 提供使用位置感知主题模型挖掘地理知识。 位置系统基于位置感知主题(“LAT”)模型估计与文档相关联的主题和位置。 位置系统从标有其关联位置的文档集合生成模型。 位置系统基于LDA样式模型生成收集级别参数。 为了生成收集级参数,定位系统估计集合的潜在主题,位置和单词的概率。 在模型生成之后,位置系统使用集合级参数来估计与目标文档相关联的主题和位置的概率。

    Hybrid location and keyword index
    15.
    发明授权
    Hybrid location and keyword index 有权
    混合位置和关键字索引

    公开(公告)号:US07444343B2

    公开(公告)日:2008-10-28

    申请号:US11278301

    申请日:2006-03-31

    IPC分类号: G06F17/00 G06F17/30

    摘要: A method and system for generating a hybrid index for indexing objects based on location and keyword attributes and performing location-based searching is provided. A search system performs a location-based search using a hybrid index that indexes both location and keyword attributes of objects. The search system generates the hybrid index either using the location attribute as the primary index or the keyword attribute as the primary index. When the location attribute is the primary index, the keyword attribute is the secondary index, and vice versa. To generate the hybrid index, the search system identifies the values for the keyword and location attributes of each object. The search system generates the primary index to map each value of a first attribute to a secondary index. The search system thus generates, for each value of the first attribute, a secondary index to map values of a second attribute to objects that have the associated values of the first and second attributes. The search system then uses the hybrid index to perform location-based searching.

    摘要翻译: 提供了一种用于基于位置和关键字属性生成用于索引对象的混合索引并执行基于位置的搜索的方法和系统。 搜索系统使用索引对象的位置和关键字属性的混合索引来执行基于位置的搜索。 搜索系统使用location属性作为主索引或keyword属性作为主索引来生成混合索引。 当location属性是主索引时,关键字属性是辅助索引,反之亦然。 为了生成混合索引,搜索系统识别每个对象的关键字和位置属性的值。 搜索系统生成主索引以将第一个属性的每个值映射到次要索引。 因此,搜索系统为第一属性的每个值生成辅助索引,以将第二属性的值映射到具有第一和第二属性的相关联值的对象。 然后,搜索系统使用混合索引来执行基于位置的搜索。

    SERVING LOCALLY RELEVANT ADVERTISEMENTS
    16.
    发明申请
    SERVING LOCALLY RELEVANT ADVERTISEMENTS 有权
    服务于当地相关广告

    公开(公告)号:US20080052413A1

    公开(公告)日:2008-02-28

    申请号:US11467771

    申请日:2006-08-28

    IPC分类号: G07G1/14 G06F15/16

    摘要: A method and system for providing location-based advertisements to requesting devices is provided. An advertisement system aggregates advertisements by collecting advertisements from multiple advertisement sources, extracting data from the collected advertisements, and storing the extracted data in a common format. After aggregating the advertisements, the advertisement system transforms each advertisement into multiple advertisement formats that are specific to protocols supported by the various device types. When the advertisement system receives queries for advertisements, it identifies matching advertisements and ranks them based on a location. The advertisement system then selects an advertisement format that is appropriate for the requesting device.

    摘要翻译: 提供了一种用于向请求设备提供基于位置的广告的方法和系统。 广告系统通过从多个广告源收集广告来聚合广告,从收集到的广告中提取数据,并以通用格式存储提取的数据。 广告系统在聚合广告之后,将每个广告变换为由各种设备类型支持的协议特定的多种广告格式。 当广告系统接收到广告的查询时,它识别匹配的广告并且基于位置对它们进行排名。 然后,广告系统选择适合于请求设备的广告格式。

    Document representation for scalable structure
    17.
    发明授权
    Document representation for scalable structure 有权
    可扩展结构的文档表示

    公开(公告)号:US07290006B2

    公开(公告)日:2007-10-30

    申请号:US10676518

    申请日:2003-09-30

    IPC分类号: G06F17/00

    摘要: An exemplary system includes a browser to browse a web page based on a web page definition having a slicing tree defining an arrangement of rectangular regions in the web page. The web page definition can include parametric data describing adaptability parameters associated with a rectangular region. A rendering module renders an adapted web page based on the web page definition, and a proxy module generates an intermediary adapted web page definition. A method includes rendering the web page according to a slicing tree and block property data in an associated web page definition. The method may include determining a set of unsummarized blocks that maximize information fidelity.

    摘要翻译: 示例性系统包括浏览器,其基于具有定义网页中的矩形区域布置的切片树的网页定义来浏览网页。 网页定义可以包括描述与矩形区域相关联的适应性参数的参数数据。 呈现模块基于网页定义呈现适应的网页,并且代理模块生成中介适配的网页定义。 一种方法包括根据切片树渲染网页并在相关网页定义中块块属性数据。 该方法可以包括确定使信息保真度最大化的一组未知块。

    GENERATING SEARCH RESULTS BASED ON DUPLICATE IMAGE DETECTION
    18.
    发明申请
    GENERATING SEARCH RESULTS BASED ON DUPLICATE IMAGE DETECTION 有权
    基于双重图像检测生成搜索结果

    公开(公告)号:US20070237426A1

    公开(公告)日:2007-10-11

    申请号:US11278575

    申请日:2006-04-04

    IPC分类号: G06K9/54 G06K9/46 G06F17/30

    CPC分类号: G06F17/30259 G06K9/6211

    摘要: A method and system for searching for content relating to a target or query image by identifying duplicate images with associated content is provided. An image search system identifies visual parts of objects within the target image based on analysis of two or more versions of the target image. The image search system identifies visual parts based on analysis of the versions. The image search system then identifies images of an image database that have visual parts that are similar to the visual parts of the target image. The image search system may rank the identified images based on their likelihood of being duplicates of the target image and provide their associated content as the search result ordered according to the ranking of the images.

    摘要翻译: 提供了一种用于通过识别具有相关内容的重复图像来搜索与目标或查询图像相关的内容的方法和系统。 图像搜索系统基于目标图像的两个或多个版本的分析来识别目标图像内的对象的可视部分。 图像搜索系统基于对版本的分析来识别视觉部分。 图像搜索系统然后识别具有与目标图像的视觉部分相似的视觉部分的图像数据库的图像。 图像搜索系统可以基于它们是目标图像的重复的可能性来对所识别的图像进行排序,并且根据图像的排序来提供其相关联的内容作为搜索结果排序。

    Detecting Serving Area of a Web Resource
    19.
    发明申请
    Detecting Serving Area of a Web Resource 有权
    检测Web资源的服务区域

    公开(公告)号:US20070233864A1

    公开(公告)日:2007-10-04

    申请号:US11277704

    申请日:2006-03-28

    IPC分类号: G06F15/16

    摘要: Methods and systems for determining the serving area of a web resource by address, by query content, and by business category are provided. A location system may determine the serving area of a web resource based on addresses of users who access the web resource. The location system may determine the serving area for a web site (or other web resource) based on query terms that resulted in a click-through to the web site. The location system may determine the serving area of a web site (or other web resource) based on the business category of the web site and a “provider location” associated with the web site.

    摘要翻译: 提供了通过地址,查询内容和业务类别来确定网络资源的服务区域的方法和系统。 位置系统可以基于访问网络资源的用户的地址来确定网络资源的服务区域。 位置系统可以基于导致到网站的点击的查询术语来确定网站(或其他网络资源)的服务区域。 位置系统可以基于网站的业务类别和与网站相关联的“提供商位置”来确定网站(或其他网络资源)的服务区域。

    GENERATING SEARCH REQUESTS FROM MULTIMODAL QUERIES
    20.
    发明申请
    GENERATING SEARCH REQUESTS FROM MULTIMODAL QUERIES 审中-公开
    生成多个查询的搜索请求

    公开(公告)号:US20120093371A1

    公开(公告)日:2012-04-19

    申请号:US13332248

    申请日:2011-12-20

    IPC分类号: G06K9/00 G06K9/54

    摘要: A method and system for generating a search request from a multimodal query that includes a query image and query text is provided. The multimodal query system identifies images of a collection that are textually related to the query image based on similarity between words associated with each image and the query text. The multimodal query system then selects those images of the identified images that are visually related to the query image. The multimodal query system may formulate a search request based on keywords of web pages that contain the selected images and submit that search request to a search engine service.

    摘要翻译: 提供了一种用于从包含查询图像和查询文本的多模式查询生成搜索请求的方法和系统。 多模式查询系统基于与每个图像相关联的单词与查询文本之间的相似度来识别与查询图像文本相关的集合的图像。 多模式查询系统然后选择与查询图像视觉相关的识别图像的那些图像。 多模式查询系统可以基于包含所选图像的网页的关键词来制定搜索请求,并将该搜索请求提交给搜索引擎服务。