Method and system for web resource location classification and detection
    71.
    发明授权
    Method and system for web resource location classification and detection 有权
    Web资源位置分类和检测方法与系统

    公开(公告)号:US08073789B2

    公开(公告)日:2011-12-06

    申请号:US12539555

    申请日:2009-08-11

    IPC分类号: G06N5/00

    摘要: A method and system for identifying locations associated with a web resource is provided. The location system identifies three different types of geographic locations: a provider location, a content location, and a serving location. A provider location identifies the geographic location of the entity that provides the web resource. A content location identifies the geographic location that is the subject of the web resource. A serving location identifies the geographic scope that the web page reaches. An application can select to use the type of location that is of particular interest.

    摘要翻译: 提供了一种用于识别与web资源相关联的位置的方法和系统。 位置系统识别三种不同类型的地理位置:提供者位置,内容位置和服务位置。 提供商位置标识提供网络资源的实体的地理位置。 内容位置标识作为Web资源主题的地理位置。 服务位置标识网页到达的地理范围。 应用程序可以选择使用特别感兴趣的位置类型。

    Object similarity search in high-dimensional vector spaces
    72.
    发明授权
    Object similarity search in high-dimensional vector spaces 有权
    高维向量空间中的对象相似度搜索

    公开(公告)号:US07941442B2

    公开(公告)日:2011-05-10

    申请号:US11737075

    申请日:2007-04-18

    IPC分类号: G06F7/00 G06F17/30

    摘要: An object search system generates a hierarchical clustering of objects of a collection based on similarity of the objects. The object search system generates a separate hierarchical clustering of objects for multiple features of the objects. To identify objects similar to a target object, the object search system first generates a feature vector for the target object. For each feature of the feature vector, the object search system uses the hierarchical clustering of objects to identify the cluster of objects that is most “feature similar” to that feature of the target object. The object search system indicates the similarity of each candidate object based on the features for which the candidate object is similar.

    摘要翻译: 对象搜索系统基于对象的相似性生成集合的对象的分层聚类。 对象搜索系统为对象的多个特征生成对象的单独分层聚类。 为了识别与目标对象类似的对象,对象搜索系统首先生成目标对象的特征向量。 对于特征向量的每个特征,对象搜索系统使用对象的分层聚类来识别与目标对象的特征最“特征相似”的对象簇。 对象搜索系统基于候选对象相似的特征来指示每个候选对象的相似性。

    Classifying functions of web blocks based on linguistic features
    73.
    发明授权
    Classifying functions of web blocks based on linguistic features 有权
    基于语言特征分类网页功能

    公开(公告)号:US07895148B2

    公开(公告)日:2011-02-22

    申请号:US11742283

    申请日:2007-04-30

    IPC分类号: G06N5/00

    CPC分类号: G06Q10/10

    摘要: A classification system trains a classifier to classify blocks of the web page into various classifications of the function of the block. The classification system trains a classifier using training web pages. To train a classifier, the classification system identifies the blocks of the training web pages, generates feature vectors for the blocks that include a linguistic feature, and inputs classification labels for each block. The classification system learns the coefficients of the classifier using any of a variety of machine learning techniques. The classification system can then use the classifier to classify blocks of web pages.

    摘要翻译: 分类系统训练分类器将网页的块分类为块的功能的各种分类。 分类系统使用训练网页训练分类器。 为了训练分类器,分类系统识别训练网页的块,为包括语言特征的块生成特征向量,并为每个块输入分类标签。 分类系统使用各种机器学习技术中的任何一种学习分类器的系数。 然后,分类系统可以使用分类器对网页块进行分类。

    Detecting Spatial Outliers in a Location Entity Dataset
    74.
    发明申请
    Detecting Spatial Outliers in a Location Entity Dataset 有权
    在位置实体数据集中检测空间异常值

    公开(公告)号:US20100179759A1

    公开(公告)日:2010-07-15

    申请号:US12353940

    申请日:2009-01-14

    IPC分类号: G06F17/00

    CPC分类号: G01S19/40

    摘要: Disclosed herein are one or more embodiments that arrange a plurality of location entities into a hierarchy of location descriptors. One or more of the disclosed embodiments may determine whether one of the location entities is a spatial outlier based at least in part on presence of one or more other location entities within a predetermined distance of the one location entity. Also, the other location entities and the one location entity may share a location descriptor.

    摘要翻译: 这里公开了将多个位置实体排列成位置描述符的层次结构的一个或多个实施例。 所公开的实施例中的一个或多个可以至少部分地基于一个位置实体的预定距离内的一个或多个其他位置实体的存在来确定位置实体之一是否是空间异常值。 此外,其他位置实体和一个位置实体可以共享位置描述符。

    Making Friend and Location Recommendations Based on Location Similarities
    75.
    发明申请
    Making Friend and Location Recommendations Based on Location Similarities 审中-公开
    基于位置相似性建立朋友和位置建议

    公开(公告)号:US20100153292A1

    公开(公告)日:2010-06-17

    申请号:US12332371

    申请日:2008-12-11

    IPC分类号: G06Q99/00

    摘要: Method for making a recommendation to a first user in a computing network, including calculating one or more similarity scores between the first user and one or more remaining users in the network, identifying a portion of the remaining users having a highest similarity scores, identifying one or more locations visited by the portion of the remaining users but not by the first user, determining an interest level of the first user in each location, ranking the locations based on the interest levels, and displaying the locations based on the ranking as a first recommendation.

    摘要翻译: 一种用于向计算网络中的第一用户推荐的方法,包括计算所述第一用户与所述网络中的一个或多个剩余用户之间的一个或多个相似度得分,识别具有最高相似性得分的剩余用户的一部分, 或多个由剩余用户的部分而不是由第一用户访问的位置,确定每个位置中的第一用户的兴趣级别,基于兴趣级别对位置进行排名,并且基于排名将位置显示为第一 建议。

    SEARCH ENGINE ENHANCEMENT USING MINED IMPLICIT LINKS
    76.
    发明申请
    SEARCH ENGINE ENHANCEMENT USING MINED IMPLICIT LINKS 有权
    使用精简的隐含链接搜索引擎增强

    公开(公告)号:US20100023508A1

    公开(公告)日:2010-01-28

    申请号:US12505426

    申请日:2009-07-17

    IPC分类号: G06F17/30 G06F17/00

    摘要: An implicit links enhancement system and method for search engines that generates implicit links obtained from mining user access logs to facilitate enhanced local searching of web sites and intranets. Embodiments of the implicit links search enhancement system and method includes extracting implicit links by mining users' access patterns and then using a modified link analysis algorithm to re-rank search results obtained from traditional search engines. More specifically, embodiments of the method include extracting implicit links from a user access log, generating an implicit links graph from the extracted implicit links, and computing page rankings using the implicit links graph. The implicit links are extracted from the log using a two-item sequential pattern mining technique. Search results obtained from a search engine are re-ranked based on an implicit links analysis performed using an updated implicit links graph, a modified re-ranking formula, and at least one re-ranking technique.

    摘要翻译: 一种用于搜索引擎的隐式链接增强系统和方法,用于生成从挖掘用户访问日志中获取的隐含链接,以促进对网站和内部网的增强的本地搜索。 隐式链接搜索增强系统和方法的实施例包括通过挖掘用户的访问模式来提取隐含链接,然后使用经修改的链接分析算法重新排列从传统搜索引擎获得的搜索结果。 更具体地,该方法的实施例包括从用户访问日志提取隐含链接,从提取的隐式链接生成隐式链接图,以及使用隐式链接图计算页面排名。 使用两项顺序模式挖掘技术从日志中提取隐式链接。 基于使用更新的隐式链接图,修改的重新排列公式和至少一个重新排序技术执行的隐式链接分析,从搜索引擎获得的搜索结果被重新排序。

    METHOD AND SYSTEM FOR WEB RESOURCE LOCATION CLASSIFICATION AND DETECTION
    77.
    发明申请
    METHOD AND SYSTEM FOR WEB RESOURCE LOCATION CLASSIFICATION AND DETECTION 有权
    网页资源位置分类与检测方法与系统

    公开(公告)号:US20100010945A1

    公开(公告)日:2010-01-14

    申请号:US12539555

    申请日:2009-08-11

    IPC分类号: G06F15/18 G06F15/16 G06N5/02

    摘要: A method and system for identifying locations associated with a web resource is provided. The location system identifies three different types of geographic locations: a provider location, a content location, and a serving location. A provider location identifies the geographic location of the entity that provides the web resource. A content location identifies the geographic location that is the subject of the web resource. A serving location identifies the geographic scope that the web page reaches. An application can select to use the type of location that is of particular interest.

    摘要翻译: 提供了一种用于识别与web资源相关联的位置的方法和系统。 位置系统识别三种不同类型的地理位置:提供者位置,内容位置和服务位置。 提供商位置标识提供网络资源的实体的地理位置。 内容位置标识作为Web资源主题的地理位置。 服务位置标识网页到达的地理范围。 应用程序可以选择使用特别感兴趣的位置类型。

    RECOMMENDING CONTACTS IN A SOCIAL NETWORK
    78.
    发明申请
    RECOMMENDING CONTACTS IN A SOCIAL NETWORK 有权
    在社交网络中推荐的联系人

    公开(公告)号:US20090319466A1

    公开(公告)日:2009-12-24

    申请号:US12546630

    申请日:2009-08-24

    IPC分类号: G06N5/02

    摘要: A method and system for recommending potential contacts to a target user is provided. A recommendation system identifies users who are related to the target user through no more than a maximum degree of separation. The recommendation system identifies the users by starting with the contacts of the target user and identifying users who are contacts of the target user's contacts, contacts of those contacts, and so on. The recommendation system then ranks the identified users, who are potential contacts for the target user, based on a likelihood that the target user will want to have a direct relationship with the identified users. The recommendation system then presents to the target user a ranking of the users who have not been filtered out.

    摘要翻译: 提供了一种用于向目标用户推荐潜在联系人的方法和系统。 推荐系统通过不超过最大程度的分离来识别与目标用户相关的用户。 推荐系统通过从目标用户的联系人开始并识别与目标用户的联系人,联系人的联系人等的用户来识别用户。 然后,推荐系统根据目标用户想要与所识别的用户有直接关系的可能性,对所识别的用户排列谁是目标用户的潜在联系人。 然后,推荐系统向目标用户呈现未被滤除的用户的排名。

    Method and system for clustering using generalized sentence patterns
    79.
    发明授权
    Method and system for clustering using generalized sentence patterns 有权
    使用广义句型进行聚类的方法和系统

    公开(公告)号:US07584100B2

    公开(公告)日:2009-09-01

    申请号:US10880662

    申请日:2004-06-30

    摘要: A method and system for clustering documents based on generalized sentence patterns of the topics of the documents is provided. A generalized sentence patterns (“GSP”) system identifies a “sentence” that describes the topic of a document. To cluster documents, the GSP system generates a “generalized sentence” form of the sentence that describes the topic of each document. The generalized sentence is an abstraction of the words of the sentence. The GSP system identifies clusters of documents based on the patterns of their generalized sentences. The GSP system clusters documents when the generalized sentence representations of their topics have a similar pattern.

    摘要翻译: 提供了一种基于文档主题的广义句子模式对文档进行聚类的方法和系统。 广义句型(“GSP”)系统识别描述文档主题的“句子”。 为了集群文件,GSP系统生成描述每个文档主题的句子的“广义句子”形式。 广义句是对句子的单词的抽象。 GSP系统根据其广义句子的模式识别文档簇。 GSP系统在其主题的广义句子表示具有相似模式时对文档进行聚类。

    INDEXING LARGE-SCALE GPS TRACKS
    80.
    发明申请
    INDEXING LARGE-SCALE GPS TRACKS 有权
    引导大规模GPS跟踪

    公开(公告)号:US20090216787A1

    公开(公告)日:2009-08-27

    申请号:US12037263

    申请日:2008-02-26

    IPC分类号: G06F7/00 G06F17/30

    摘要: Described is a technology by which uploaded GPS data is indexed according to spatio-temporal relationships to facilitate efficient insertion and retrieval. The indexes may be converted to significantly smaller-sized data structures when new updates to that structure are not likely. GPS data is processed into a track of spatially-partitioned segments such that each segment has a cell. Each cell has an associated temporal index (a compressed start-end tree), into which data for that cell's segments are inserted. The temporal index may include an end time index that relates each segment's end time to a matching start time index. Given query input comprising a spatial predicate and a temporal predicate, tracks may be searched for by determining which spatial candidate cells may contain matching results. For each candidate cell, the search accesses the cell's associated temporal index to find any track or tracks that correspond to the temporal predicate.

    摘要翻译: 描述了一种根据时空关系对上传的GPS数据进行索引的技术,以便于有效的插入和检索。 当该结构的新更新不太可能时,索引可能会转换为显着更小的数据结构。 GPS数据被处理成空间分割的段的轨道,使得每个段具有一个单元。 每个单元都具有关联的时间索引(压缩的开始结束树),该单元格的段的数据被插入到该时间索引中。 时间索引可以包括将每个段的结束时间与匹配的开始时间索引相关联的结束时间索引。 给定包括空间谓词和时间谓词的查询输入,可以通过确定哪些空间候选小区可以包含匹配结果来搜索轨道。 对于每个候选小区,搜索访问小区的相关联的时间索引以找到与时间谓词相对应的任何轨道或​​轨道。