Augmenting user, query, and document triplets using singular value decomposition
    82.
    发明授权
    Augmenting user, query, and document triplets using singular value decomposition 失效
    使用奇异值分解增强用户,查询和文档三元组

    公开(公告)号:US07747618B2

    公开(公告)日:2010-06-29

    申请号:US11222243

    申请日:2005-09-08

    IPC分类号: G06F7/00

    CPC分类号: G06F17/30675 G06F17/30864

    摘要: A system for augmenting click-through data with latent information present in the click-through data for use in generating search results that are better tailored to the information needs of a user submitting a query is provided. The augmentation system creates a three-dimensional matrix with the dimensions of users, queries, and documents. The augmentation system then performs a three-order singular value decomposition of the three-dimensional matrix to generate a three-dimensional core singular value matrix and a left singular matrix for each dimension. The augmentation system finally multiplies the three-dimensional core singular value matrix by the left singular matrices to generate an augmented three-dimensional matrix that explicitly contains the information that was latent in the un-augmented three-dimensional matrix.

    摘要翻译: 提供了一种用于通过存在于点击数据中的潜在信息来增强点击数据的系统,用于生成针对提交查询的用户的信息需求更好地定制的搜索结果。 增强系统创建一个具有用户,查询和文档尺寸的三维矩阵。 然后,增强系统执行三维矩阵的三阶奇异值分解,以产生每个维度的三维核心奇异值矩阵和左奇异矩阵。 增强系统最终将三维核心奇异值矩阵乘以左奇异矩阵,以生成明确包含未增强三维矩阵中潜在信息的增强三维矩阵。

    Method and system for summarizing a document
    83.
    发明授权
    Method and system for summarizing a document 有权
    汇总文件的方法和系统

    公开(公告)号:US07698339B2

    公开(公告)日:2010-04-13

    申请号:US10918242

    申请日:2004-08-13

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30705 G06F17/30719

    摘要: A method and system for calculating the significance of a sentence within a document is provided. The summarization system calculates the significance of the sentences of a document and selects the most significant sentences as the summary of the document. The summarization system calculates the significance of a sentence based on the “important” words of the document that are contained within the sentence. The summarization system calculates the importance of words of the document using various scoring techniques and then combines the scores to classify a word as important or not important. The summarization system can then be used to identify significant sentences of the document based on the important words that a sentence contains and select significant sentences as a summary of the document.

    摘要翻译: 提供了一种用于计算文档中句子的重要性的方法和系统。 总结系统计算文档的句子的重要性,并选择最重要的句子作为文档的摘要。 总结系统根据文本中包含的“重要”字来计算句子的意义。 总结系统使用各种评分技术计算文档的单词的重要性,然后将分数组合成一个单词重要或不重要。 然后,总结系统可以用于基于句子包含的重要词语来识别文档的重要句子,并且将重要句子作为文档的摘要来选择。

    Method and system for adapting search results to personal information needs
    84.
    发明授权
    Method and system for adapting search results to personal information needs 失效
    将搜索结果适应个人信息需求的方法和系统

    公开(公告)号:US07630976B2

    公开(公告)日:2009-12-08

    申请号:US11125839

    申请日:2005-05-10

    IPC分类号: G06F17/30 G06Q30/00

    摘要: A method and system for adapting search results of a query to the information needs of the user submitting the query is provided. A search system analyzes click-through triplets indicating that a user submitted a query and that the user selected a document from the results of the query. To overcome the large size and sparseness of the click-through data, the search system when presented with an input triplet comprising a user, a query, and a document determines a probability that the user will find the input document important by smoothing the click-through triplets. The search system then orders documents of the result based on the probability of their importance to the input user.

    摘要翻译: 提供了一种用于将查询的搜索结果适应于提交查询的用户的信息需求的方法和系统。 搜索系统分析点击三胞胎,指示用户提交了查询,并且用户从查询的结果中选择了文档。 为了克服点击数据的大尺寸和稀疏性,搜索系统当呈现包括用户,查询和文档的输入三元组时,确定用户将通过平滑点击数据来重新找到输入文档的概率, 通过三胞胎。 然后,搜索系统基于其对输入用户的重要性的概率来订购结果的文档。

    Method and system for determining similarity of items based on similarity objects and their features
    85.
    发明授权
    Method and system for determining similarity of items based on similarity objects and their features 有权
    基于相似对象及其特征确定项目相似度的方法和系统

    公开(公告)号:US07533094B2

    公开(公告)日:2009-05-12

    申请号:US10997749

    申请日:2004-11-23

    IPC分类号: G06F17/30

    摘要: A method and system for determining similarity between items is provided. To calculate similarity scores for pairs of items, the similarity system initializes a similarity score for each pair of objects and each pair of features. The similarity system then iteratively calculates the similarity scores for each pair of objects based on the similar scores of the pairs of features calculated during a previous iteration and calculates the similarity scores for each pair of features based on the similarity scores of the pairs of objects calculated during a previous iteration. The similarity system implements an algorithm that is based on a recursive definition of the similarities between objects and between features. The similarity system continues the iterations of recalculating the similarity scores until the similarity scores converge on a solution.

    摘要翻译: 提供了一种用于确定项目之间的相似性的方法和系统。 为了计算物品对的相似性分数,相似系统初始化每对物体和每对特征的相似性得分。 然后,相似系统基于在先前迭代期间计算的特征对的类似得分迭代地计算每对对象的相似性得分,并且基于计算出的对象对的相似性得分来计算每对特征的相似性得分 在之前的迭代。 相似系统实现了一种基于对象之间和特征之间的相似性的递归定义的算法。 相似系统继续重新计算相似性分数的迭代,直到相似性得分收敛于解。

    Method and system for incrementally learning an adaptive subspace by optimizing the maximum margin criterion
    86.
    发明授权
    Method and system for incrementally learning an adaptive subspace by optimizing the maximum margin criterion 有权
    通过优化最大裕度标准来逐步学习自适应子空间的方法和系统

    公开(公告)号:US07502495B2

    公开(公告)日:2009-03-10

    申请号:US11070382

    申请日:2005-03-01

    IPC分类号: G06K9/00

    CPC分类号: G06K9/6234

    摘要: A method and system for generating a projection matrix for projecting data from a high dimensional space to a low dimensional space. The system establishes an objective function based on a maximum margin criterion matrix. The system then provides data samples that are in the high dimensional space and have a class. For each data sample, the system incrementally derives leading eigenvectors of the maximum margin criterion matrix based on the derivation of the leading eigenvectors of the last data sample. The derived eigenvectors compose the projection matrix, which can be used to project data samples in a high dimensional space into a low dimensional space.

    摘要翻译: 一种用于生成用于将数据从高维空间投影到低维空间的投影矩阵的方法和系统。 该系统基于最大裕度标准矩阵建立目标函数。 然后,系统提供处于高维空间并具有类的数据样本。 对于每个数据样本,系统基于最后数据样本的前导特征向量的推导,递增地导出最大边际准则矩阵的前导特征向量。 衍生的特征向量组成投影矩阵,可用于将高维空间中的数据样本投影到低维空间中。

    Clustering based text classification
    87.
    发明授权
    Clustering based text classification 有权
    基于聚类的文本分类

    公开(公告)号:US07366705B2

    公开(公告)日:2008-04-29

    申请号:US10921477

    申请日:2004-08-16

    CPC分类号: G06F17/3071

    摘要: Systems and methods for clustering-based text classification are described. In one aspect text is clustered as a function of labeled data to generate cluster(s). The text includes the labeled data and unlabeled data. Expanded labeled data is then generated as a function of the cluster(s). The expanded label data includes the labeled data and at least a portion of unlabeled data. Discriminative classifier(s) are then trained based on the expanded labeled data and remaining ones of the unlabeled data.

    摘要翻译: 描述了基于聚类的文本分类的系统和方法。 在一个方面,文本被聚类为标记数据的函数以生成集群。 该文本包括标记数据和未标记数据。 然后根据集群生成扩展标签数据。 扩展的标签数据包括标记的数据和至少一部分未标记的数据。 然后基于扩展的标记数据和剩余的未标记数据来训练鉴别分类器。

    Advertising keyword cross-selling
    88.
    发明申请
    Advertising keyword cross-selling 有权
    广告关键字交叉销售

    公开(公告)号:US20070143176A1

    公开(公告)日:2007-06-21

    申请号:US11300918

    申请日:2005-12-15

    IPC分类号: G06Q30/00

    摘要: Seed keywords are leveraged to provide expanded keywords that are then associated with relevant advertisers. Instances can also include locating potential advertisers based on the expanded keywords. Inverse lookup techniques are employed to determine which keywords are associated with an advertiser. Filtering can then be employed to eliminate inappropriate keywords for that advertiser. The keywords are then automatically revealed to the advertiser for consideration as relevant search terms for their advertisements. In this manner, revenue for a search engine and/or for an advertiser can be substantially enhanced through the automatic expansion of relevant search terms. Advertisers also benefit by having larger and more relevant search term selections automatically available to them, saving them both time and money.

    摘要翻译: 使用种子关键字来提供扩展的关键字,然后与相关的广告商相关联。 实例还可以包括根据扩展的关键字定位潜在的广告客户。 采用反向查找技术来确定哪些关键字与广告商相关联。 然后可以使用过滤来消除该广告客户的不合适的关键字。 然后,这些关键字会自动向广告客户显示,作为其广告的相关搜索字词。 以这种方式,可以通过自动扩展相关搜索词来大大增强搜索引擎和/或广告商的收入。 广告商也可以通过自动获得更大更多相关的搜索词选项来获益,从而节省时间和金钱。

    Augmenting user, query, and document triplets using singular value decomposition
    90.
    发明申请
    Augmenting user, query, and document triplets using singular value decomposition 失效
    使用奇异值分解增强用户,查询和文档三元组

    公开(公告)号:US20070055646A1

    公开(公告)日:2007-03-08

    申请号:US11222243

    申请日:2005-09-08

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30675 G06F17/30864

    摘要: A system for augmenting click-through data with latent information present in the click-through data for use in generating search results that are better tailored to the information needs of a user submitting a query is provided. The augmentation system creates a three-dimensional matrix with the dimensions of users, queries, and documents. The augmentation system then performs a three-order singular value decomposition of the three-dimensional matrix to generate a three-dimensional core singular value matrix and a left singular matrix for each dimension. The augmentation system finally multiplies the three-dimensional core singular value matrix by the left singular matrices to generate an augmented three-dimensional matrix that explicitly contains the information that was latent in the un-augmented three-dimensional matrix.

    摘要翻译: 提供了一种用于通过存在于点击数据中的潜在信息来增强点击数据的系统,用于生成针对提交查询的用户的信息需求更好地定制的搜索结果。 增强系统创建一个具有用户,查询和文档尺寸的三维矩阵。 然后,增强系统执行三维矩阵的三阶奇异值分解,以产生每个维度的三维核心奇异值矩阵和左奇异矩阵。 增强系统最终将三维核心奇异值矩阵乘以左奇异矩阵,以生成明确包含未增强三维矩阵中潜在信息的增强三维矩阵。