Method and apparatus for identifying synonyms and using synonyms to search
    1.
    发明授权
    Method and apparatus for identifying synonyms and using synonyms to search 有权
    用于识别同义词并使用同义词进行搜索的方法和装置

    公开(公告)号:US08392438B2

    公开(公告)日:2013-03-05

    申请号:US12863501

    申请日:2010-04-23

    IPC分类号: G06F7/00 G06F17/30

    摘要: A method and an apparatus for identifying synonym and utilizing such synonym to conduct search is disclosed. The disclosed method includes: obtaining arbitrary two words to be identified; determining whether a shortest edit distance between the two words less than or equal to an edit distance threshold; determining whether the two words to be identified exist in a preset knowledge database, and if an answer is yes then searching a smallest granularity type with highest weight value for each word in the knowledge database; and if the two word have the same smallest granularity type with highest weight value, then determining such two words are synonyms, or non-synonym otherwise. The disclosed techniques greatly improve accuracy of synonym identification and guarantee effect of synonym identification.

    摘要翻译: 公开了一种用于识别同义词并利用这种同义词进行搜索的方法和装置。 所公开的方法包括:获得待识别的任意两个字; 确定两个单词之间的最短编辑距离是否小于或等于编辑距离阈值; 确定要确定的两个词是否存在于预设知识数据库中,如果答案为是,则搜索知识数据库中每个单词的最大权重值的最小粒度类型; 并且如果这两个字具有最高重量值的相同最小粒度类型,则确定这两个字是同义词,否则确定为非同义词。 所公开的技术大大提高同义词识别的准确性并保证同义词识别的效果。

    Method and Apparatus for Identifying Synonyms and Using Synonyms to Search
    2.
    发明申请
    Method and Apparatus for Identifying Synonyms and Using Synonyms to Search 有权
    用于识别同义词并使用同义词进行搜索的方法和装置

    公开(公告)号:US20110047138A1

    公开(公告)日:2011-02-24

    申请号:US12863501

    申请日:2010-04-23

    IPC分类号: G06F17/30

    摘要: A method and an apparatus for identifying synonym and utilizing such synonym to conduct search is disclosed. The disclosed method includes: obtaining arbitrary two words to be identified; determining whether a shortest edit distance between the two words less than or equal to an edit distance threshold; determining whether the two words to be identified exist in a preset knowledge database, and if an answer is yes then searching a smallest granularity type with highest weight value for each word in the knowledge database; and if the two word have the same smallest granularity type with highest weight value, then determining such two words are synonyms, or non-synonym otherwise. The disclosed techniques greatly improve accuracy of synonym identification and guarantee effect of synonym identification.

    摘要翻译: 公开了一种用于识别同义词并利用这种同义词进行搜索的方法和装置。 所公开的方法包括:获得待识别的任意两个字; 确定两个单词之间的最短编辑距离是否小于或等于编辑距离阈值; 确定要确定的两个词是否存在于预设知识数据库中,如果答案为是,则搜索知识数据库中每个单词的最大权重值的最小粒度类型; 并且如果这两个字具有最高重量值的相同最小粒度类型,则确定这两个字是同义词,否则确定为非同义词。 所公开的技术大大提高同义词识别的准确性并保证同义词识别的效果。

    Method, apparatus and system, for rewriting search queries
    3.
    发明授权
    Method, apparatus and system, for rewriting search queries 有权
    方法,装置和系统,用于重写搜索查询

    公开(公告)号:US08880512B2

    公开(公告)日:2014-11-04

    申请号:US12863482

    申请日:2010-04-30

    IPC分类号: G06F7/00 G06F17/30

    摘要: A search system includes: a data rewriting system that obtains, from a database, one or more search term candidates that are relevant to a present search query. The data rewriting system retrieves properties of the present search query and the one or more search term candidates, where the properties describe respective matching results of the present search query and the one or more search term candidates. Based at least in part on the matching results, the data rewriting system determines whether or not the present search query needs to be rewritten, and rewrites the present search query based at least in part on the matching results to provide a rewritten present search query if it is determined that the present search query needs to be rewritten. A search engine performs a search based at least in part on the rewritten present search query.

    摘要翻译: 搜索系统包括:数据重写系统,其从数据库中获取与当前搜索查询相关的一个或多个搜索词候选者。 数据重写系统检索当前搜索查询和一个或多个搜索词候选的属性,其中属性描述当前搜索查询和一个或多个搜索词候选的各自的匹配结果。 至少部分地基于匹配结果,数据重写系统确定是否需要重写当前搜索查询,并且至少部分地基于匹配结果重写当前搜索查询,以提供重写的当前搜索查询,如果 确定需要重写当前搜索查询。 搜索引擎至少部分地基于重写的当前搜索查询来执行搜索。

    Search Method, Apparatus and System
    4.
    发明申请
    Search Method, Apparatus and System 有权
    搜索方法,仪器和系统

    公开(公告)号:US20110082860A1

    公开(公告)日:2011-04-07

    申请号:US12863482

    申请日:2010-04-30

    IPC分类号: G06F17/30

    摘要: The present disclosure describes a search method, a search apparatus and a search system. The method includes: a data rewriting system that obtains, from a database, one or more search term candidates that are relevant to a present search term. The data rewriting system retrieves properties of the present search term and the one or more search term candidates, where the properties describe respective matching results of the present search term and the one or more search term candidates. Based on the matching results, the data rewriting system determines whether or not the present search term needs to be rewritten, and rewrites the present search term based on the matching results to provide a rewritten present search term if it is determined that the present search term needs to be rewritten. A search engine performs a search based on the rewritten present search term. The disclosed method, apparatus and system avoid the approach of conducting a search based on fixed rules after the present search term is rewritten, thus reducing the probability of having ambiguity in the search process and improving the degree of search accuracy.

    摘要翻译: 本公开描述了搜索方法,搜索装置和搜索系统。 该方法包括:数据重写系统,其从数据库中获取与当前搜索项相关的一个或多个搜索词候选者。 数据重写系统检索当前搜索项和一个或多个搜索项候选的属性,其中属性描述了当前搜索项和一个或多个搜索项候选的各自的匹配结果。 基于匹配结果,数据重写系统确定当前搜索项是否需要重写,并且如果确定当前搜索项目,则基于匹配结果重写当前搜索项,以提供重写的当前搜索项 需要重写。 搜索引擎基于重写的当前搜索项来执行搜索。 所公开的方法,装置和系统避免了在当前搜索项被重写之后基于固定规则进行搜索的方法,从而降低了在搜索过程中具有模糊性并提高搜索精度的可能性。

    Ranking search results based on word weight
    5.
    发明授权
    Ranking search results based on word weight 有权
    按字重排列搜索结果

    公开(公告)号:US08856098B2

    公开(公告)日:2014-10-07

    申请号:US12804229

    申请日:2010-07-15

    IPC分类号: G06F7/00 G06F17/30

    CPC分类号: G06F17/30696 G06F17/30687

    摘要: Ranking search results, comprises retrieving search results that include target strings that relate to a query string; segmenting the query string and each of the target strings; pairing segments in the query string with respective segments in the target strings to form combinations; retrieving weights that correspond to the combinations; and determining a weighted word length based on the weights corresponding to each of the target strings; and ranking the target strings based on their respective weighted word lengths. Alternatively, ranking search results includes determining a minimum weight of each inserted word with respect to segments in the query string; determining a minimum weight of each deleted word with respect to segments in the target strings; determining a total edit distance for each target string; and ranking the target strings based on the total edit distances.

    摘要翻译: 排名搜索结果,包括检索包括与查询字符串相关的目标字符串的搜索结果; 分割查询字符串和每个目标字符串; 将查询字符串中的段与目标字符串中的各个段进行配对以形成组合; 检索对应于组合的权重; 以及基于与每个所述目标串相对应的权重来确定加权的字长; 并且基于它们各自的加权字长对目标字符串进行排序。 或者,排序搜索结果包括确定每个插入的单词相对于查询字符串中的段的最小权重; 确定每个删除的单词相对于目标字符串中的段的最小权重; 确定每个目标串的总编辑距离; 并根据总编辑距离对目标字符串进行排序。

    Method for generating search result and system for information search
    6.
    发明授权
    Method for generating search result and system for information search 有权
    生成搜索结果的方法和信息搜索系统

    公开(公告)号:US08849822B2

    公开(公告)日:2014-09-30

    申请号:US12863473

    申请日:2010-04-29

    IPC分类号: G06F7/00 G06F17/30

    摘要: The present disclosure discloses a method for generating a search result and an information search system. The method for generating a search result includes: receiving, by an information search system, a search request; obtaining, by searching, a plurality of pieces of matching information that match the search request; obtaining a respective amount of user response associated with each of the plurality of pieces of matching information and further obtaining a total amount of user response associated with a respective categories to which each of the plurality of pieces of matching information belongs; and ranking the plurality of pieces of information to generate a search result based on the total amount of user response associated with the respective category to which each of the plurality of pieces of matching information belongs. By using the above technical scheme, a result of more rational ranking of matching information can be displayed to a user when the user performs a search, thus improving experience of the user.

    摘要翻译: 本公开公开了一种用于生成搜索结果和信息搜索系统的方法。 用于生成搜索结果的方法包括:通过信息搜索系统接收搜索请求; 通过搜索获得与搜索请求匹配的多条匹配信息; 获得与所述多条匹配信息中的每一条相关联的相应量的用户响应,并进一步获得与所述多条匹配信息中的每一条所属于的相应类别相关联的用户响应的总量; 并且根据与多个匹配信息中的每个匹配信息所属的相应类别相关联的用户响应的总量,对多条信息进行排序以生成搜索结果。 通过使用上述技术方案,当用户执行搜索时,可以向用户显示匹配信息的更合理排序的结果,从而提高用户的体验。

    Ranking search results based on word weight
    7.
    发明申请
    Ranking search results based on word weight 有权
    按字重排列搜索结果

    公开(公告)号:US20110016111A1

    公开(公告)日:2011-01-20

    申请号:US12804229

    申请日:2010-07-15

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30696 G06F17/30687

    摘要: Ranking search results, comprises receiving a query string; retrieving a plurality of search results that include a corresponding plurality of target strings that relate to the query string; segmenting the query string and each of the plurality of target strings; pairing segments in the query string with respective segments in the target strings to form a plurality of combinations; retrieving a plurality of weights that correspond to the plurality of combinations based on a mapping of word combinations and their respective weights, wherein a weight measures semantic correlation between words in a word combination; and determining a weighted word length based on the weights corresponding to each of the plurality of target strings; and ranking the plurality of target strings based on their respective weighted word lengths. Alternatively, ranking search results includes determining a minimum weight of each inserted word with respect to segmented words in the query string; determining a minimum weight of each deleted word with respect to segmented words in the target strings; determining a total edit distance based at least in part on the minimum weight of each inserted word and the minimum weight of each deleted word; and ranking the target strings based on the total edit distances.

    摘要翻译: 排名搜索结果,包括接收查询字符串; 检索包括与所述查询串相关的对应的多个目标字符串的多个搜索结果; 分割查询字符串和多个目标字符串中的每一个; 将查询字符串中的段与目标字符串中的各个段进行配对以形成多个组合; 基于字组合及其相应权重的映射来检索与所述多个组合相对应的多个权重,其中权重测量单词组合中的单词之间的语义相关性; 以及基于与所述多个目标字符串中的每一个相对应的权重来确定加权字长; 并且基于它们各自的加权字长对多个目标字符串进行排序。 或者,排序搜索结果包括确定每个插入字相对于查询字符串中的分段字的最小权重; 确定每个被删除单词相对于目标字符串中的分段单词的最小权重; 至少部分地基于每个插入字的最小权重和每个被删除字的最小权重来确定总编辑距离; 并根据总编辑距离对目标字符串进行排序。

    Generating ranked search results using linear and nonlinear ranking models
    8.
    发明授权
    Generating ranked search results using linear and nonlinear ranking models 有权
    使用线性和非线性排名模型生成排名搜索结果

    公开(公告)号:US08346765B2

    公开(公告)日:2013-01-01

    申请号:US12802816

    申请日:2010-06-14

    IPC分类号: G06F17/30

    摘要: Generating ranked search results includes receiving a plurality of matching information items that match a search request, ranking at least some of the plurality of matching information items using a linear ranking model that linearly combines a first plurality of feature values to obtain a first set of ranked results, ranking at least some of the first set of ranked results using a nonlinear ranking model that nonlinearly combines a second plurality of feature values to obtain a second set of ranked results, and provide a search response based on the second set of ranked results.

    摘要翻译: 生成排名的搜索结果包括接收与搜索请求匹配的多个匹配信息项,使用线性组合第一多个特征值的线性排序模型来排列多个匹配信息项中的至少一些,以获得第一组排名 结果,使用非线性组合第二多个特征值以获得第二组排名结果的非线性排序模型来排列第一组排名结果中的至少一些,并且基于第二组排名结果提供搜索响应。

    Method for Generating Search Result and System for Information Search
    9.
    发明申请
    Method for Generating Search Result and System for Information Search 有权
    生成搜索结果和信息搜索系统的方法

    公开(公告)号:US20120047148A1

    公开(公告)日:2012-02-23

    申请号:US12863473

    申请日:2010-04-29

    IPC分类号: G06F17/30

    摘要: The present disclosure discloses a method for generating a search result and an information search system. The method for generating a search result includes: receiving, by an information search system, a search request; obtaining, by searching, a plurality of pieces of matching information that match the search request; obtaining a respective amount of user response associated with each of the plurality of pieces of matching information and further obtaining a total amount of user response associated with a respective categories to which each of the plurality of pieces of matching information belongs; and ranking the plurality of pieces of information to generate a search result based on the total amount of user response associated with the respective category to which each of the plurality of pieces of matching information belongs. By using the above technical scheme, a result of more rational ranking of matching information can be displayed to a user when the user performs a search, thus improving experience of the user.

    摘要翻译: 本公开公开了一种用于生成搜索结果和信息搜索系统的方法。 用于生成搜索结果的方法包括:通过信息搜索系统接收搜索请求; 通过搜索获得与搜索请求匹配的多条匹配信息; 获得与所述多条匹配信息中的每一条相关联的相应量的用户响应,并进一步获得与所述多条匹配信息中的每一条所属于的相应类别相关联的用户响应的总量; 并且根据与多个匹配信息中的每个匹配信息所属的相应类别相关联的用户响应的总量,对多条信息进行排序以生成搜索结果。 通过使用上述技术方案,当用户执行搜索时,可以向用户显示匹配信息的更合理排序的结果,从而提高用户的体验。

    Generating ranked search results using linear and nonlinear ranking models
    10.
    发明申请
    Generating ranked search results using linear and nonlinear ranking models 有权
    使用线性和非线性排名模型生成排名搜索结果

    公开(公告)号:US20100325105A1

    公开(公告)日:2010-12-23

    申请号:US12802816

    申请日:2010-06-14

    IPC分类号: G06F17/30

    摘要: Generating ranked search results includes receiving a plurality of matching information items that match a search request, ranking at least some of the plurality of matching information items using a linear ranking model that linearly combines a first plurality of feature values to obtain a first set of ranked results, ranking at least some of the first set of ranked results using a nonlinear ranking model that nonlinearly combines a second plurality of feature values to obtain a second set of ranked results, and provide a search response based on the second set of ranked results.

    摘要翻译: 生成排名的搜索结果包括接收与搜索请求匹配的多个匹配信息项,使用线性组合第一多个特征值的线性排序模型来排列多个匹配信息项中的至少一些,以获得第一组排名 结果,使用非线性组合第二多个特征值以获得第二组排名结果的非线性排序模型来排列第一组排名结果中的至少一些,并且基于第二组排名结果提供搜索响应。