Efficient computation of document similarity
    1.
    发明申请
    Efficient computation of document similarity 有权
    文档相似性的有效计算

    公开(公告)号:US20080126335A1

    公开(公告)日:2008-05-29

    申请号:US11606213

    申请日:2006-11-29

    CPC classification number: G06F17/30622 Y10S707/99935 Y10S707/99943

    Abstract: Systems, methodologies, media, and other embodiments associated with efficiently computing document similarity are described. One exemplary system embodiment includes logic to produce a gram from a string and logic to identify candidate documents based on identifying matches between query grams and document grams stored in an inverted index that relates grams to documents. The example system may also include logic to selectively partially reconstruct a candidate document from entries in the inverted index and logic to compute an edit distance between a string associated with a query and a string associated with the partially reconstructed candidate document. The example system may also include a signal logic configured to provide a signal corresponding to the edit distance.

    Abstract translation: 描述了与有效计算文档相似性相关联的系统,方法,媒体和其他实施例。 一个示例性系统实施例包括从字符串产生克的逻辑,以及基于识别查询克与存储在与文档相关的克的反向索引中的文档之间的匹配来识别候选文档的逻辑。 示例系统还可以包括逻辑,用于选择性地部分地重建候选文档从反向索引中的条目和逻辑来计算与查询关联的字符串与与部分重建的候选文档相关联的字符串之间的编辑距离。 示例系统还可以包括被配置为提供对应于编辑距离的信号的信号逻辑。

    Efficient computation of document similarity
    2.
    发明授权
    Efficient computation of document similarity 有权
    文档相似性的有效计算

    公开(公告)号:US07610281B2

    公开(公告)日:2009-10-27

    申请号:US11606213

    申请日:2006-11-29

    CPC classification number: G06F17/30622 Y10S707/99935 Y10S707/99943

    Abstract: Systems, methodologies, media, and other embodiments associated with efficiently computing document similarity are described. One exemplary system embodiment includes logic to produce a gram from a string and logic to identify candidate documents based on identifying matches between query grams and document grams stored in an inverted index that relates grams to documents. The example system may also include logic to selectively partially reconstruct a candidate document from entries in the inverted index and logic to compute an edit distance between a string associated with a query and a string associated with the partially reconstructed candidate document. The example system may also include a signal logic configured to provide a signal corresponding to the edit distance.

    Abstract translation: 描述了与有效计算文档相似性相关联的系统,方法,媒体和其它实施例。 一个示例性系统实施例包括从字符串产生克的逻辑,以及基于识别查询克与存储在与文档相关的克的反向索引中的文档之间的匹配来识别候选文档的逻辑。 示例系统还可以包括逻辑,用于选择性地部分地重建候选文档从反向索引中的条目和逻辑来计算与查询关联的字符串与与部分重建的候选文档相关联的字符串之间的编辑距离。 示例系统还可以包括被配置为提供对应于编辑距离的信号的信号逻辑。

    Word matching with context sensitive character to sound correlating
    3.
    发明申请
    Word matching with context sensitive character to sound correlating 审中-公开
    字符匹配与上下文敏感字符声音相关

    公开(公告)号:US20070150279A1

    公开(公告)日:2007-06-28

    申请号:US11318826

    申请日:2005-12-27

    CPC classification number: G10L13/08

    Abstract: Systems, methods, media, and other embodiments associated with word matching with context sensitive character to sound correlating are described. One exemplary method embodiment includes automatically generating context sensitive character to sound correlation rules, making the rules available to a query processing logic, converting words into sets of sounds using the rules, and storing a data entry linking the word and set of sounds in a data store searchable by the query processing logic.

    Abstract translation: 描述了与与上下文敏感字符与声音相关的字匹配相关联的系统,方法,媒体和其他实施例。 一个示例性方法实施例包括自动地生成与声音相关规则相关的上下文敏感字符,使得规则可用于查询处理逻辑,使用该规则将单词转换成声音集合,以及将数据输入链接到数据中的单词和声音组 可通过查询处理逻辑进行搜索。

Patent Agency Ranking