System and method for linguistic collation
    22.
    发明授权
    System and method for linguistic collation 有权
    语言整理的系统和方法

    公开(公告)号:US07941311B2

    公开(公告)日:2011-05-10

    申请号:US10691424

    申请日:2003-10-22

    CPC classification number: G06F17/22

    Abstract: A system and method is provided for handling the collation of linguistic symbols of different languages that may have various types of compressions (e.g., from 2-to-1 to 8-to-1). A symbol table of the symbols identified as Unicode code points is generated, with each symbol tagged with a highest compression type of that symbol by sorting the compression tables of the various languages. During a sorting operation with respect to a given string, the tag of a symbol in the string is checked to identify the highest compression type of compressions beginning with that symbol, and the compression tables for the language with compression types equal or lower than the highest compression type of the symbol are searched using a binary search method to find a matching compression for the symbols in the string. A common search module is used to perform binary searches through compression tables of different compression types.

    Abstract translation: 提供了一种用于处理可能具有各种类型的按压(例如,从2到1到8对1)的不同语言的语言符号的对照的系统和方法。 生成标识为Unicode代码点的符号的符号表,每个符号通过排序各种语言的压缩表来标记该符号的最高压缩类型。 在对于给定的字符串的排序操作期间,检查字符串中的符号的标签以识别以该符号开始的最高压缩类型的压缩,以及压缩类型等于或低于最高的压缩类型的压缩表 使用二元搜索方法搜索符号的压缩类型,以找到字符串中符号的匹配压缩。 常用的搜索模块用于通过不同压缩类型的压缩表执行二进制搜索。

Patent Agency Ranking