Method and apparatus for establishing relationship between documents
    1.
    发明授权
    Method and apparatus for establishing relationship between documents 有权
    建立文件关系的方法和装置

    公开(公告)号:US07809716B2

    公开(公告)日:2010-10-05

    申请号:US11740431

    申请日:2007-04-26

    IPC分类号: G06F7/00

    CPC分类号: G06F17/30613

    摘要: The present invention is directed to a method and apparatus for establishing documents relationship based on user's operation upon search result. When a user uses search engine to search for documents with a query in repository, the search result may be a list of ranked documents, and these documents may contain a lot of relationship in term of the specific query. If the user clicks some search result further, and if the click and open operation meet certain conditions, for example exceed a period of time, the clicked document could be deemed as related to the search query. Furthermore it could be inferred that there is a strong relationship between different documents clicked by the user. The present invention records the relationship between documents and presents it to the user when necessary.

    摘要翻译: 本发明涉及一种用于基于用户对搜索结果的操作建立文档关系的方法和装置。 当用户使用搜索引擎在存储库中搜索具有查询的文档时,搜索结果可以是排名文档的列表,并且这些文档可以在特定查询方面包含很多关系。 如果用户进一步点击一些搜索结果,并且如果点击和打开操作满足某些条件,例如超过一段时间,则点击的文档可被视为与搜索查询相关。 此外,可以推断出用户点击的不同文档之间存在很强的关系。 本发明记录文件之间的关系,并在必要时将其呈现给用户。

    METHOD AND APPARATUS FOR ESTABLISHING RELATIONSHIP BETWEEN DOCUMENTS
    2.
    发明申请
    METHOD AND APPARATUS FOR ESTABLISHING RELATIONSHIP BETWEEN DOCUMENTS 有权
    建立文件之间的关系的方法和装置

    公开(公告)号:US20070299826A1

    公开(公告)日:2007-12-27

    申请号:US11740431

    申请日:2007-04-26

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30613

    摘要: The present invention is directed to a method and apparatus for establishing documents relationship based on user's operation upon search result. When a user uses search engine to search for documents with a query in repository, the search result may be a list of ranked documents, and these documents may contain a lot of relationship in term of the specific query. If the user clicks some search result further, and if the click and open operation meet certain conditions, for example exceed a period of time, the clicked document could be deemed as related to the search query. Furthermore it could be inferred that there is a strong relationship between different documents clicked by the user. The present invention records the relationship between documents and presents it to the user when necessary.

    摘要翻译: 本发明涉及一种用于基于用户对搜索结果的操作建立文档关系的方法和装置。 当用户使用搜索引擎在存储库中搜索具有查询的文档时,搜索结果可以是排列文档的列表,并且这些文档可以在特定查询方面包含很多关系。 如果用户进一步点击一些搜索结果,并且如果点击和打开操作满足某些条件,例如超过一段时间,则点击的文档可被视为与搜索查询相关。 此外,可以推断出用户点击的不同文档之间存在很强的关系。 本发明记录文件之间的关系,并在必要时将其呈现给用户。

    Method and apparatus for preprocessing a plurality of documents for search and for presenting search result
    3.
    发明授权
    Method and apparatus for preprocessing a plurality of documents for search and for presenting search result 有权
    用于预处理多个用于搜索的文档和用于呈现搜索结果的方法和装置

    公开(公告)号:US08838650B2

    公开(公告)日:2014-09-16

    申请号:US11847285

    申请日:2007-08-29

    IPC分类号: G06F7/00

    CPC分类号: G06F17/30864

    摘要: A method and apparatus for preprocessing a plurality of documents for search and presenting search result and a system for searching documents that comprises these apparatuses. The search result, for example, includes at least one candidate document. The candidate document is assigned a tree structure representing its content. The tree structure includes at least one node. The method may include presenting at least a portion of the tree structure corresponded to the candidate document in the search result.

    摘要翻译: 一种用于预处理用于搜索和呈现搜索结果的多个文档的方法和装置,以及用于搜索包括这些装置的文档的系统。 搜索结果例如包括至少一个候选文档。 候选文件被分配一个表示其内容的树结构。 树结构包括至少一个节点。 该方法可以包括在搜索结果中呈现对应于候选文档的树结构的至少一部分。

    Index and method for extending and querying index
    4.
    发明授权
    Index and method for extending and querying index 失效
    扩展和查询索引的索引和方法

    公开(公告)号:US07689574B2

    公开(公告)日:2010-03-30

    申请号:US11562495

    申请日:2006-11-22

    IPC分类号: G06F17/00 G06F15/16 G06F3/00

    CPC分类号: G06F17/30622

    摘要: A method, system and program storage device are provided for extending an inverted index, which comprises first and second inverted index subfiles to increase the speed of establishing and updating inverted index files. The method includes performing ordered keyword indexing operations of generating an inverted index from data sources, in which a frequency of occurrence of keywords in each of the data sources is calculated, and writing each keyword, the data sources, and the frequency of occurrence of each keyword in the corresponding data sources to the inverted index. If a number of data sources involved in the indexing operations reaches a first threshold, then writing contents of the inverted index as a smallest grid into the first inverted index subfile. If a number of smallest grids in the first inverted index subfile reaches a second threshold, then merging the smallest grids into a merged grid and writing the merged grid into the second inverted index subfile. If the number of merged grids in the second inverted index subfile reaches a third threshold, then further merging the merged grids into a larger merged grid, and writing the larger merged grid back into the first inverted index subfile.

    摘要翻译: 提供了一种用于扩展反向索引的方法,系统和程序存储装置,其包括第一和第二反向索引子文件,以增加建立和更新反向索引文件的速度。 该方法包括执行从数据源生成反向索引的有序关键字索引操作,其中计算每个数据源中的关键字的发生频率,并且写入每个关键字,数据源和每个数据源的发生频率 关键字在相应的数据源中反转索引。 如果涉及索引操作的数据源数目达到第一阈值,则将反向索引的内容作为最小格网写入第一反向索引子文件中。 如果第一反向索引子文件中的最小格数达到第二阈值,则将最小网格合并到合并的网格中,并将合并的网格写入第二个反向索引子文件。 如果第二反向索引子文件中的合并网格数达到第三阈值,则将合并的网格进一步合并到较大的合并网格中,并将较大的合并网格写回第一个反向索引子文件。

    Search ranking method for file system and related search engine
    5.
    发明授权
    Search ranking method for file system and related search engine 有权
    搜索文件系统和相关搜索引擎的排名方法

    公开(公告)号:US07644069B2

    公开(公告)日:2010-01-05

    申请号:US11679379

    申请日:2007-02-27

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30106 G06F17/30112

    摘要: The present invention provides a search ranking method suitable for a file system, including receiving a query, calculating final relevance scores of individual file items with respect to the query at least partially in accordance with energy scores of individual nodes on a current file system energy tree, and outputting a list of search results based on the final relevance scores. The file system energy tree is updated in response to an operation on the file system performed by a user, wherein the file system energy tree has a tree structure corresponding to that of the file system, and the individual nodes thereof respectively corresponds to the individual file items in the file system

    摘要翻译: 本发明提供一种适用于文件系统的搜索排序方法,包括接收查询,至少部分地根据当前文件系统能量树上各个节点的能量分数来计算关于查询的各个文件的最终相关性分数 并且基于最终相关性得分输出搜索结果的列表。 响应于由用户执行的对文件系统的操作来更新文件系统能量树,其中文件系统能量树具有与文件系统的树结构对应的树结构,并且其各个节点分别对应于单个文件 文件系统中的项目

    Search Ranking Method for File System and Related Search Engine
    6.
    发明申请
    Search Ranking Method for File System and Related Search Engine 有权
    文件系统和相关搜索引擎的搜索排名方法

    公开(公告)号:US20070276807A1

    公开(公告)日:2007-11-29

    申请号:US11679379

    申请日:2007-02-27

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30106 G06F17/30112

    摘要: The present invention provides a search ranking method suitable for a file system, comprising: receiving a query; calculating final relevance scores of individual file items with respect to the query at least partially in accordance with energy scores of individual nodes on a current file system energy tree, and outputting a list of search results based on the final relevance scores; and updating the file system energy tree in response to an operation on the file system performed by a user, wherein the file system energy tree has a tree structure corresponding to that of the file system, and the individual nodes thereof respectively corresponds to the individual file items in the file system. The present invention also provides a corresponding file system search engine and computer program product. With the present invention, files and file folders that the user is interested in are usually arranged in relatively higher positions of the list of search results in file system search. Moreover, with the increase in the user's clicks on the file, the list of search results can be dynamically adapted to changes in the user's interest or preference.

    摘要翻译: 本发明提供一种适用于文件系统的搜索排序方法,包括:接收查询; 至少部分地根据当前文件系统能量树上的各个节点的能量分数来计算关于查询的单个文件的最终相关性分数,并且基于最终相关性得分输出搜索结果的列表; 以及响应于由用户执行的对所述文件系统的操作来更新所述文件系统能量树,其中所述文件系统能量树具有对应于所述文件系统的树结构,并且其各个节点分别对应于所述单个文件 文件系统中的项目。 本发明还提供了相应的文件系统搜索引擎和计算机程序产品。 利用本发明,用户感兴趣的文件和文件夹通常被布置在文件系统搜索中的搜索结果列表的相对较高的位置。 此外,随着用户对文件的点击的增加,搜索结果的列表可以动态地适应用户兴趣或偏好的变化。

    Method and apparatus of correcting chemical names
    7.
    发明申请
    Method and apparatus of correcting chemical names 审中-公开
    化学名称校正方法和装置

    公开(公告)号:US20110082844A1

    公开(公告)日:2011-04-07

    申请号:US12924541

    申请日:2010-09-29

    IPC分类号: G06F17/30

    CPC分类号: G06F17/273

    摘要: A computer-implemented method and system for checking a chemical name. The method tokenizes the chemical name to obtain corresponding tokens; checks the chemical name according to the chemical association between chemical compositions represented by the tokens; and if the chemical name does not pass the check, replaces at least part of tokens of the chemical name that does not pass the check, and repeats the checking step. The system and method can not only help users to find and correct errors in spelling a chemical name but also check the entire chemical name at the level of chemical associations. Hence, not only chemical names that are incorrectly spelled but also ones that do not conform to chemical rules can be found, and significant help is provided to users for correcting chemical names.

    摘要翻译: 用于检查化学名称的计算机实现的方法和系统。 该方法标记化学名称以获得相应的令牌; 根据由标记代表的化学成分之间的化学关系检查化学名称; 如果化学名称未通过检查,则至少替换部分不通过检查的化学名称的令牌,并重复检查步骤。 系统和方法不仅可以帮助用户查找和纠正化学名称中的错误,还可以在化学关联级别检查整个化学名称。 因此,不仅可以找到不正确拼写的化学名称,还可以找到不符合化学规定的化学名称,并向用户提供对化学名称进行校正的重要帮助。

    TAGGING METHOD AND APPARATUS BASED ON STRUCTURED DATA SET
    8.
    发明申请
    TAGGING METHOD AND APPARATUS BASED ON STRUCTURED DATA SET 有权
    基于结构化数据集的标记方法和设备

    公开(公告)号:US20110078206A1

    公开(公告)日:2011-03-31

    申请号:US12860112

    申请日:2010-08-20

    IPC分类号: G06F17/30

    摘要: A tagging method and apparatus, including computer program products, based on a structured data set are provided, the tagging method comprising: creating classification models for respective nodes in the structured data set of an event; acquiring public opinions on the event; and tagging the opinions to corresponding nodes of the structured data set using the created classification models. The tagging method and apparatus of the present disclosure are able to provide well-ordered, focused public opinions for each event to users, and to exhibit the evolution of the public opinions along with time.

    摘要翻译: 提供了一种基于结构化数据集的包括计算机程序产品的标记方法和装置,所述标记方法包括:为事件的结构化数据集中的各个节点创建分类模型; 收集有关事件的意见; 并使用创建的分类模型将意见标记到结构化数据集的相应节点。 本公开的标签方法和装置能够为用户提供针对每个事件的良好有序的集中的舆论,并且随着时间展现舆论的演变。

    Tagging method and apparatus based on structured data set
    9.
    发明授权
    Tagging method and apparatus based on structured data set 有权
    基于结构化数据集的标记方法和装置

    公开(公告)号:US08868609B2

    公开(公告)日:2014-10-21

    申请号:US12860112

    申请日:2010-08-20

    IPC分类号: G06F17/30

    摘要: Tagging methods and apparatus, including computer program products, based on a structured data set. Classification models are created for respective nodes in the structured data set of an event. Public opinions on the event are acquired. The opinions are tagged to corresponding nodes of the structured data set using the created classification models. The tagging methods and apparatus provide well-ordered, focused public opinions for each event to users, and exhibit the evolution of the public opinions along with time.

    摘要翻译: 基于结构化数据集的标记方法和设备,包括计算机程序产品。 为事件的结构化数据集中的各个节点创建分类模型。 有关事件的舆论获得。 使用创建的分类模型将意见标记到结构化数据集的相应节点。 标签方法和设备为用户提供了每一个事件的有序,重点的舆论,随着时间的推移呈现出舆论的演变。

    Processing geographical location data in a document
    10.
    发明授权
    Processing geographical location data in a document 有权
    处理文档中的地理位置数据

    公开(公告)号:US08589780B2

    公开(公告)日:2013-11-19

    申请号:US13277405

    申请日:2011-10-20

    IPC分类号: G06F17/00

    CPC分类号: G09B29/106

    摘要: Techniques for processing geographical location data in a document comprise: obtaining geographical location data in the document; grading the geographical location data according to a predetermined condition to determine an associated relationship between the geographical location data; marking on an electronic map the associated relationship between the geographical location data; and presenting the marked electronic map.

    摘要翻译: 用于处理文档中的地理位置数据的技术包括:获得文档中的地理位置数据; 根据预定条件对地理位置数据进行分级,以确定地理位置数据之间的关联关系; 在电子地图上标记地理位置数据之间的相关关系; 并提供标记的电子地图。