Method and apparatus for constructing a compact similarity structure and for using the same in analyzing document relevance
    1.
    发明授权
    Method and apparatus for constructing a compact similarity structure and for using the same in analyzing document relevance 失效
    用于构建紧凑型相似度结构并用于分析文档相关性的方法和装置

    公开(公告)号:US07949644B2

    公开(公告)日:2011-05-24

    申请号:US12152522

    申请日:2008-05-15

    CPC classification number: G06F17/30705 Y10S707/99935 Y10S707/99942

    Abstract: A computer-readable medium comprises data structure for providing information about levels of similarity between pairs of N documents. The data structure comprises a plurality of entries of similarity values representing levels of similarity for a plurality of pairs of the documents. Each of the similarity values represents a level of similarity of one document of a given pair relative to the other document of the given pair. The similarity value of each entry is greater than a threshold similarity value that is greater than zero. The plurality of similarity-value entries are fewer than N2−N in number if the similarity values are asymmetric with regard to document pairing, and the plurality of similarity-value entries are fewer than N 2 - N 2 in number if the similarity values are symmetric with regard to document pairing. A method and apparatus for generating the data structure are described.

    Abstract translation: 计算机可读介质包括用于提供关于N个文档对之间的相似性级别的信息的数据结构。 数据结构包括表示多对文档对象的相似度级的多个相似度条目。 每个相似度值表示给定对的一个文档相对于给定对的另一个文档的相似度级别。 每个条目的相似度值大于大于零的阈值相似度值。 如果相似性值对于文档配对是不对称的,则多个相似值条目数量少于数目中的N2-N,并且如果相似度值是相似度值,则多个相似值条目数量少于N 2 -N 2 对于文件配对。 描述了用于生成数据结构的方法和装置。

    Hybrid method for simulation optimization
    2.
    发明申请
    Hybrid method for simulation optimization 审中-公开
    混合方法进行模拟优化

    公开(公告)号:US20080312885A1

    公开(公告)日:2008-12-18

    申请号:US11811820

    申请日:2007-06-12

    CPC classification number: G06F17/11

    Abstract: A computer-implemented method of solving a system optimization problem having a plurality of parameters of unknown value is comprised of randomly generating sets of values for unknown parameters within an the optimization problem. A population of original candidate solutions is generated by applying an algorithm for deterministic optimization to each of the sets of values. The population of solutions is ranked. Additional candidate solutions are iteratively generated from at least certain of the solutions in the population. The validity of the additional candidate solutions is checked, and the valid additional candidate solutions are added to the population of solutions. The population of solutions is re-ranked and at least one solution from the population of solutions is output when a predetermined criterion is met whereby the values for the parameters in the output solution may be used for controlling a system.

    Abstract translation: 解决具有未知值的多个参数的系统优化问题的计算机实现的方法包括在优化问题内随机生成未知参数的值集合。 通过对每个值集合应用用于确定性优化的算法来生成原始候选解决方案的群体。 解决方案人数排名。 从群体中的至少某些解决方案迭代生成其他候选解决方案。 检查附加候选解决方案的有效性,并将有效的附加候选解决方案添加到解决方案群体中。 解决方案的群体被重新排序,并且当满足预定标准时输出来自解决方案群体的至少一个解决方案,由此输出解决方案中的参数的值可以用于控制系统。

    Method and apparatus for performing a semantically informed merge operation
    3.
    发明申请
    Method and apparatus for performing a semantically informed merge operation 审中-公开
    用于执行语义信息合并操作的方法和装置

    公开(公告)号:US20080294427A1

    公开(公告)日:2008-11-27

    申请号:US11802173

    申请日:2007-05-21

    CPC classification number: G06F17/2785

    Abstract: A method and apparatus for performing an informed semantic merge operation comprises selecting a source region in a document and a target region in the same or a different document. A bi-directionally coupled surface region is identified in the source region and a bi-directionally coupled surface region is identified in the target region. A first semantic object coupled to the surface region in the source region is identified and a second semantic object coupled to the surface region in the target region is identified. The subcomponents of the first semantic object are combined with the subcomponents of the second semantic object by merging.

    Abstract translation: 用于执行知情语义合并操作的方法和装置包括在相同或不同文档中选择文档中的源区域和目标区域。 在源区域中识别双向耦合表面区域,并且在目标区域中识别双向耦合的表面区域。 识别耦合到源区域中的表面区域的第一语义对象,并且识别耦合到目标区域中的表面区域的第二语义对象。 第一语义对象的子组件通过合并与第二语义对象的子组件组合。

    Method and apparatus for anchoring expressions based on an ontological model of semantic information
    4.
    发明申请
    Method and apparatus for anchoring expressions based on an ontological model of semantic information 审中-公开
    基于语义信息的本体论模型来锚定表达的方法和装置

    公开(公告)号:US20080294426A1

    公开(公告)日:2008-11-27

    申请号:US11802172

    申请日:2007-05-21

    CPC classification number: G06F17/2785

    Abstract: A method and apparatus for the recording and maintenance of semantic elements in electronically-held information objects provide for grounding semantic objects in an ontology, such that inheritance and other relations between concepts are preserved in persistent storage. The disclosed method and apparatus provide semantic document authors with a means to anchor concept references to specific, persistent, semantic objects, thereby providing the system with access to all properties of the underlying data model of the semantic objects being referenced, while also specifying the type and scope of their relations, as well as behavioral aspects of the visual and editing environment.

    Abstract translation: 用于记录和维护电子信息对象中的语义元素的方法和装置提供本体中的语义对象的接地,从而在持久存储器中保留概念之间的继承和其他关系。 所公开的方法和装置为语义文档作者提供了将概念引用锚定到特定的,持久的,语义对象的手段,从而为系统提供对被引用的语义对象的底层数据模型的所有属性的访问,同时还指定类型 他们的关系范围以及视觉和编辑环境的行为方面。

    Method and apparatus for document filtering using ensemble filters
    5.
    发明授权
    Method and apparatus for document filtering using ensemble filters 失效
    使用集成滤波器进行文档过滤的方法和装置

    公开(公告)号:US07398269B2

    公开(公告)日:2008-07-08

    申请号:US10713592

    申请日:2003-11-14

    Abstract: A technique for representing an information need and employing one or more filters to select documents that satisfy the represented information need, including a technique of creating filters that involves (a) dividing a set of documents into one or more subsets such that each subset can be used as the source of features for creating a filtering profile or used to set or validate the score threshold for the profile and (b) determining whether multiple profiles are required and how to combine them to create an effective filter. Multiple profiles can be incorporated into an individual filter and the individual filters combined to create an ensemble filter. Ensemble filters can then be further combined to create meta filters.

    Abstract translation: 用于表示信息的技术需要并采用一个或多个过滤器来选择满足所表示的信息的文档,包括创建过滤器的技术,该技术涉及(a)将一组文档划分成一个或多个子集,使得每个子集可以是 用作创建过滤配置文件或用于设置或验证配置文件的分数阈值的功能的来源,以及(b)确定是否需要多个配置文件,以及如何组合它们以创建有效的过滤器。 多个配置文件可以并入到单个过滤器中,并且各个过滤器组合以创建整体过滤器。 然后可以将组合过滤器进一步组合以创建元过滤器。

    Method and apparatus for performing semantically informed text operations
    6.
    发明申请
    Method and apparatus for performing semantically informed text operations 审中-公开
    用于执行语义信息文本操作的方法和装置

    公开(公告)号:US20080295013A1

    公开(公告)日:2008-11-27

    申请号:US11802171

    申请日:2007-05-21

    CPC classification number: G06F17/2785 G06F17/24

    Abstract: One example of a semantically informed text operation comprises selecting a source region of a document and determining if the source region has a surface region bi-directionally coupled to a semantic object. The coupled semantic object is identified as are the presentation(s) associated with the semantic object. A target region of the same or anther document is selected. Any of the presentations that cannot be expressed in the target region are eliminated to identify a set of remaining presentations. A set of semantic choices based on the remaining presentations is determined. One of the semantic choices is selected and executed in the target region.

    Abstract translation: 语义通知文本操作的一个示例包括选择文档的源区域并确定源区域是否具有双向耦合到语义对象的表面区域。 耦合语义对象被识别为与语义对象相关联的呈现。 选择相同或者其他文档的目标区域。 消除了在目标区域中无法表达的任何演示文稿,以确定一组剩余的演示文稿。 确定基于剩余呈现的一组语义选择。 在目标区域中选择并执行其中一个语义选择。

    Method and apparatus for the automated construction of models of activities from textual descriptions of the activities
    7.
    发明申请
    Method and apparatus for the automated construction of models of activities from textual descriptions of the activities 审中-公开
    从活动的文字描述中自动构建活动模式的方法和装置

    公开(公告)号:US20080294398A1

    公开(公告)日:2008-11-27

    申请号:US11807007

    申请日:2007-05-25

    CPC classification number: G06Q10/06

    Abstract: A method of automatically constructing a model of an activity from an unsupervised examination of a plurality of textual documents describing the activity is comprised of: extracting prototypical steps from the plurality of textual documents; sequencing the extracted steps; aligning the sequenced steps; and constructing the model based on the aligned steps. The model may take the form of a step vs. position matrix which identifies the prototypical steps that make up the activity and provides the probability of each step occupying each position within the activity. The model thus constitutes common sense knowledge that encodes the stereotypical steps of an activity and the stereotypical sequencing of the steps.

    Abstract translation: 一种从描述活动的多个文本文档的无监督检查自动构建活动模型的方法包括:从多个文本文档中提取原型步骤; 对提取的步骤进行排序; 调整排序步骤; 并基于对齐的步骤构建模型。 该模型可以采取步骤与位置矩阵的形式,其识别构成活动的原型步骤,并提供每个步骤占据活动内的每个位置的概率。 因此,该模型构成了常识知识,其编码活动的刻板步骤和步骤的刻板排序。

    Method and apparatus for constructing a compact similarity structure and for using the same in analyzing document relevance
    8.
    发明授权
    Method and apparatus for constructing a compact similarity structure and for using the same in analyzing document relevance 失效
    用于构建紧凑型相似度结构并用于分析文档相关性的方法和装置

    公开(公告)号:US07472131B2

    公开(公告)日:2008-12-30

    申请号:US11298500

    申请日:2005-12-12

    CPC classification number: G06F17/30705 Y10S707/99935 Y10S707/99942

    Abstract: A computer-readable medium comprises data structure for providing information about levels of similarity between pairs of N documents. The data structure comprises a plurality of entries of similarity values representing levels of similarity for a plurality of pairs of the documents. Each of the similarity values represents a level of similarity of one document of a given pair relative to the other document of the given pair. The similarity value of each entry is greater than a threshold similarity value that is greater than zero. The plurality of similarity-value entries are fewer than N2−N in number if the similarity values are asymmetric with regard to document pairing, and the plurality of similarity-value entries are fewer than N 2 - N 2 in number if the similarity values are symmetric with regard to document pairing. A method and apparatus for generating the data structure are described.

    Abstract translation: 计算机可读介质包括用于提供关于N个文档对之间的相似性级别的信息的数据结构。 数据结构包括表示多对文档对象的相似度级的多个相似度条目。 每个相似度值表示给定对的一个文档相对于给定对的另一个文档的相似度级别。 每个条目的相似度值大于大于零的阈值相似度值。 如果相似度值对于文档配对是不对称的,则多个相似值条目数量少于数目中的N2-N,并且多个相似度值条目少于 N 2 - 如果相似度值对于文档配对,则数字中的 2 。 描述了用于生成数据结构的方法和装置。

      Methods and apparatus for interactive document clustering
      10.
      发明申请
      Methods and apparatus for interactive document clustering 审中-公开
      交互式文档聚类的方法和装置

      公开(公告)号:US20090287668A1

      公开(公告)日:2009-11-19

      申请号:US12153331

      申请日:2008-05-16

      CPC classification number: G06F16/355

      Abstract: A computer-based process is described for identifying clusters of documents that have some degree of similarity from among a set of documents that permits user interaction with the process. A plurality of seed candidate documents is identified. Candidate probes based upon the seed candidate documents are generated, and information regarding the candidate probes is displayed to a user. User input regarding the candidate probes is received, and a set of probes from which to form clusters of documents are defined based upon the user input regarding the candidate probes. A probe is selected and a cluster of documents is formed from among available documents not yet clustered using the probe. The process can be repeated to generate further clusters. The process can be implemented with a computer system, and associated programming instructions can be contained within a computer readable medium.

      Abstract translation: 描述了一种基于计算机的过程,用于识别与允许用户与过程交互的一组文档之间具有一定程度的相似性的文档簇。 识别多个种子候选文件。 生成基于种子候选文档的候选探针,并且向用户显示关于候选探针的信息。 接收关于候选探针的用户输入,并且基于关于候选探针的用户输入来定义用于形成文档簇的一组探测。 选择一个探针,并且从尚未使用探针聚类的可用文档中形成一组文档。 可以重复该过程以产生更多的聚类。 该过程可以用计算机系统实现,并且相关联的编程指令可以包含在计算机可读介质内。

    Patent Agency Ranking