Document descriptor extraction method
    1.
    发明授权
    Document descriptor extraction method 有权
    文件描述提取方法

    公开(公告)号:US07080314B1

    公开(公告)日:2006-07-18

    申请号:US09595719

    申请日:2000-06-16

    IPC分类号: G06F15/00

    CPC分类号: G06F17/2247

    摘要: The present invention discloses a document descriptor extraction method and system. The document descriptor extraction method and system creates a document descriptor by generalizing input sequences within a document; factoring the input sequences and generalized input sequences; and selecting a document descriptor from the input sequences, generalized sequences, and factored sequences, preferably using minimum descriptor length (MDL) principles. Novel algorithms are employed to perform the generalizing, factoring, and selecting.

    摘要翻译: 本发明公开了一种文档描述符提取方法和系统。 文档描述符提取方法和系统通过对文档内的输入序列进行泛化来创建文档描述符; 分解输入序列和广义输入序列; 以及优选地使用最小描述符长度(MDL)原理从输入序列,广义序列和因子序列中选择文档描述符。 采用新颖的算法进行泛化,分解和选择。

    Determination of physical topology of a communication network
    2.
    发明授权
    Determination of physical topology of a communication network 失效
    确定通信网络的物理拓扑

    公开(公告)号:US06697338B1

    公开(公告)日:2004-02-24

    申请号:US09428419

    申请日:1999-10-28

    IPC分类号: H04L1228

    CPC分类号: H04L41/12

    摘要: Physical connectivity is determined between elements such as switches and routers in a multiple subnet communication network. Each element has one or more interfaces each of which is physically linked with an interface of another network element. Address sets are generated for each interface of the network elements, wherein members of a given address set correspond to network elements that can be reached from the corresponding interface for which the given address set was generated. The members of first address sets generated for corresponding interfaces of a given network element, are compared with the members of second address sets generated for corresponding interfaces of network elements other than the given element. A set of candidate connections between an interface of the given network element and one or more interfaces of other network elements, are determined. If more than one candidate connection is determined, connections with network elements that are in the same subnet as the given network element are eliminated from the set.

    摘要翻译: 在多个子网通信网络中的诸如交换机和路由器的元件之间确定物理连接性。 每个元件具有一个或多个接口,每个接口与另一个网络元件的接口物理连接。 为网络元件的每个接口生成地址集,其中给定地址集合的成员对应于可以从生成给定地址集的相应接口到达的网络元素。 将给定网元的相应接口生成的第一地址集的成员与为给定元素以外的网元的相应接口生成的第二地址集的成员进行比较。 确定给定网络元件的接口与其他网络元件的一个或多个接口之间的一组候选连接。 如果确定了多个候选连接,则与组中与网络元素位于与给定网络元素相同的子网中的连接被消除。

    System and method for serializing updates to ancestor sites in a distributed database
    4.
    发明授权
    System and method for serializing updates to ancestor sites in a distributed database 失效
    将更新序列化到分布式数据库中的祖先站点的系统和方法

    公开(公告)号:US06499037B1

    公开(公告)日:2002-12-24

    申请号:US09391991

    申请日:1999-09-09

    IPC分类号: G06F1730

    摘要: A system for, and method of, ensuring serialization of updates from a replica site in a distributed database that is described by a copy graph and a distributed database incorporating the system or the method. In one embodiment, the system includes: (1) a directed acyclic copy graph (DAG) creation module that identifies backedges in, and removes the backedges from, the copy graph to yield a DAG and (2) a propagation module, associated with the DAG creation module, that initially employs eager updating to propagate the updates along the backedges and thereafter employs lazy updating to propagate the updates along edges of the directed acyclic copy graph to ensure the serialization.

    摘要翻译: 由复制图和包含系统或方法的分布式数据库描述的分布式数据库中的副本站点确保序列化更新的系统和方法。 在一个实施例中,系统包括:(1)有向非循环复制图(DAG)创建模块,其识别复制图中的背后的和在其中移除的形式,以产生DAG,并且(2)传播模块,与 DAG创建模块,其最初采用热切更新来沿着备份传播更新,然后采用延迟更新来沿着有向非循环复制图的边缘传播更新以确保序列化。

    System and method for aging versions of data in a main memory database
    5.
    发明授权
    System and method for aging versions of data in a main memory database 失效
    主内存数据库中数据老化版本的系统和方法

    公开(公告)号:US6125371A

    公开(公告)日:2000-09-26

    申请号:US914744

    申请日:1997-08-19

    IPC分类号: G06F17/30

    摘要: For use with a database of data records stored in a memory, a system and method for increasing a memory capacity and a memory database employing the system or the method. The system includes: (1) a time stamping controller that assigns a time stamp to transactions to be performed on the database, the time stamp operates to preserve an order of the transactions, (2) a versioning controller that creates multiple versions of ones of the data records affected by the transactions that are update transactions and (3) an aging controller, which is associated with each of the time stamping and versioning controllers, that monitors a measurable characteristic of the memory and deletes ones of the multiple versions of the ones of the data records in response to the time stamp and the measurable characteristic thereby to increase memory capacity.

    摘要翻译: 用于存储在存储器中的数据记录的数据库,用于增加存储器容量的系统和方法以及采用该系统或方法的存储器数据库。 该系统包括:(1)时间戳控制器,为在数据库上执行的事务分配时间戳,时间戳操作以保持事务的顺序,(2)版本控制器,其创建多个版本的 由更新事务的事务影响的数据记录和(3)与每个时间戳和版本控制器相关联的老化控制器,其监视存储器的可测量特性并删除其中的一个版本 的数据记录,以响应于时间戳和可测量的特性,从而增加存储器容量。

    Method and apparatus for detecting and recovering from data corruption of a database via read logging
    6.
    发明授权
    Method and apparatus for detecting and recovering from data corruption of a database via read logging 有权
    用于通过读取记录来检测和恢复数据库的数据损坏的方法和装置

    公开(公告)号:US06449623B1

    公开(公告)日:2002-09-10

    申请号:US09207927

    申请日:1998-12-09

    IPC分类号: G06F1200

    摘要: A method of detecting and recovering from data corruption of a database is characterized by the step of logging information about reads of a database in memory to detect errors in data of the database, wherein said errors in data of said database arise from one of bad writes of data to the database, of erroneous input of data to the database by users and of logical errors in code of a transaction. The read logging method may be implemented in a plurality of database recovery models including a cache-recovery model, a prior state model a redo-transaction model and a delete transaction model. In the delete transaction model, it is assumed that logical information is not available to allow a redo of transactions after a possible error and the effects of transactions that read corrupted data are deleted from history and any data written by a transaction reading Ararat data is treated as corrupted.

    摘要翻译: 从数据库的数据损坏中检测和恢复的方法的特征在于记录关于数据库在存储器中的读取的信息的步骤,以检测数据库的数据中的错误,其中所述数据库的数据中的错误来自坏写入之一 的数据,数据由用户错误地输入到数据库以及事务代码中的逻辑错误。 读取记录方法可以在包括高速缓存恢复模型,先前状态模型,重做事务模型和删除事务模型的多个数据库恢复模型中实现。 在删除事务模型中,假设逻辑信息不可用于允许在可能的错误之后重做事务,并且从历史中删除读取损坏的数据的事务的影响,并且处理由读取Ararat数据的事务写入的任何数据被处理 被破坏。

    On-line reorganization in object-oriented databases
    7.
    发明授权
    On-line reorganization in object-oriented databases 失效
    面向对象数据库的在线重组

    公开(公告)号:US06343296B1

    公开(公告)日:2002-01-29

    申请号:US09389337

    申请日:1999-09-03

    IPC分类号: G06F1730

    摘要: An on-line reorganization method of an object-oriented database with physical references involves a novel fuzzy traversal of the database, or a partition thereof, to identify the approximate parents of all migrating objects. Where the entire database is traversed the process begins from its persistent root. For traversals of a partition the process begins from each object with a reference pointing to it from outside the partition. To facilitate the identification of these inter-partitional objects an External Reference Table (“ERT”) is maintained. During the fuzzy traversal all new inserted and deleted references are tracked in a Temporary Reference Table (“TRT”). After the fuzzy traversal is completed, for each migrating object, a lock is obtained on the identified approximate parents and on all new parents in which references to the object were inserted, as indicated by the TRT. Based on the information in the TRT, locks are released on all approximate parents whose references to the object have been deleted. The references to the migrating object in the remaining set of locked parents are updated, the object is relocated and the locks are released. Alternatively, each parent of a migrating object can be individually locked, updated and released.

    摘要翻译: 具有物理引用的面向对象数据库的在线重组方法涉及数据库或其分区的新颖的模糊遍历,以识别所有迁移对象的近似父母。 在整个数据库遍历的过程中,进程从其持久根开始。 对于遍历分区,进程从每个对象开始,引用从分区外部引用它。 为了便于识别这些跨部门对象,维护了外部参考表(“ERT”)。 在模糊遍历期间,所有新插入和删除的引用将在临时参考表(“TRT”)中进行跟踪。 模糊遍历完成后,对于每个迁移对象,在所标识的近似父母以及所有新父母上获取锁定,其中插入了对象的引用,如TRT所示。 根据TRT中的信息,在对对象的引用已被删除的所有近似父类上释放锁。 在剩余的一组锁定的父项中对迁移对象的引用被更新,对象被重新定位并且锁被释放。 或者,可以单独锁定,更新和释放迁移对象的每个父项。

    Method and apparatus for detecting and recovering from data corruption of a database via read prechecking and deferred maintenance of codewords
    9.
    发明授权
    Method and apparatus for detecting and recovering from data corruption of a database via read prechecking and deferred maintenance of codewords 有权
    用于通过读取预检和代码字的延迟维护来检测和恢复数据库的数据损坏的方法和装置

    公开(公告)号:US06374264B1

    公开(公告)日:2002-04-16

    申请号:US09207926

    申请日:1998-12-09

    IPC分类号: G06F1730

    摘要: A method of detecting and recovering from data corruption of a database is characterized by the step of protecting data of the database with codewords, one codeword for each region of the database; and verifying that a codeword matches associated data before the data is read from the database to prevent transaction-carried corruption. A deferred maintenance scheme is recommended for the codewords protecting the database such that the method of detecting and recovering from data corruption of a database may comprise the steps of protecting data of the database with codewords, one codeword for each region of the database; and asynchronously maintaining the codewords to improve concurrency of the database. Moreover, the database may be audited by using the codewords and noting them in a table and protecting regions of the database with latches. Once codeword values are computed and checked against noted values in memory, a flush can cause codewords from outstanding log records to be applied to the stored codeword table.

    摘要翻译: 一种从数据库的数据损坏中检测和恢复的方法的特征在于利用码字保护数据库的数据的步骤,数据库的每个区域的一个码字; 并且在从数据库读取数据之前验证码字匹配相关联的数据以防止交易携带的损坏。 对于保护数据库的代码字,建议使用延迟维护方案,以便检测和恢复数据库的数据损坏的方法可以包括以下代码字来保护数据库的数据的步骤,数据库的每个区域的一个代码字; 并且异步地维护码字以改善数据库的并发性。 此外,可以通过使用码字来审核数据库,并在表中注明数据库,并用锁存器保护数据库的区域。 一旦根据存储器中的标记值计算和检查码字值,则刷新可以使来自未完成的日志记录的码字被应用于存储的码字表。

    System and method for physically versioning data in a main memory
database
    10.
    发明授权
    System and method for physically versioning data in a main memory database 失效
    在主内存数据库中物理版本化数据的系统和方法

    公开(公告)号:US6122645A

    公开(公告)日:2000-09-19

    申请号:US2635

    申请日:1998-01-05

    IPC分类号: G06F17/30

    摘要: For use with a database of data records organized into components, the database stored in a memory, a processing system for, and method of, physically versioning the database. In one embodiment, the processing system includes: (1) a component copier that creates a physical copy of an original component to be affected by an update transaction to be applied to the database, and that causes pointers in nodes of the physical copy to point to other nodes in the physical copy, (2) a data updater, associated with the component copier, that applies the update transaction to the physical copy to create therefrom a new physical version, the original component remaining unaffected by the update transaction and (3) a pointer updater, associated with the data updated, that employs an atomic word write to revise a component pointer, associated with the database, to cause the pointer to point to the new physical version.

    摘要翻译: 为了与组织成组件的数据记录的数据库一起使用,存储在内存中的数据库,处理系统以及对数据库进行物理版本化的方法。 在一个实施例中,处理系统包括:(1)组件复印机,其创建将要被应用于数据库的更新事务影响的原始组件的物理副本,并且使物理副本的节点中的指针指向 到物理副本中的其他节点,(2)与组件复印机相关联的数据更新器,将更新事务应用于物理副本以从其创建新的物理版本,原始组件不受更新事务的影响,(3 )与更新的数据相关联的指针更新器,其使用原子字写入来修改与数据库相关联的组件指针,以使指针指向新的物理版本。