APPARATUS AND METHODS FOR CONCEPT-CENTRIC INFORMATION EXTRACTION
    21.
    发明申请
    APPARATUS AND METHODS FOR CONCEPT-CENTRIC INFORMATION EXTRACTION 审中-公开
    概念中心信息提取的装置和方法

    公开(公告)号:US20100241639A1

    公开(公告)日:2010-09-23

    申请号:US12408450

    申请日:2009-03-20

    IPC分类号: G06F17/30

    CPC分类号: G06F16/345 G06F16/313

    摘要: Disclosed are methods and apparatus for extracting (or annotating) structured information from web content. Web content of interest from a particular domain is represented as one or more tree instances having a plurality of branching nodes that each correspond to a web object such that the tree instances correspond to one or more structured data instances. The particular domain is associated with domain knowledge that includes one or more presentation rulesets that each specifies a particular structure for a set of data instances, a domain-specific concept labeler, one or more specified properties of the web objects in the tree instances, and a concept schema that specifies a representation of the data to be extracted from the web content. A structured data instance that conforms to the concept schema is extracted from the one or more tree instances based on the domain knowledge for the particular domain. Extraction of the structured data instances is accomplished by (i) using the domain-specific concept labeler to annotate a subset of nodes of the tree instances; and (ii) using a locally adaptive concept annotator to extract the structured data instances based on the annotated segments and the local properties associated with such annotated segments. The extracted structured data instance is stored as structured output records in a database.

    摘要翻译: 公开了从网页内容中提取(或注释)结构化信息的方法和装置。 来自特定域的感兴趣的Web内容被表示为具有多个分支节点的一个或多个树实例,每个分支节点对应于web对象,使得树实例对应于一个或多个结构化数据实例。 特定域与域知识相关联,其包括一个或多个呈现规则集,每个表示规则集指定一组数据实例的特定结构,特定于域的概念标签器,树实例中的web对象的一个​​或多个指定的属性,以及 一个概念模式,指定要从Web内容中提取的数据的表示。 基于特定域的域知识,从一个或多个树实例提取符合概念模式的结构化数据实例。 结构化数据实例的提取是通过(i)使用域特定概念标签器来注释树实例的节点的子集来实现的; 以及(ii)使用本地适应性概念注释器基于所注释的段和与这些注释段相关联的本地属性来提取结构化数据实例。 提取的结构化数据实例作为结构化输出记录存储在数据库中。

    DECENTRALIZED RECORD EXPIRY
    22.
    发明申请
    DECENTRALIZED RECORD EXPIRY 有权
    分散式记录过期

    公开(公告)号:US20090089313A1

    公开(公告)日:2009-04-02

    申请号:US11863902

    申请日:2007-09-28

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30306

    摘要: A technique is described that reduces the complexity and resource consumption associated with performing record expiry in a distributed database system. In accordance with the technique, a record is checked to see if it has expired only when it has been accessed for a read or a write. If at the time of a read a record is determined to have expired, then it is not served. If at the time of a write a record is determined to have expired, then the write is treated as an insertion of a new record, and steps are taken to treat the insertion consistently with regard to the previous expired version. A background process is used to delete records that have not been written to or actively deleted by a client after expiration.

    摘要翻译: 描述了一种降低与在分布式数据库系统中执行记录到期相关联的复杂性和资源消耗的技术。 根据该技术,检查记录以查看它是否仅在已被访问以进行读取或写入时才过期。 如果在阅读时确定记录已经过期,则不会提供记录。 如果在写入时确定记录已经过期,则写入被视为新记录的插入,并且采取步骤以一致的方式对先前的过期版本进行处理。 使用后台进程来删除客户端到期后尚未写入或主动删除的记录。

    Methods and apparatus for contextual schema mapping of source documents to target documents
    23.
    发明申请
    Methods and apparatus for contextual schema mapping of source documents to target documents 审中-公开
    将源文档的语境模式映射到目标文档的方法和装置

    公开(公告)号:US20080027930A1

    公开(公告)日:2008-01-31

    申请号:US11496271

    申请日:2006-07-31

    IPC分类号: G06F17/30

    CPC分类号: G06F16/20 G06F16/285

    摘要: Methods and apparatus are provided for improved schema mapping of source documents to target documents. A list of matches are generated between at least one source table and at least one target table. One or more of the matches are annotated with a logical condition providing a context in which the match applies. Matches can be annotated with a logical condition, for example, by generating a set of candidate view conditions, C, to be applied to the one or more source tables. A schema match algorithm can generate the list of matches. Candidate logical conditions can be identified, for example, by (i) creating a set of views for categorical attributes in the tables and adding a view for each partitioning of the attribute values; (ii) using a classifier built on target attribute values; or (iii) evaluating internal features of a source table.

    摘要翻译: 提供了方法和设备,用于改进源文档到目标文档的模式映射。 在至少一个源表和至少一个目标表之间生成匹配列表。 匹配中的一个或多个用提供匹配适用的上下文的逻辑条件进行注释。 匹配可以用逻辑条件来注释,例如,通过生成要应用于一个或多个源表的候选视图条件C的集合。 模式匹配算法可以生成匹配列表。 可以识别候选逻辑条件,例如,(i)为表中的分类属性创建一组视图,并为属性值的每个分区添加视图; (ii)使用基于目标属性值的分类器; 或(iii)评估源表的内部特征。

    Method for determining juxtaposition of physical components with use of RFID tags
    24.
    发明授权
    Method for determining juxtaposition of physical components with use of RFID tags 有权
    使用RFID标签确定物理组件并置的方法

    公开(公告)号:US06847856B1

    公开(公告)日:2005-01-25

    申请号:US10651740

    申请日:2003-08-29

    IPC分类号: G06K17/00 G08B13/181

    CPC分类号: G06K17/0029 G06K17/00

    摘要: Radio Frequency Identification (RFID) tags are used for automatically determining the connectivity or alignment between physical components, including, for example, connectivity of network cables and device ports, as well as alignment of components assembled by automated manufacturing systems. In one embodiment of the invention, accurate determinations of the physical three-dimensional locations of cables and equipment are employed to determine which cables are plugged into which device ports of which pieces of equipment. In another embodiment of the invention, multiple RFID tags are used to determine the appropriate alignment between components being assembled by an automated manufacturing system.

    摘要翻译: 射频识别(RFID)标签用于自动确定物理组件之间的连接或对准,包括例如网络电缆和设备端口的连接,以及由自动化制造系统组装的部件的对准。 在本发明的一个实施例中,使用电缆和设备的物理三维位置的精确确定来确定哪些电缆被插入到哪个设备的哪个设备端口中。 在本发明的另一个实施例中,使用多个RFID标签来确定由自动化制造系统组装的部件之间的适当对准。

    System and method for aging versions of data in a main memory database
    25.
    发明授权
    System and method for aging versions of data in a main memory database 失效
    主内存数据库中数据老化版本的系统和方法

    公开(公告)号:US6125371A

    公开(公告)日:2000-09-26

    申请号:US914744

    申请日:1997-08-19

    IPC分类号: G06F17/30

    摘要: For use with a database of data records stored in a memory, a system and method for increasing a memory capacity and a memory database employing the system or the method. The system includes: (1) a time stamping controller that assigns a time stamp to transactions to be performed on the database, the time stamp operates to preserve an order of the transactions, (2) a versioning controller that creates multiple versions of ones of the data records affected by the transactions that are update transactions and (3) an aging controller, which is associated with each of the time stamping and versioning controllers, that monitors a measurable characteristic of the memory and deletes ones of the multiple versions of the ones of the data records in response to the time stamp and the measurable characteristic thereby to increase memory capacity.

    摘要翻译: 用于存储在存储器中的数据记录的数据库,用于增加存储器容量的系统和方法以及采用该系统或方法的存储器数据库。 该系统包括:(1)时间戳控制器,为在数据库上执行的事务分配时间戳,时间戳操作以保持事务的顺序,(2)版本控制器,其创建多个版本的 由更新事务的事务影响的数据记录和(3)与每个时间戳和版本控制器相关联的老化控制器,其监视存储器的可测量特性并删除其中的一个版本 的数据记录,以响应于时间戳和可测量的特性,从而增加存储器容量。