CONTENT CLASSES FOR OBJECT STORAGE INDEXING SYSTEMS

    公开(公告)号:US20180357333A1

    公开(公告)日:2018-12-13

    申请号:US15781164

    申请日:2016-02-17

    IPC分类号: G06F17/30

    摘要: In some examples, a system may use content classes to manage indexing of object data. A content class may include a set of one or more content properties. Each content property may include a name, an expression for extracting data, and an expression type. When object data is received, such as for indexing, the expression type of each content property may be compared with the data type of the received data. Based at least in part on determining that the expression type matches the data type, the system may extract a data value from the received data in accordance with the expression. The system may save the extracted data value to a data structure in association with the name of the content property, and may subsequently use the data value and the name of the content property when creating an index for the object data.

    Creating NoSQL database index for semi-structured data

    公开(公告)号:US09953102B2

    公开(公告)日:2018-04-24

    申请号:US14599296

    申请日:2015-01-16

    IPC分类号: G06F17/30

    摘要: Semi-structured source data is preprocessed to obtain text partitions to be stored into a data table with a first combined primary key including a structure thread primary key and a sequence value primary key. The structure thread primary key identifies a structure thread that is segmented into several consecutive intervals according to a determined or predetermined sequence. An inverted index table, created for the preprocessed text partitions, includes a second combined primary key including the structure thread primary key and a keyword primary key. Corresponding to values of the primary keys in the second combined primary key, related text partition sequence IDs are recorded as index values of the inverted index table. Index values having a same keyword primary key value but different structure thread primary key values are located in different rows in the inverted index table. The present techniques improve query efficiency of database index and facilitate updating.

    Hierarchical storage architecture using node ID ranges

    公开(公告)号:US09760652B2

    公开(公告)日:2017-09-12

    申请号:US10870924

    申请日:2004-06-21

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30911

    摘要: A system and method for managing and storing logically grouped hierarchical data via physical block storage is provided. Logical groups of parsed XML node data forming node ID ranges are indexed by creating and inserting an index entry into a node ID range. Index entries indicate node ID range bounds for blocks in which nodes are stored. Consulting a node ID range index facilitates XML node traversal via logical links between nodes in different blocks. Additionally, physical links between nodes within a block allow for fast node traversal. Node update including insertion and deletion as well as document order based pre-fetch and XML document re-organization is also facilitated by this architecture.

    HIGH PERFORMANCE MAP EDITOR FOR BUSINESS ANALYSTS

    公开(公告)号:US20170249569A1

    公开(公告)日:2017-08-31

    申请号:US15596575

    申请日:2017-05-16

    发明人: Kien K. Huynh

    IPC分类号: G06Q10/06 G06F17/22 G06F17/30

    摘要: At least one model synchronization map table is built that facilitates navigation between elements of business objects of a business object model and corresponding elements of an extensible markup language (XML) schema definition-based (XSD-based) model. The XSD-based model includes at least one XML schema and is generated from the business object model. The business object model and the XSD-based model are displayed in a dual-view editor. The dual-view editor enables a user to toggle between and independently edit the business object model and the XSD-based model. In response to independent edits made to one of the XSD-based model and the business object model within the dual-view editor, the elements of the business object model are automatically synchronized with the corresponding elements of the XSD-based model using the at least one model synchronization map table.

    Structured searching of dynamic structured document corpuses

    公开(公告)号:US09659055B2

    公开(公告)日:2017-05-23

    申请号:US14612386

    申请日:2015-02-03

    申请人: MModal IP LLC

    IPC分类号: G06F17/30 G06F17/27

    摘要: A system includes a document corpus containing structured documents, which contain both text and annotations of the text. The system also includes a search engine which is adapted to perform structured searches of the structured documents. As new types of annotations are added to the system, the search engine is updated automatically to become capable of performing structured searches for the new types of annotations. For example, if a new natural language processing (NLP) component, adapted to generate annotations of a new type, is added to the system, then the system automatically updates a query language to include a definition of the new type of annotation. The search engine may then immediately be capable of processing structured queries which refer to the new type of annotation.

    COLUMNAR DATA ARRANGEMENT FOR SEMI-STRUCTURED DATA

    公开(公告)号:US20170116273A1

    公开(公告)日:2017-04-27

    申请号:US15078713

    申请日:2016-03-23

    IPC分类号: G06F17/30

    摘要: Techniques are provided for de-normalizing semi-structured hierarchical data into a virtual table. At least a portion of semi-structured data document collection is denormalized for improving the execution of queries that involves a traversal of the semi-structured data hierarchy of the semi-structured data document collection, in an embodiment. Based on the extracted schema of the semi-structured data, a de-normalized arrangement is generated, in which the hierarchical relationship of the semi-structured data is converted into a set of columns. The denormalized arrangement is materialized by applying the de-normalized arrangement onto the semi-structured data. The materialized arrangement, the virtual table, may be stored on a persistent storage or kept in volatile memory. The virtual table may be stored in one format on the persistent storage and in another format in the volatile memory. A received query that involves a traversal of the semi-structured data hierarchy is converted to a relational query that can be executed on the virtual table, in an embodiment. The execution of the relational query on the virtual table improves the performance in generating the resulting data set.

    System and method for XML subdocument selection
    9.
    发明授权
    System and method for XML subdocument selection 有权
    用于XML子文档选择的系统和方法

    公开(公告)号:US09542502B2

    公开(公告)日:2017-01-10

    申请号:US14479588

    申请日:2014-09-08

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30911

    摘要: Methods for XML subdocument selection and corresponding systems and computer-readable mediums. A method includes receiving a document having fragments with attribute/value pairs and receiving logical expressions that define relationships between fragments of the document. The method includes analyzing the logical expressions according to the document and creating an index based on the analysis that includes names of the fragments to be candidates for selection into subdocuments. The method includes extracting, from the document, all fragments named in the index and creating, in the index, an entry for each attribute/value pair. The method includes creating a plurality of subdocuments corresponding to the document and storing the subdocuments, including the respective related fragments.

    摘要翻译: XML子文档选择方法和相应的系统和计算机可读介质。 一种方法包括接收具有具有属性/值对的片段的文档并接收定义文档片段之间的关系的逻辑表达式。 该方法包括根据文档分析逻辑表达,并基于分析创建一个索引,该分析包括要作为子文档选择候选的片段的名称。 该方法包括从文档中提取索引中命名的所有片段,并在索引中创建每个属性/值对的条目。 该方法包括创建对应于文档的多个子文档并存储子文档,包括相应的相关片段。

    Verifiable trust for data through wrapper composition
    10.
    发明授权
    Verifiable trust for data through wrapper composition 有权
    通过包装组合验证数据的信任

    公开(公告)号:US09537650B2

    公开(公告)日:2017-01-03

    申请号:US12832400

    申请日:2010-07-08

    摘要: A digital escrow pattern for data services can include selective access for obscured data at a remote site or in a cloud service, distributing trust across multiple entities to avoid a single point of data compromise. Based on the pattern, a “trustworthy envelope” for any kind of payload enables curtained access through a variety of decorations or seals placed on the envelope that allow for a gamut of trust ranging with guarantees such as, but not limited to, confidentiality, privacy, anonymity, tamper detection, integrity, etc. Verifiable trust is provided through families of techniques that are referred to as wrapper composition. Multiple concentric and/or lateral transform wrappers or layers can wholly or partially transform data, metadata or both to mathematical transform (e.g., encrypt, distribute across storage, obscure) or otherwise introduce lack of visibility to some or all of the data, metadata or both.

    摘要翻译: 用于数据服务的数字托管模式可以包括对远程站点或云服务中的模糊数据的选择性访问,在多个实体之间分发信任以避免单点数据泄密。 基于这种模式,任何类型的有效载荷的“值得信赖的信封”可以通过放置在信封上的各种装饰或密封件进行屏蔽,允许范围广泛的信任范围,包括但不限于保密性,隐私性 ,匿名性,篡改检测,完整性等。可通过称为包装组合的技术家族提供可验证的信任。 多个同心和/或横向变换包装器或层可以将数据,元数据或两者全部或部分地转换成数学变换(例如,加密,跨存储分布,模糊)或以其他方式引入对一些或全部数据,元数据或 都。