Data record compression with progressive and/or selective decomposition

    公开(公告)号:US09710517B2

    公开(公告)日:2017-07-18

    申请号:US14703622

    申请日:2015-05-04

    申请人: QBASE, LLC

    IPC分类号: G06F17/30

    摘要: Disclosed herein are systems and methods for compressing structured or semi-structured data in a horizontal manner achieving compression ratios similar to vertical compression. Collections include structured or semi-structured data include a number of fields and are described using a schema. Fields include information having semantic similarity and are compressed using methods suitable for compressing the type of data. Data of a collection is compressed after fragmentation or may be normalized prior to compression. Data with semantic similarity is compressed using token tables and/or n-gram tables, where higher weighted, consisting of the product of frequency and length, occurring values may be stored in the lower numbered indices of the data table. Records include record descriptor bytes, field descriptor bytes, zero or more array descriptor bytes, zero or more object descriptor bytes, or bytes representing the data associated with the record. Data is indexed or compressed by a suitable module.

    Event detection through text analysis using trained event template models
    3.
    发明授权
    Event detection through text analysis using trained event template models 有权
    通过使用经过训练的事件模板模型进行文本分析的事件检测

    公开(公告)号:US09177254B2

    公开(公告)日:2015-11-03

    申请号:US14558300

    申请日:2014-12-02

    申请人: QBASE, LLC

    摘要: A system and method for detecting events based on input data from a plurality of sources. The system may receive input from a plurality of sources containing information about possible events. A method for event detection involves pre-processing and normalizing a data input from a plurality of sources, extracting and disambiguating events and entities, associate event and entities, correlate events and entities associated from a data input to results from a different data sources to determine if an event has occurred, and store the detected events in a data storage.

    摘要翻译: 一种用于基于来自多个源的输入数据来检测事件的系统和方法。 系统可以从包含关于可能事件的信息的多个源接收输入。 一种用于事件检测的方法涉及预处理和归一化来自多个源的数据输入,提取和消除事物和实体的关联,关联事件和实体,将与数据输入相关联的事件和实体与来自不同数据源的结果相关联以确定 如果事件已经发生,并将检测到的事件存储在数据存储器中。

    PLUGGABLE ARCHITECTURE FOR EMBEDDING ANALYTICS IN CLUSTERED IN-MEMORY DATABASES
    4.
    发明申请
    PLUGGABLE ARCHITECTURE FOR EMBEDDING ANALYTICS IN CLUSTERED IN-MEMORY DATABASES 有权
    嵌入式内存数据库嵌入式分析的可扩展架构

    公开(公告)号:US20150154283A1

    公开(公告)日:2015-06-04

    申请号:US14558055

    申请日:2014-12-02

    申请人: QBASE, LLC

    IPC分类号: G06F17/30

    摘要: Disclosed are pluggable, distributed computing-system architectures allowing for embedding analytics to be added or removed from nodes of a system hosting an in-memory database. The disclosed system includes an API that may be used to create customized, application specific analytics modules. The newly created analytics modules may be easily plugged into the in-memory database. Each user query submitted to the in-memory database may specify different analytics be applied with differing parameters. All analytics modules operate on the in-memory image of the data, inside the in-memory database platform. All the analytics modules, may be capable of performing on-the-fly analytics, which may allow a dynamic and comprehensive processing of search results.

    摘要翻译: 公开了可插拔的分布式计算系统架构,允许嵌入分析以从托管内存数据库的系统的节点添加或删除。 所公开的系统包括可以用于创建定制的,特定于应用的分析模块的API。 新创建的分析模块可以轻松地插入到内存数据库中。 提交到内存数据库的每个用户查询可以指定使用不同参数应用不同的分析。 所有分析模块都在内存数据库平台内部的数据内存映像中进行操作。 所有分析模块可能能够执行即时分析,这可能允许对搜索结果进行动态和全面的处理。

    Data record compression with progressive and/or selective decomposition
    5.
    发明授权
    Data record compression with progressive and/or selective decomposition 有权
    使用渐进和/或选择性分解的数据记录压缩

    公开(公告)号:US09025892B1

    公开(公告)日:2015-05-05

    申请号:US14557900

    申请日:2014-12-02

    申请人: QBASE, LLC

    摘要: Disclosed herein are systems and methods for compressing structured or semi-structured data in a horizontal manner achieving compression ratios similar to vertical compression. Collections include structured or semi-structured data include a number of fields and are described using a schema. Fields include information having semantic similarity and are compressed using methods suitable for compressing the type of data. Data of a collection is compressed after fragmentation or may be normalized prior to compression. Data with semantic similarity is compressed using token tables and/or n-gram tables, where higher weighted, consisting of the product of frequency and length, occurring values may be stored in the lower numbered indices of the data table. Records include record descriptor bytes, field descriptor bytes, zero or more array descriptor bytes, zero or more object descriptor bytes, or bytes representing the data associated with the record. Data is indexed or compressed by a suitable module.

    摘要翻译: 本文公开了用于以水平方式压缩结构化或半结构化数据的系统和方法,其实现类似于垂直压缩的压缩比。 集合包括结构化或半结构化数据包括多个字段,并使用模式进行描述。 字段包括具有语义相似性的信息,并使用适合于压缩数据类型的方法进行压缩。 集合的数据在分段之后被压缩,或者可以在压缩之前被归一化。 使用令牌表和/或n-gram表压缩具有语义相似性的数据,其中由频率和长度的乘积组成的较高加权可以存储在数据表的较低编号的索引中。 记录包括记录描述符字节,字段描述符字节,零个或多个数组描述符字节,零个或多个对象描述符字节或表示与记录相关联的数据的字节。 数据由合适的模块索引或压缩。

    Search suggestions using fuzzy-score matching and entity co-occurrence

    公开(公告)号:US09507834B2

    公开(公告)日:2016-11-29

    申请号:US14950874

    申请日:2015-11-24

    申请人: QBASE, LLC

    IPC分类号: G06F17/30

    摘要: A method for generating search suggestions by using fuzzy-score matching and entity co-occurrence in a knowledge base is disclosed. Embodiments of the method may be employed in any search system that may include an entity extraction computer module that may perform partial entity extractions from provided search queries, a fuzzy-score matching computer module that may generate algorithms based on the type of entity extracted and perform a search against an entity co-occurrence knowledge base. The entity co-occurrence knowledge base, which may include a repository where entities may be indexed as entities to entities, entities to topics, or entities to facts among others, may return fast and accurate suggestions to the user to complete the search query. The suggestions may include alternates to the partial query provided by the user that may enhance and save time when performing searches.

    Implementation of clustered in-memory database
    8.
    发明授权
    Implementation of clustered in-memory database 有权
    集群内存数据库的实现

    公开(公告)号:US09430547B2

    公开(公告)日:2016-08-30

    申请号:US14558254

    申请日:2014-12-02

    申请人: QBASE, LLC

    IPC分类号: G06F17/30

    摘要: An in-memory database system and method for administrating a distributed in-memory database, comprising one or more nodes having modules configured to store and distribute database partitions of collections partitioned by a partitioner associated with a search conductor. Database collections are partitioned according to a schema. Partitions, collections, and records, are updated and removed when requested by a system interface, according to the schema. Supervisors determine a node status based on a heartbeat signal received from each node. Users can send queries through a system interface to search managers. Search managers apply a field processing technique, forward the search query to search conductors, and return a set of result records to the analytics agents. Analytics agents perform analytics processing on a candidate results records from a search manager. The search conductors comprising partitioners associated with a collection, search and score the records in a partition, then return a set of candidate result records after receiving a search query from a search manager.

    摘要翻译: 一种用于管理分布式存储器内数据库的内存数据库系统和方法,包括一个或多个节点,其具有被配置为存储和分发由与搜索引导器相关联的分割器分割的集合的数据库分区的模块。 数据库集合根据模式进行分区。 根据模式,系统界面请求时更新和删除分区,集合和记录。 主管根据从每个节点接收的心跳信号确定节点状态。 用户可以通过系统界面向搜索经理发送查询。 搜索管理员应用现场处理技术,将搜索查询转发到搜索引擎,并将一组结果记录返回给分析代理。 分析代理对搜索管理器的候选结果记录执行分析处理。 搜索导体包括与收集相关联的分割器,搜索并对分区中的记录进行分数,然后在从搜索管理器接收到搜索查询之后返回一组候选结果记录。

    Method for entity-driven alerts based on disambiguated features
    9.
    发明授权
    Method for entity-driven alerts based on disambiguated features 有权
    基于消歧特征的实体驱动警报的方法

    公开(公告)号:US09336280B2

    公开(公告)日:2016-05-10

    申请号:US14558121

    申请日:2014-12-02

    申请人: QBASE, LLC

    IPC分类号: G06F17/30

    CPC分类号: G06F17/3053 G06F17/3071

    摘要: A method for entity-driven alerts based on disambiguated features, is disclosed. According to an embodiment, disclosed method may refer to entity-driven alerts based on trending or new knowledge of a disambiguated feature. The alerts may be sent to a user when new knowledge is discovered about the disambiguated feature, a new association (such as new features, facts, quotations, or topic IDs related, among others) with the feature of interest, and/or new trending changes are emerging about the feature of interest. According to various embodiments, method for entity-driven alerts based on disambiguated features may reduce the number of false positives resulting in a normal search query. Which in turn, may increase the efficiency of monitoring, allowing for broadened universe of alerts.

    摘要翻译: 公开了一种基于消歧特征的实体驱动警报的方法。 根据实施例,所公开的方法可以指基于消歧特征的趋势或新知识的实体驱动的警报。 当发现关于消歧特征的新知识时,可以将警报发送给用户,具有感兴趣特征的新关联(例如新特征,事实,引用或与其相关的主题ID)和/或新趋势 关于兴趣特征的变化正在出现。 根据各种实施例,基于消歧特征的实体驱动警报的方法可以减少导致正常搜索查询的误报数量。 这反过来可能会提高监控的效率,从而扩大警报的范围。

    Non-exclusionary search within in-memory databases

    公开(公告)号:US09916368B2

    公开(公告)日:2018-03-13

    申请号:US15052528

    申请日:2016-02-24

    申请人: QBASE, LLC

    IPC分类号: G06F17/30

    摘要: Methods for non-exclusionary searching within clustered in-memory databases are disclosed. The non-exclusionary search methods may allow the execution of searches where the results may include records where fields specified in the query are not populated or defined. The disclosed methods include the application of fuzzy matching and scoring algorithms, which enables the system to search, score and compare records with different schemata. This may significantly improve the recall of relevant records.