CODEBOOK MANAGEMENT BASED ON DATA SOURCE GROUPING

    公开(公告)号:US20240419329A1

    公开(公告)日:2024-12-19

    申请号:US18822209

    申请日:2024-09-01

    Abstract: A system and method for codebook management is disclosed. Training datasets are obtained from various data sources. A similarity score is generated for each training dataset with reference to the other training datasets. In response to detecting a similarity score above a predetermined threshold for one or more of the other training datasets, a combined codebook is created based on training datasets that have a similarity score above a predetermined threshold. Based on the similarity score, multiple data sources are combined into a group, and the combined codebook is used for the data sources within the group. A mismatch performance metric can be computed for the combined codebook, and a revised combined codebook can be regenerated in response to the mismatch performance metric being above a predetermined threshold.

    System and method for multi-type data compression or decompression with a virtual management layer

    公开(公告)号:US12166506B2

    公开(公告)日:2024-12-10

    申请号:US18657719

    申请日:2024-05-07

    Abstract: A system and methods for multi-type data compression or decompression with a virtual management layer, comprising. It incorporates a virtual management layer to organize incoming data types and select a compression or decompression system that utilizes a technique best suited for a particular data type. Associated data sets may be flagged prior to compression or decompression so that associated types may be preserved together after the compression or decompression process is complete. This approach allows each data type to be compressed or decompressed using a technique that is the most efficient for a particular data type. Additionally, the approach allows all information associated with a particular data set to be compressed or decompressed in some way.

    SYSTEM AND METHOD FOR DYADIC DISTRIBUTION-BASED COMPRESSION AND ENCRYPTION

    公开(公告)号:US20240372562A1

    公开(公告)日:2024-11-07

    申请号:US18770652

    申请日:2024-07-12

    Abstract: A system and method for simultaneous compression and encryption of data. The system analyzes input data to determine its properties and creates a transformation matrix based on these properties. Using this matrix, the input data is transformed into a modified distribution, generating a main data stream of transformed data and a secondary stream of transformation information. The main data stream is compressed, and both streams are combined into a single output. The system implements security measures to protect against various attacks, including side-channel vulnerabilities. By using a dyadic distribution algorithm, the system achieves both compression and encryption in a single pass over the data, offering significant efficiency gains. The system can operate in both lossless and lossy modes, providing flexibility for different application requirements. This approach offers a unique solution for data transmission and storage scenarios where both data reduction and security are critical concerns.

    Event-driven data transmission using codebooks with protocol adaption

    公开(公告)号:US12136934B2

    公开(公告)日:2024-11-05

    申请号:US18644019

    申请日:2024-04-23

    Abstract: A system and method for event-driven data communication using codebooks with protocol adaption. The system initiates with a request for propagation information from an application to a first transaction manager. The first transaction manager configures a packet describing its location, potentially containing one or more protocol appendices, or encoded data using a codebook. This packet is provided to the application for transmission to another application with a second transaction manager. Upon receiving a protocol request from the second transaction manager, the first transaction manager communicates using a selected protocol decoded from the protocol appendix. If the selected protocol is supported, the transaction proceeds, completing successfully. This system enables transparent encoding, negotiation, and selection of communication protocols, allowing efficient transactions between different transaction managers.

    SYSTEM AND METHOD FOR RANDOM-ACCESS MANIPULATION OF COMPACTED DATA FILES

    公开(公告)号:US20240362189A1

    公开(公告)日:2024-10-31

    申请号:US18768606

    申请日:2024-07-10

    CPC classification number: G06F16/1752 G06F3/0608 G06F3/0641 G06F3/067

    Abstract: A system and method for random-access manipulation of compacted data files, utilizing a reference codebook, a random-access engine, a data deconstruction engine, and a data deconstruction engine. The system may receive a data query pertaining to a data read or data write request, wherein the data file to be read from or written to is a compacted data file. A random-access engine may facilitate data manipulation processes by transforming the codebook into a hierarchical representation and then traversing the representation scanning for specific codewords associated with a data query request. In an embodiment, an estimator module is present and configured to utilize cardinality estimation to determine a starting codeword to begin searching the compacted data file for the data associated with the data query. The random-access engine may encode the data to be written, insert the encoded data into a compacted data file, and update the codebook as needed.

    SYSTEM AND METHOD FOR DATA COMPACTION AND ENCRYPTION OF ANONYMIZED DATA RECORDS

    公开(公告)号:US20240329836A1

    公开(公告)日:2024-10-03

    申请号:US18737474

    申请日:2024-06-07

    Abstract: A system and method for data compaction and encryption of anonymized data records. A dataset may be pre-processed by dividing into sourceblocks at reasonable intervals and tallying each sourceblock's frequency, creating a tally record of tokens and count values. This tally record may then be anonymized and transmitted to a data deconstruction engine which combined with a library manager creates a codebook and performs optimization techniques on the codebook. The data deconstruction engine and library manager may be distributed across multiple nodes or devices. The received anonymized tally record may be parsed into individual tokens by identifying the tokens with the highest count value. The tokens may then be sent descending order of count value to the library manger where each token may be assigned a codeword. A half-backed codebook is then created using the tokens and each token's unique codeword, before sending the half-backed codebook to a system user.

    SYSTEM AND METHOD FOR CODEBOOK MANAGEMENT BASED ON DATA SOURCE GROUPING

    公开(公告)号:US20240248602A1

    公开(公告)日:2024-07-25

    申请号:US18593931

    申请日:2024-03-03

    Abstract: A system and method for codebook management is disclosed. Training datasets are obtained from various data sources. A similarity score is generated for each training dataset with reference to the other training datasets. In response to detecting a similarity score above a predetermined threshold for one or more of the other training datasets, a combined codebook is created based on training datasets that have a similarity score above a predetermined threshold. Based on the similarity score, multiple data sources are combined into a group, and the combined codebook is used for the data sources within the group. A mismatch performance metric can be computed for the combined codebook, and a revised combined codebook can be regenerated in response to the mismatch performance metric being above a predetermined threshold.

Patent Agency Ranking