Systems and methods for pre-processing string data for network transmission

    公开(公告)号:US12081639B1

    公开(公告)日:2024-09-03

    申请号:US18319008

    申请日:2023-05-17

    CPC classification number: H04L69/04

    Abstract: Systems and methods for pre-processing string data for network transmission are disclosed. A system can extract first sequences from a sequence file and generate respective encoded sequences based on the first sequences extracted from the sequence file. The system can generate a hash table that stores the respective encoded sequences. The system can combine at least two entries in the hash table based on a comparison of data generated from at least two of the respective plurality of encoded sequences. The system can transmit an output file including a plurality of decoded sequences generated based on the hash table.

    SYSTEMS AND METHODS FOR REMOVING DATA FROM TEXT STRINGS

    公开(公告)号:US20240248932A1

    公开(公告)日:2024-07-25

    申请号:US18585304

    申请日:2024-02-23

    Abstract: Systems and methods for removing data from strings are disclosed. A system can access a first hash table that stores representations of a first set of strings, where the representations have predetermined number of characters. The system can generate a second hash table that stores a second representations of a string of a second set of strings, where the second representations have the predetermined number of characters. Upon determining that the first hash table includes at least one of the plurality of second representations of the string included in the second hash table, the system can increment a counter associated with the string. The system can generate a third set of strings by removing the string from the second set of strings responsive to determining that the counter satisfies a threshold, and transmit the third set of strings to a computing system.

    SYSTEMS AND METHODS FOR PRE-PROCESSING STRING DATA FOR NETWORK TRANSMISSION

    公开(公告)号:US20240430344A1

    公开(公告)日:2024-12-26

    申请号:US18820699

    申请日:2024-08-30

    Abstract: Systems and methods for pre-processing string data for network transmission are disclosed. A system can extract first sequences from a sequence file and generate respective encoded sequences based on the first sequences extracted from the sequence file. The system can generate a hash table that stores the respective encoded sequences. The system can combine at least two entries in the hash table based on a comparison of data generated from at least two of the respective plurality of encoded sequences. The system can transmit an output file including a plurality of decoded sequences generated based on the hash table.

    Systems and methods for removing human genetic data from genetic sequences

    公开(公告)号:US11914653B1

    公开(公告)日:2024-02-27

    申请号:US18100074

    申请日:2023-01-23

    Abstract: Systems and methods for removing data from strings are disclosed. A system can access a first hash table that stores representations of a first set of strings, where the representations have predetermined number of characters. The system can generate a second hash table that stores a second representations of a string of a second set of strings, where the second representations have the predetermined number of characters. Upon determining that the first hash table includes at least one of the plurality of second representations of the string included in the second hash table, the system can increment a counter associated with the string. The system can generate a third set of strings by removing the string from the second set of strings responsive to determining that the counter satisfies a threshold, and transmit the third set of strings to a computing system.

Patent Agency Ranking