Random access of data encoded by polynucleotides

    公开(公告)号:US10774379B2

    公开(公告)日:2020-09-15

    申请号:US15459268

    申请日:2017-03-15

    Abstract: This disclosure describes frameworks and techniques related to the random access of digital data encoded by polynucleotides. Digital data of a data file can be encoded as a series of nucleotides and one or more polynucleotide sequences can be generated that encode the digital data for the data file. The bits of the digital data can be segmented to produce multiple polynucleotide sequences that encode the bits of the digital data with each polynucleotide sequence encoding an individual segment of the digital data. The individual segments can be grouped together and associated with a group identifier. Each data file can be associated with a number of group identifiers and the number of segments in each group can be within a specified range. Primers corresponding to the group identifiers can be used to selectively access the polynucleotides that encode the digital data of a data file.

    ERROR CORRECTION FOR NUCLEOTIDE DATA STORES
    2.
    发明申请

    公开(公告)号:US20170141793A1

    公开(公告)日:2017-05-18

    申请号:US15004827

    申请日:2016-01-22

    Abstract: This disclosure provides techniques for adding error correction to information in a data store that encodes information as a sequence of bases in polynucleotides. Errors may be introduced through creation of the database (e.g., oligonucleotide synthesis) and/or reading information from the database (e.g., polynucleotide sequencing). Additional polynucleotides added to the database can provide error correction through redundancy. The sequence of polynucleotides that provide error correction may be designed by performing an invertible summary operation on information to be stored in the database. One example of an invertible summary operation is the exclusive or operation (XOR). This disclosure also provides techniques for storing metadata related to organization of a database and structure of information on polynucleotides within the database. Metadata may be encoded in polynucleotides and added to the data store. The polynucleotides holding metadata may be designed with unique primer sites so that the metadata can be selectively amplified and sequenced.

    Modifications to polynucleotides for sequencing

    公开(公告)号:US10689684B2

    公开(公告)日:2020-06-23

    申请号:US15431897

    申请日:2017-02-14

    Abstract: This disclosure describes techniques to improve the sequencing of polynucleotides by decreasing the likelihood of errors occurring during a sequencing calibration process. In implementations, regions of polynucleotides that are used for the calibration process can be modified to reduce a number of polynucleotides that have a same nucleotide at one or more positions of the calibration regions. In some cases, the calibration regions can be modified by adding a sequence to the polynucleotides that replaces the original calibration regions. Also, the calibration regions can be modified by rearranging the nucleotides at the different positions of the calibration regions. Additionally, the calibration regions can be modified by adding sequences of varying length to the polynucleotides being sequenced to produce polynucleotides having varying length with different calibration regions.

    MODIFICATIONS TO POLYNUCLEOTIDES FOR SEQUENCING

    公开(公告)号:US20180230509A1

    公开(公告)日:2018-08-16

    申请号:US15431897

    申请日:2017-02-14

    CPC classification number: C12Q1/6806 C12Q1/6869 G16B30/00 C12Q2545/101

    Abstract: This disclosure describes techniques to improve the sequencing of polynucleotides by decreasing the likelihood of errors occurring during a sequencing calibration process. In implementations, regions of polynucleotides that are used for the calibration process can be modified to reduce a number of polynucleotides that have a same nucleotide at one or more positions of the calibration regions. In some cases, the calibration regions can be modified by adding a sequence to the polynucleotides that replaces the original calibration regions. Also, the calibration regions can be modified by rearranging the nucleotides at the different positions of the calibration regions. Additionally, the calibration regions can be modified by adding sequences of varying length to the polynucleotides being sequenced to produce polynucleotides having varying length with different calibration regions.

    Generating pluralities of primer and payload designs for retrieval of stored nucleotides

    公开(公告)号:US10787699B2

    公开(公告)日:2020-09-29

    申请号:US15427808

    申请日:2017-02-08

    Abstract: This disclosure describes techniques to improve the accuracy of random access of data stored in polynucleotide sequence data storage systems. Primers used in polynucleotide sequence replication and amplification can be scored against a number of criteria that indicate the fitness of sequences of nucleotides to function as primers. Primers having scores that indicate a particular fitness to function as primers can be added to a specific group of primers. The primers from the group of primers can be used in amplification and replication of polynucleotide sequences that encode digital data. Additionally, an amount of overlap between primer targets and payloads encoding digital data can be determined. Minimizing the amount of overlap between primer targets and payloads can improve the efficiency of polynucleotide replication and amplification. The bits of the digital data can be randomized to minimize the amount of overlap between payloads encoding the digital data and primer targets.

    PRIMER DESIGN FOR RETRIEVAL OF STORED POLYNUCLEOTIDES

    公开(公告)号:US20180223340A1

    公开(公告)日:2018-08-09

    申请号:US15427344

    申请日:2017-02-08

    Abstract: This disclosure describes techniques to improve the accuracy of random access of data stored in polynucleotide sequence data storage systems. Primers used in polynucleotide sequence replication and amplification can be scored against a number of criteria that indicate the fitness of sequences of nucleotides to function as primers. Primers having scores that indicate a particular fitness to function as primers can be added to a specific group of primers. The primers from the group of primers can be used in amplification and replication of polynucleotide sequences that encode digital data. Additionally, an amount of overlap between primer targets and payloads encoding digital data can be determined. Minimizing the amount of overlap between primer targets and payloads can improve the efficiency of polynucleotide replication and amplification. The bits of the digital data can be randomized to minimize the amount of overlap between payloads encoding the digital data and primer targets.

    RANDOM ACCESS OF DATA ENCODED BY POLYNUCLEOTIDES

    公开(公告)号:US20180265921A1

    公开(公告)日:2018-09-20

    申请号:US15459268

    申请日:2017-03-15

    Abstract: This disclosure describes frameworks and techniques related to the random access of digital data encoded by polynucleotides. Digital data of a data file can be encoded as a series of nucleotides and one or more polynucleotide sequences can be generated that encode the digital data for the data file. The bits of the digital data can be segmented to produce multiple polynucleotide sequences that encode the bits of the digital data with each polynucleotide sequence encoding an individual segment of the digital data. The individual segments can be grouped together and associated with a group identifier. Each data file can be associated with a number of group identifiers and the number of segments in each group can be within a specified range. Primers corresponding to the group identifiers can be used to selectively access the polynucleotides that encode the digital data of a data file.

    Primer Design for Retrieval of Stored Nucleotides

    公开(公告)号:US20180223341A1

    公开(公告)日:2018-08-09

    申请号:US15427808

    申请日:2017-02-08

    Abstract: This disclosure describes techniques to improve the accuracy of random access of data stored in polynucleotide sequence data storage systems. Primers used in polynucleotide sequence replication and amplification can be scored against a number of criteria that indicate the fitness of sequences of nucleotides to function as primers. Primers having scores that indicate a particular fitness to function as primers can be added to a specific group of primers. The primers from the group of primers can be used in amplification and replication of polynucleotide sequences that encode digital data. Additionally, an amount of overlap between primer targets and payloads encoding digital data can be determined. Minimizing the amount of overlap between primer targets and payloads can improve the efficiency of polynucleotide replication and amplification. The bits of the digital data can be randomized to minimize the amount of overlap between payloads encoding the digital data and primer targets.

    Primer and payload design for retrieval of stored polynucleotides

    公开(公告)号:US10793897B2

    公开(公告)日:2020-10-06

    申请号:US15427344

    申请日:2017-02-08

    Abstract: This disclosure describes techniques to improve the accuracy of random access of data stored in polynucleotide sequence data storage systems. Primers used in polynucleotide sequence replication and amplification can be scored against a number of criteria that indicate the fitness of sequences of nucleotides to function as primers. Primers having scores that indicate a particular fitness to function as primers can be added to a specific group of primers. The primers from the group of primers can be used in amplification and replication of polynucleotide sequences that encode digital data. Additionally, an amount of overlap between primer targets and payloads encoding digital data can be determined. Minimizing the amount of overlap between primer targets and payloads can improve the efficiency of polynucleotide replication and amplification. The bits of the digital data can be randomized to minimize the amount of overlap between payloads encoding the digital data and primer targets.

Patent Agency Ranking