-
公开(公告)号:US10774379B2
公开(公告)日:2020-09-15
申请号:US15459268
申请日:2017-03-15
Inventor: Yuan-Jyue Chen , Karin Strauss , Luis H. Ceze , Lee Organick
IPC: C12Q1/6869 , C07H21/02 , C12P19/34 , G06F16/16 , G16B99/00 , G16B50/00 , C07H21/00 , C12Q1/00 , G16B50/40
Abstract: This disclosure describes frameworks and techniques related to the random access of digital data encoded by polynucleotides. Digital data of a data file can be encoded as a series of nucleotides and one or more polynucleotide sequences can be generated that encode the digital data for the data file. The bits of the digital data can be segmented to produce multiple polynucleotide sequences that encode the bits of the digital data with each polynucleotide sequence encoding an individual segment of the digital data. The individual segments can be grouped together and associated with a group identifier. Each data file can be associated with a number of group identifiers and the number of segments in each group can be within a specified range. Primers corresponding to the group identifiers can be used to selectively access the polynucleotides that encode the digital data of a data file.
-
公开(公告)号:US20170141793A1
公开(公告)日:2017-05-18
申请号:US15004827
申请日:2016-01-22
Inventor: Karin Strauss , Siena Dumas Ang , Luis H. Ceze , James Bornholt
CPC classification number: H03M13/05 , G16B15/00 , G16B50/00 , H03M13/09 , H03M13/098 , H03M13/373 , H03M13/611
Abstract: This disclosure provides techniques for adding error correction to information in a data store that encodes information as a sequence of bases in polynucleotides. Errors may be introduced through creation of the database (e.g., oligonucleotide synthesis) and/or reading information from the database (e.g., polynucleotide sequencing). Additional polynucleotides added to the database can provide error correction through redundancy. The sequence of polynucleotides that provide error correction may be designed by performing an invertible summary operation on information to be stored in the database. One example of an invertible summary operation is the exclusive or operation (XOR). This disclosure also provides techniques for storing metadata related to organization of a database and structure of information on polynucleotides within the database. Metadata may be encoded in polynucleotides and added to the data store. The polynucleotides holding metadata may be designed with unique primer sites so that the metadata can be selectively amplified and sequenced.
-
公开(公告)号:US10689684B2
公开(公告)日:2020-06-23
申请号:US15431897
申请日:2017-02-14
Inventor: Yuan-Jyue Chen , Karin Strauss , Luis H. Ceze , Lee Organick , Randolph Lopez , Georg Seelig
IPC: C12Q1/6806 , C12Q1/6869 , G16B30/00
Abstract: This disclosure describes techniques to improve the sequencing of polynucleotides by decreasing the likelihood of errors occurring during a sequencing calibration process. In implementations, regions of polynucleotides that are used for the calibration process can be modified to reduce a number of polynucleotides that have a same nucleotide at one or more positions of the calibration regions. In some cases, the calibration regions can be modified by adding a sequence to the polynucleotides that replaces the original calibration regions. Also, the calibration regions can be modified by rearranging the nucleotides at the different positions of the calibration regions. Additionally, the calibration regions can be modified by adding sequences of varying length to the polynucleotides being sequenced to produce polynucleotides having varying length with different calibration regions.
-
公开(公告)号:US20180230509A1
公开(公告)日:2018-08-16
申请号:US15431897
申请日:2017-02-14
Applicant: Microsoft Technology Licensing, LLC
Inventor: Yuan-Jyue Chen , Karin Strauss , Luis H. Ceze , Lee Organick , Randolph Lopez , Georg Seelig
CPC classification number: C12Q1/6806 , C12Q1/6869 , G16B30/00 , C12Q2545/101
Abstract: This disclosure describes techniques to improve the sequencing of polynucleotides by decreasing the likelihood of errors occurring during a sequencing calibration process. In implementations, regions of polynucleotides that are used for the calibration process can be modified to reduce a number of polynucleotides that have a same nucleotide at one or more positions of the calibration regions. In some cases, the calibration regions can be modified by adding a sequence to the polynucleotides that replaces the original calibration regions. Also, the calibration regions can be modified by rearranging the nucleotides at the different positions of the calibration regions. Additionally, the calibration regions can be modified by adding sequences of varying length to the polynucleotides being sequenced to produce polynucleotides having varying length with different calibration regions.
-
公开(公告)号:US10787699B2
公开(公告)日:2020-09-29
申请号:US15427808
申请日:2017-02-08
Applicant: Microsoft Technology Licensing, LLC
Inventor: Yuan-Jyue Chen , Karin Strauss , Luis H. Ceze , Siena Dumas Ang , Sergey Yekhanin
IPC: C12Q1/6811 , G16B25/00
Abstract: This disclosure describes techniques to improve the accuracy of random access of data stored in polynucleotide sequence data storage systems. Primers used in polynucleotide sequence replication and amplification can be scored against a number of criteria that indicate the fitness of sequences of nucleotides to function as primers. Primers having scores that indicate a particular fitness to function as primers can be added to a specific group of primers. The primers from the group of primers can be used in amplification and replication of polynucleotide sequences that encode digital data. Additionally, an amount of overlap between primer targets and payloads encoding digital data can be determined. Minimizing the amount of overlap between primer targets and payloads can improve the efficiency of polynucleotide replication and amplification. The bits of the digital data can be randomized to minimize the amount of overlap between payloads encoding the digital data and primer targets.
-
公开(公告)号:US20180223340A1
公开(公告)日:2018-08-09
申请号:US15427344
申请日:2017-02-08
Applicant: Microsoft Technology Licensing, LLC
Inventor: Yuan-Jyue Chen , Luis H. Ceze , Sergey Yekhanin , Siena Dumas Ang , Karin Strauss
Abstract: This disclosure describes techniques to improve the accuracy of random access of data stored in polynucleotide sequence data storage systems. Primers used in polynucleotide sequence replication and amplification can be scored against a number of criteria that indicate the fitness of sequences of nucleotides to function as primers. Primers having scores that indicate a particular fitness to function as primers can be added to a specific group of primers. The primers from the group of primers can be used in amplification and replication of polynucleotide sequences that encode digital data. Additionally, an amount of overlap between primer targets and payloads encoding digital data can be determined. Minimizing the amount of overlap between primer targets and payloads can improve the efficiency of polynucleotide replication and amplification. The bits of the digital data can be randomized to minimize the amount of overlap between payloads encoding the digital data and primer targets.
-
公开(公告)号:US20180265921A1
公开(公告)日:2018-09-20
申请号:US15459268
申请日:2017-03-15
Inventor: Yuan-Jyue Chen , Karin Strauss , Luis H. Ceze , Lee Organick
Abstract: This disclosure describes frameworks and techniques related to the random access of digital data encoded by polynucleotides. Digital data of a data file can be encoded as a series of nucleotides and one or more polynucleotide sequences can be generated that encode the digital data for the data file. The bits of the digital data can be segmented to produce multiple polynucleotide sequences that encode the bits of the digital data with each polynucleotide sequence encoding an individual segment of the digital data. The individual segments can be grouped together and associated with a group identifier. Each data file can be associated with a number of group identifiers and the number of segments in each group can be within a specified range. Primers corresponding to the group identifiers can be used to selectively access the polynucleotides that encode the digital data of a data file.
-
公开(公告)号:US20180223341A1
公开(公告)日:2018-08-09
申请号:US15427808
申请日:2017-02-08
Applicant: Microsoft Technology Licensing, LLC
Inventor: Yuan-Jyue Chen , Karin Strauss , Luis H. Ceze , Siena Dumas Ang , Sergey Yekhanin
IPC: C12Q1/68
Abstract: This disclosure describes techniques to improve the accuracy of random access of data stored in polynucleotide sequence data storage systems. Primers used in polynucleotide sequence replication and amplification can be scored against a number of criteria that indicate the fitness of sequences of nucleotides to function as primers. Primers having scores that indicate a particular fitness to function as primers can be added to a specific group of primers. The primers from the group of primers can be used in amplification and replication of polynucleotide sequences that encode digital data. Additionally, an amount of overlap between primer targets and payloads encoding digital data can be determined. Minimizing the amount of overlap between primer targets and payloads can improve the efficiency of polynucleotide replication and amplification. The bits of the digital data can be randomized to minimize the amount of overlap between payloads encoding the digital data and primer targets.
-
公开(公告)号:US12157102B2
公开(公告)日:2024-12-03
申请号:US17863033
申请日:2022-07-12
Applicant: MICROSOFT TECHNOLOGY LICENSING, LLC
Inventor: Bichlien H. Nguyen , Douglas P. Kelley , Karin Strauss , Robert Carlson , Hsing-Yeh Parker , John Mulligan , Luis H. Ceze , Yuan-Jyue Chen , Douglas Carmean
IPC: B01J19/00
Abstract: A system includes a synthesizer unit having a fluid input to receive fluids and a communication input to receive commands to synthesize data-encoded DNA sequences and cleave the DNA. A first flexible chemistry reaction chamber module may be fluidically coupled to the synthesizer unit to receive the data-encoded DNA sequences and amplify the sequences. A deposition unit may be fluidically coupled to the first flexible chemistry reaction chamber module to receive the amplified DNA sequences and encapsulate the amplified DNA sequences into one or more wells in a storage plate for storage and retrieval to and from a plate storage unit. Retrieved DNA may be processed and read by further units.
-
公开(公告)号:US10793897B2
公开(公告)日:2020-10-06
申请号:US15427344
申请日:2017-02-08
Applicant: Microsoft Technology Licensing, LLC
Inventor: Yuan-Jyue Chen , Luis H. Ceze , Sergey Yekhanin , Siena Dumas Ang , Karin Strauss
IPC: C12Q1/6811 , G16B30/00
Abstract: This disclosure describes techniques to improve the accuracy of random access of data stored in polynucleotide sequence data storage systems. Primers used in polynucleotide sequence replication and amplification can be scored against a number of criteria that indicate the fitness of sequences of nucleotides to function as primers. Primers having scores that indicate a particular fitness to function as primers can be added to a specific group of primers. The primers from the group of primers can be used in amplification and replication of polynucleotide sequences that encode digital data. Additionally, an amount of overlap between primer targets and payloads encoding digital data can be determined. Minimizing the amount of overlap between primer targets and payloads can improve the efficiency of polynucleotide replication and amplification. The bits of the digital data can be randomized to minimize the amount of overlap between payloads encoding the digital data and primer targets.
-
-
-
-
-
-
-
-
-