Abstract:
An ultra-fast solution to the problem of comparing genomes across sequencing technologies and genome freezes, while preserving privacy, is presented. A method for transforming a standard genome representation (i.e., a list of variants relative to a reference) into a "fingerprint" of the genome does not require knowledge of the technology, reference and encoding used, and yields fingerprints that can be readily compared to ascertain relatedness between two genome representations. Because of their reduced size, computation on the genome fingerprints is fast and requires little memory. This enables scaling up a variety of important genome analyses, including determinations of degree of relatedness, recognizing duplicative sequenced genomes in a set, and many others. Because the original genome representation cannot be reconstructed from its fingerprint, the method also has significant implications for privacy-preserving genome analytics.
Abstract:
A method of preparing a sequencing library based on a blood sample, a method of sequencing a nucleic acid, a method for determining a fetal genetic abnormality of a chromosomal aneuploidy in a sample obtained from a pregnant female subject, an apparatus to prepare a sequencing library based on a blood sample, an assembly to sequence a nucleic acid, and a system for determining a fetal genetic abnormality of a chromosomal aneuploidy in a sample obtained from a pregnant female subject are provided.
Abstract:
같은 단백질로 번역되는 3개의 핵산 염기서열(코돈)의 조합을 이용함으로써 유전자 라이브러리 합성 후, 핵산 염기서열의 다양한 라이브러리 서열의 분석 시 쉽게 에러를 찾아낼 수 있는 방식을 제안한다. 이를 이용해 단백질 서열은 같지만 핵산 염기서열은 서로 다른 유전자 라이브러리를 만들 수 있다는 것을 보여준다. 이는 생체 내에서 특정 유전자를 발현시키기 위해 최적화되어 있는 코돈의 사용을 변화시킴으로써 코돈 변화에 따른 유전자 발현의 상관관계를 측정할 수 있는 새로운 실험방법을 제공한다. 또한, 같은 방식으로 단백질 서열 역시 일부 유사단백질 서열로 변환시킨 유전자 라이브러리를 동시에 합성 및 분석할 수 있으며, 이를 통해 발현된 유전자의 일부 단백질 서열의 변화가 해당 유전자의 기능에 어떠한 영향을 미치는지 확인할 수 있는 방법을 제공한다.
Abstract:
The present disclosure provides methods for optimizing barcode design for multiplex DNA sequencing. Also disclosed are DNA barcodes optimized for use with particular sequencing technologies.
Abstract:
Aspects of the invention relate to methods, compositions and algorithms for designing and producing a target nucleic acid. The method can include: (1) providing a plurality of blunt-end double-stranded nucleic acid fragments having a restriction enzyme recognition sequence at both ends thereof; (2) producing via enzymatic digestion a plurality of cohesive-end double-stranded nucleic acid fragments each having two different and non-complementary overhangs; (3) ligating the plurality of cohesive-end double-stranded nucleic acid fragments with a ligase; and (4) forming a linear arrangement of the plurality of cohesive-end double-stranded nucleic acid fragments, wherein the unique arrangement comprises the target nucleic acid. In certain embodiments, the plurality of blunt-end double-stranded nucleic acid fragments can be provided by: releasing a plurality of oligonucleotides synthesized on a solid support; and synthesizing complementary strands of the plurality of oligonucleotides using a polymerase based reaction.
Abstract:
The invention provides a method for increasing the stability and/or activity of a polypeptide at low pH and/or elevated temperatures. The invention further provides a method for increasing the melting termperature of a polypeptide. Also provided are paleoenzymologically reconstructed thioredoxin polypeptides having activity at higher temperatures and/or lower pH than extant thioredoxin polypepetides, as well as paleoenzymologically reconstructed thioredoxin polypeptides having higher melting temperatures than extant thioredoxin polypepetides.
Abstract:
The present invention relates to methods and algorithms for identifying, synthesizing and co-assembling combinatorial libraries of polynucleotide variants.
Abstract:
The present disclosure relates to CBH II chimera fusion polypetides, nucleic acids encoding the polypeptides, and host cells for producing the polypeptides.