摘要:
Methods, computer systems, and computer program products for biopolymer engineering. A variant set for a biopolymer of interest is constructed by identifying, using a plurality of rules, a plurality of positions in the biopolymer of interest and, for each respective position in the plurality of positions, substitutions for the respective position. The plurality of positions and the substitutions for each respective position in the plurality of positions collectively define a biopolymer sequence space. A variant set comprising a plurality of variants of the biopolymer of interest is selected. A property of all or a portion of the variants in the variant set is measured. A sequence-activity relationship is modeled between (i) one or more substitutions at one or more positions of the biopolymer of interest represented by the variant set and (ii) the property measured for all or the portion of the variants in the variant set. The variant set is redefined to comprise variants that include substitutions in the plurality of positions that are selected based on a function of the sequence-activity relationship.
摘要:
Methods and systems for providing biological results in the form of systematically varied libraries of sequences or as data representing sequences or physical preparations of systematically varied libraries and/or selections from systematically varied libraries.
摘要:
The present invention generally relates to methods of rapidly and efficiently searching biologically-related data space. More specifically, the invention includes methods of identifying bio-molecules with desired properties, or which are most suitable for acquiring such properties, from complex bio-molecule libraries or sets of such libraries. The invention also provides methods of modeling sequence-activity relationships. As many of the methods are computer-implemented, the invention additionally provides digital systems and software for performing these methods.
摘要:
A method of determining a property that affects expression of polynucleotides in an expression system. A plurality of polynucleotides, each encoding a polypeptide sequence, is constructed. An amino acid is encoded a plurality of times in both a first and second polynucleotide in the plurality. The amino acid is encodable by a plurality of synonymous codons including a first codon. The first codon is in the first polynucleotide at a first frequency relative to other synonymous codons, and is in the second polynucleotide at a second frequency relative to other synonymous codons. The first and second frequencies are different. Each polynucleotide is individually expressed in the expression system to measure an expression property value of the polynucleotides, thereby determining a property that affects expression of polynucleotides. The property is an effect that a frequency of use of one or more codons has on expression of polynucleotides in the expression system.
摘要:
Methods and devices for more efficiently engineering diversity into recombinant polypeptides and/or nucleic acids are provided herein. For example, a variety of methods of selecting and/or assessing potential crossover sites in an amino acid sequence or a nucleotide sequence are provided, as well as the resulting chimeric product sequences. These methods include, e.g., consideration of structural, functional and/or statistical data in the selection and assessment of sequences and crossover sites for use in recombination.
摘要:
A method of designing a polynucleotide sequence encoding a polypeptide sequence of a predetermined polypeptide is provided. A frequency lookup table corresponding to an expression system is obtained. The table comprises a plurality of sequence elements and a plurality of frequency ranges, each frequency range for a corresponding sequence element. Each frequency range is a range of frequencies with which a corresponding sequence element can occur in a polynucleotide. The polynucleotide sequence is defined using the frequency lookup table by determining, for each respective sequence element in the frequency lookup table, whether the respective sequence element encodes a portion of the polypeptide sequence. When the respective sequence element encodes a portion of the polypeptide sequence, the sequence element is incorporated into the polynucleotide at a frequency of occurrence that is within the frequency range specified for the respective sequence element in the lookup table. The polynucleotide sequence is then outputted.
摘要:
Novel Sorangium polyketide synthases, and domains thereof, and polynucleotides encoding therefor. Additionally, chimeric polyketide synthases that include domains, or subsets of domains, patterned on said novel polyketide synthases. Methods to prepare polyketide combinatorial libraries are described, as are recombinant host cells in which polyketides are produced.
摘要:
A method of determining a property that affects expression of polynucleotides in an expression system. A plurality of polynucleotides, each encoding a polypeptide sequence, is constructed. An amino acid is encoded a plurality of times in both a first and second polynucleotide in the plurality. The amino acid is encodable by a plurality of synonymous codons including a first codon. The first codon is in the first polynucleotide at a first frequency relative to other synonymous codons, and is in the second polynucleotide at a second frequency relative to other synonymous codons. The first and second frequencies are different. Each polynucleotide is individually expressed in the expression system to measure an expression property value of the polynucleotides, thereby determining a property that affects expression of polynucleotides. The property is an effect that a frequency of use of one or more codons has on expression of polynucleotides in the expression system.
摘要:
Methods and devices for more efficiently engineering diversity into recombinant polypeptides and/or nucleic acids are provided herein. For example, a variety of methods of selecting and/or assessing potential crossover sites in an amino acid sequence or a nucleotide sequence are provided, as well as the resulting chimeric product sequences. These methods include, e.g., consideration of structural, functional and/or statistical data in the selection and assessment of sequences and crossover sites for use in recombination.
摘要:
The present invention generally relates to methods of rapidly and efficiently searching biologically-related data space. More specifically, the invention includes methods of identifying bio-molecules with desired properties, or which are most suitable for acquiring such properties, from complex bio-molecule libraries or sets of such libraries. The invention also provides methods of modeling sequence-activity relationships. As many of the methods are computer-implemented, the invention additionally provides digital systems and software for performing these methods.