EXPERIMENT AND MACHINE-LEARNING TECHNIQUES TO IDENTIFY AND GENERATE HIGH AFFINITY BINDERS

    公开(公告)号:US20220380753A1

    公开(公告)日:2022-12-01

    申请号:US17333272

    申请日:2021-05-28

    申请人: X Development LLC

    IPC分类号: C12N15/10 G16B35/00 G16B15/30

    摘要: The present disclosure relates to in vitro experiments and in silico computation and machine-learning based techniques to iteratively improve a process for identifying binders that can bind any given molecular target. Particularly, aspects of the present disclosure are directed to obtaining sequence data for aptamers that bind to a target, where the sequence data has a first signal to noise ratio, generating, by a search process, a first set of aptamer sequences derived from the sequence data, obtaining subsequent sequence data for subsequent aptamers that bind to the target, where the subsequent aptamers includes aptamers synthesized from the first set of aptamer sequences, and the subsequent sequence data has a second signal to noise ratio greater than the first signal to noise ratio, generating, by a linear machine-learning model, a second set of aptamer sequences derived from the subsequent sequence data, and outputting the second set of aptamer sequences.

    Systems and methods for artificial intelligence-guided biomolecule design and assessment

    公开(公告)号:US11450407B1

    公开(公告)日:2022-09-20

    申请号:US17384104

    申请日:2021-07-23

    申请人: Pythia Labs, Inc.

    IPC分类号: G16B15/20 G16B35/00 G16B15/30

    摘要: Described herein are systems and methods for designing and testing custom biologic molecules in silico which are useful, for example, for the treatment, prevention, and diagnosis of disease. In particular, in certain embodiments, the biomolecule engineering technologies described herein employ artificial intelligence (AI) software modules to accurately predict performance of candidate biomolecules and/or portions thereof with respect to particular design criteria. In certain embodiments, the AI-powered modules described herein determine performance scores with respect to design criteria such as binding to a particular target. AI-computed performance scores may, for example, be used as objective functions for computer implemented optimization routines that efficiently search a landscape of potential protein backbone orientations and binding interface amino-acid sequences. By virtue of their modular design, AI-powered scoring modules can be used separately, or in combination, such as in a pipeline approach where different structural features of a custom biologic are optimized in succession.

    SAPOSIN LIPOPROTEIN PARTICLES AND LIBRARIES FROM CRUDE MEMBRANES

    公开(公告)号:US20220270707A1

    公开(公告)日:2022-08-25

    申请号:US17739029

    申请日:2022-05-06

    IPC分类号: G16B15/30 G16B35/00 G01N33/68

    摘要: The invention is directed to a method for studying or identifying a biologically active agent that binds to a membrane protein, the method comprising: (i) obtaining one or more 2D and/or 3D structures of a saposin lipoprotein particle comprising the membrane protein, (ii) modeling the binding of said biologically active agent to said membrane protein present in said saposin lipoprotein particle using said one or more 2D and/or 3D structures and/or resolving the binding sites and interactions between said biologically active agent and said membrane protein in the 2D and/or 3D structure.

    Set membership testers for aligning nucleic acid samples

    公开(公告)号:US11335437B2

    公开(公告)日:2022-05-17

    申请号:US15809908

    申请日:2017-11-10

    摘要: Disclosed are methods and tools for rapidly aligning reads to a reference sequence. These methods and tools employ Bloom filters or similar set membership testers to perform the alignment. The reads may be short sequences of nucleic acids or other biological molecules and the reference sequences may be sequences of genomes, chromosomes, etc. The Bloom filters include a collection of hash functions, a bit array, and associated logic for applying reads to the filter. Each filter, and there may be multiple of these used in a particular application, is used to determine whether an applied read is present in a reference sequence. Each Bloom filter is associated with a single reference sequence such as the sequence of a particular chromosome. In one example, chromosomal abundance is determined by aligning reads from a sequencer to multiple chromosomes, each having an associated Bloom filter or other set membership tester.

    Methods and systems for genetic analysis

    公开(公告)号:US11299783B2

    公开(公告)日:2022-04-12

    申请号:US17235776

    申请日:2021-04-20

    申请人: Personalis, Inc.

    摘要: This disclosure provides systems and methods for sample processing and data analysis. Sample processing may include nucleic acid sample processing and subsequent sequencing. Some or all of a nucleic acid sample may be sequenced to provide sequence information, which may be stored or otherwise maintained in an electronic storage location. The sequence information may be analyzed with the aid of a computer processor, and the analyzed sequence information may be stored in an electronic storage location that may include a pool or collection of sequence information and analyzed sequence information generated from the nucleic acid sample. Methods and systems of the present disclosure can be used, for example, for the analysis of a nucleic acid sample, for producing one or more libraries, and for producing biomedical reports. Methods and systems of the disclosure can aid in the diagnosis, monitoring, treatment, and prevention of one or more diseases and conditions.