-
公开(公告)号:US11615324B2
公开(公告)日:2023-03-28
申请号:US17174677
申请日:2021-02-12
申请人: Ro5 Inc.
发明人: Aurimas Pabrinkis , Alwin Bucher , Gintautas Kamuntavi{hacek over (c)}ius , Alvaro Prat , Orestis Bastas , {hacek over (Z)}ygimantas Jo{hacek over (c)}ys , Roy Tal , Charles Dazler Knuff
IPC分类号: G06N3/08 , G06N5/022 , G06K9/62 , G06F16/951
摘要: A system and method for de novo drug discovery using machine learning algorithms. In a preferred embodiment, de novo drug discovery is performed via data enrichment and interpolation/perturbation of molecule models within the latent space, wherein molecules with certain characteristics can be generated and tested in relation to one or more targeted receptors. Filtering methods may be used to determine active novel molecules by filtering out non-active molecules and contain activity predictors to better navigate the molecule-receptor domain. The system may comprise neural networks trained to reconstruct known ligand-receptors pairs and from the reconstruction model interpolate and perturb the model such that novel and unique molecules are discovered. A second preferred embodiment trains a variational autoencoder coupled with a bioactivity model to predict molecules exhibiting a range of desired properties.
-
2.
公开(公告)号:US20220358373A1
公开(公告)日:2022-11-10
申请号:US17865834
申请日:2022-07-15
申请人: Ro5 Inc.
发明人: Alwin Bucher , Gintautas Kamuntavicius , Alvaro Prat , Orestis Bastas , Zygimantas Jocys , Roy Tal
摘要: A system and method for optimizing the latent space in generative machine learning models, and applications of the optimizations for use in the de novo generation of molecules for both ligand-based and pocket-based generation. The ligand-based optimizations comprise a tunable reward system based on a multi-property model and further define new measurable metrics: molecular novelty and uniqueness. The pocket-based optimizations comprise an initial multi-property optimization followed up by either a seed-based optimization or a relaxed-based optimization.
-
公开(公告)号:US20230325687A1
公开(公告)日:2023-10-12
申请号:US18189861
申请日:2023-03-24
申请人: Ro5 Inc.
发明人: Aurimas Pabrinkis , Alwin Bucher , Gintautas Kamuntavicius , Alvaro Prat , Orestis Bastas , Zygimantas Jocys , Roy Tal , Charles Dazler Knuff
IPC分类号: G06N5/022 , G06F18/22 , G06F16/951 , G06N3/08
CPC分类号: G06N5/022 , G06F16/951 , G06F18/22 , G06N3/08
摘要: A system and method for de novo drug discovery using machine learning algorithms. In a preferred embodiment, de novo drug discovery is performed via data enrichment and interpolation/perturbation of molecule models within the latent space, wherein molecules with certain characteristics can be generated and tested in relation to one or more targeted receptors. Filtering methods may be used to determine active novel molecules by filtering out non-active molecules and contain activity predictors to better navigate the molecule-receptor domain. The system may comprise neural networks trained to reconstruct known ligand-receptors pairs and from the reconstruction model interpolate and perturb the model such that novel and unique molecules are discovered. A second preferred embodiment trains a variational autoencoder coupled with a bioactivity model to predict molecules exhibiting a range of desired properties.
-
公开(公告)号:US20230290114A1
公开(公告)日:2023-09-14
申请号:US18161879
申请日:2023-01-30
申请人: Ro5 Inc.
发明人: Alvaro Prat , Alwin Bucher , Roy Tal
IPC分类号: G06V10/74 , G06N5/022 , G06F16/951 , G06N3/08 , G16B15/00 , G16B40/00 , G16B45/00 , G16B50/10 , G06F18/22
CPC分类号: G06V10/761 , G06N5/022 , G06F16/951 , G06N3/08 , G16B15/00 , G16B40/00 , G16B45/00 , G16B50/10 , G06F18/22 , G06V30/40
摘要: A system and method for pharmacophore-conditioned generation of molecules. The system and method modifies a conditional variational autoencoder (CVAE) such that the latent space in generation of a molecule is not conditioned on the pharmacophore space of the molecule. This allows for generation of pharmacophore descriptors independently from the conditional on which CVAE has been trained, removing a substantial impediment to the use of CVAEs for exploration of pharmacophore descriptors of a molecule.
-
5.
公开(公告)号:US11610139B2
公开(公告)日:2023-03-21
申请号:US17865834
申请日:2022-07-15
申请人: Ro5 Inc.
发明人: Alwin Bucher , Gintautas Kamuntavicius , Alvaro Prat , Orestis Bastas , Zygimantas Jocys , Roy Tal
IPC分类号: G06N3/08 , G06N5/022 , G06F16/951 , G06K9/62 , G16B15/00 , G16B40/00 , G16B45/00 , G16B50/10
摘要: A system and method for optimizing the latent space in generative machine learning models, and applications of the optimizations for use in the de novo generation of molecules for both ligand-based and pocket-based generation. The ligand-based optimizations comprise a tunable reward system based on a multi-property model and further define new measurable metrics: molecular novelty and uniqueness. The pocket-based optimizations comprise an initial multi-property optimization followed up by either a seed-based optimization or a relaxed-based optimization.
-
公开(公告)号:US20220188655A1
公开(公告)日:2022-06-16
申请号:US17398190
申请日:2021-08-10
申请人: Ro5 Inc.
发明人: Gintautas Kamuntavicius , Aurimas Pabrinkis , Orestis Bastas , Alwin Bucher , Alvaro Prat , Mikhail Demtchenko , Sam Christian Macer , Zygimantas Jocys , Roy Tal , Charles Dazler Knuff
摘要: A system and method that takes in a data set comprising molecular structure data and properties of interest, e.g., ADMET, EC50, IC50, etc., and determines the substructures that cause or do not cause the property of interest. The substructures may then be used to filter out potentially harmful new proposed/generated molecules or create a new data set of known active/inactive substructures of a property of interest that may fulfill other obligations. The system comprises a substructure extraction module which further comprises a scaffold extraction module and a comparison module. A scaffold extraction module clusters, searches, and extracts substructures in question while a comparison module compares the bioactivity of each molecule with and without each substructure in question to determine the substructures effect on the property of interest.
-
公开(公告)号:US11354582B1
公开(公告)日:2022-06-07
申请号:US17540142
申请日:2021-12-01
申请人: Ro5 Inc.
IPC分类号: G06G7/48 , G06G7/58 , G06N5/02 , G06F16/951 , G06K9/62 , G06N3/08 , G16B15/00 , G16B40/00 , G16B45/00 , G16B50/10
摘要: A system and method for automated retrosynthesis which can reliably identify valid and practical precursors and reaction pathways. The methodology involves a k-beam recursive process wherein at each stage of recursion, retrosynthesis is performed using a library of molecule disconnection rules to identify possible precursor sets, validation of the top k precursor sets is performed using a transformer-based forward reaction prediction scoring system, the best candidate of the top k precursor sets is selected, and a database is searched to determine whether the precursors are commercially available. The recursion process is repeated until a valid chain of chemical reactions is found wherein all precursors necessary to synthesize the target molecule are found to be commercially available.
-
8.
公开(公告)号:US11256995B1
公开(公告)日:2022-02-22
申请号:US17237458
申请日:2021-04-22
申请人: Ro5 Inc.
发明人: Alwin Bucher , Alvaro Prat , Orestis Bastas , Aurimas Pabrinkis , Gintautas Kamuntavi{hacek over (c)}ius , Mikhail Demtchenko , Sam Christian Macer , Zeyu Yang , Cooper Stergis Jamieson , {hacek over (Z)}ygimantas Jo{hacek over (c)}ys , Roy Tal , Charles Dazler Knuff
IPC分类号: G06N5/00 , G06N5/02 , G06N3/08 , G06K9/62 , G06F16/951
摘要: A system and method that predicts whether a given protein-ligand pair is active or inactive, the ground-truth protein-ligand complex crystalline-structure similarity, and an associated bioactivity value. The system and method further produce 3-D visualizations of previously unknown protein-ligand pairs that show directly the importance assigned to protein-ligand interactions, the positive/negative-ness of the saliencies, and magnitude. Furthermore, the system and method make enhancements in the art by accurately predicting protein-ligand pair bioactivity from decoupled models, removing the need for docking simulations, as well as restricting attention of the machine learning between protein and ligand atoms only.
-
公开(公告)号:US20220351053A1
公开(公告)日:2022-11-03
申请号:US17845813
申请日:2022-06-21
申请人: Ro5 Inc.
发明人: Povilas Norvaisas , Roy Tal , Zygimantas Jocys , Charles Dazler Knuff , Alvaro Prat , Gintautas Kamuntavicius , Hisham Abdel Aty , Orestis Bastas , Nikola Nonkovic
摘要: A system and method for feedback-driven automated drug discovery which combines machine learning algorithms with automated research facilities and equipment to make the process of drug discovery more data driven and less reliant on intuitive decision-making by experts. In an embodiment, the system comprises automated research equipment configured to perform automated assays of chemical compounds, a data platform comprising drug databases and an analysis engine, a bioactivity and de novo modules operating on the data platform, and a retrosynthesis system operating on the drug discovery platform, all configured in a feedback loop that drives drug discovery by using the outcome of assays performed on the automated research equipment to feed the bioactivity module and retrosynthesis systems, which identify new molecules for testing by the automated research equipment.
-
10.
公开(公告)号:US20220284316A1
公开(公告)日:2022-09-08
申请号:US17828533
申请日:2022-05-31
申请人: Ro5 Inc.
发明人: Alwin Bucher , Alvaro Prat , Orestis Bastas , Gintautas Kamuntavicius , Zeyu Yang , Charles Dazler Knuff , Zygimantas Jocys , Roy Tal , Hisham Abdel Aty
摘要: A system and method for accelerating the calculations of free energy differences by automating FEP-path-decision-making and replacing the standard series of alchemical interpolations typically created by molecular dynamic (MD) simulations with voxelated interpolated states. A novel machine learning approach comprising a restricted variational autoencoder (ResVAE) is used which can reduce the computational-cost associated with interpolations by restricting the dimensions of a molecular latent space. The ResVAE generates a model based on flow-based transformations of a 3D-VAE latent point that is trained to maximize the log-likelihood of MD samples which enables the model to compute transformations more efficiently between molecules and also handle deletions of atoms more efficiently during iterative FEP calculation steps.
-
-
-
-
-
-
-
-
-