-
公开(公告)号:US12033405B1
公开(公告)日:2024-07-09
申请号:US18297342
申请日:2023-04-07
Applicant: X Development LLC
Inventor: Vadim Tschernezki , Lance Co Ting Keh , Hongxu Ma , Allen Richard Zhao , Jie Jacquot
IPC: G06V20/64 , G06F18/214 , G06F18/2413 , G06N3/08 , G06N20/00
CPC classification number: G06V20/64 , G06F18/214 , G06F18/2413 , G06N3/08 , G06N20/00
Abstract: Methods, systems, and apparatuses, including computer programs encoded on a computer storage medium, for machine learning classification based on separate processing of multiple views. In some implementations, a system obtains image data for multiple images showing different views of an object. A machine learning model is used to generate a separate output based on each the multiple images individually. The outputs for the respective images are combined to generate a combined output. A predicted characteristic of the object is determined based on the combined output. An indication of the predicted characteristic of the object is provided.
-
2.
公开(公告)号:US20220380753A1
公开(公告)日:2022-12-01
申请号:US17333272
申请日:2021-05-28
Applicant: X Development LLC
Inventor: Ivan Grubisic , Ray Nagatani , Lance Co Ting Keh , Andrew Weitz , Kenneth Jung , Ryan Poplin
Abstract: The present disclosure relates to in vitro experiments and in silico computation and machine-learning based techniques to iteratively improve a process for identifying binders that can bind any given molecular target. Particularly, aspects of the present disclosure are directed to obtaining sequence data for aptamers that bind to a target, where the sequence data has a first signal to noise ratio, generating, by a search process, a first set of aptamer sequences derived from the sequence data, obtaining subsequent sequence data for subsequent aptamers that bind to the target, where the subsequent aptamers includes aptamers synthesized from the first set of aptamer sequences, and the subsequent sequence data has a second signal to noise ratio greater than the first signal to noise ratio, generating, by a linear machine-learning model, a second set of aptamer sequences derived from the subsequent sequence data, and outputting the second set of aptamer sequences.
-
公开(公告)号:US11353394B2
公开(公告)日:2022-06-07
申请号:US16948760
申请日:2020-09-30
Applicant: X Development LLC
Inventor: Gearoid Murphy , Artem Goncharuk , Lance Co Ting Keh , Diosdado Rey Banatao , Sujit Sanjeev
Abstract: The present disclosure relates to techniques for deformulating the spectra of arbitrary compound formulations such as polymer formulations into their chemical components. Particularly, aspects of the present disclosure are directed to obtaining an initial set of spectra for a plurality of samples comprising pure samples and composite samples, constructing a basis set of spectra for a plurality of pure samples based on the initial set of spectra, and providing or outputting the basis set of spectrum. The basis set of spectra is constructed in an iterative process that attempts to decompose, using a decomposition algorithm or model, the spectrum from the initial set of spectra in order to differentiate the pure samples from the composite samples. The basis set of spectra may then be used to deduce the composition of a material from a spectrogram.
-
公开(公告)号:US20240086423A1
公开(公告)日:2024-03-14
申请号:US17898236
申请日:2022-08-29
Applicant: X Development LLC
Inventor: Lance Co Ting Keh , Ivan Grubisic , Ryan Poplin , Jon Deaton , Hayley Weir
IPC: G06F16/28 , G06N3/0455 , G06N3/08
CPC classification number: G06F16/285 , G06N3/0455 , G06N3/08
Abstract: Some techniques relate to projecting aptamer representations into an embedding space and clustering the representations. A cluster-specific binding metric can be defined for each cluster based on aptamer-specific binding metrics of aptamers associated with the cluster. A subset of the clusters can be selected based on the cluster-specific binding metrics. Identifications of aptamers assigned to the subset of clusters can then be output.
-
公开(公告)号:US11651602B1
公开(公告)日:2023-05-16
申请号:US17060002
申请日:2020-09-30
Applicant: X Development LLC
Inventor: Vadim Tschernezki , Lance Co Ting Keh , Hongxu Ma , Allen Richard Zhao , Jie Jacquot
CPC classification number: G06V20/64 , G06K9/627 , G06K9/6256 , G06N3/08 , G06N20/00
Abstract: Methods, systems, and apparatuses, including computer programs encoded on a computer storage medium, for machine learning classification based on separate processing of multiple views. In some implementations, a system obtains image data for multiple images showing different views of an object. A machine learning model is used to generate a separate output based on each the multiple images individually. The outputs for the respective images are combined to generate a combined output. A predicted characteristic of the object is determined based on the combined output. An indication of the predicted characteristic of the object is provided.
-
公开(公告)号:US20230101523A1
公开(公告)日:2023-03-30
申请号:US17936181
申请日:2022-09-28
Applicant: X Development LLC
Inventor: Ryan Poplin , Lance Co Ting Keh , Ivan Grubisic , Ray Nagatani
Abstract: The present disclosure relates to in vitro experiments and in silico computation and machine-learning based techniques to iteratively improve a process for identifying binders that can bind a target. Particularly, aspects of the present disclosure are directed to obtaining initial sequence data, identifying, by a first machine-learning model having model parameters learned from the initial sequence data, a first set of aptamer sequences, obtaining, using an in vitro binding selection process, subsequent sequence data including sequences from the first set of aptamer sequences, identifying, by a second machine-learning model having model parameters learned from the subsequent sequence data, a second set of aptamer sequences, determining, using one or more in vitro assays, analytical data for aptamers synthesized from the second set of aptamer sequences, and identifying a final set of aptamer sequences from the second set of aptamer sequences based on the analytical data associated with each aptamer.
-
7.
公开(公告)号:US20220383981A1
公开(公告)日:2022-12-01
申请号:US17333287
申请日:2021-05-28
Applicant: X Development LLC
Inventor: Ivan Grubisic , Ray Nagatani , Lance Co Ting Keh , Andrew Weitz , Kenneth Jung , Ryan Poplin
Abstract: The present disclosure relates to in vitro experiments and in silico computation and machine-learning based techniques to iteratively improve a process for identifying binders that can bind any given molecular target. Particularly, aspects of the present disclosure are directed to obtaining initial sequence data for aptamers that bind to a target, measuring a first signal to noise ratio within the initial sequence data, provisioning, based on the first signal to noise ratio, a first machine-learning system, generating, by the first machine-learning system, a first set of aptamer sequences, obtaining subsequent sequence data for aptamers that bind to the target, measuring a second signal to noise ratio within the subsequent sequence data, provisioning, based on the second signal to noise ratio, a second machine-learning system, generating, by the second machine-learning system, a second set of aptamer sequences, and outputting the second set of aptamer sequences.
-
公开(公告)号:US20220101277A1
公开(公告)日:2022-03-31
申请号:US17033512
申请日:2020-09-25
Applicant: X Development LLC
Inventor: Diosdado Rey Banatao , Karen R. Davis , Neil Treat , Artem Goncharuk , Charles Spirakis , Sujit Sanjeev , Gearoid Murphy , Lance Co Ting Keh , Rebecca Radkoff , Taoran Dai
Abstract: Systems and methods for managing chemical recycling processes include accessing characterization data of a feedstock, the characterization data comprising one or more spectra collected according to one or more spectroscopic methods. The methods include predicting, using the characterization data, a set of constituent materials included in the feedstock. The methods include predicting a material composition of the feedstock using the predicted set of constituent materials. The methods include identifying, at least in part using the predicted material composition of the feedstock, one or more target products. The methods include generating a set of chemical reaction schemas enabling a conversion of at least part of the feedstock into the one or more target products. The methods also include storing identifications of the material composition of the feedstock, the one or more target products, and the set of chemical reaction schemas in a data store.
-
公开(公告)号:US11630057B2
公开(公告)日:2023-04-18
申请号:US17658765
申请日:2022-04-11
Applicant: X Development LLC
Inventor: Gearoid Murphy , Artem Goncharuk , Lance Co Ting Keh , Diosdado Rey Banatao , Sujit Sanjeev
Abstract: The present disclosure relates to techniques for deformulating the spectra of arbitrary compound formulations such as polymer formulations into their chemical components. Particularly, aspects of the present disclosure are directed to obtaining an initial set of spectra for a plurality of samples comprising pure samples and composite samples, constructing a basis set of spectra for a plurality of pure samples based on the initial set of spectra, and providing or outputting the basis set of spectrum. The basis set of spectra is constructed in an iterative process that attempts to decompose, using a decomposition algorithm or model, the spectrum from the initial set of spectra in order to differentiate the pure samples from the composite samples. The basis set of spectra may then be used to deduce the composition of a material from a spectrogram.
-
公开(公告)号:US20230081439A1
公开(公告)日:2023-03-16
申请号:US17471903
申请日:2021-09-10
Applicant: X Development LLC
Inventor: Ryan Poplin , Ivan Grubisic , Lance Co Ting Keh , Ray Nagatani
Abstract: A latent space is defined to represent sequences using training data and a machine-learning model. The training data identifies sequences of molecules and binding-approximation metrics that characterizes whether the molecules bind to a particular target and/or that approximate an extent to which the molecule is more likely to bind to the particular target than some other molecules. Supplemental training data is accessed that identifies other sequences of other molecules and binding affinity scores quantifying binding strengths between the molecules and the particular target. Projections of representations of the other sequences in the supplemental training data are projected in the latent space using the binding affinity scores. An area or position of interest within the latent space is identified based on the projections. A particular sequence represented within or at the area or position of interest or at the position of interest is identified for downstream processing.
-
-
-
-
-
-
-
-
-