-
公开(公告)号:US20240013856A1
公开(公告)日:2024-01-11
申请号:US17874158
申请日:2022-07-26
申请人: Illumina, Inc.
IPC分类号: G16B20/00 , G16B40/00 , G16B50/00 , G16B40/20 , G16B30/00 , G06N3/047 , G06N3/048 , G06N3/084 , G06N3/04 , G06N3/08
CPC分类号: G16B20/00 , G16B40/00 , G16B50/00 , G16B40/20 , G16B30/00 , G06N3/047 , G06N3/048 , G06N3/084 , G06N3/04 , G06N3/08 , G06F18/24
摘要: The technology disclosed relates to splice site prediction and aberrant splicing detection. In particular, it relates to a splice site predictor that includes a convolutional neural network trained on training examples of donor splice sites, acceptor splice sites, and non-splicing sites. An input stage of the convolutional neural network feeds an input sequence of nucleotides for evaluation of target nucleotides in the input sequence. An output stage of the convolutional neural network translates analysis by the convolutional neural network into classification scores for likelihoods that each of the target nucleotides is a donor splice site, an acceptor splice site, and a non-splicing site.
-
公开(公告)号:US12119088B2
公开(公告)日:2024-10-15
申请号:US17899539
申请日:2022-08-30
申请人: ILLUMINA, INC.
IPC分类号: G06F16/907 , G06F18/21 , G06F18/214 , G06F18/23 , G06F18/23211 , G06F18/24 , G06F18/2415 , G06F18/2431 , G06N3/04 , G06N3/08 , G06N3/084 , G06N7/01 , G06V10/26 , G06V10/44 , G06V10/75 , G06V10/762 , G06V10/764 , G06V10/77 , G06V10/778 , G06V10/82 , G06V10/98 , G06V20/69 , G16B40/00 , G16B40/20 , G06N5/046 , G06V20/40
CPC分类号: G16B40/20 , G06F16/907 , G06F18/214 , G06F18/217 , G06F18/23 , G06F18/23211 , G06F18/24 , G06F18/2415 , G06F18/2431 , G06N3/04 , G06N3/08 , G06N3/084 , G06N7/01 , G06V10/267 , G06V10/454 , G06V10/751 , G06V10/763 , G06V10/764 , G06V10/7715 , G06V10/7784 , G06V10/82 , G06V10/993 , G06V20/69 , G16B40/00 , G06N5/046 , G06V20/47
摘要: A system, a method and a non-transitory computer readable storage medium for base calling are described. The base calling method includes processing through a neural network first image data comprising images of clusters and their surrounding background captured by a sequencing system for one or more sequencing cycles of a sequencing run. The base calling method further includes producing a base call for one or more of the clusters of the one or more sequencing cycles of the sequencing run.
-
公开(公告)号:US20240055072A1
公开(公告)日:2024-02-15
申请号:US18478763
申请日:2023-09-29
申请人: Illumina, Inc.
IPC分类号: G16B20/00 , G06N3/08 , G06N3/084 , G06N3/047 , G16B30/00 , G06N3/04 , G16B40/00 , G06N3/048 , G16B50/00 , G16B40/20
CPC分类号: G16B20/00 , G06N3/08 , G06N3/084 , G06N3/047 , G16B30/00 , G06N3/04 , G16B40/00 , G06N3/048 , G16B50/00 , G16B40/20 , G06F18/24
摘要: The technology disclosed relates to splice site prediction and aberrant splicing detection. In particular, it relates to a splice site predictor that includes a convolutional neural network trained on training examples of donor splice sites, acceptor splice sites, and non-splicing sites. An input stage of the convolutional neural network feeds an input sequence of nucleotides for evaluation of target nucleotides in the input sequence. An output stage of the convolutional neural network translates analysis by the convolutional neural network into classification scores for likelihoods that each of the target nucleotides is a donor splice site, an acceptor splice site, and a non-splicing site.
-
公开(公告)号:US11837324B2
公开(公告)日:2023-12-05
申请号:US16160980
申请日:2018-10-15
申请人: Illumina, Inc.
IPC分类号: G16B20/00 , G16B40/00 , G16B50/00 , G16B40/20 , G16B30/00 , G06N3/047 , G06N3/048 , G06N3/084 , G06N3/04 , G06N3/08 , G06F18/24
CPC分类号: G16B20/00 , G06N3/04 , G06N3/047 , G06N3/048 , G06N3/08 , G06N3/084 , G16B30/00 , G16B40/00 , G16B40/20 , G16B50/00 , G06F18/24
摘要: The technology disclosed relates to constructing a convolutional neural network-based classifier for variant classification. In particular, it relates to training a convolutional neural network-based classifier on training data using a backpropagation-based gradient update technique that progressively match outputs of the convolutional neural network-based classifier with corresponding ground truth labels. The convolutional neural network-based classifier comprises groups of residual blocks, each group of residual blocks is parameterized by a number of convolution filters in the residual blocks, a convolution window size of the residual blocks, and an atrous convolution rate of the residual blocks, the size of convolution window varies between groups of residual blocks, the atrous convolution rate varies between groups of residual blocks. The training data includes benign training examples and pathogenic training examples of translated sequence pairs generated from benign variants and pathogenic variants.
-
公开(公告)号:US11749380B2
公开(公告)日:2023-09-05
申请号:US17180542
申请日:2021-02-19
申请人: Illumina, Inc.
IPC分类号: G06N3/08 , G16B40/10 , G16B30/20 , C12Q1/6869
CPC分类号: G16B40/10 , G06N3/08 , G16B30/20 , C12Q1/6869
摘要: The technology disclosed relates to artificial intelligence-based base calling. The technology disclosed relates to accessing a progression of per-cycle analyte channel sets generated for sequencing cycles of a sequencing run, processing, through a neural network-based base caller (NNBC), windows of per-cycle analyte channel sets in the progression for the windows of sequencing cycles of the sequencing run such that the NNBC processes a subject window of per-cycle analyte channel sets in the progression for the subject window of sequencing cycles of the sequencing run and generates provisional base call predictions for three or more sequencing cycles in the subject window of sequencing cycles, from multiple windows in which a particular sequencing cycle appeared at different positions, using the NNBC to generate provisional base call predictions for the particular sequencing cycle, and determining a base call for the particular sequencing cycle based on the plurality of base call predictions.
-
公开(公告)号:US11488009B2
公开(公告)日:2022-11-01
申请号:US16160978
申请日:2018-10-15
申请人: Illumina, Inc.
摘要: The technology disclosed relates to constructing a convolutional neural network-based classifier for variant classification. In particular, it relates to training a convolutional neural network-based classifier on training data using a backpropagation-based gradient update technique that progressively match outputs of the convolutional neural network-based classifier with corresponding ground truth labels. The convolutional neural network-based classifier comprises groups of residual blocks, each group of residual blocks is parameterized by a number of convolution filters in the residual blocks, a convolution window size of the residual blocks, and an atrous convolution rate of the residual blocks, the size of convolution window varies between groups of residual blocks, the atrous convolution rate varies between groups of residual blocks. The training data includes benign training examples and pathogenic training examples of translated sequence pairs generated from benign variants and pathogenic variants.
-
公开(公告)号:US12106829B2
公开(公告)日:2024-10-01
申请号:US18352029
申请日:2023-07-13
申请人: Illumina, Inc.
IPC分类号: G06N3/08 , G16B30/20 , G16B40/10 , C12Q1/6869
CPC分类号: G16B40/10 , G06N3/08 , G16B30/20 , C12Q1/6869
摘要: The technology disclosed relates to artificial intelligence-based base calling. The technology disclosed relates to accessing a progression of per-cycle analyte channel sets generated for sequencing cycles of a sequencing run, processing, through a neural network-based base caller (NNBC), windows of per-cycle analyte channel sets in the progression for the windows of sequencing cycles of the sequencing run such that the NNBC processes a subject window of per-cycle analyte channel sets in the progression for the subject window of sequencing cycles of the sequencing run and generates provisional base call predictions for three or more sequencing cycles in the subject window of sequencing cycles, from multiple windows in which a particular sequencing cycle appeared at different positions, using the NNBC to generate provisional base call predictions for the particular sequencing cycle, and determining a base call for the particular sequencing cycle based on the plurality of base call predictions.
-
公开(公告)号:US20240055078A1
公开(公告)日:2024-02-15
申请号:US18352029
申请日:2023-07-13
申请人: Illumina, Inc.
CPC分类号: G16B40/10 , G06N3/08 , G16B30/20 , C12Q1/6869
摘要: The technology disclosed relates to artificial intelligence-based base calling. The technology disclosed relates to accessing a progression of per-cycle analyte channel sets generated for sequencing cycles of a sequencing run, processing, through a neural network-based base caller (NNBC), windows of per-cycle analyte channel sets in the progression for the windows of sequencing cycles of the sequencing run such that the NNBC processes a subject window of per-cycle analyte channel sets in the progression for the subject window of sequencing cycles of the sequencing run and generates provisional base call predictions for three or more sequencing cycles in the subject window of sequencing cycles, from multiple windows in which a particular sequencing cycle appeared at different positions, using the NNBC to generate provisional base call predictions for the particular sequencing cycle, and determining a base call for the particular sequencing cycle based on the plurality of base call predictions.
-
公开(公告)号:US11783917B2
公开(公告)日:2023-10-10
申请号:US16826126
申请日:2020-03-20
申请人: Illumina, Inc.
IPC分类号: G06V10/82 , G06K9/62 , G06N3/08 , G16B40/00 , G06N3/04 , G06F16/907 , G06N3/084 , G06N7/00 , G06V10/75 , G06N5/046
CPC分类号: G06K9/6218 , G06F16/907 , G06K9/628 , G06K9/6222 , G06K9/6232 , G06K9/6256 , G06K9/6262 , G06K9/6267 , G06K9/6277 , G06N3/04 , G06N3/08 , G06N3/084 , G06N7/005 , G06V10/751 , G06V10/82 , G16B40/00 , G06N5/046
摘要: The technology disclosed processes input data through a neural network and produces an alternative representation of the input data. The input data includes per-cycle image data for each of one or more sequencing cycles of a sequencing run. The per-cycle image data depicts intensity emissions of one or more analytes and their surrounding background captured at a respective sequencing cycle. The technology disclosed processes the alternative representation through an output layer and producing an output and base calls one or more of the analytes at one or more of the sequencing cycles based on the output.
-
公开(公告)号:US11676685B2
公开(公告)日:2023-06-13
申请号:US16826134
申请日:2020-03-20
申请人: Illumina, Inc.
IPC分类号: G06K9/00 , G16B40/20 , G06N3/08 , G16B40/00 , G06N3/04 , G06F16/907 , G06N3/084 , G06V10/82 , G06F18/23 , G06F18/24 , G06F18/213 , G06F18/214 , G06F18/21 , G06F18/2415 , G06F18/2431 , G06F18/23211 , G06N7/01 , G06V10/762 , G06V10/764 , G06V10/77 , G06V10/778 , G06V10/44 , G06V10/98 , G06N5/046
CPC分类号: G16B40/20 , G06F16/907 , G06F18/213 , G06F18/214 , G06F18/217 , G06F18/23 , G06F18/23211 , G06F18/24 , G06F18/2415 , G06F18/2431 , G06N3/04 , G06N3/08 , G06N3/084 , G06N7/01 , G06V10/454 , G06V10/763 , G06V10/764 , G06V10/7715 , G06V10/7784 , G06V10/82 , G06V10/993 , G16B40/00 , G06N5/046
摘要: The technology disclosed assigns quality scores to bases called by a neural network-based base caller by (i) quantizing classification scores of predicted base calls produced by the neural network-based base caller in response to processing training data during training, (ii) selecting a set of quantized classification scores, (iii) for each quantized classification score in the set, determining a base calling error rate by comparing its predicted base calls to corresponding ground truth base calls, (iv) determining a fit between the quantized classification scores and their base calling error rates, and (v) correlating the quality scores to the quantized classification scores based on the fit.
-
-
-
-
-
-
-
-
-