MACHINE-LEARNING MODELS FOR SELECTING OLIGONUCLEOTIDE PROBES FOR ARRAY TECHNOLOGIES

    公开(公告)号:US20230340571A1

    公开(公告)日:2023-10-26

    申请号:US18307482

    申请日:2023-04-26

    摘要: This disclosure describes methods, non-transitory computer readable media, and systems that can use a machine-learning model to classify or predict a probability of an oligonucleotide probe yielding an accurate genotype call or hybridizing with a target oligonucleotide—based on the oligonucleotide probe's nucleotide-sequence composition. To intelligently identify oligonucleotide probes that are more likely to yield accurate downstream genotyping—or more likely to successfully hybridize with target oligonucleotides—some embodiments of the disclosed machine-learning model include customized layers trained to detect motifs or other nucleotide-sequence patterns that correlate with favorable or unfavorable probe accuracy. By intelligently processing the nucleotide sequences of candidate oligonucleotide probes before implementing a microarray for a particular target oligonucleotide, the disclosed system can identify oligonucleotide probes with better genotyping accuracy (or better binding accuracy) than existing microarray systems for use in a microarray.