APPARATUSES, SYSTEMS, AND METHODS FOR EXTRACTING MEANING FROM DNA SEQUENCE DATA USING NATURAL LANGUAGE PROCESSING (NLP)

    公开(公告)号:US20220139498A1

    公开(公告)日:2022-05-05

    申请号:US17088734

    申请日:2020-11-04

    Abstract: Apparatuses, systems, and methods are provided that may analyze deoxyribonucleic add (DNA) sequence data using a natural language processing (NLP) model to, for example, identify genetic elements such as known and/or novel cis-regulatory elements (e.g., known and/or putative novel drought-responsive cis-regulatory elements (DREs)). Apparatuses, systems, and methods are also provided that may identify transcriptional regulators (e.g., upstream transcriptional regulators of a novel putative DRE) based on natural language processing (NLP) model data and expression genome-wide association study (eGWAS) data. Apparatuses, systems, and methods are also provided that may verify putative novel cis-regulatory elements based on a comparison of natural language processing (NLP) model output data and other model output data.

Patent Agency Ranking