Methods and systems for predicting DNA accessibility in the pan-cancer genome

发明授权

US10467523B2 Methods and systems for predicting DNA accessibility in the pan-cancer genome 有权

请登陆查看更多内容

专利标题： Methods and systems for predicting DNA accessibility in the pan-cancer genome
申请号： US15818462

申请日： 2017-11-20
公开(公告)号： US10467523B2

公开(公告)日： 2019-11-05
发明人: Kamil Wnuk , Jeremi Sudol , Shahrooz Rabizadeh , Patrick Soon-Shiong , Christopher Szeto , Charles Vaske
申请人： NantOmics, LLC , Nant Holdings IP, LLC
申请人地址： US CA Culver City US CA Culver City
专利权人： Nant Holdings IP, LLC,NantOmics, LLP
当前专利权人： Nant Holdings IP, LLC,NantOmics, LLP
当前专利权人地址： US CA Culver City US CA Culver City
代理机构： Mauriel Kapouytian Woods LLP
代理商 Liang Huang; Andrew A. Noble
主分类号： G06N3/04
IPC分类号： G06N3/04 ; G16B40/00 ; G06N3/08 ; G06N7/00

Methods and systems for predicting DNA accessibility in the pan-cancer genome

摘要：

Techniques are provided for predicting DNA accessibility. DNase-seq data files and RNA-seq data files for a plurality of cell types are paired by assigning DNase-seq data files to RNA-seq data files that are at least within a same biotype. A neural network is configured to be trained using batches of the paired data files, where configuring the neural network comprises configuring convolutional layers to process a first input comprising DNA sequence data from a paired data file to generate a convolved output, and fully connected layers following the convolutional layers to concatenate the convolved output with a second input comprising gene expression levels derived from RNA-seq data from the paired data file and process the concatenation to generate a DNA accessibility prediction output. The trained neural network is used to predict DNA accessibility in a genomic sample input comprising RNA-seq data and whole genome sequencing for a new cell type.

公开/授权文献

US20180144261A1 METHODS AND SYSTEMS FOR PREDICTING DNA ACCESSIBILITY IN THE PAN-CANCER GENOME 公开/授权日：2018-05-24

信息查询

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06N	基于特定计算模型的计算机系统
G06N3/00	基于生物学模型的计算机系统
G06N3/02	.采用神经网络模型
G06N3/04	..体系结构，例如，互连拓扑