SYSTEMS AND METHODS FOR AUTOMATING RNA EXPRESSION CALLS IN A CANCER PREDICTION PIPELINE

    公开(公告)号:US20210272649A1

    公开(公告)日:2021-09-02

    申请号:US17324949

    申请日:2021-05-19

    申请人: Tempus Labs, Inc.

    摘要: Systems and methods are provided for performing quality control analysis. The method obtains, in electronic form, a batch dataset comprising, for each respective sample in a batch of samples, a corresponding plurality of sequence reads derived from the respective sample by targeted or whole transcriptome RNA sequencing and corresponding metadata for the respective sample. The method determines for the batch dataset a cohort-matched reference batch, where the cohort-matched reference batch is balanced for tissue site, tumor purity, cancer type, sequencer identity, or date sequenced. The method performs one or more global batch quality control tests on the batch dataset using at least the cohort-matched reference batch. The method removes respective samples from the batch dataset that fail any one of the one or more global batch quality control tests or flagging for manual inspection respective samples that fail any one of the one or more global batch quality control tests.

    METHODS OF NORMALIZING AND CORRECTING RNA EXPRESSION DATA

    公开(公告)号:US20200098448A1

    公开(公告)日:2020-03-26

    申请号:US16581706

    申请日:2019-09-24

    申请人: TEMPUS LABS, INC.

    IPC分类号: G16B30/00 G16B5/00 G06F16/215

    摘要: A platform to perform normalization and correction on gene expression datasets and combines different datasets into a standard dataset using a framework configured to continuously incorporate new gene expression data. The framework determines a series of conversion factors that are used to on-board new gene expression datasets, such as unpaired datasets, where these conversion factors are able to correct for variations in data type, variations in gene expressions, and variations in collection systems.