SYSTEMS AND METHODS FOR VISUALIZING A PATTERN IN A DATASET

    公开(公告)号:US20190332963A1

    公开(公告)日:2019-10-31

    申请号:US16442800

    申请日:2019-06-17

    Abstract: A visualization system comprising a persistent memory, storing a dataset, and a non-persistent memory implements a pattern visualizing method. The dataset contains discrete attribute values for each first entity of a first type in a plurality of first entities of the first type and discrete attribute values for each first entity of a second type in a plurality of first entities of the second type for each second entity in a plurality of second entities. The dataset is compressed by blocked compression and represents discrete attribute values in both compressed sparse row and column formats. The discrete attribute values are clustered to assign each second entity to a cluster in a plurality of clusters.

    SYSTEMS, METHODS, AND MEDIA FOR DE NOVO ASSEMBLY OF WHOLE GENOME SEQUENCE DATA

    公开(公告)号:US20170235876A1

    公开(公告)日:2017-08-17

    申请号:US15242256

    申请日:2016-08-19

    CPC classification number: G16B30/00

    Abstract: Described are computer-implemented methods, systems, and media for de novo phased diploid assembly of nucleic acid sequence data generated from a nucleic acid sample of an individual utilizing nucleic acid tags to preserve long-range sequence context for the individual such that a subset of short-read sequence data derived from a common starting sequence shares a common tag. The phased diploid assembly is achieved without alignment to a reference sequence derived from organisms other than the individual. The methods, systems, and media described are computer-resource efficient, allowing scale-up.

    SYSTEMS AND METHODS FOR DETERMINING THE INTEGRITY OF TEST STRINGS WITH RESPECT TO A GROUND TRUTH STRING

    公开(公告)号:US20210134393A1

    公开(公告)日:2021-05-06

    申请号:US16934994

    申请日:2020-07-21

    Abstract: Systems and methods for analyzing first and second strings against a ground truth string are provided. A construct representing a plurality of components is obtained, each component for a different portion of the truth string. The construct comprises a plurality of measurement string sampling pools each having an identifier and a corresponding plurality of measurement samplings corresponding to one or two of the components. Each sampling has the identifier and a portion of the first or second string. Samplings are assigned to first, second or third classes when coding a portion of the first string, second string, or both the first and second string. First and second positions are tested for sequence events by calculating a plurality of sequence event models using assumptions on the components having samplings encompassing the first and second positions and class assignments. These assumptions are updated using the calculated models and the models are recalculated.

Patent Agency Ranking