Distributed data analytics
    1.
    发明授权

    公开(公告)号:US10706970B1

    公开(公告)日:2020-07-07

    申请号:US15719231

    申请日:2017-09-28

    摘要: An apparatus in one embodiment comprises a distributed data processing system in which multiple processing devices communicate with one another over at least one network. The distributed data processing system is configured to obtain reads of biological samples of respective microbiomes, with each of the biological samples containing genomic material from a plurality of distinct microorganisms of its corresponding one of the microbiomes, and to perform distributed data analytics to detect a disease, infection or contamination that involves genomic material from multiple ones of the distinct microorganisms in one or more of the microbiomes. Performing distributed data analytics illustratively comprises performing local analytics in respective ones of a plurality of data zones, and performing global analytics utilizing results of the local analytics performed in the respective data zones. Each of the data zones may comprise, for example, one or more sequencing centers utilized to generate a corresponding subset of the reads within that data zone.

    Distributed data analytics
    2.
    发明授权

    公开(公告)号:US11749412B2

    公开(公告)日:2023-09-05

    申请号:US16921303

    申请日:2020-07-06

    摘要: An apparatus in one embodiment comprises a distributed data processing system in which multiple processing devices communicate with one another over at least one network. The distributed data processing system is configured to obtain reads of biological samples of respective sample sources, with each of the biological samples containing genomic material from a plurality of distinct microorganisms within an environment of a corresponding one of the sample sources, and to perform distributed data analytics to characterize an actual or potential outbreak of at least one of a disease, an infection and a contamination that involves genomic material from multiple ones of the distinct microorganisms in one or more of the sample sources. Performing distributed data analytics illustratively comprises performing local analytics in respective ones of a plurality of data zones, and performing global analytics utilizing results of the local analytics performed in the respective data zones.

    DISTRIBUTED DATA ANALYTICS
    3.
    发明申请

    公开(公告)号:US20200335223A1

    公开(公告)日:2020-10-22

    申请号:US16921303

    申请日:2020-07-06

    摘要: An apparatus in one embodiment comprises a distributed data processing system in which multiple processing devices communicate with one another over at least one network. The distributed data processing system is configured to obtain reads of biological samples of respective sample sources, with each of the biological samples containing genomic material from a plurality of distinct microorganisms within an environment of a corresponding one of the sample sources, and to perform distributed data analytics to characterize an actual or potential outbreak of at least one of a disease, an infection and a contamination that involves genomic material from multiple ones of the distinct microorganisms in one or more of the sample sources. Performing distributed data analytics illustratively comprises performing local analytics in respective ones of a plurality of data zones, and performing global analytics utilizing results of the local analytics performed in the respective data zones.

    Distributed data analytics
    4.
    发明授权

    公开(公告)号:US11854707B2

    公开(公告)日:2023-12-26

    申请号:US17708534

    申请日:2022-03-30

    摘要: An apparatus in one embodiment comprises a distributed data processing system in which multiple processing devices communicate with one another over at least one network. The distributed data processing system is configured to obtain reads of biological samples of respective sample sources, with each of the biological samples containing genomic material from a plurality of distinct microorganisms within an environment of a corresponding one of the sample sources, and to perform distributed data analytics to provide surveillance functionality characterizing at least one of a disease, an infection and a contamination as involving genomic material from multiple ones of the sample source. Performing distributed data analytics illustratively comprises performing local analytics in respective ones of a plurality of data zones, and performing global analytics utilizing results of the local analytics performed in the respective data zones.

    DISTRIBUTED DATA ANALYTICS
    5.
    发明申请

    公开(公告)号:US20220223296A1

    公开(公告)日:2022-07-14

    申请号:US17708534

    申请日:2022-03-30

    摘要: An apparatus in one embodiment comprises a distributed data processing system in which multiple processing devices communicate with one another over at least one network. The distributed data processing system is configured to obtain reads of biological samples of respective sample sources, with each of the biological samples containing genomic material from a plurality of distinct microorganisms within an environment of a corresponding one of the sample sources, and to perform distributed data analytics to provide surveillance functionality characterizing at least one of a disease, an infection and a contamination as involving genomic material from multiple ones of the sample source. Performing distributed data analytics illustratively comprises performing local analytics in respective ones of a plurality of data zones, and performing global analytics utilizing results of the local analytics performed in the respective data zones.

    Methods and apparatus implementing data model for disease monitoring, characterization and investigation

    公开(公告)号:US10528875B1

    公开(公告)日:2020-01-07

    申请号:US15281248

    申请日:2016-09-30

    摘要: A method comprises receiving metagenomics data, configuring a data model characterizing relationships between aspects of the metagenomics data, and processing the metagenomics data in accordance with the configured data model in order to characterize at least one of a disease, infection or contamination. The data model comprises an abundance score element that relates portions of the metagenomics data comprising reads of biological samples to one or more genomic sequences of an ecogenome, and a comparative score element that relates portions of the metagenomics data comprising characteristics of multiple patients to one another with respect to the disease, infection or contamination. The data model further relates the abundance score element to the comparative score element via one or more additional elements of the data model corresponding to respective other aspects of the metagenomics data. The metagenomics data may comprise metagenomics sequencing results from metagenomics sequencing centers associated with respective data zones.