- Patent Title: Systems and methods for grouping and collapsing sequencing reads
-
Application No.: US16667642Application Date: 2019-10-29
-
Publication No.: US11688489B2Publication Date: 2023-06-27
- Inventor: Chen Zhao , Kevin Eric Wu , Sven Bilke
- Applicant: Illumina, Inc.
- Applicant Address: US CA San Diego
- Assignee: Illumina, Inc.
- Current Assignee: Illumina, Inc.
- Current Assignee Address: US CA San Diego
- Agency: Sheppard, Mullin, Richter & Hampton LLP
- Main IPC: G01N33/48
- IPC: G01N33/48 ; G16B30/10 ; G06F16/22 ; G06F16/2457 ; G16B30/20

Abstract:
Disclosed herein are systems and methods for collapsing sequencing reads and identifying similar sequencing reads. In one example, a method includes generating a plurality of first identifier subsequences from a first identifier sequence of each nucleotide sequencing read and generating a first signature for the nucleotide sequencing read by applying hashing to the plurality of first identifier subsequences. The method may include assigning the nucleotide sequencing read to a first particular bin of a first data structure based on the first signature and determining a nucleotide sequence for each first particular bin of the first data structure with one or more nucleotide sequencing reads assigned.
Information query