Invention Grant
- Patent Title: Method and apparatus for biological sequence processing fastq files comprising lossless compression and decompression
-
Application No.: US15993095Application Date: 2018-05-30
-
Publication No.: US11360940B2Publication Date: 2022-06-14
- Inventor: Zhe Liu , Jun Zhang
- Applicant: HUAWEI TECHNOLOGIES CO., LTD.
- Applicant Address: CN Guangdong
- Assignee: HUAWEI TECHNOLOGIES CO., LTD.
- Current Assignee: HUAWEI TECHNOLOGIES CO., LTD.
- Current Assignee Address: CN Guangdong
- Agency: Fish & Richardson P.C.
- Main IPC: G06F16/174
- IPC: G06F16/174 ; G16B50/50 ; G16B50/00 ; H03M7/30 ; G16B20/00 ; G16B20/20 ; G16B50/40 ; G16B30/00

Abstract:
This application provides a biological sequence data processing method including selecting a target base from bases in a biological sequence fastq file according to characteristic information of each base. A base patch file is generated by using characteristic information of the target base. Lossless compression is performed on the biological sequence fastq file to obtain a compressed fastq file, and lossless compression is performed on the base patch file to obtain a compressed patch file. The compressed patch file and the compressed fastq file are decompressed. In response to determining that characteristic information of the target base in the decompressed compressed patch file is inconsistent with characteristic information of the target base in the decompressed compressed fastq file, the characteristic information of the target base in the decompressed compressed fastq file is modified to the characteristic information of the target base in the decompressed compressed patch file.
Public/Granted literature
- US20180365260A1 Biological Sequence Data Processing Method And Apparatus Public/Granted day:2018-12-20
Information query