发明授权
- 专利标题: Compression of genomic data file
- 专利标题(中): 压缩基因组数据文件
-
申请号: US13428794申请日: 2012-03-23
-
公开(公告)号: US08972201B2公开(公告)日: 2015-03-03
- 发明人: Sharmila Shekhar Mande , Monzoorul Hague Mohammed , Anirban Dutta , Tungadri Bose
- 申请人: Sharmila Shekhar Mande , Monzoorul Hague Mohammed , Anirban Dutta , Tungadri Bose
- 申请人地址: IN Mumbai
- 专利权人: Tata Consultancy Services Limited
- 当前专利权人: Tata Consultancy Services Limited
- 当前专利权人地址: IN Mumbai
- 代理机构: Barnes & Thornburg LLP
- 优先权: IN3655/MUM/2011 20111224
- 主分类号: G06F19/22
- IPC分类号: G06F19/22
摘要:
Systems and methods for compression of a genomic data file are described herein. In one embodiment, genomic sequences, sequence headers, and quality sequences associated with a plurality of data streams provided in a genomic data file are identified. Each of the genomic sequences includes at least one of primary characters and secondary characters. Further, the secondary characters from each of the genomic sequences may be removed to obtain an intermediate genomic sequence file and a quality score corresponding to the secondary character may be modified in quality sequences to obtain an intermediate quality sequence file. Based on the intermediate genomic sequence file and the intermediate quality sequence file, a modified genomic sequence file and a modified quality sequence file, respectively are generated. A compressed genomic data file is obtained using at least the modified genomic sequence and the modified quality sequence.
公开/授权文献
- US20130166518A1 Compression Of Genomic Data File 公开/授权日:2013-06-27
信息查询