Invention Grant
- Patent Title: Encoding and storing text using DNA sequences
-
Application No.: US16143671Application Date: 2018-09-27
-
Publication No.: US11017170B2Publication Date: 2021-05-25
- Inventor: Changchuan Yin
- Applicant: AT&T Intellectual Property I, L.P.
- Applicant Address: US GA Atlanta
- Assignee: AT&T Intellectual Property I, L.P.
- Current Assignee: AT&T Intellectual Property I, L.P.
- Current Assignee Address: US GA Atlanta
- Agency: Guntin & Gust, PLC
- Agent Dana B. Lemoine
- Main IPC: G06F40/284
- IPC: G06F40/284 ; G06F40/242

Abstract:
Text can be encoded into DNA sequences. Each word from a document or other text sample can be encoded in a DNA sequence or DNA sequences and the DNA sequences can be stored for later retrieval. The DNA sequences can be stored digitally, or actual DNA molecules containing the sequences can be synthesized and stored. In one example, the encoding technique makes use of a polynomial function to transform words based on the Latin alphabet into k-mer DNA sequences of length k. Because the whole bits required for the DNA sequences are smaller than the actual strings of words, storing documents using DNA sequences may compress the documents relative to storing the same documents using other techniques. In at least one example, the mapping between words and DNA sequences is one-to-one and the collision ratio for the encoding is low.
Information query