Invention Application
- Patent Title: IDENTIFYING PERSONALLY IDENTIFIABLE INFORMATION WITHIN AN UNSTRUCTURED DATA STORE
-
Application No.: US17583866Application Date: 2022-01-25
-
Publication No.: US20220147653A1Publication Date: 2022-05-12
- Inventor: Vasyl Pihur , Subhash Sankuratripati , Dachuan Huang , Leah Fortier
- Applicant: Snap Inc.
- Applicant Address: US CA Santa Monica
- Assignee: Snap Inc.
- Current Assignee: Snap Inc.
- Current Assignee Address: US CA Santa Monica
- Main IPC: G06F21/62
- IPC: G06F21/62

Abstract:
Methods and systems for identifying personally identifiable information (PII) are disclosed. In some aspects, frequency maps of fields storing known PII information are generated. The frequency maps may count occurrences of unique bigrams in the PII fields. A field of interest may then be analyzed to generate a second frequency map. Correlations between the first frequency maps and the second frequency map may be generated. If one of the correlations meets certain criterion, the disclosed embodiments may determine that the field of interest does or does not include PII. Access control for the field of interest may then be based on whether the field includes PII. In some aspects, a storage location of data included in the field of interest may be based on whether the field includes PII.
Public/Granted literature
- US11797709B2 Identifying personally identifiable information within an unstructured data store Public/Granted day:2023-10-24
Information query