Invention Grant
- Patent Title: Entropy-based detection of sensitive information in code
- Patent Title (中): 基于熵的检测代码中的敏感信息
-
Application No.: US13858448Application Date: 2013-04-08
-
Publication No.: US09336381B1Publication Date: 2016-05-10
- Inventor: David James Kane-Parry , Thibault Candebat
- Applicant: AMAZON TECHNOLOGIES, INC.
- Applicant Address: US NV Reno
- Assignee: Amazon Technologies, Inc.
- Current Assignee: Amazon Technologies, Inc.
- Current Assignee Address: US NV Reno
- Agency: Lindauer Law, PLLC
- Main IPC: G06F21/00
- IPC: G06F21/00 ; G06F21/50

Abstract:
Techniques are described for identifying security credentials or other sensitive information based on an entropy-based analysis of information included in documents such as source code files, object code files, or other types of files. A baseline information entropy may be determined for one or more documents, indicating a baseline level of randomness for information in the document(s). One or more of the documents may be analyzed to identify the presence of high entropy portions that have an information entropy above a threshold value. The threshold value may be based on the baseline information entropy, or based on other criteria such as a programming language of the document(s). Because security credentials may have a higher level of information entropy than the surrounding code, any high entropy portions of the document(s) may be identified as potential security risks.
Information query