Invention Application
- Patent Title: OBFUSCATED IDENTIFIER DETECTION METHOD BASED ON NATURAL LANGUAGE PROCESSING AND RECORDING MEDIUM AND APPARATUS FOR PERFORMING THE SAME
-
Application No.: US17278781Application Date: 2020-11-25
-
Publication No.: US20220156370A1Publication Date: 2022-05-19
- Inventor: Jeong Hyun YI , Geochang JEON
- Applicant: Foundation of Soongsil University-Industry Cooperation
- Applicant Address: KR Seoul
- Assignee: Foundation of Soongsil University-Industry Cooperation
- Current Assignee: Foundation of Soongsil University-Industry Cooperation
- Current Assignee Address: KR Seoul
- Priority: KR10-2020-0154542 20201118
- International Application: PCT/KR2020/016745 WO 20201125
- Main IPC: G06F21/56
- IPC: G06F21/56 ; G06F8/40 ; G06F40/279 ; G06F40/166

Abstract:
An obfuscated identifier detection method based on natural language processing includes: converting an input obfuscated apk to smali code level, inspecting an obfuscated string in identifiers of the smali code acquired from a smali code converter, extracting information necessary for deobfuscation and frequency of the identifiers when there is the obfuscated string, storing frequency, type and name information of identifiers calculated from information extracted from an unobfuscated apk, and acquiring and deobfuscating an identifier type name having a most similar frequency in an identifier name database (DB) using information extracted from an obfuscated information extractor. Accordingly, it is possible to reduce delay in analysis and achieve faster analysis by automatically renaming the code that is difficult to understand due to identifier conversion obfuscation.
Information query