-
公开(公告)号:US20240311145A1
公开(公告)日:2024-09-19
申请号:US18596194
申请日:2024-03-05
申请人: SEOUL NATIONAL UNIVERSITY R&DB FOUNDATION , RESEARCH & BUSINESS FOUNDATION SUNGKYUNKWAN UNIVERSITY
发明人: Yunheung PAEK , Hyungjoon KOO , Sunwoo AHN , Seonggwan AHN
IPC分类号: G06F8/75
CPC分类号: G06F8/751
摘要: A binary code similarity detection device performs a preprocessing operation of generating an assembly expression for the binary code by converting a machine language of an input binary code into an assembly language, extracting an assembly function or a command from the binary code converted to the assembly language, and detects a similarity to the assembly expression of a pre-stored binary code by inputting the assembly expression generated by the preprocessing operation to a trained model based on bidirectional encoder representations from transformers (BERT), and the trained model is generated by performing a pre-training step of causing the assembly expression to be understood and a fine-tuning step of inputting an assembly expression of a first binary code and an assembly expression of a second binary code to a pre-trained model and then fine-tuning the pre-trained model based on a similarity between the first binary code and the second binary code.