Full Text:   <259>

CLC number: 

On-line Access: 2024-08-27

Received: 2023-10-17

Revision Accepted: 2024-05-08

Crosschecked: 0000-00-00

Cited: 0

Clicked: 411

Citations:  Bibtex RefMan EndNote GB/T7714

-   Go to

Article info.
Open peer comments

Journal of Zhejiang University SCIENCE C 1998 Vol.-1 No.-1 P.

http://doi.org/10.1631/FITEE.2400088


A survey of binary code representation technology


Author(s):  Taiyan WANG, Qingsong XIE, Lu YU, Zulie PAN, Min ZHANG

Affiliation(s):  College of Electronic Engineering, National University of Defense Technology, Hefei 230037, China; more

Corresponding email(s):   zhangmindy@nudt.edu.cn

Key Words:  Binary analysis, Binary code representation, Binary code feature selection, Binary code feature embedding


Taiyan WANG, Qingsong XIE, Lu YU, Zulie PAN, Min ZHANG. A survey of binary code representation technology[J]. Frontiers of Information Technology & Electronic Engineering, 1998, -1(-1): .

@article{title="A survey of binary code representation technology",
author="Taiyan WANG, Qingsong XIE, Lu YU, Zulie PAN, Min ZHANG",
journal="Frontiers of Information Technology & Electronic Engineering",
volume="-1",
number="-1",
pages="",
year="1998",
publisher="Zhejiang University Press & Springer",
doi="10.1631/FITEE.2400088"
}

%0 Journal Article
%T A survey of binary code representation technology
%A Taiyan WANG
%A Qingsong XIE
%A Lu YU
%A Zulie PAN
%A Min ZHANG
%J Journal of Zhejiang University SCIENCE C
%V -1
%N -1
%P
%@ 2095-9184
%D 1998
%I Zhejiang University Press & Springer
%DOI 10.1631/FITEE.2400088

TY - JOUR
T1 - A survey of binary code representation technology
A1 - Taiyan WANG
A1 - Qingsong XIE
A1 - Lu YU
A1 - Zulie PAN
A1 - Min ZHANG
J0 - Journal of Zhejiang University Science C
VL - -1
IS - -1
SP -
EP -
%@ 2095-9184
Y1 - 1998
PB - Zhejiang University Press & Springer
ER -
DOI - 10.1631/FITEE.2400088


Abstract: 
binary analysis, as an important foundational technology, provides support for numerous applications in the fields of software engineering and security research. With the continuous expansion of software scale and the complex evolution of software architecture, binary analysis technology is facing new challenges. To break through existing bottlenecks, researchers have applied artificial intelligence (AI) technology to the understanding and analysis of binary code. The core lies in characterizing binary code, i.e., how to use intelligent methods to generate representation vectors containing semantic information for binary code, and apply them to multiple downstream tasks of binary analysis. In this paper, we provide a comprehensive survey of recent advances in binary code representation technology, and introduce the workflow of existing related research in two parts: binary code feature selection methods and binary code feature embedding methods. The feature selection section mainly includes two parts: definition and classification of features; and feature construction. Firstly, the abstract definition and classification of features are systematically explained, and secondly, the process of constructing specific representations of features is introduced in detail. In the feature embedding section, based on the different intelligent semantic understanding models used, the embedding methods are classified into four categories based on the usage of text embedding models and graph embedding models. Finally, we summarize the overall development of existing research and provide prospects for some potential research directions related to binary code representation technology.

Darkslateblue:Affiliate; Royal Blue:Author; Turquoise:Article

Open peer comments: Debate/Discuss/Question/Opinion

<1>

Please provide your name, email address and a comment





Journal of Zhejiang University-SCIENCE, 38 Zheda Road, Hangzhou 310027, China
Tel: +86-571-87952783; E-mail: cjzhang@zju.edu.cn
Copyright © 2000 - 2024 Journal of Zhejiang University-SCIENCE