JZUS - Journal of Zhejiang University SCIENCE

Frontiers of Information Technology & Electronic Engineering

ISSN 2095-9184 (print), ISSN 2095-9230 (online)

2022 Vol.23 No.3 P.361-381

Generic, efficient, and effective deobfuscation and semantic-aware attack detection for PowerShell scripts

Chunlin XIONG, Zhenyuan LI, Yan CHEN, Tiantian ZHU, Jian WANG, Hai YANG, Wei RUAN

College of Computer Science and Technology, Zhejiang University, Hangzhou 310027, China; Department of Electrical Engineering and Computer Science, Northwestern University, Evanston, IL 60208, USA; College of Computer Science and Technology, Zhejiang University of Technology, Hangzhou 310023, China; Magic Shield Co., Ltd., Hangzhou 310027, China; College of Control Science and Engineering, Zhejiang University, Hangzhou 310027, China

chunlinxiong94@zju.edu.cn, ruanwei@zju.edu.cn

Abstract: In recent years, PowerShell has increasingly been reported as appearing in a variety of cyber attacks. However, because the PowerShell language is dynamic by design and can construct script fragments at different levels, state-of-the-art static analysis based PowerShell attack detection approaches are inherently vulnerable to obfuscations. In this paper, we design the first generic, effective, and lightweight deobfuscation approach for PowerShell scripts. To precisely identify the obfuscated script fragments, we define obfuscation based on the differences in the impacts on the abstract syntax trees of PowerShell scripts and propose a novel emulation-based recovery technology. Furthermore, we design the first semantic-aware PowerShell attack detection system that leverages the classic objective-oriented association mining algorithm and newly identifies 31 semantic signatures. The experimental results on 2342 benign samples and 4141 malicious samples show that our deobfuscation method takes less than 0.5 s on average and increases the similarity between the obfuscated and original scripts from 0.5% to 93.2%. By deploying our deobfuscation method, the attack detection rates for Windows Defender and VirusTotal increase substantially from 0.33% and 2.65% to 78.9% and 94.0%, respectively. Moreover, our detection system outperforms both existing tools with a 96.7% true positive rate and a 0% false positive rate on average.

Key words: PowerShell; Abstract syntax tree; Obfuscation and deobfuscation; Malicious script detection

Chinese Summary <49> 通用、有效且轻量的PowerShell解混淆和语义敏感的攻击检测方法

熊春霖¹，李振源¹，陈焰²，朱添田³，王箭¹，杨海⁴，阮伟⁵
¹浙江大学计算机科学与技术学院，中国杭州市，310027
²西北大学电气工程与计算机科学系，美国伊利诺伊州埃文斯顿市，60208
³浙江工业大学计算机科学与技术学院，中国杭州市，310023
⁴杭州奇盾信息技术有限公司，中国杭州市，310027
⁵浙江大学控制科学与工程学院，中国杭州市，310027
摘要：近年来，PowerShell攻击越来越多见诸报道。然而，由于PowerShell语言的动态特性，且可在不同级别构造脚本片段，即使基于最先进的静态脚本分析的PowerShell攻击检测方法，其本质上也容易受到混淆的影响。本文为PowerShell脚本设计了一种通用、有效且轻量的去混淆方法。首先，为精准识别模糊脚本片段，根据混淆方法对PowerShell抽象语法树的影响，提出一种全新混淆片段检测方法，在此基础上提出一种基于仿真的恢复技术。此外，设计了一个语义敏感的PowerShell攻击检测系统，该系统利用经典的面向目标的关联挖掘算法，新识别31个用于恶意脚本检测的语义特征。在2342个良性样本和4141个恶意样本上的实验结果表明，所提去混淆方法平均耗时不到0.5秒，且将模糊脚本和原始脚本的相似度从0.5%提至93.2%。采用该去混淆方法，Windows Defender和VirusTotal的攻击检测率分别从0.33%和2.65%提至78.9%和94.0%。实验还表明，我们的检测系统优于现有两种工具（平均真正例率为96.7%，假正例率为0%）。

关键词组：PowerShell；抽象语法树；混淆和解混淆；恶意脚本检测

Share this article to： More

Go to Contents

References:

Open peer comments: Debate/Discuss/Question/Opinion

<1>

DOI:

10.1631/FITEE.2000436

CLC number:

TP309

Download Full Text:

Click Here

Downloaded:

9848

Download summary:

Downloaded:

1060

Clicked:

9360

Cited:

On-line Access:

2024-08-27

Received:

2023-10-17

Revision Accepted:

2024-05-08

Crosschecked:

2020-12-29

Journal of Zhejiang University-SCIENCE, 38 Zheda Road, Hangzhou 310027, China
Tel: +86-571-87952276; Fax: +86-571-87952331; E-mail: jzus@zju.edu.cn
Copyright © 2000~ Journal of Zhejiang University-SCIENCE

CONTENTS

INSTR. FOR AUTHOR

FOR REVIEWER

ABOUT JZUS

Publishing Service