###
计算机系统应用英文版:2021,30(5):39-46
本文二维码信息
码上扫一扫!
基于梯度提升决策树的变形宏病毒检测
(1.中国科学院 深圳先进技术研究院, 深圳 518071;2.深信服科技股份有限公司, 深圳 518071)
Obfuscated Macro Malware Detection Based on Gradient Boosting Decision Tree
(1.Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518071, China;2.Sangfor Technologies Inc., Shenzhen 518071, China)
摘要
图/表
参考文献
相似文献
本文已被:浏览 794次   下载 1233
Received:September 07, 2020    Revised:September 25, 2020
中文摘要: 宏病毒在高级持续性威胁中被广泛运用. 其变形成本低廉且方式灵活, 导致传统的基于病毒规则库的反病毒系统难于有效对抗. 提出一种基于梯度提升决策树的变形宏病毒检测方法. 该方法以病毒专家经验为指导, 实施大规模特征工程, 基于词法分析对变形宏病毒做细粒度建模, 并使用海量样本训练模型. 实验表明, 该方法能够准确检测企业级用户网络中传播的真实变形宏病毒和主流变形工具生成的变形宏病毒; 对400万个宏程序样本进行10折交叉验证, 准确率和召回率分别达到99.41%和97.34%, 优于现有其他方法.
Abstract:Macro malware is widely used in advanced persistent threats. Macro obfuscation is low-cost and flexible, rendering traditional rule-based anti-malware systems insufficient. A gradient-boosting-decision-tree-based approach to detecting obfuscated macro malware is proposed. The approach performs large-scale feature engineering guided by the expertise of malware specialists, with fine-grained modeling for obfuscated macro malware carried out on top of lexical analysis, and massive samples are used to train the model. Experimental results show that the approach is able to precisely detect real-world obfuscated macro malware found in the network of enterprise customers, as well as those variants generated by mainstream obfuscation tools; 10-fold cross validation is carried out for a total of 4000 000 macro programs, giving a precision of 99.41% and a recall of 97.34%, which outperforms existing works.
文章编号:     中图分类号:    文献标志码:
基金项目:中国博士后科学基金(2020M682837)
引用文本:
闫华,刘嘉,位凯志,古亮.基于梯度提升决策树的变形宏病毒检测.计算机系统应用,2021,30(5):39-46
YAN Hua,LIU Jia,WEI Kai-Zhi,GU Liang.Obfuscated Macro Malware Detection Based on Gradient Boosting Decision Tree.COMPUTER SYSTEMS APPLICATIONS,2021,30(5):39-46