###
计算机系统应用英文版:2022,31(1):11-20
本文二维码信息
码上扫一扫!
基于ADMM算法的网络连接数据变量选择
(中国科学技术大学 管理学院统计与金融系, 合肥 230026)
Variable Selection of Network-linked Data Based on ADMM Algorithm
(Department of Statistics and Finance, School of Management, University of Science and Technology of China, Hefei 230026, China)
摘要
图/表
参考文献
相似文献
本文已被:浏览 723次   下载 1131
Received:March 21, 2021    Revised:April 21, 2021
中文摘要: 随着科技的发展,网络连接数据在统计学习、机器学习等领域的应用越来越普遍.在线性回归模型中,目前关于网络连接数据的变量选择研究主要针对的是同质性样本,即样本的个体效应α相同,但在现实中大多数样本的个体效应存在异质性,在不考虑异质性的情况下会使得模型的估计和预测产生较大偏差.因此,当网络数据中个体效应存在组异质性时,本文提出一种新的变量选择方法SNC.利用网络凝聚效应,我们对变量系数和相连样本个体效应的差异性进行联合惩罚,同时采用ADMM算法进行求解,并证明了算法的收敛性.数值模拟和实证分析显示,我们的方法提高了变量选择的准确性并且降低了预测误差.
Abstract:With the development of science and technology, the application of network-linked data in statistical learning, machine learning and other fields becomes increasingly common. In linear regression models, the current research on the variable selection of network-linked data mainly focuses on the homogeneous samples, namely that the individual effects of the samples are the same. In reality, however, the individual effects of most samples are heterogeneous. As a result, the neglect of the heterogeneity will lead to large deviations in the estimation and prediction of the models. Therefore, this paper proposes a new variable selection method SNC to cope with the situation when there is group heterogeneity in network-linked data. Using the network agglomeration effect, we carry out a joint penalty for the difference between the variable coefficient and the individual effect of the connected samples and solve the problem with ADMM algorithm, with the convergence of the algorithm proved. The results of numerical simulation and example analysis show that this method improves the accuracy of variable selection and reduces the prediction error.
文章编号:     中图分类号:    文献标志码:
基金项目:国家自然科学基金(12101584,11601501,11671374,71731010,71921001);中国博士后科学基金(2021TQ0326,2021M703100);2021年合肥市博士后科研活动项目
引用文本:
方佳佳,李阳,郑泽敏.基于ADMM算法的网络连接数据变量选择.计算机系统应用,2022,31(1):11-20
FANG Jia-Jia,LI Yang,ZHENG Ze-Min.Variable Selection of Network-linked Data Based on ADMM Algorithm.COMPUTER SYSTEMS APPLICATIONS,2022,31(1):11-20