###
计算机系统应用英文版:2022,31(2):161-167
本文二维码信息
码上扫一扫!
基于自监督网络的DDPG算法的建筑能耗控制
殷雨竹1,2,3, 陈建平2,3, 傅启明1,2,3, 陆悠1,2,3, 吴宏杰1,2,3
(1.苏州科技大学 电子与信息工程学院, 苏州 215009;2.苏州科技大学 江苏省建筑智慧节能重点实验室, 苏州 215009;3.苏州科技大学 苏州市移动网络技术与应用重点实验室, 苏州 215009)
Building Energy Consumption Control Based on DDPG Algorithm of Self-supervised Network
(1.School of Electronic and Information Engineering, Suzhou University of Science and Technology, Suzhou 215009, China;2.Jiangsu Province Key Laboratory of Intelligent Building Energy Efficiency, Suzhou University of Science and Technology, Suzhou 215009, China;3.Suzhou Key Laboratory of Mobile Network Technology and Application, Suzhou University of Science and Technology, Suzhou 215009, China)
摘要
图/表
参考文献
相似文献
本文已被:浏览 548次   下载 919
Received:April 08, 2021    Revised:May 11, 2021
中文摘要: 针对强化学习方法训练能耗控制系统时所存在奖赏稀疏的问题, 将一种基于自监督网络的深度确定策略梯度(deep deterministic policy gradient, DDPG)方法应用到建筑能耗控制问题中. 首先, 处理状态和动作变量作为自监督网络前向模型的输入, 预测下一个状态特征向量, 同时将预测误差作为好奇心设计内部奖赏, 以解决奖赏稀疏问题. 然后, 采用数据驱动的方法训练建筑能耗模型, 构建天气数据作为输入、能耗数据作为输出. 最后, 利用基于自监督网络的DDPG方法求解最优控制策略, 并以此设定空气处理装置(air handling unit, AHU)的最优排放温度, 减少设备能耗. 实验结果表明, 该方法能够在保持建筑环境舒适的基础上, 实现较好的节能效果.
Abstract:In view of the sparse reward problem in the training of energy consumption control systems using reinforcement learning methods, a deep deterministic policy gradient (DDPG) method based on the self-supervised network is applied to the building energy consumption control. First, the processing state and action variables are regarded as the input of the self-supervised network forward model, predicting the feature vector of the next state and using the prediction error as the internal reward of curiosity to solve the sparse reward problem. Then, a data-driven method is used to train the building energy consumption model with weather data as input and energy consumption data as output. Finally, the DDPG method based on the self-supervised network is used to develop the optimal control strategy, and the optimal discharge temperature of the air handling unit (AHU) is set based on the strategy to reduce the energy consumption of the equipment. Experimental results show that this method can achieve good energy-saving effects on the basis of maintaining a comfortable building environment.
文章编号:     中图分类号:    文献标志码:
基金项目:国家重点研发计划(2020YFC200660);国家自然科学基金(62072324,61876217,61876121,61772357);江苏省重点研发计划(BE2017663)
Author NameAffiliationE-mail
YIN Yu-Zhu School of Electronic and Information Engineering, Suzhou University of Science and Technology, Suzhou 215009, China
Jiangsu Province Key Laboratory of Intelligent Building Energy Efficiency, Suzhou University of Science and Technology, Suzhou 215009, China
Suzhou Key Laboratory of Mobile Network Technology and Application, Suzhou University of Science and Technology, Suzhou 215009, China 
 
CHEN Jian-Ping Jiangsu Province Key Laboratory of Intelligent Building Energy Efficiency, Suzhou University of Science and Technology, Suzhou 215009, China
Suzhou Key Laboratory of Mobile Network Technology and Application, Suzhou University of Science and Technology, Suzhou 215009, China 
 
FU Qi-Ming School of Electronic and Information Engineering, Suzhou University of Science and Technology, Suzhou 215009, China
Jiangsu Province Key Laboratory of Intelligent Building Energy Efficiency, Suzhou University of Science and Technology, Suzhou 215009, China
Suzhou Key Laboratory of Mobile Network Technology and Application, Suzhou University of Science and Technology, Suzhou 215009, China 
fqm_1@126.com 
LU You School of Electronic and Information Engineering, Suzhou University of Science and Technology, Suzhou 215009, China
Jiangsu Province Key Laboratory of Intelligent Building Energy Efficiency, Suzhou University of Science and Technology, Suzhou 215009, China
Suzhou Key Laboratory of Mobile Network Technology and Application, Suzhou University of Science and Technology, Suzhou 215009, China 
 
WU Hong-Jie School of Electronic and Information Engineering, Suzhou University of Science and Technology, Suzhou 215009, China
Jiangsu Province Key Laboratory of Intelligent Building Energy Efficiency, Suzhou University of Science and Technology, Suzhou 215009, China
Suzhou Key Laboratory of Mobile Network Technology and Application, Suzhou University of Science and Technology, Suzhou 215009, China 
 
Author NameAffiliationE-mail
YIN Yu-Zhu School of Electronic and Information Engineering, Suzhou University of Science and Technology, Suzhou 215009, China
Jiangsu Province Key Laboratory of Intelligent Building Energy Efficiency, Suzhou University of Science and Technology, Suzhou 215009, China
Suzhou Key Laboratory of Mobile Network Technology and Application, Suzhou University of Science and Technology, Suzhou 215009, China 
 
CHEN Jian-Ping Jiangsu Province Key Laboratory of Intelligent Building Energy Efficiency, Suzhou University of Science and Technology, Suzhou 215009, China
Suzhou Key Laboratory of Mobile Network Technology and Application, Suzhou University of Science and Technology, Suzhou 215009, China 
 
FU Qi-Ming School of Electronic and Information Engineering, Suzhou University of Science and Technology, Suzhou 215009, China
Jiangsu Province Key Laboratory of Intelligent Building Energy Efficiency, Suzhou University of Science and Technology, Suzhou 215009, China
Suzhou Key Laboratory of Mobile Network Technology and Application, Suzhou University of Science and Technology, Suzhou 215009, China 
fqm_1@126.com 
LU You School of Electronic and Information Engineering, Suzhou University of Science and Technology, Suzhou 215009, China
Jiangsu Province Key Laboratory of Intelligent Building Energy Efficiency, Suzhou University of Science and Technology, Suzhou 215009, China
Suzhou Key Laboratory of Mobile Network Technology and Application, Suzhou University of Science and Technology, Suzhou 215009, China 
 
WU Hong-Jie School of Electronic and Information Engineering, Suzhou University of Science and Technology, Suzhou 215009, China
Jiangsu Province Key Laboratory of Intelligent Building Energy Efficiency, Suzhou University of Science and Technology, Suzhou 215009, China
Suzhou Key Laboratory of Mobile Network Technology and Application, Suzhou University of Science and Technology, Suzhou 215009, China 
 
引用文本:
殷雨竹,陈建平,傅启明,陆悠,吴宏杰.基于自监督网络的DDPG算法的建筑能耗控制.计算机系统应用,2022,31(2):161-167
YIN Yu-Zhu,CHEN Jian-Ping,FU Qi-Ming,LU You,WU Hong-Jie.Building Energy Consumption Control Based on DDPG Algorithm of Self-supervised Network.COMPUTER SYSTEMS APPLICATIONS,2022,31(2):161-167