融合反事实语义增强与因果注意力的领域泛化

doi:10.15888/j.cnki.csa.010131

AIPUB归智期刊联盟

微信公众号

网站二维码

首页 > 过刊浏览>年第卷第期 >1-10. DOI:10.15888/j.cnki.csa.010131

PDF HTML阅读 XML下载导出引用引用提醒

融合反事实语义增强与因果注意力的领域泛化
DOI:
                        10.15888/j.cnki.csa.010131
                    
CSTR:
                        
                    
作者:
                        
                        
                    
作者单位:
作者简介:
通讯作者:
中图分类号:
基金项目:宁夏自然科学基金 (2025AAC030154)

Domain Generalization via Counterfactual Semantic Enhancement and Causal Attention

Author:

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

针对深度学习模型在分布偏移场景中泛化能力不足的问题, 在Mamba状态空间模型的基础上, 提出一种融合反事实语义增强和因果注意力机制的领域泛化方法, 通过设计反事实语义增强模块, 实现前景-背景解耦与重组生成反事实特征, 显式构建“前景保持、背景干预”的因果情境, 有效削弱背景-标签的伪相关性, 强化模型对因果语义前景的挖掘能力, 引导其关注稳定可靠的语义关联; 进一步提出因果注意力机制, 将上述模块提取到的因果语义信息显式嵌入Mamba状态更新过程, 以提高特征的因果一致性. 整体模型结构实现了对前景与背景信息的动态区分与融合. 在标准领域泛化基准上的实验结果表明, 本文方法在PACS、OfficeHome、VLCS和TerraIncognita数据集上平均准确率分别达到91.9%、77.0%、81.1%和54.9%, 均优于现有SOTA方法, 证实本文方法显著提高了模型对前景语义区域的关注一致性, 展现出优越的可解释性与泛化性能.

Abstract:

To address the limited generalization capability of deep learning models under distribution shifts, this study proposes a domain generalization method based on the Mamba state-space model that integrates counterfactual semantic enhancement with a causal attention mechanism. By designing a counterfactual semantic enhancement module, foreground-background decoupling and recombination are achieved to generate counterfactual features, explicitly constructing a causal scenario of “foreground preservation and background intervention”. This effectively mitigates spurious background-label correlations, enhances the model’s ability to extract causal semantic foreground representations, and guides it to focus on stable and reliable semantic associations. Furthermore, a causal attention mechanism is introduced to explicitly embed the causal semantic information extracted by the module into the Mamba state update process, improving the causal consistency of features. The overall architecture enables dynamic discrimination and integration of foreground and background information. Experimental results on standard domain generalization benchmarks demonstrate that the proposed method achieves average accuracy rates of 91.9%, 77.0%, 81.1%, and 54.9% on the PACS, OfficeHome, VLCS, and TerraIncognita datasets, respectively, outperforming existing state-of-the-art methods. These results confirm that the proposed method significantly improves the consistency of the model’s focus on foreground semantic regions, thus demonstrating superior interpretability and generalization performance.

参考文献

相似文献

引证文献

引用本文

魏成亮,刘进锋.融合反事实语义增强与因果注意力的领域泛化.计算机系统应用,,():1-10

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2025-09-09
最后修改日期:2025-10-09
录用日期:
在线发布日期: 2026-03-02
出版日期:

微信公众号

网站二维码

引用本文

分享

相关视频

文章指标

历史

文章二维码