融合反事实语义增强与因果注意力的领域泛化
CSTR:
作者:
作者单位:

作者简介:

通讯作者:

中图分类号:

基金项目:

宁夏自然科学基金 (2025AAC030154)


Domain Generalization via Counterfactual Semantic Enhancement and Causal Attention
Author:
Affiliation:

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    针对深度学习模型在分布偏移场景中泛化能力不足的问题, 在Mamba状态空间模型的基础上, 提出一种融合反事实语义增强和因果注意力机制的领域泛化方法, 通过设计反事实语义增强模块, 实现前景-背景解耦与重组生成反事实特征, 显式构建“前景保持、背景干预”的因果情境, 有效削弱背景-标签的伪相关性, 强化模型对因果语义前景的挖掘能力, 引导其关注稳定可靠的语义关联; 进一步提出因果注意力机制, 将上述模块提取到的因果语义信息显式嵌入Mamba状态更新过程, 以提高特征的因果一致性. 整体模型结构实现了对前景与背景信息的动态区分与融合. 在标准领域泛化基准上的实验结果表明, 本文方法在PACS、OfficeHome、VLCS和TerraIncognita数据集上平均准确率分别达到91.9%、77.0%、81.1%和54.9%, 均优于现有SOTA方法, 证实本文方法显著提高了模型对前景语义区域的关注一致性, 展现出优越的可解释性与泛化性能.

    Abstract:

    To address the limited generalization capability of deep learning models under distribution shifts, this study proposes a domain generalization method based on the Mamba state-space model that integrates counterfactual semantic enhancement with a causal attention mechanism. By designing a counterfactual semantic enhancement module, foreground-background decoupling and recombination are achieved to generate counterfactual features, explicitly constructing a causal scenario of “foreground preservation and background intervention”. This effectively mitigates spurious background-label correlations, enhances the model’s ability to extract causal semantic foreground representations, and guides it to focus on stable and reliable semantic associations. Furthermore, a causal attention mechanism is introduced to explicitly embed the causal semantic information extracted by the module into the Mamba state update process, improving the causal consistency of features. The overall architecture enables dynamic discrimination and integration of foreground and background information. Experimental results on standard domain generalization benchmarks demonstrate that the proposed method achieves average accuracy rates of 91.9%, 77.0%, 81.1%, and 54.9% on the PACS, OfficeHome, VLCS, and TerraIncognita datasets, respectively, outperforming existing state-of-the-art methods. These results confirm that the proposed method significantly improves the consistency of the model’s focus on foreground semantic regions, thus demonstrating superior interpretability and generalization performance.

    参考文献
    相似文献
    引证文献
引用本文

魏成亮,刘进锋.融合反事实语义增强与因果注意力的领域泛化.计算机系统应用,,():1-10

复制
分享
相关视频

文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2025-09-09
  • 最后修改日期:2025-10-09
  • 录用日期:
  • 在线发布日期: 2026-03-02
  • 出版日期:
文章二维码
您是第位访问者
版权所有:中国科学院软件研究所 京ICP备05046678号-3
地址:北京市海淀区中关村南四街4号,邮政编码:100190
电话:010-62661041 传真: Email:csa@iscas.ac.cn
技术支持:北京勤云科技发展有限公司

京公网安备 11040202500063号