COMPUTER SYSTEMS APPLICATIONS!

WeChat

Mobile website

Current Issue
Online First
Archive
Click Rank
Most Downloaded
综述文章

Article Search

Search by issue

Select AllDeselectExport

Display Method:

High-stealthiness Model Fingerprint Algorithm Based on Saliency Map

ZHANG Sheng-Yao, PAN Xu-Dong, ZHANG Mi

2024,33(4):1-12, DOI: 10.15888/j.cnki.csa.009459

[Abstract] (206) [HTML] (63) [PDF 2.72 M] (371)

Abstract:
Training of deep neural networks (DNN) in mission-critical scenarios involves increasingly more resources, which stimulates model stealing from prediction API at the cloud and violates the intellectual property rights of the model owners. To trace public illegal model copies, DNN model fingerprint provides a promising copyright verification option for model owners who want to preserve the model integrity. However, existing fingerprinting schemes are mainly based on output-level traces (e.g., mis-prediction behavior on special inputs) to cause limited stealthiness during model fingerprint verification. This study proposes a novel task-agnostic fingerprinting scheme based on saliency map traces of model prediction. The proposed scheme puts forward a constrained manipulation objective of saliency maps to construct clean-label and natural fingerprint samples, thus significantly improving the stealthiness of model fingerprints. According to extensive evaluation results on three typical tasks, this scheme is proven to substantially enhance the fingerprint effectiveness of existing schemes and remain highly stealthy of model fingerprints.

Multimodal Sentiment Analysis Based on Dual Encoder Representation Learning

XIAN Guang-Ming, YANG Xian-Ping, ZHAO Zhi-Feng

2024,33(4):13-25, DOI: 10.15888/j.cnki.csa.009461

[Abstract] (136) [HTML] (39) [PDF 2.65 M] (352)

Abstract:
Multimodal sentiment analysis aims to assess users’ sentiment by analyzing the videos they upload on social platforms. The current research on multimodal sentiment analysis primarily focuses on designing complex multimodal fusion networks to learn the consistency information among modalities, which enhances the model’s performance to some extent. However, most of the research overlooks the complementary role played by the difference information among modalities, resulting in sentiment analysis biases. This study proposes a multimodal sentiment analysis model called DERL (dual encoder representation learning) based on dual encoder representation learning. This model learns modality-invariant representations and modality-specific representations by a dual encoder structure. Specifically, a cross-modal interaction encoder based on a hierarchical attention mechanism is employed to learn the modality-invariant representations of all modalities to obtain consistency information. Additionally, an intra-modal encoder based on a self-attention mechanism is adopted to learn the modality-specific representations within each modality and thus capture difference information. Furthermore, two gate network units are designed to enhance and filter the encoded features and enable a better combination of modality-invariant and modality-specific representations. Finally, during fusion, potential similar sentiment between different multimodal representations is captured for sentiment prediction by reducing the L2 distance among them. Experimental results on two publicly available datasets CMU-MOSI and CMU-MOSEI show that this model outperforms a range of baselines.

Survey

Application Review of Convolutional Neural Networks in Pathological Images Diagnosis of Liver Cancer

SHAO Run-Hua, LIU Jing, MA Jin-Gang, WANG Yi-Fan, CHEN Tian-Zhen, LI Ming

2024,33(4):26-38, DOI: 10.15888/j.cnki.csa.009466

[Abstract] (161) [HTML] (144) [PDF 1.86 M] (508)

Abstract:
Liver cancer is a malignant liver tumor that originates from liver cells, and its diagnosis has always been a difficult medical problem and a research hotspot in various fields. Early diagnosis of liver cancer can reduce the mortality rate of liver cancer. Histopathological image examination is the gold standard for oncology diagnosis as the images can display the cells and tissue structures of tissue slices, which can be employed to determine cell types, tissue structures, and the number and morphology of abnormal cells, and evaluate the specific condition of the tumor. This study focuses on the application of convolutional neural networks in liver cancer diagnosis algorithms for pathological images, including liver tumor detection, image segmentation, and preoperative prediction. The design ideas and related improvement goals and methods of each algorithm of convolutional neural networks are elaborated in detail to provide clearer reference ideas for researchers. Additionally, the advantages and disadvantages of convolutional neural network algorithms in diagnosis are summarized and analyzed, with potential research hotspots and related difficulties in the future discussed.

Brain Tumor Classification Based on Federated Learning and Improved CBAM-ResNet18

WU Bo, SHI Dong-Hui, LYU Dong-Lai, HU Tao

2024,33(4):39-49, DOI: 10.15888/j.cnki.csa.009469

[Abstract] (143) [HTML] (51) [PDF 2.10 M] (251)

Abstract:
The multi-client brain tumor classification method based on the convolutional block attention module has inadequate extraction of tumor region details from MRI images, and channel attention and spatial attention interfere with each other under the federated learning framework. In addition, the accuracy in classifying medical tumor data from multiple points is low. To address these problems, this study proposes a brain tumor classification method that amalgamates the federated learning framework with an enhanced CBAM-ResNet18 network. The method leverages the federated learning characteristic to collaboratively work with brain tumor data from multiple sources. It replaces the ReLU activation function with Leaky ReLU to mitigate issues of neuron death. The channel attention module within the convolutional block attention module is modified from a dimension reduction followed by a dimension increment approach to a dimension increment followed by a dimension reduction approach. This change significantly enhances the network’s ability to extract image details. Furthermore, the architecture of the channel attention module and spatial attention module in the convolutional block attention module has been shifted from a cascade structure to a parallel structure, ensuring that the network’s feature extraction capability remains unaffected by the order of processing. A publicly available brain tumor MRI dataset from Kaggle is used in the study. The results demonstrate that FL-CBAM-DIPC-ResNet has a remarkable performance. It achieves impressive accuracy, precision, recall, and F1 score of 97.78%, 97.68%, 97.61%, and 97.63%, respectively. These values of accuracy, precision, recall, and F1 score are 6.54%, 4.78%, 6.80%, and 7.00% higher than those of the baseline model. These experimental findings validate that the proposed method not only overcomes data islands and enables data fusion from multiple sources but also outperforms the majority of existing mainstream models in terms of performance.

Research Progress on Path Planning for Visually Impaired Travel with Integrated Obstacle Detection

FENG Jin-Yu, ZHANG Kui-Xing, ZHANG Tie-Lin, LI Yan-Jun

2024,33(4):50-59, DOI: 10.15888/j.cnki.csa.009486

[Abstract] (73) [HTML] (138) [PDF 2.16 M] (382)

Abstract:
The visually impaired are a vulnerable group in society and face many obstacles when traveling independently. Providing safe and reliable auxiliary equipment for the visually impaired reflects the progress of social civilization. This study introduces the key technologies for obstacle detection and identification and path planning related algorithms for assisting visually impaired travel. The study mainly analyzes path planning algorithms after obstacle detection, comprehensively compares the application characteristics and scenarios of various technologies, and discusses the research progress of related methods in visually impaired assistive devices. In addition, it summarizes the current application status of multi-technology integration in intelligent assistance equipment. On this basis, combined with the advancement of technologies such as artificial intelligence and embedded devices, the future development direction of auxiliary visually impaired travel equipment is prospected.

Speech Enhancement Based on Deep Complex Axial Self-attention Convolutional RecurrentNetwork

CAO Jie, WANG Qiao, LIANG Hao-Peng, WANG Chen-Zhang, LI Xiao-Xu, YU Hong

2024,33(4):60-68, DOI: 10.15888/j.cnki.csa.009458

[Abstract] (117) [HTML] (114) [PDF 1.72 M] (408)

Abstract:
Inaccurate phase estimation in single-channel speech enhancement tasks will cause poor quality of the enhanced speech. To this end, this study proposes a speech enhancement method based on a deep complex axial self-attention convolutional recurrent network (DCACRN), which enhances speech amplitude information and phase information in the complex domain simultaneously. Firstly, a complex convolutional network-based encoder is employed to extract complex features from the input speech signal, and a convolutional hopping module is introduced to map the features into a high-dimensional space for feature fusion, which enhances the information interaction and the gradient flow. Then an encoder-decoder structure based on the axial self-attention mechanism is designed to enhance the model’s timing modeling ability and feature extraction ability. Finally, the reconstruction of the speech signals is realized by the decoder, while the hybrid loss function is adopted to optimize the network model to improve the quality of enhanced speech signals. Meanwhile, the mixed loss function is utilized to optimize the network model and improve the quality of enhanced speech signals. The experiments are conducted on the public datasets Valentini and DNS Challenge, and the results show that the proposed method improves both the perceptual evaluation of speech quality (PESQ) and short-time objective intelligibility (STOI) metrics compared to other models. In the non-reverberant dataset, PESQ is improved by 12.8% over DCTCRN and 3.9% over DCCRN, which validates the effectiveness of the proposed model in speech enhancement tasks.

全文下载排行(总排行，年度排行，各期排行)
摘要点击排行(总排行，年度排行，各期排行)

Article Search

Search by issue

Select AllDeselectExport

Display Method:

HDF raster image data reading and processing based on. NET

焦飞, 黄天文

2007,16(10):48-51, DOI:

[Abstract] (4652) [HTML] (0) [PDF 0.00 Byte] (86080)

Abstract:
论文对HDF数据格式和函数库进行研究,重点以栅格图像为例,详细论述如何利用VC++.net和VC#.net对光栅数据进行读取与处理,然后根据所得到的象素矩阵用描点法显示图像.论文是以国家气象中心开发Micaps3.0(气象信息综合分析处理系统)的课题研究为背景的.

Methods of Multiple Channel Real-Time Date Acquisition Based on Windows 2000

邵奇可, 陈国定, 张琦

2002,11(12):67-68, DOI:

[Abstract] (3765) [HTML] (0) [PDF 0.00 Byte] (57438)

Abstract:
本文介绍非实时操作系统Windows 2000下,利用VisualC++6.0开发实时数据采集的方法.所用到的数据采集卡是研华的PCL-818L.借助数据采集卡PCL-818L的DLLs中的API函数,提出三种实现高速实时数据采集的方法及优缺点.

Analysis of improving the c language quick sort algorithm

刘娜, 佟冶

2008,17(1):113-116, DOI:

[Abstract] (5746) [HTML] (0) [PDF 0.00 Byte] (47404)

Abstract:
排序是计算机程序设计中一种重要操作，本文论述了C语言中快速排序算法的改进，即快速排序与直接插入排序算法相结合的实现过程。在C语言程序设计中，实现大量的内部排序应用时，所寻求的目的就是找到一个简单、有效、快捷的算法。本文着重阐述快速排序的改进与提高过程，从基本的性能特征到基本的算法改进，通过不断的分析，实验，最后得出最佳的改进算法。

Survey on the Research of Deep Web Crawler

曾伟辉, 李淼

2008,17(5):122-126, DOI:

[Abstract] (7480) [HTML] (0) [PDF 0.00 Byte] (45604)

Abstract:
随着Internet的迅速发展，网络资源越来越丰富，人们如何从网络上抽取信息也变得至关重要，尤其是占网络资源80%的Deep Web信息检索更是人们应该倍加关注的难点问题。为了更好的研究Deep Web爬虫技术，本文对有关Deep Web爬虫的内容进行了全面、详细地介绍。首先对Deep Web爬虫的定义及研究目标进行了阐述，接着介绍了近年来国内外关于Deep Web爬虫的研究进展，并对其加以分析。在此基础上展望了Deep Web爬虫的研究趋势，为下一步的研究奠定了基础。

External Links

You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-3
Address：4# South Fourth Street, Zhongguancun,Haidian, Beijing,Postal Code：100190
Phone：010-62661041 Fax： Email：csa (a) iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063