• Online First

    Select All
    Display Type: |
    • Inverse Target Interference for Image Data Augmentation

      Online: April 19,2024 DOI: 10.15888/j.cnki.csa.009507

      Abstract (14) HTML (0) PDF 2.21 M (18) Comment (0) Favorites

      Abstract:Mixed sample data enhancement methods focus only on the model’s forward representation of the category to which the image belongs while ignoring the reverse determination of whether the image belongs to a specific category. To address the problem of uniquely describing image categories and affecting model performance, this study proposes a method of image data augmentation with inverse target interference. To prevent overfitting of the network model, the method first modifies the original image to increase the diversity of background and target images. Secondly, the idea of reverse learning is adopted to enable the network model to correctly identify the category that the original image belongs to while fully learning the attributes of the populated image that do not belong to that category to increase the confidence of the network model in identifying the category that the original image belongs to. In conclusion, to verify the method’s effectiveness, the study utilizes different network models to perform many experiments on five datasets including CIFAR-10 and CIFAR-100. Experimental results show that compared to other state-of-the-art data augmentation methods, the proposed method can significantly enhance the model’s learning effect and generalization ability in complex settings.

    • Colon Polyp Segmentation Fusing Multi-scale Gate Convolution and Window Attention

      Online: April 19,2024 DOI: 10.15888/j.cnki.csa.009509

      Abstract (13) HTML (0) PDF 1.22 M (22) Comment (0) Favorites

      Abstract:Accurate segmentation of colon polyps is important to remove abnormal tissue and reduce the risk of polyps converting to colon cancer. The current colon polyp segmentation model has the problems of high misjudgment rate and low segmentation accuracy in the segmentation of polyp images. To achieve accurate segmentation of polyp images, this study proposes a colon polyp segmentation model (MGW-Net) combining multi-scale gated convolution and window attention. Firstly, it designs an improved multi-scale gate convolution module (MGCM) to replace the U-Net convolutional block to achieve full extraction of colon polyp image information. Secondly, to reduce the information loss at the skip connection and make full use of the information at the bottom of the network, the study builds a multi-information fusion enhancement module (MFEM) by combining improved dilated convolution and hybrid enhanced residual window attention to optimize the feature fusion at the skip connection. Experimental results on CVC-ClinicDB and Kvasir-SEG data sets show that the similarity coefficients of MGW-Net are 93.8% and 92.7%, and the average crossover ratio is 89.4% and 87.9%, respectively. Experimental results on CVC-ColonDB, CVC-300, and ETIS datasets show that MGW-Net has strong generalization performance, which verifies that MGW-Net can effectively improve the accuracy and robustness of colon polyp segmentation.

    • Sparse Convolutional Network with Global Context Enhancement for Anti-external Force Damage Detection of Power Grid

      Online: April 19,2024 DOI: 10.15888/j.cnki.csa.009514

      Abstract (10) HTML (0) PDF 2.40 M (24) Comment (0) Favorites

      Abstract:In the anti-external force damage inspection of transmission lines, the current lightweight target detection algorithm deployed at the edge has insufficient detection accuracy and slow reasoning speed. To solve the above problems, this study proposes a sparse convolution network (SCN) with global context enhancement for anti-external force damage detection of the power grid, Fast-YOLOv5. Based on the YOLOv5 algorithm, the FasterNet+ network is designed as a new feature extraction network, which can maintain detection accuracy, improve the reasoning speed of the model, and reduce computational complexity. In the bottleneck layer of the algorithm, an ECAFN module with efficient channel attention is designed, which improves the detection effect by adaptively calibrating the feature response in the channel direction, efficiently obtaining the cross-channel interactive information and further reducing the amount of parameters and calculation. The study proposes the detection layer of the sparse convolutional network SCN replacement model with context enhancement to enhance the foreground focus feature and improve the prediction ability of the model by capturing the global context information. The experimental results show that compared with the original model, the accuracy of the improved model is increased by 1.9%, and the detection speed is doubled, reaching 56.2 FPS. The amount of parameters and calculation are reduced by 50% and 53% respectively, which is more in line with the requirements for efficient detection of transmission lines.

    • Super-resolution Reconstruction of Remote Sensing Images with Cross-scale Hybrid Attention

      Online: April 19,2024 DOI: 10.15888/j.cnki.csa.009525

      Abstract (16) HTML (0) PDF 1.38 M (23) Comment (0) Favorites

      Abstract:To address the inadequacy of existing remote sensing image super-resolution reconstruction models in long-term feature similarity and multi-scale feature relevance, this study proposes a novel remote sensing image super-resolution reconstruction algorithm based on a cross-scale hybrid attention mechanism. Initially, the study introduces a global layer attention (GLA) mechanism and employs layer-wise attention to weight and merge global features across different levels, thereby modeling the extended dependency between low-resolution and high-resolution image features. Concurrently, it designs a cross-scale local attention (CSLA) mechanism to identify and integrate local information patches in multi-scale low-resolution feature maps that correspond with high-resolution images, enhancing the model’s ability to restore image details. Finally, the study proposes a local information-aware loss function to guide the image reconstruction process, further improving the visual quality and detail preservation of the reconstructed images. Experiments on UC-Merced datasets demonstrate that the proposed method outperforms most mainstream methods in terms of average PSNR/SSIM across three magnification factors and exhibits superior quality and detail preservation in visual results.

    • Degraded Document Images Binarization Based on Multidimensional Side Window Clustering Segmentation

      Online: April 19,2024 DOI: 10.15888/j.cnki.csa.009526

      Abstract (10) HTML (0) PDF 1.80 M (26) Comment (0) Favorites

      Abstract:The distribution of grayscale values in calligraphic character document images exhibits significant variations under poor lighting conditions, resulting in lower image contrast in low-light areas and degradation of morphological texture features of the strokes. Traditional methods typically focus on local information such as mean, squared deviation, and entropy, while giving less consideration to morphological texture, rendering them insensitive to the features of low-contrast areas. To address these issues, this study proposes a binarization method called clustering segmentation-based side-window filter (CS-SWF) specifically designed for degraded calligraphic documents. Firstly, this method utilizes multi-dimensional SWF to describe pixel chunks with similar morphological features. Then, with multiple correction rules, it utilizes downsampling to extract low-latitude information and correct feature regions. Finally, the clustered blocks in the feature map are classified to obtain the binarization results. To evaluate the performance of the proposed method, it is compared with existing methods using F-measure (FM), peak signal-to-noise ratio (PSNR), and distance reciprocal distortion (DRD) as indicators. Experimental results on a self-constructed dataset consisting of 100 handwritten degraded document images demonstrate that the proposed binarization method exhibits greater stability in low-contrast dark regions and outperforms the comparison algorithm in terms of accuracy and robustness.

    • Construction and Characterisation of WUI Fire Causal Factor Network Based on Text Mining

      Online: April 19,2024 DOI: 10.15888/j.cnki.csa.009527

      Abstract (8) HTML (0) PDF 1.15 M (25) Comment (0) Favorites

      Abstract:To prevent and reduce the occurrence of WUI fires, this study mines the key causal factors of WUI fires and clarifies the action mechanism between the causal factors. First, this study obtains the causal factors from WUI fire accident cases based on the proposed mining technology and uses the Apriori algorithm to obtain association rules between the causal factors. Then it uses the complex network theory to construct the WUI fire causal factor network, calculate the topological parameters of the network, and analyze the characteristics of the WUI fire causal network. Finally, the study introduces the risk index of the WUI fire causal chain, mines the high-risk connecting edges, and proposes the chain breaking measures. The results show that the WUI fire causal factor network has a small-world characteristic, and high temperature, strong wind, and drought have a greater influence on other causal factors. Burning waste, plant fire, emergency response speed, human arson, and strong wind have important roles in the conversion of different causal factors, which should be controlled better. The most risky side of the network is burning waste → plant fire, and the risk chain can be cut off by enacting regulations such as the prohibition of unauthorized burning waste, to achieve the prevention and active control of WUI fires.

    • Multi-level Feature Interaction Transformer for Multi-organ Image Segmentation

      Online: April 19,2024 DOI: 10.15888/j.cnki.csa.009528

      Abstract (14) HTML (0) PDF 3.57 M (25) Comment (0) Favorites

      Abstract:Clinical diagnoses can be facilitated through the utilization of multi-organ medical image segmentation. This study proposes a multi-level feature interaction Transformer model to address the issues of weak global feature extraction capability in CNN, weak local feature extraction capability in Transformer, and the quadratic computational complexity problem of Transformer for multi-organ medical image segmentation. The proposed model employs CNN for extracting local features, which are then transformed into global features through Swin Transformer. Multi-level local and global features are generated through down-sampling, and each level of local and global features undergo interaction and enhancement. After the enhancement at each level, the features are cross-fused by multi-level feature fusion modules. The features, once again fused, pass through up-sampling and segmentation heads to produce segmentation masks. The proposed model is experimented on the Synapse and ACDC datasets, achieving average dice similarity coefficient (DSC) and average 95th percentile Hausdorff distance (HD95) values of 80.16% and 19.20 mm, respectively. These results outperform representative models such as LGNet and RFE-UNet. The proposed model is effective for multi-organ medical image segmentation.

    • Siamese Low-light Video Enhancement Network with Fusion of Local and Global Features

      Online: April 19,2024 DOI: 10.15888/j.cnki.csa.009533

      Abstract (13) HTML (0) PDF 2.34 M (26) Comment (0) Favorites

      Abstract:Videos captured in low illumination environments often carry problems such as low contrast, high noise, and unclear details, which seriously affect computer vision tasks such as target detection and segmentation. Most of the existing low-light video enhancement methods are constructed based on convolutional neural networks. Since convolution cannot make full use of the long-range dependencies between pixels, the generated video often suffers from loss of details and color distortion in some regions. To address the above problems, this study proposes a Siamese low-light video enhancement network coupling local and global features. The model obtains local features of video frames through a deformable convolution-based local feature extraction module and designs a lightweight self-attention module to capture the global features of video frames. Finally, the extracted local and global features are fused by a feature fusion module, which guides the model to generate enhanced videos with more realistic colors and details. The experimental results show that the proposed method can effectively improve the brightness of low-light videos and generate videos with richer colors and details. It also outperforms the methods proposed in recent years in evaluation metrics such as peak signal-to-noise ratio and structural similarity.

    • Land Cover Classification of Time-series SAR Images Using Mult-TWDTW Algorithm

      Online: April 07,2024 DOI: 10.15888/j.cnki.csa.009518

      Abstract (17) HTML (0) PDF 3.11 M (88) Comment (0) Favorites

      Abstract:Synthetic aperture radar (SAR) images provide an important time-series data source for land cover classification. The existing time-series matching algorithms can fully exploit the similarity among time-series features to obtain satisfactory classification results. In this study, the classic time-series matching algorithm named time-weighted dynamic time warping (TWDTW), which comprehensively considers shape similarity and phenological differences, is introduced to guide SAR-based land cover classification. To solve the problem that the traditional TWDTW algorithm only considers the similarity matching of a single feature on the time series, this study proposes a multi-feature fusion-based TWDTW (Mult-TWDTW) algorithm. In the proposed method, three features, namely, the backscattering coefficient, interferometric coherence, and the dual-polarization radar vegetation index (DpRVI), are extracted, and the Mult-TWDTW model is designed by fusing multiple features based on the TWDTW algorithm. To verify the effectiveness of the proposed method, the study implements land cover classification in the Danjiangkou area using time-series data obtained from the Sentinel-1A satellite. Then, the Mult-TWDTW algorithm is compared with the multi-layer perception (MLP), one-dimensional convolutional neural network (1D-CNN), K-means, and support vector machine (SVM) algorithms as well as the TWDTW algorithm using a single feature. The experimental results show that the Mult-TWDTW algorithm obtains the best classification results, manifested as its overall accuracy and Kappa coefficient reaching 95.09% and 91.76, respectively. In summary, the Mult-TWDTW algorithm effectively fuses the information of multiple features and can enhance the potential of time-series matching algorithms in the classification of multiple types of land covers.

    • Agricultural Product Recommendation Algorithm Based on Fine-grained Feature Interactive Selection Network

      Online: April 07,2024 DOI: 10.15888/j.cnki.csa.009519

      Abstract (11) HTML (0) PDF 767.58 K (100) Comment (0) Favorites

      Abstract:In the digital era, an increasing number of people prefer shopping on e-commerce platforms. With the development of agricultural product e-commerce platforms, consumers find it challenging to discover suitable products among numerous choices. To enhance user satisfaction and purchase intent, agricultural product e-commerce platforms need to recommend appropriate products based on user preferences. Considering various agricultural features such as season, region, user interests, and product attributes, feature interactions can better capture user demands. This study introduces a new model, fine-grained feature interaction selection networks (FgFisNet). The model effectively learns feature interactions using both the inner product and Hadamard product by introducing fine-grained interaction layers and feature interaction selection layers. During the training process, it automatically identifies important feature interactions, eliminates redundant ones, and feeds the significant feature interactions and first-order features into a deep neural network to obtain the final click through rate (CTR) prediction. Extensive experiments on a real dataset from agricultural e-commerce demonstrate significant economic benefits achieved by the proposed FgFisNet method.

    Prev 1 2 3 Next Last
    Result 43 Jump to Page GO
You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-3
Address:4# South Fourth Street, Zhongguancun,Haidian, Beijing,Postal Code:100190
Phone:010-62661041 Fax: Email:csa (a) iscas.ac.cn
Technical Support:Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063