3D Dense Captioning Method Based on Multi-level Context Voting

doi:10.15888/j.cnki.csa.008997

WeChat

Mobile website

Home > Archive>Volume 32, Issue 3, 2023 >291-299. DOI:10.15888/j.cnki.csa.008997

PDF HTML XML Export Cite reminder

3D Dense Captioning Method Based on Multi-level Context Voting
DOI:
                        10.15888/j.cnki.csa.008997
                    
Author:
                        
                        
                    
Affiliation:
Clc Number:
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

Traditional three-dimensional (3D) dense captioning methods have problems such as insufficient consideration of point-cloud context information, loss of feature information, and thin hidden state information. Therefore, a multi-level context voting network is proposed. It uses the self-attention mechanism to capture the context information of point clouds in the voting process and utilizes it at multiple levels to improve the accuracy of object detection. Meanwhile, the temporal fusion of hidden state and attention module is designed to fuse the hidden state of the current moment with the attention result of the previous moment to enrich the information of the hidden state and thus improve the expressiveness of the model. In addition, a “two-stage” training method is adopted in the model, which can effectively filter out the generated low-quality object proposals and enhance the description effect. Extensive experiments on official datasets ScanNet and ScanRefer show that this method achieves more competitive results compared to baseline methods.

Reference

Cited by

Get Citation

吴春雷,郝宇钦,李阳.基于多层级上下文投票的三维密集字幕.计算机系统应用,2023,32(3):291-299

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:August 03,2022
Revised:September 07,2022
Adopted:
Online: December 09,2022
Published:

You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-3
Address：4# South Fourth Street, Zhongguancun,Haidian, Beijing,Postal Code：100190
Phone：010-62661041 Fax： Email：csa (a) iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063

WeChat

Mobile website

Get Citation

Share

Article Metrics

History