###
计算机系统应用英文版:2017,26(2):18-24
本文二维码信息
码上扫一扫!
旅游自动应答语义模型分析与实践
王彦1,2, 左春1,2,3, 曾炼4
(1.中国科学院软件研究所 软件工程技术研究开发中心, 北京 100190;2.中国科学院大学, 北京 100049;3.中科软科技股份有限公司, 北京 100190;4.金童软件科技有限公司, 南京 210019)
Analysis and Practice of Semantic Model in Tourism Auto-Answering System
(1.Technology Center of Software Engineering, Institute of Software, Chinese Academy of Sciences, Beijing 100190, China;2.University of Chinese Academy of Sciences, Beijing 100049, China;3.Sinosoft Co. Ltd., Beijing 100190, China;4.Jintongsoft Co. Ltd., Nanjing 210019, China)
摘要
图/表
参考文献
相似文献
本文已被:浏览 1300次   下载 2075
Received:May 18, 2016    Revised:July 25, 2016
中文摘要: 针对常见问答系统采用的以词法分析为基础的浅层语义模型难以有效挖掘用户问句深层语义的问题,本文立足于旅游问答应用领域,采用组合范畴语法对旅游问句进行句法分析,使用Lambda演算式表示问句语义,以此构建旅游领域问句的语义模型,以便于通过精确的问句语义快速查找应答结果.研究首先进行旅游领域数据采集与语料标注的准备性工作,并针对语料对旅游问句的句式句法进行分析;然后采用基于概率的组合范畴语法的监督学习过程,通过训练获得较为可靠的旅游问句语义词典;最后根据语义词典及其他相关知识,学习用户问句语义,构建旅游自动应答语义分析系统,着重于问句解析和相应的语义模型的构建.通过在评测集上的验证,这种语义解析方法在解析效果上有比较明确的提升.
Abstract:According to the weakness of shallow semantics models based on lexical analysis which are commonly used in QA system, shallow semantics models cannot accurately analyze the deep semantics of users' questions. This paper focuses on the tourism QA application field, adopts the combined category grammar (CCG) to parse the question sentences, and uses lambda calculus to express the question semantics, so that semantic models on tourism questions can be derived. And it's convenient to search answers according to such accurate semantic quickly. The research first carries out data acquisition and corpus tagging preparatory work,including the analysis of tourism question corpus both in sentence pattern and syntax. Then the supervised learning process based on a probabilistic CCG algorithm is used to train a reliable semantic dictionary. At last, an automated answering system is built according to the semantic dictionary and related knowledge, which is mainly about the question parsing and building of corresponding semantic models. The final result on evaluation dataset shows that the semantic analysis method has relatively clear improvement in analytical performance.
文章编号:     中图分类号:    文献标志码:
基金项目:
引用文本:
王彦,左春,曾炼.旅游自动应答语义模型分析与实践.计算机系统应用,2017,26(2):18-24
WANG Yan,ZUO Chun,ZENG Lian.Analysis and Practice of Semantic Model in Tourism Auto-Answering System.COMPUTER SYSTEMS APPLICATIONS,2017,26(2):18-24