###
DOI:
计算机系统应用英文版:2011,20(3):41-44
本文二维码信息
码上扫一扫!
一种批量抽取动态Web 信息系统
(1.宁夏万纬信息技术公司,银川 750000;2.宁夏医科大学理学院,银川 750000)
Batch Extraction Information System from Dynamic Web
(1.Ningxia Wanwei IT Technology Co, Yinchuan 750000, China;2.Science College of Ningxia Medical University, Yinchuan 750000, China)
摘要
图/表
参考文献
相似文献
本文已被:浏览 1675次   下载 3069
Received:July 16, 2010    Revised:August 19, 2010
中文摘要: 针对从Web 页面获取信息的广泛需求,分析了从中提取信息的关键技术如URL 地址、HTML 页面和HtmlParse 解析库;以从Google Map 中获取企业黄页信息为例,根据从中自动提取数据的技术和步骤,设计和实现了该系统原型,并指出的相关问题及其解决办法。
中文关键词: Web 页面  HtmlParse  Google 地图  信息抽取  系统
Abstract:In order to respond some extensive requirements for getting information from Web pages, some key techniques such as URL, HTML page and HtmlParse API, were analyzed. Getting yellow page information from Google maps was taken as an example, and according to related techniques and steps of abstracting information from it, the system prototype was designed and implemented. Some related problems were presented, and its corresponding solution were discussed too.
文章编号:     中图分类号:    文献标志码:
基金项目:宁夏科技攻关计划项目(KGX-01-10-01)
引用文本:
马龙,张春涛,杨德仁.一种批量抽取动态Web 信息系统.计算机系统应用,2011,20(3):41-44
MA Long,ZHANG Chun-Tao,YANG De-Ren.Batch Extraction Information System from Dynamic Web.COMPUTER SYSTEMS APPLICATIONS,2011,20(3):41-44