(小三号黑体)
摘 要
随着信息的发展,出现了越来越多的非结构化信息。并且非结构化信息在政府和企业等的决策中扮演着重要的角色。如何将非结构化数据有效的管理起来,能够进行数据和知识挖掘,提取当中的隐含信息,提供一种形象的可视分析,为政府和企业决策提供支持成为当今亟待解决的主要问题。
本文以北京市科委的指数统计文档为研究对象,主要任务是针对以北京市科
(小四号宋体)
委的指数统计文档为代表的非结构化信息的抽取和企业指标信息的可视分析。主要工作包括三个方面:第一,设计了一套以北京市科委的指数统计文档编写规范为标准的确实可行的信息抽取算法;第二,针对抽取出来的指标信息,借助于Dundas可视化工具进行可视分析;第三,完成了一个满足客户需求的企业信息库管理系统。
论文从项目背景出发,介绍了系统开发的背景和研究价值。然后,详细介绍
了企业指标信息智能处理的可行性和算法设计,以及企业指标信息可视分析的原理及其实现。再次,论文详细阐述了系统的需求,具体介绍了企业信息库管理系统的设计及其实现,最后论文针对企业信息库管理系统进行了分析和评价,并指明了下一步的改进计划。
(小四号黑体)
(小四号宋体)
关键词:非结构化信息;信息可视化;可视分析
(小三号Times New Roman加粗)
Abstract
With the development of information, there has been an increasing number of unstructured information. And it plays an important role in decision of government and enterprise, etc. How to manage the unstructured information efficiently, mine the data and knowledge, extract the implicit information, provide a visual image analysis, and then support the government and enterprise's decision have become the main issues to be settled urgently.
In this question for discussion, we mainly have a research in indicator of enterprise documents from the Beijing Science and Technology Commission and try to obtain the indicators of the unstructured information, and then provide a visual image analysis. It includes three aspects: First, to design a set of practical information extraction algorithm; second, through the use of the Dundas Chart toolbox, providing visual analysis; third, completed Enterprise Information Management System which meet customers requirement.
The beginning of the dissertation introduced the background of the project, introduced the background of the system and research value. Second, detailing information extraction algorithms and principles of Information Visualization. Third, the dissertation elaborated the system's requirement, specifically introduced the system design and implementation. Finally, some possible improvements and future works were presented.
(小四号Times New Roman加粗)
(小四号Times New
(小四号Times New Rom
Key words: Unstructured Information; Information Visualization; Visual Analysis