引用本文:王德文,肖凯,肖磊.基于Hive的电力设备状态信息数据仓库[J].电力系统保护与控制,2013,41(9):125-130.
WANG De-wen,XIAO Kai,XIAO Lei.Data warehouse of electric power equipment condition information based on Hive[J].Power System Protection and Control,2013,41(9):125-130
【打印本页】   【下载PDF全文】   查看/发表评论  【EndNote】   【RefMan】   【BibTex】
←前一篇|后一篇→ 过刊浏览    高级检索
本文已被:浏览 5424次   下载 473 本文二维码信息
码上扫一扫!
分享到: 微信 更多
基于Hive的电力设备状态信息数据仓库
王德文1, 肖凯1, 肖磊1
华北电力大学控制与计算机工程学院,河北 保定 071003
摘要:
随着智能变电站的建设及其状态监测系统的发展,电力设备状态信息数据规模逐渐增大。针对现有电力数据仓库在海量状态数据存储查询和分析处理方面的不足,提出基于Hive的电力设备状态信息数据仓库及其多维数据快速查询与分析方法。通过对状态监测系统与生产管理系统(PMS)的分析,将电力设备静态信息与状态监测信息存储到Hive数据仓库中。设计了基于Hive的电力设备状态信息数据仓库的系统架构和海量状态数据存储结构,采用Hadoop 分布式文件系统(HDFS)对数据进行分布式存储管理,MapReduce作为海量数据查询分析
关键词:  智能变电站  电力设备状态信息  数据仓库  Hive  HDFS
DOI:10.7667/j.issn.1674-3415.2013.09.021
分类号:
基金项目:国家自然科学基金(61074078);中央高校基本科研业务费专项资金(12MS113)
Data warehouse of electric power equipment condition information based on Hive
WANG De-wen,XIAO Kai,XIAO Lei
Abstract:
With the continuous construction of smart substation and the development of its condition monitoring system, the data size of electric power equipment on condition monitoring is leaping. Aiming at deficiencies of current electric power data warehouse on massive condition monitoring data storage, query and analysis, a method of data warehouse based on Hive for fast query and analysis on multidimensional data is proposed. First, through analyzing condition monitoring system and production management system, the static information and condition monitoring information of electric power equipment are stored in Hive data warehouse. Second, the architecture of the data warehouse and storage structure of massive condition data are designed, adopting Hadoop distributed file system (HDFS) for distributed storage and management, MapReduce as computing model of massive data query and analysis, and Hive Query Language (HiveQL) as a control tool of data warehouse. The process of data warehouse is given respectively. Finally, an experimental platform of data warehouse for electric power equipment condition information based on Hive is established, results of multidimensional data queries on 5 nodes and 10 nodes cluster show that this method has good scalability, and can meet the needs of fast query on large scaled multidimensional condition data of electric power equipment.
Key words:  smart substation  electric power equipment condition information  data warehouse  Hive  HDFS
  • 1
X关闭
  • 1
X关闭