I. Introduction
It is imperative to build a smart grid (SG) with renewable energy resources such as solar energy, wind power, hydropower, geothermal, bio energy, etc. [1]–[3]. With the rapid technology development, the new inventions such as electrical vehicle, large capacity batteries, smart sensors, etc. are dramatically changing the traditional power systems [3]–[5]. The pervasively located smart sensors provide massive heterogeneous data such as voltage, frequency, real-time electrical price, wind speed, etc., which are significant indispensable for the SG operation, maintenance, and management. However, the useful knowledge usually hides in the collected big data, and lacks of spatial information. Therefore, in this paper, based on the Apache Spark, an open source cluster computing framework, a platform is built to effectively collect, store, and parallelly process the big data. Furthermore, with the Google Earth, a global geographic information system (GIS), the discovered knowledge can be cooperated with spatial information and visualized in various ways.