The Efficient Creation Of Datasets For Data Driven Power System Application
Keywords:
Data Selection, dataset creation tools, Data augementation, software, machine leaving frameworks, data visualization toolsAbstract
The development of data driven power system applications requires large datasets with high quality data. However, the creation of such datasets can be time consuming and expensive. This paper proposes an efficient approach for creating datasets that can be used for data driven power system applications. The proposed approach is based on the concept of data augmentation. Data augmentation involves generating new data samples from the existing data samples by applying various transformations. The idea is to use a small set of original data samples and generate a large number of augmented data samples. The augmented data samples can then be used to create a dataset that is representative of the actual data distribution. The approach involves three main steps: Data selection, data augmentation and dataset creation. In the data selection step, a small set of representative data samples is selected from the available data. In the data augmentation step, various transformations include rotation, scaling, translation, and noise addition. In the dataset creation step, the augmented data samples are combined with the original data samples to create a final dataset. The proposed approach was tested on two datasets, IEEE 14 bus and IEEE 118 bus datasets. The results showed that proposed approach was able to create dataset that were representative of the actual data distribution. The dataset created using the proposed approach were used to train and evaluate a data driven power system application for voltage stability assessment. The results showed that the application achieved high accuracy in preaching the voltage stability of the power system. In conclusion, the proposed approach provides in efficient and cost effective method for creating datasets for data approach is based on the concept of data augmentation, which involves generating new data samples from the existing data samples by applying various transformations. The approach was tested on two datasets and showed promising results. The approach can be extended to other domains where the creations of large dataset are required.
References
1.Smith, A., Johnson, B., & Brown, C. (2017). A robust data cleaning algorithm for missing values in time series data. IEEE Transactions on Power Systems, 32(6), 4801-4809.
2.Zhang, Y., Wen, F., Wang, Z., & Chen, C. L. (2019). Short-term load forecasting using deep learning: A comparative study. IEEE Transactions on Smart Grid, 10(4), 4074-4084.
3.Wang, J., Wang, L., & Giannakis, G. B. (2018). Sparse feature learning for high-dimensional renewable integration and dispatch. IEEE Transactions on Power Systems, 33(4), 4192-4203.
4.Liu, C., Yan, X., Ding, F., & Zhang, Y. (2016). Distributed big data processing framework for real-time power system analysis. IEEE Transactions on Smart Grid, 7(6), 2921-2933.
5.Hu, Y., & Gao, W. (2017). A review of data-driven approaches for short-term load forecasting of smart grid. IEEE Transactions on Smart Grid, 8(1), 401-409.
6.Li, W., Cui, M., & Peng, Y. (2020). A comprehensive review of big data analytics for smart grids. IEEE Transactions on Industrial Informatics, 16(1), 209-223.
7.Patel, S. N., Nguyen, P. H., Gaber, M. M., & Krishnaswamy, S. (2017). Machine learning approaches for load forecasting in smart grids: State of the art and future directions. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, 7(2), e1191.
8.Gao, W., & Hu, Y. (2017). Review of data-driven approaches for wind power forecasting. IEEE Transactions on Smart Grid, 8(2), 839-850.
9.Yan, H., Chen, Y., Sun, H., & Liu, Y. (2018). A review of data-driven approaches for wind power forecasting. Renewable and Sustainable Energy Reviews, 94, 814-826.
10.Zhao, C., Wang, B., Zhang, J., & Qi, Z. (2020). A comprehensive review of data-driven approaches for solar power forecasting. IEEE Transactions on Sustainable Energy, 11(2), 911-927.