Professional Documents
Culture Documents
253
Using the function D and the PMUs data traits enabled us regions compared with Figure 1. Using basic algorithm has
to develop a more efficient algorithm to calculate the best taken 8 iterations while taking advantage of the proposed
initial centroids. In this application, the total number of data is initialization algorithm reduced it to three iterations.
24031, therefore, batches with sizes less than 100 can easily Decreasing the iterations by one-third has a significant impact
be considered as noise or at most unnecessary. It should be on the clustering computational burden applications.
mentioned that we have reached the same results by ignoring Figure 2 illustrates three plots of data (VII) and each of
batches size 1000 but 100 will be the most conservative batch them can be expanded to phases abc. The same thing is
reduction size. applicable using the signals sequences. Now, any
The algorithm merges the batches with close average points. combination of these data can be used for decision-making
This is not really critical in real systems as we are assuming to and clustering processes regardless of the number of
have enough data during normal working condition, and dimensions. The proposed code has no limitation in data
abnormal working condition has smaller periods of time dimensions which enables the CPU to utilize any combination
compared to normal steady state. At this stage, we have a of preferred data.
limited number of batches, and each of them includes sets of
data with a certain distance in their averages. These batches C-3
will be organized in a rising manner based on their sizes. 180
C-1
Obviously, batches with larger sizes are more important, at
100
least indicating clusters with bigger size. Finally, the top k
ones of these batches will be chosen to represent the clusters
0
Ia
and their center of mass will be calculated. These centroids C-2
state changes and how the PMU data and subsequently 100
C-2
and just being fed from UG. At t=0.3 the downstream DGs
connect to the network improving the voltage, reducing the -100
sensed current, and significantly changing the phase in that
specific bus. At t=0.6 a bolted 3-phase happened in the middle 80 1.5
of the network which is obvious through the PMU output and 60
1
40
will be cleared at t=0.9. After fault clearance, the system still 20 0.5
0 0
is being fed from both sides. That is, the system has the same Ia Va
state as the second period marked by C-2 in figure.
Figure 5. PMU5 data clustered using proposed algorithm.
Figure 2 illustrates PMU5 VIIa data plotted based on the
known scenario periods. Hence, the transient parts are in cyan Figures 6 and 7 present clustering results for the same
and the actual clusters have been categorized by different scenario but using data from PMU3. Figure 7 depicts the
colors and arrows showing the cluster names which are the clustering result using the proposed algorithm for 3 scenario
same in other PMU5 cluster results. That is, the cyan colored periods. Connecting the DG at t=0.3 causing the system state
data is to clarify the borders and system states (on bus 5) to go from C-1 to C-2 has a minor location change based on
presenting in 3-D for the designed scenario. Figure 4 bus 3 signals. Hence, installed PMU at bus 3 has been chosen
presents the clustered data using basic k-Means algorithm and as the C-1 and C-2 are closely located to each other, which is
Figure 5 shows the resulted clusters using the proposed more challenging for clustering. Clustered data in Figure 7
algorithm. One can observe in Figure 5 that basic algorithm show the correct distinction between C-1 and C-2 knots;
could not differentiate the C-1 cluster region and has specified however, Figure 6 illustrates how k-Means may result in a
parts of the transients to it. The actual C-1, however, has been totally wrong result of random initial centroids. That is, both
included in the green cluster section of C-2 which is not C-1 and C-2 knots have been classified inside a single cluster
correct. On the other hand, Figure 5 clusters using the and parts of transient have been mistakenly considered as the
proposed algorithm have distinct borders and correct cluster third cluster. Using basic k-Means method yields results by 9
254
iterations while the proposed algorithm again decreases it by Figure 8 presents a complicated scenario case which has
one-third. Data pre-analyzing, using proposed algorithm, been clustered using the proposed algorithm. This cluster
results in precise and stable clustering in any number of result has been yielded by six iterations when the same
executions. It has been mentioned before that k-Means clusters region and iterations have been yielded in any
clustering using the basic algorithm does not have a stable execution. On the other hand, the basic algorithm has
output. That is, Figures 4-7 are selected from many executions significantly different cluster regions per each execution with
which run by the authors. The basic code may even result in some critical mistakes in categorizing two cluster knots in a
really accurate clustering output or aim to a totally surprising single cluster in many runs. Basic code took a number of
clustered data as of local minimums in objective function. The iterations with a minimum of twenty-one and a maximum of
number of iterations, however, never reaches less than two forty-eight observed, and the iteration average of total runs is
times the iterations using the proposed algorithm. thirty-four which is really high compared to six iterations
using the proposed algorithm.
180
5 CONCLUSIONS
100
C-3
C-1
This article presented the newly developed algorithm to pre-
analyze the data and calculate the initial centroids for
0
Ia
-100
REFERENCES
[1] T. Kanungo, D. M. Mount, N. S. Netanyahu, C. D. Piatko, R.
Silverman, and A. Y. Wu, "An efficient k-means clustering algorithm:
0
Analysis and implementation," IEEE Trans. Pattern Anal. Mach. Intell.
0.5 vol.24, no.7, pp.881,892, Jul 2002.
C-2
200
250 [2] X. Hui, J. Wu, and J. Chen, "k-Means clustering versus validation
1
150 measures: A data-distribution rerspective," IEEE Trans. Syst., Man. B,
100
1.5 50
0 vol.39, no.2, pp.318-331, April 2009
Va Ia
[3] G. A. Jimnez-Estvez, L. S. Vargas, and V. Marianov, "Determination
Figure 7. PMU3 data clustered with proposed algorithm of feeder areas for the design of large distribution networks," IEEE
Trans. Power Del., vol.25, no.3, pp.1912-1922, July 2010
[4] M. J. Li, M. K. Ng, Yiu-Ming Cheung, and J. Z. Huang, "Agglomerative
fuzzy k-Means clustering algorithm with selection of number of
C-1 clusters," IEEE Trans. Knowl. Data Eng., vol.20, no.11, pp.1519-1534,
6 C-4
Nov. 2008
5
[5] G. F. Tzortzis and C. L. Likas, "The global kernel k -means algorithm
4 for clustering in feature space," IEEE Trans. Neural Netw., vol.20, no.7,
C-2 C-5 pp.1181-1194, July 2009
3
c
2 smart grid systems Int. J. Cyber-Sec. Digit. Forensics, vol.3, no.1, pp.
1
34-38, April 2014.
C-3
0
8
6 10
8
4 6
2 4
2
Ib 0 0
Ia
255