Professional Documents
Culture Documents
Quantitative Management
Vol. 4, No. 2, pp. 191-210, 2007
QTQM
ICAQM 2007
______________________________________________________________________
Abstract: This paper proposes a practical methodology for the solution of multi-objective system
reliability optimization problems. The new method is based on the sequential combination of
multi-objective evolutionary algorithms and data clustering on the prospective solutions to yield a smaller,
more manageable sets of prospective solutions. Existing methods for multiple objective problems involve
either the consolidation of all objectives into a single objective, or the determination of a Pareto-optimal
set. In this paper, a new approach, involving post-Pareto clustering is proposed, offering a compromise
between the two traditional approaches. In many real-life multi-objective optimization problems, the
Pareto-optimal set can be extremely large or even contain an infinite number of solutions. Broad and
detailed knowledge of the system is required during the decision making process in discriminating among
the solutions contained in the Pareto-optimal set to eliminate the less satisfactory trade-offs and to select
the most promising solution(s) for system implementation. The well-known reliability optimization
problem, the redundancy allocation problem (RAP), was formulated as a multi-objective problem with the
system reliability to be maximized, and cost and weight of the system to be minimized. A multiple stage
process was performed to identify promising solutions. A Pareto-optimal set was initially obtained using
the fast elitist nondominated sorting genetic algorithm (NSGA-II). The decision-making stage was then
performed with the aid of data clustering techniques to prune the size of the Pareto-optimal set and
obtain a smaller representation of the multi-objective design space; thereby making it easier for the
decision-maker to find satisfactory and meaningful trade-offs, and to select a preferred final design
solution.
Keywords: Data mining, multiple objective genetic algorithms (MOGA), system reliability optimization.
______________________________________________________________________
1. Introduction
ystem design and reliability optimization problems are inherently multi-objective and
conflicting in nature. In multi-objective optimization problems, there are two primary
approaches to identify solution(s). The first involves quantifying the relative importance of
the attributes and aggregating them into some kind of overall objective function, e.g., utility
or value function. Solving the optimization problem with this approach generates an
optimal solution, but only for a specified set of quantified weights or specific utility
function. Unfortunately, the precise value of the objective function weights used or the
form of a selected utility function dictates the final solution, and thus, broad and detailed
knowledge of the system is demanded.
The second approach involves populating a number of feasible solutions along a
Pareto frontier and the final solution is a set of non-dominated solutions. In this case, the
Pareto set can contain a large number (in some cases, thousands) of solutions. From the
decision-makers perspective, consideration of all the nondominated solutions can be
prohibitive and inefficient.
192
This paper takes the view that, for many multi-objective engineering design
optimization problems, a balance between single solutions and Pareto-optimal sets can be
advantageous. A practical approach to intelligently reduce the size of the Pareto set is
presented. This creates a reduced set of perspective solutions; thereby, making selection of
the final solution easier.
The redundancy allocation problem (RAP) was formulated as a multi-objective
problem as the system reliability is to be maximized and cost and weight of the system are
to be minimized. Although the RAP has been studied in great detail, one area that has not
been sufficiently studied is multiple objective optimization in general, and more specifically,
the post-Pareto analysis stage. The solution to the multi-objective RAP is a set of
nondominated solutions. This set is often too large for a decision-maker to make an
intelligent identification of the most suitable compromise solution. In this paper, a practical
approach is described to determine a reasonable-sized set of promising solutions. The
resulting pruned Pareto set is simply a nondominated sub-set of a more manageable size to
offer meaningful contrasting design options.
n1
ns
193
programming approach and solved a more difficult design problem. They considered a
system with 14 subsystems and constraints on both cost and weight. For each subsystem,
there were three or four different component choices each with different reliability, cost and
weight. Bulfin & Liu [3] used integer programming and they formulated the problem as a
knapsack problem using surrogate constraints.
These previous mathematical programming approaches are only applicable to a
limited or restricted problem domain and require simplifying assumptions, which limits the
search to an artificially restricted search space. In these formulations, once a component
selection is made, only the same component type can be used to provide redundancy. This
restriction is required so the selected mathematical programming tool can be applied, but it
is not an actual imposition of the engineering design problem. Thus, the resulting solution
is only optimal for a restricted solution space, and better solutions can be found if the
restriction is no longer imposed.
Genetic Algorithms (GAs) offer some distinct advantages compared to alternative
methods used to solve the RAP. Coit & Smith [6, 7] used GAs to obtain solutions to the
RAP. In their research, they solved 33 variations of the Fyffe problem using a GA.
Similarly, Tabu Search (Kulturel-Konak, et al. [21]) has been used to obtain solutions to this
problem.
Several reliability optimization formulations considering multiple criteria have also
been presented in the literature. A multi-objective formulation to maximize system
reliability and minimize the system cost was considered by Sakawa [27] using the surrogate
worth trade-off method. Inagaki et al. [17] used interactive optimization to design a system
with minimum cost and weight and maximum reliability.
Dhingra [11] used goal programming and goal-attainment to generate Pareto-optimal
solutions to solve a special case of a multi-objective RAP. Busacca et al. [4] proposed a
multi-objective GA approach that was applied to a design problem with the aim to identify
the optimal system configuration and components with respect to reliability and cost
objectives.
Recently, Kulturel-Konak, et al. [22] solved this problem using Tabu Search method
considering three objective functions, maximization of system reliability and minimization
of cost and weight of the system.
The following notation will be used throughout the reminder of the paper:
Notation:
R, C, W = system level reliability, cost and weight or constraint limits
s = number of subsystems
(x
i ,1 , x i ,2 ,..., x i ,mi
194
NOTE: an objective function weight, i, and a component weight, wij , are conceptually
very different.
2.2. Problem Formulation
The use of redundant elements increases system reliability, but also increases the
procurement cost and system weight. A complex question arises then regarding how to
optimally allocate redundant elements. The answer depends on the structure of the
designed system, the decision-makers priorities and preferences, and ultimately, some
selected optimality criterion.
Different problem formulations of the RAP have been presented in the literature. For
instance, Problem P1 maximizes the system reliability given restrictions on the system cost,
C, and the system weight, W. Alternatively; Problem P2 is formulated as a multi-objective
optimization problem by using the weighted sum approach. Problem P3 is a multi-criteria
formulation of the RAP, in which a Pareto-optimal set of solutions is obtained. The
formulation of the three problems is summarized below:
Problem P1: Reliability maximization
s
max Ri(x i )
i =1
Subject to:
s mi
c ij x ij C
i =1 j =1
s mi
wij x ij W
i =1 j =1
mi
xij {0,1,2,..} .
Given the overall restrictions on system cost of C and weight of W, the problem is to
determine which design alternative to select with the specified level of component
reliability, and how many redundant components to use in order to achieve the maximum
reliability.
Problem P2: Weighted sum method formulation
s
s mi
s mi
max 1 R ( x ) 2C ( x) 3W ( x ) = 1 Ri ( xi ) 2 c ij x ij 3 wij x ij
=
=
=
=
1
1
1
1
i
j
i
j
i =1
Subject to:
mi
i = 1 ,
i =1
195
x ij {0,1,2,..} .
The solution for this multi-objective formulation is determined by combining the three
objectives into a single objective problem. This requires the user to a priori specify objective
function weights to represent the relative importance to the individual objective function
and one single set of weighting coefficients yields only one optimal solution. Therefore,
selection of the correct set of weights is critical and it will dictate the final solution. Often,
decision-makers lack the training or detailed knowledge to precisely select the weights. This
is undesirable because even small alterations to the weights can lead to very different final
solutions.
Problem P3: Multi-objective formulation
s
s mi
s mi
i =1 j =1
i =1 j =1
Subject to
mi
xij {0,1,2,..} .
For the multi-objective RAP, the objectives are to determine the optimal design
configuration that will maximize system reliability, minimize the total cost and minimize
the system weight. A Pareto-optimal set of solutions can be obtained by using any
multi-objective evolutionary algorithm (MOEA) available. However, there may be too
many prospective solutions for the decision-maker to fully consider before ultimately
selecting a unique design configuration to implement. The Problem P3 formulation is the
one addressed in this paper.
196
to:
g j (x) 0
j = 1, 2, , J
hq ( x ) = 0
q = 1, 2, , Q.
In the vector function, fi(x), some of the objectives are often in conflict with others,
and some have to be minimized while others are maximized. The constraints define the
feasible region X, and any point x X defines a feasible solution. There is rarely a situation
in which all the fi(x) values have an optimum in X at a common point x. Therefore, it is
necessary to establish certain criteria to determine what is considered as an optimal
solution, and this criteria is nondominance. Thus, solutions to a multi-objective
optimization problem are mathematically expressed in terms of nondominance.
Without loss of generality, in a minimization problem for all objectives, a solution x1
dominates a solution x2, if and only if the two following conditions are true:
x1 is strictly better than x2 in at least one objective, i.e., fi(x1) < fi(x2) for at least one i.
Then, the optimal solutions to a multi-objective optimization problem are the set of
nondominated solutions and they are usually known as Pareto-optimal set (Zeleny [32]).
3.1. Single-Objective Approaches
197
practitioners.
3.2. Multiple Objective Genetic Algorithms
NSGA (nondominated sorting genetic algorithm) developed by Srinivas and Deb [29]
In general, MOEAs are suited to multi-objective optimization because they are able to
capture multiple Pareto-optimal solutions in a single run and may exploit similarities of
solutions by recombination. However, MOEAs produce many different Pareto-optimal
solutions and the final Pareto set (or sub-set) may be very large since it increases with the
number of objectives. Then, the decision-maker has to consider this potentially huge Pareto
set and make his/her choice.
198
3.2.1. NSGA-II
The fast elitist non-dominated sorting genetic algorithm II (NSGA-II) was proposed by
Deb et al. ([9], [10]). This is the algorithm used for this research. It is an improved version
of the original nondominated sorting genetic algorithm (NSGA) that reduces the
computational complexity and maintains the solutions of the best Pareto front found
including them into the next generation; thereby preventing non-elitism.
A crowding distance parameter is incorporated in this algorithm that serves as an
estimate of the population density by calculating the perimeter of the largest cuboid
enclosing an individual without including any other member of the population. This
parameter implies that, between any two solutions being considered, the solution with the
higher rank is preferable and a solution that is located in a less crowded region is preferable
to a solution that lies in a high solution density region.
NSGA-II uses the concept of controlled elitism to tune the mutation rate and the
elitism rate to attain equilibrium between the two. Controlled elitism limits the maximum
number of individuals belonging to each front by employing a geometric decreasing
function that is governed by the reduction rate, thereby preventing the algorithm to
premature converge to a local optima.
This algorithm is highly efficient in obtaining good Pareto-optimal fronts for any
number of objectives and can accommodate any number of constraints. These features of
NSGA-II are attractive for the purpose of this paper.
199
grouping is done by calculating the centroid for each group, and assigning each observation
to the group with the closest centroid. For the membership function, each data point
belongs to its nearest center, forming a partition of the data. The objective function that the
k-means algorithm optimizes is:
n
KM ( X , C ) = min || v i c j ||2 ,
i =1 j {1,.., k}
where:
vi = ith data vector
cj = jth cluster centroid
j
mKM (cl | v i ) =
0 ; otherwise.
b (i ) a (i )
,
max{a (i ), b (i )}
where, a(i) is the average distance from the ith data point to all the other data points in its
cluster, and b(i) is the average distance from the ith data point to all the points in the nearest
neighbor cluster.
The value of s(i) ranges from +1 to 1. A value of +1, indicates points that are very
distant from neighboring clusters; a value of 0, indicates points that are not distinctly in one
cluster or another, and a value of 1, indicates points that are probably assigned to the
wrong cluster.
For a given cluster, Xj, a cluster silhouette, Sj, can be determined. Sj characterizes the
heterogeneity and isolation properties of such a cluster. It is calculated as the average of the
200
sum of all samples silhouette widths in Xj. Moreover, for any partition, a global silhouette
value or silhouette index, GSu, can be used as an effective validity index for a partition U.
GSu =
1 c
Sj
c j =1
It has been demonstrated that this equation can be applied to estimate the optimal
number of clusters for a partition U (Rousseeuw [25]). In this case the partition with the
maximum silhouette index value is taken as the optimal partition.
4.2. Description of the New Approach
After obtaining the Pareto set from a particular MOEA, in this case, from the
NSGA-II, a MATLAB code was developed to perform the steps of the proposed
technique. From normalized data, the code runs the k-means algorithm, from two to a
specified number of means; it calculate average silhouette values and the clustering with the
highest average silhouette width, GSu , is selected as the optimal number of clusters in
the Pareto-optimal set. The pseudocode of the algorithmic implementation is shown in
Figure 2.
201
Notice that the k-means algorithm can converge to a local optimum, in this case, a
partition of points in which moving any single point to a different cluster increases the total
sum of distances. This problem can be overcome by performing several replicates, each
with a new set of initial cluster centroid positions. That is, each of the replicates begins
from a different randomly selected set of initial centroids. The final solution is the one with
the lowest total sum of distances over all replicates.
It is important to mention that for purposes of this paper, the multiobjective
formulation of the RAP considered only three objective functions. However, the proposed
algorithm can be applied to a multiobjective problem with more objective functions, such as
maximize reliability, minimize cost, minimize weight, minimize volume, etc.
For C=2 to MC
*maximum number of centroids*
For Z=1 to R *number of replicates*
Randomly select initial values for C
For each vi X, assign all vi to cj C according to nearest cj
Recompute cj
Until no change in cj
Return C, KM(X,C) and membership
Store values for C, KM(X,C) and membership
Z=Z+1
end
end
Select the minimum KM(X,C) obtained for all replicates
end
Obtain silhouette values, s(i)
Choose the cluster with the maximum silhouette width, GSu, of all centroids considered
W
6
8
2
4
4
The multi-objective RAP with three objectives was formulated as shown in Problem P3,
to maximize system reliability, minimize total cost and minimize system weight. The
202
NSGA-II algorithm was used to solve the problem and 75 solutions were found in the
Pareto set. Figure 3 shows the Pareto-optimal set of solutions found by NSGA-II. The
maximum and minimum values of reliability, cost and weight found on the solutions
contained in the Pareto-optimal set are presented in Table 2.
i = 1,2,, n,
where fimin(x) = minimum value for f i ( x) found in the Pareto optimal set.
fimax(x) = maximum value for f i ( x) found in the Pareto optimal set.
To use the above equation, all the objective functions were considered to be minimized,
thus reliability was multiplied by -1. It is important to remark that by using a different type
of normalization function, the clustering outcome may look different.
To determine the optimal number of clusters, silhouette plots were used as suggested
by Rousseeuw [25], and several runs were performed for different values of k with several
replicates for each value of k. For this particular data set, 5 was found to be the optimum
number of clusters. The five clusters are shown in Figure 4 from normalized data. Cluster 1
contains 4 solutions; there were 15 solutions in cluster 2, 5 in cluster 3, 30 in cluster 4 and
21 solutions in cluster 5. The decision-maker now considers only five clusters instead of 75
203
unique solutions. This demonstrates that this new approach, combining MOEA and data
clustering, is of practical benefit.
204
205
Since the original cluster 5 already contained promising trade-offs, plotting the
solutions in two dimensions can be of graphical aid for the decision-maker. Figures 7, 8
and 9 plot reliability vs. cost, reliability vs. weight and cost vs. weight, respectively, from
normalized (0 to 1) objective function data.
Figure 7.
206
Cluster 1
Cluster 2
Cluster 3
Cluster 4
Cluster 5
Cluster 6
Cluster 7
Cluster 8
Cluster 9
Cluster 10
Cluster 11
Weight
45
37
35
41
38
51
46
35
49
34
39
For this particular multi-objective RAP, clusters 2, 4 and 7 seem to contain desirable
solutions. For ease of interpretation and analysis, the 11 representative solutions of this
knee region are plotted in two dimensions in Figure 10 for reliability vs. cost. From this
figure, one can easily notice that solutions #47, #44 and #40 belonging to clusters 2, 4 and
7 respectively are the ones that are presented to the decision-maker as good trade-offs if no
previously defined objective function preference have been specified by the decision-maker.
For example, solution #44, shown in Figure 11, achieves a reliability of 0.986416 at a
cost of 55 and a weight of 41. For system implementation, the configuration is composed
of one component of type 1, one component of type 2 and one component of type 5 for
subsystem 1; two components of type 1 for subsystem 2, and one component of type 1 and
one component of type 3 for subsystem 3.
207
6. Conclusion
The RAP was formulated as a multiple objective system reliability optimization
problem, with system reliability to be maximized and system cost and weight to be
minimized. A Pareto-optimal set of solutions was obtained using the fast elitist NSGA-II
algorithm. Then, the developed algorithm was used to find groups of similar solutions
contained in the Pareto front.
In our approach, a clustering validation technique has been integrated into the
k-means clustering algorithm to give a relatively automatic clustering process. The only
parameters defined by the user are the maximum number of clusters to be analyzed and the
desired number of replicates to carry out. This is to avoid the bias due to the selection of
the initial centroids that usually can be observed. This was performed by selecting different
values as initial centroids and then comparing the results obtained until a minimum was
found. To determine the optimal number of clusters, k, in the set, the silhouette method
208
was applied. A value of the silhouette width, s(i), was obtained for the several values of k
desired to investigate. The clustering with the highest average silhouette width, GSu, was
selected as the optimal number of clusters in the Pareto-optimal set.
With this approach, the decision-maker obtained a pruned Pareto-subset of just k
particular solutions. Moreover, clustering analysis was useful to focus search on the knee
region of the Pareto front. The knee region is characterized by those solutions of the
Pareto-optimal set where a small improvement in one objective would lead to a large
deterioration in at least one other objective. The clusters formed in this region contain those
solutions that are likely to be more relevant for the decision-maker. The analysis of these
trade-offs continue until a solution or a small portion of the nondominated set is located. In
this paper, we can see that no objective function preferences are known or required. If there
are objective function rankings or priorities available, then a similar clustering approach can
be done after filtering of undesirable solutions, as described by Taboada et al. [31].
References
1.
2.
3.
4.
5.
6.
7.
Coit, D. W. and Smith A. (1996b). Penalty guided genetic search for reliability design
optimization. Computers and Industrial Engineering, 30(4), 895-904.
8.
Cox D. (1957). Note on grouping. Journal of the American Statistical Association, 52,
543547.
9.
Deb, K., Agarwal, S., Pratap, A. and Meyarivan, T. (2000a). A Fast Elitist
Non-Dominated Sorting Genetic Algorithm for Multi-Objective Optimization: NSGA-II.
KanGAL Report Number 200001, Indian Institute of Technology. Kanpur, India.
10. Deb, K., Agarwal, S., Pratap, A. and Meyarivan, T. (2000b). A Fast Elitist
Non-Dominated Sorting Genetic Algorithm for Multi-Objective Optimization: NSGA-II.
Proceedings of the Parallel Problem Solving from Nature VI Conference, 849-858. Paris,
France.
11. Dhingra, A. K. (1992). Optimal apportionment of reliability and redundancy in series
systems under multiple objectives. IEEE Transactions on Reliability, 41(4), 576-582.
12. Erickson, M., Mayer, A. and Horn, J. (2001). The niched Pareto genetic algorithm 2
applied to the design of groundwater remediation systems. First International Conference
on Evolutionary Multi-Criterion Optimization, 681-695.
13. Fonseca, C. M. and Fleming, P. J. (1993). Genetic algorithms for multiobjective
optimization: formulation, discussion and generalization. Proceedings of the Fifth
International Conference on Genetic Algorithms, 416-423. San Mateo, California.
209
14. Fyffe, D. E., Hines, W. W. and Lee, N. K. (1968). System reliability allocation and a
computational algorithm. IEEE Transactions on Reliability, 17, 64-69.
15. Holland J. (1975). Adaptation in natural and artificial systems. U. Michigan Press, MI.
16. Horn, J., Nafpliotis, N. and Goldberg, D. E. (1994). A niched Pareto genetic algorithm
for multiobjective optimization. In Proceedings of the First IEEE Conference on
Evolutionary Computation, IEEE World Congress on Computational Intelligence, 1, 82-87,
Piscataway, New Jersey. IEEE Service Center.
17. Inagaki T., Inoue, K. and Akashi, H. (1978). Interactive optimization of system
reliability under multiple objectives. IEEE Transactions on Reliability, 27, 264-267.
18. Jain A. K. and Dubes R. C. (1988). Algorithms for Clustering Data. Englewood Cliffs:
Prentice Hall.
19. Kaufman L. and Rousseeuw P. J. (1990). Finding Groups in Data. An Iintroduction to
Cluster Analysis. Wiley-Interscience.
20. Keeney, R. L. and Raiffa H. (1976). Decisions with Multiple Objectives: Preferences and
Tradeoffs. John Wiley & Sons.
21. Kulturel-Konak, S., Smith, A. and Coit D. W. (2003). Efficiently solving the
redundancy allocation problem using Tabu search. IIE Transactions, 35(6), 515-526.
22. Kulturel-Konak, S., Baheranwala, F. and Coit D. W. (2005). Pruned Pareto-Optimal
sets for the system redundancy allocation problem based on multiple prioritized
objectives. Journal of Heuristics (in press).
23. MacQueen J. (1967). Some methods for classification and analysis of multivariate
observations. In L. M. LeCam and J. Neyman, editors, Proceedings of the Fifth Berkeley
Symposium on Mathematical Statistics and Probability, 1, 281297.
24. Rao, S. S. (1991). Optimization Theory and Application. New Delhi: Wiley Eastern
Limited.
25. Rousseeuw, P. J. (1987). Silhouettes: A graphical aid to the interpretation and
validation of cluster analysis. Journal of Computational and Applied Mathematics, 20,
53-65.
26. Rousseeuw P., Trauwaert E. and Kaufman L. (1989). Some silhouette-based graphics
for clustering interpretation. Belgian Journal of Operations Research, Statistics and
Computer Science, 29(3).
27. Sakawa, M. (1978). Multiobjective optimization by the surrogate worth trade-off
method. IEEE Transactions on Reliability, 27, 311-314.
28. Schaffer, J. D. (1985). Multiple objective optimization with vector evaluated genetic
algorithms. In genetic algorithms and their applications, Proceedings of the First
International Conference on Genetic Algorithms, 93-100. Hillsdale, New Jersey.
29. Srinivas, N. and K. Deb. (1994). Multiobjective optimization using nondominated
sorting in genetic algorithms. Journal of Evolutionary Computation, 2(3), 221-248.
30. Steuer, R. E. (1989). Multiple Criteria Optimization: Theory, Computation, and Application.
Reprint Edition, Krieger Publishing Company, Malabar, Florida.
31. Taboada, H., Baheranwala, F., Coit, D. W. and Wattanapongsakorn, N. (2006).
Practical solutions of Multi-Objective optimization: An application to system
reliability design problems. Reliability Engineering & System Safety, 92(3), 314-322.
32. Zeleny, M. (1982). Multiple Criteria Decision Making. McGraw-Hill series in
Quantitative Methods for Management.
33. Zitzler, E. and Thiele, L. (1999). Multiobjective evolutionary algorithms: A
210
comparative case study and the strength Pareto approach. IEEE Transactions on
Evolutionary Computation, 3(4), 257-271.
Authors Biographies:
Heidi A. Taboada is a PhD Candidate in the Department of Industrial & Systems
Engineering at Rutgers University, Piscataway, NJ. She has an MS in Industrial & Systems
Engineering from Rutgers University (2005), an MS in Industrial Engineering (Minor in
Quality) from Instituto Tecnolgico de Celaya (2002), and a BS in Biochemical
Engineering from Instituto Tecnolgico de Zacatepec (2000). Her research interests lie in
the areas of applied operations research, multiple objective optimization, biologically
inspired methods and algorithms, including evolutionary computation, reliability modeling
and optimization, and data mining.
David W. Coit is an Associate Professor in the Department of Industrial & Systems
Engineering at Rutgers University. He received a BS degree in Mechanical Engineering
from Cornell University, an MBA from Rensselaer Polytechnic Institute, and MS & PhD in
Industrial Engineering from the University of Pittsburgh. In 1999, he was awarded a
CAREER grant from NSF to study reliability optimization. He also has over ten years of
experience working for IIT Research Institute (IITRI), Rome, NY (IITRI is now called
Alion Science & Technology), where he was a reliability analyst, project manager, and an
engineering group manager. His current research involves reliability prediction &
optimization, risk analysis, and multi-criteria optimization considering uncertainty. He is a
member of IIE and INFORMS.