Combining System Dynamics and Decision Analysis for Rapid
Strategy Selection
Nathaniel
Department of Computer Science, University of Saskatchewan!
Abstract
An important class of decision problems involve the selection of a policy in the presence
of both uncertainty regarding future eventualities and a system exhibiting complex policy
response. Within this paper, we examine the performance benefits of performing strategy
selection for such problems using a hybrid modeling approach that combines decision
analysis and system dynamics. Within this approach, a modeler uses a decision tree to
encapsulate choices, uncertainties, and consequences (the last computed by a system.
dynamics model). While this hybrid technique offers many additional advantages in
tems of expressiveness and the encouragement of systematic investigation of policy
space, this paper focuses on the performance gains it provides. In particular, the use of
decision trees to represent such decision problems pemmits the use of dynamic
programming, which can dramatically decrease the costs of identifying a preferred policy
beyond what is possible by evaluating possible policies in tum. The paper quantifies
these performance advantages by means of recurrence relations for arbitrary trees, and
derives inductively proven closed-form expressions of performance gain for complete
trees. The results suggest that the hybrid method yields speedups exponential in the depth.
of the tree for both complete and randomly generated trees.
1 Introduction
Consider a situation characterized by unfolding inty in which we wish to use a
system dynamics model to make a set of policy decisions’. Under such circumstances,
our decision-making at a certain point in time will frequently depend on the sequence of
chance events that has occurred up to that point.
Employing the terminology of decision analysis, we use the term “decision rule” to refer
to a function that specifies our sequence of policies given any possible history of events.
In many problems, we seek to identify the best decision mule, recognizing that in most
cases the best decision rule will need to be adaptive — that is, will vary in its choice of a
decision based on the event history experienced until this point.
The purpose of this paper is to communicate the computational advantages of a novel
hybrid technique for identifying the optimal decision rules in such situations. To do this,
we contrast the computational costs of two possible techniques for identifying an optimal
decision rule. The first technique for identifying a preferred decision rule requires
evaluating each decision rule in tum within the system dynamics model and selecting the
* Address correspondence and reprint requests to Nathaniel Osgood, Department of Computer Science,
280.6 Thorvaldson Building, 110 Science Place, University of Saskatchewan, Saskatoon, SK S7N 5C9
Canada. Email: osgood@cs.usask.ca
? We make no assumptions here as to the etiology of this uncertainty. Some of it may lie with respect to
occurrence of extemal events, while in other cases it may reflect degree of confidence conceming intemal
factors or the effectiveness of our decisions.
tule that scores the best. The altemative approach (described comprehensively in a
forthcoming paper [Osgood & Kaufman, 2005]) involves constructing a decision tree to
represent the decisions and uncertainties and performing a dynamic-programming based
backwards induction algorithm on that decision tree in order to amive at a preferred
decision mle. This new? technique is general, scalable and can reduce the cost of strategy
identification by a factor exponential in the depth of the tree.
The advantages of this new technique will be particularly pronounced for complex
decision problems involving many levels of decisions and events, heavyweight system.
dynamics models and cases in which the model includes endogenous uncertainty. An
important priority for realizing the full advantages of the technique is the provision of
software support for the hybrid models.
The next section introduces basic terminology and a simple motivating example to which.
we will refer throughout the paper. Section 2.6 discusses the traditional technique for
identifying a preferred decision mule and illustrates how this technique would be applied
to the example. Section 4 briefly characterizes the hybrid technique for decision rule
selection and illustrates its application on the example. Section 5 provides recursive
expressions for the computational costs of both techniques and demonstrates how the
costs scale empirically with full and randomly generated decision trees. Section 0
summarizes paper results and discusses the prospects for realistic application of the
hybrid algorithm for strategy identification.
2 ASimple Example
2.1 The Problem
This section introduces a very simple system dynamics policy choice problem in order to
serve as a concrete point of reference throughout the paper. In this stylized example, we
wish to design a capacity expansion plan for our electrical infrastructure over the
timeframe from 0 to tz. Because the electrical demand anticipated will vary widely
depending on the level of economic growth experienced over that same timeframe, we
wish to consider adaptive plans that offer flexibility to react to changing economic
conditions. Suppose further that we have at hand a descriptively complex system
dynamics model capable of simulating the dynamic economic and environmental
consequences of an electrical infrastructure build out sequence in the context of a
specified pattem of ongoing economic growth (and thus demand) as well as a single-
attributed preference function’ that allows us to express our level of preference regarding
the consequences of different scenarios. This preference function might, for example,
° while simple, single-decision decision trees have previously been employed to guide exploration of
system dynamics model, the author is not aware of previous explicit use of multiple-decision decision trees
together with system dynamics. There are many cases in which exploration of scenarios or decision rules
using a system dynamics model have followed the structure of an implicit decision tree, and there are likely
many cases in which a very simple, single-level form of backwards induction was implicitly used to derive
an optimal decision in the presence of uncertainty (for example, determining the preferred choice under
different possible eventualities). The author is not aware of any cases in which this process has been
fommmlized and extended to multi-level decision trees.
’ For scoring each consequence we specify the selection of a preference function rather than some more
general metric in order to allow for proper handling of uncertainty (risk) when evaluating decision rules.
consist of some discounted utility measure based on the price of electricity, and level of
electrical brownouts or blackouts experienced, etc.
It should be emphasized that this example is as simple as possible for the sake of clarity
in order to provide a concrete sense of some of the principles and tradeoffs. The example
does not demonstrate important aspects that extend from the full generality of the
framework.
2.2 Modeling Approximations
Suppose for simplicity that we wish to consider only two possible stages of capacity
expansion and divide our planning horizon of [0, tz) into two smaller timeframes: [0 to
tw) and [tv, tz). Suppose further that we choose to dichotomize the level of economic
growth for each of these time horizons into one of two cases (high economic growth vs.
low economic growth), and that we specify the subjective likelihood that high economic
growth will occur within each of these timeframes with probability pg and p,. For
expository simplicity, we assume that the probability of high economic growth in both
periods is independent of the event and decision sequence up until that point. Suppose
that we wish to consider only plans involving one of two possible decisions for each of
the time windows of interest: The choice of either expanding or not expanding our
generation capacity (where these expansions or non-expansions are assumed to begin at
either 0 or ty).
2.3 The Corresponding Decision Tree
Figure 1 depicts a decision tree laying out the set of uncertainties, decisions and
consequences associated with this example.
Expand capacity
Initial Des
No expansion.
The decision tree is oriented so that time flows from left to right. The tree includes
square decision nodes that represent choices to be made at a particular point in time (e.g.
choice of add vs. do not add capacity), circular event nodes representing uncertainties
over time (e.g. occurrence of high vs. low economic growth), and triangular terminal
nodes. Each terminal node is associated with the consequences of a particular scenario —
the unique sequence of specific events and particular decisions that lie on the path from
the root node to that terminal node.
2.4 Decision Rules
Recall that a decision nule describes the decisions that will be made in response to any
sequence of events. Table 1 lists the set of possible decision rules for the problem at
hand. Because the decision at time 0 is not preceded by any events, it is not conditional
on any event occurrence. By contrast, the decision at time ty may be conditional on the
growth of the economy for the time period [0, ty). It bears noting that there are quite a
number of decision rules despite the very simple structure the decision tree (just two
decisions and two events). As we will see, the number of decision rules will in general
rise rapidly with the depth of the tree.
Table 1: Enumeration of possible decision rules for the example shown in Figure 1. Each row
corresponds to a different decision rule.
RulelId | Decision rule (expressed informally) Texminal
Decision atO | Decision at ty Nodes
1 Do not expand | Do not expand 11,12,15,16
2 Do not expand _| If e=Low, expand else do not expand | 11,12,13,14
3 Do not expand | If e=High, expand else do not expand | 9,10,15,16
4 Do not expand | Expand 9,10,13,14
5 Expand Do not expand 347.8
6 Expand If e=Low, expand else do not expand | 3,4,5,6
7 Expand If e=High, expand else do not expand | 1,2,7,8
8 Expand Expand 1,2,5,6
For this example, we must choose between 8 decision rules, reflecting the fact that each
decision rule must choose at two points in time between two possible choices, and the
fact that the second decision can be made in response to two possible eventualities (low
economic growth vs. high economic growth for [0, tw).
2.5 Relationship between Decision Rules and Terminal Nodes
Understanding the relationship between decision rules and terminal nodes is central to
understanding the tradeoffs and analysis presented in this paper. We therefore pause to
make two remarks on this relationship.
In all but the most degenerate trees, a particular decision rule will be associated with
many terminal nodes. (See Table 1) Each of these terminal nodes represents a scenario —
a particular sequence of events and decisions — that could occur if the decision rule were
in place. The path to this terminal node captures both the sequence of events and the
decision rule’s particular responses to those events that define the scenario. For example,
under decision rule 3, different pattems of events could lead to scenarios associated with
any of the terminal nodes labeled 9, 10, 15, and 16. This is depicted graphically in Figure
2. The fact that there are four such terminal nodes reflects the fact that in this simple
example there are 2 possible events, each with 2 possible outcomes. To understand the
effectiveness of a decision mule, it is necessary to calculate the consequences of the
various scenarios with which it can be associated. Conversely, once equipped with a
preference function for judging consequences we can use the preference function to
express our preference for decision rules as well. We do this by considering the set of
possible scenarios with which each decision rule is associated and the likelihood with
which each of those scenarios can occur under that decision rule.
Figure 2: Representation of a specific decision rule in the decision tree. Depending on which event
sequence takes place (which emboldened path is taken), this decision rule could lead to any of the
‘scenarios represented by terminal nodes 9, 10, 15 and 16.
We noted above that each decision rule will typically be associated with several terminal
nodes. Conversely, a particular terminal node t within a decision tree will in general be
associated with a set of decision rules. The decision rules in that set happen to be
identical in their response to the particular sequence of events represented on the path to
terminal node t, even though they may differ dramatically in terms of the decisions they
would make for other sequences of events. Evaluating the scenario associated with the
terminal node tells us something about the behavior of all of the decision niles. For
example, terminal node 15 in Figure 1 could result from either decision mle 3 or decision.
mule 1. This reflects the fact that both decision mule 3 and decision mule 1 both forego any
expansion in response to a scenario in which low early growth is followed by high later
growth.
2.6 Identifying a Preferred Decision Rule
Consider now the problem of identifying the best decision rule given our problem.
structure. Using the terminology introduced above, we seek to find an optimal decision
tule for expanding our electricity generation capacity over the timeframe from 0 to tg in
the light of uncertainty regarding economic growth from time 0 to time tz.
The next two sections discuss two general approaches for this type of planning problem.
The first of these approaches involves enumeration of the decision rules and evaluation of
each decision rule in tum; the other involves identifying the optimal decision mle through
toll-back on a decision tree. For each case, we will discuss the application of the general
technique to the specific example shown in Figure 1.
3 Identifying Preferred Decision Rule Endogenously
The first approach for identifying an optimal decision rule involves iterating through each.
of the possible decision ules in tum. Each such nule is evaluated by executing the system.
dynamics model on each of the possible scenarios that can . The decision mule is
then scored using the weighted average of the preference function applied to each of
these event-history-specific scenarios where the weighting reflects the likelihood of
occurrence of that scenario. Following Raiffa, we term this procedure a normal form
analysis.
Consider applying this technique to the example whose decision tree is shown in Figure 1.
We begin by considering the set of decision rules that must be compared, as shown in
Table 1. For each possible decision rule, we encode the decision rule as one or more
formulas in the system dynamics modeling package. These formulas determine which
decisions to make at each point in time, possibly taking into account the event and
decision history experienced to this point. The decisions and event structure of this
example is simple enough that the rules shown in Table 1 could likely be implemented in
just one or two formulas in the system dynamics model.
For example, consider the evaluation of rule 3 in Table 1, which foregoes a capacity
expansion at time 0 and then performs one starting at time tq if and only if the economy
experienced high growth from [0, ty). Recall that for expository simplicity we are
assuming that the occurrence of economic growth is exogenous to the model, being
specified for example by a time series or as a parameter specifying growth rate. A
capacity decision in the system dynamics model might be represented as a formula
goveming the flow into an appropriate stock (eg. “generation capacity under
construction”). The formula goveming this flow could be designed so that for a window
of time starting at ty a flow would be generated into the stock if the parameter specifying
the rate of economic growth during [0, ty) indicated a high level of growth.
In order to evaluate a decision rule, it is necessary to run the simulation on each possible
scenario (possible sequence of events and decisions)’ that can occur under that decision
° Im a deterministic model, each of these scenarios would typically be executed in a distinct system
dynamics run. Fora stochastic model, several scenarios may be executed together in a single Monte Carlo
mun.
7 Tf event occurrence were endogenous (as might be the case if events occurrence were dictated by a
stochastic process and by endogenous dynamics) then multiple sequences of events might be examined
within a single Monte-Carlo nm. For example, if electrical generating capacity itself were to
probabilistically influence the economic growth experienced, we would not seek to exogenously impose an
assumed economic growth rate, but allow it to be generated by the system dynamics model itself. This issue
is discussed in much greater depth in a companion paper [Osgood & Kaufman, 2004]. While both
approaches examined in this paper scale gracefully to handle endogenous events, for simplicity we confine
tule. Each simulation nm computes the consequences of a particular scenario. For each
such run we record the likelihood of scenario occurrence and the result of applying the
preference function to the scenario consequences (typically performed endogenously to
the system dynamics model).
For the example, once we have specified the formulas encoding decision rule 3 in the
system dynamics model, we would have to run the model separately for each possible
scenario that can result during the application of this decision rule. These scenarios
(associated with terminal nodes 9, 10, 15, 16 in Figure 1) correspond to different possible
combinations of events — i.e. combinations of high and low early and late economic
growth (LL, LH, HL, HH). After rmning the model on each of these four scenarios in
tum, we record the preference-adjusted outcome (as specified by the appropriate variable
in the system dynamics model) and the probability of the occurrence of each ((1-pg)(1-p,),
(1-pe)pL, Pe(1-p1), DePe).
When the consequences for all possible scenarios are explored for a given decision mle
(eg. 9, 10, 15 and 16 for nile 3), we can complete the evaluation of the decision node.
We do this by determining the expected value of the set of scores that resulted from
applying the preference function to each of the terminal nodes executed. For decision
mule 3, for example, the decision rule’s score would be computed by the formula:
PEPLG + Pe l— PG. + A Pe) PLGs + A- PE)A- PL)Go-
Where we use the notation ¢, to indicate the result of applying the preference function to
the consequence for scenario (terminal node) i.
By performing this procedure for each possible decision rule in tum, we can artive at a
tableaux of the expected preferences for all decision rules. We then select as optimal the
decision mule with the highest expected preference.
Given that each evaluation of the system dynamics model will in general be expensive
(particularly if stochastic effects require running a large set of particular realizations), one
statistic of considerable interest is the total number of particular terminal nodes (specific
scenarios, each corresponding to a unique event and decision history) that require
evaluation in order to identify the optimal strategy. As suggested by Table 1, for the
particular example at hand we need to evaluate 4 terminal nodes in the tree in order to
evaluate each decision rule. Given that we evaluate each decision rule in tum, the total
number of particular terminal node executions required in order to identify the best
possible decision rule is thus 8*4=32.
4 Identifying an Preferred Decision Rule Using a
Decision Tree
The section above described a procedure for evaluating decision rules endogenously
within the system dynamics package. In this section, we introduce an altemative hybrid
approach that supplements the system dynamics model with a decision tree model
specific to the problem. This method relies on the system dynamics model for simulation
of each scenario (terminal node) but eliminates the need to enumerate all possible
decision mules, instead using dynamic programming (via backwards induction) to rapidly
identify a preferred decision rule from the set of preference-adjusted scenario outcomes.
our analysis here to the case where we must exogenously specify event occurrence, with each event history
requiring a single execution of the system dynamics model.
For a case where all events are exogenous®, the hybrid technique begins with the
construction of an explicit representation of the decision tree corresponding to the
situation of interest (for the example, see Figure 1). This initial tree should include a
specification of the branch probabilities but should for the time being leave unspecified
the consequences associated with each terminal node.
Following the construction of the tree, the modeler proceeds with a system dynamics
evaluation of the terminal nodes of the decision tree, each of which represents a particular
scenario of interest characterized by a unique history of chance events and decisions. For
each such terminal node, the system dynamics model is parameterized to represent the
scenario (decision and event history) of interest and the model is rm. During or (less
frequently) following the run, a preference function is applied to the model-computed.
consequences — typically trajectories of particular variables in the system dynamics
models. We then associate the (scalar) value of the preference function with the terminal
node in the decision tree.
For the example we have been using, consider executing terminal node 1 in Figure 1. For
this terminal node, the system dynamics model would first be parameterized to represent
a scenario involving an initial expansion of generation capacity, followed by a high level
of growth over the interval [0,tw), followed by a further expansion of production capacity
starting at time ty and finally culminating in another period of high economic growth
from [ty, tz). Following parameterization, the system dynamics model would be run to
produce the consequences of interest. Because the system dynamics model in this case
includes its own intemal preference function, the model would itself apply this function
to the results of this evaluation and report the result as part of its final output. The final
output value for this variable would be set as the consequence of terminal node 1 in the
decision tree model, for use during the next (rollback) stage.
Two points about this process bear emphasizing.
e The results of executing a given terminal node is wholly determined by the model
itself and, the specific history of events and decisions that play out during its
execution’. The results of executing a given terminal node are not determined by
any other feature of the broader decision rule with which this terminal node may
be associated. As noted in Section 2.5, a particular terminal node will in general
be associated with (can result from) many different decision rules. Although they
may differ entirely in their treatment of other sequences of events, the decision.
tules will share identical responses to the particular sequence of events that gives
tise to the terminal node. As a result, the pattem of causation leading up to the
® This section discusses the application of this approach to the simpler case in which no endogenous
uncertainty is present. The handling of endogenous uncertainty requires a more general approach in which
the tree is constructed dynamically as Monte Carlo ensembles are run on for meta-scenarios consisting of
successively different combinations of possible decisions and any exogenous uncertainties. Contiguous
events are aggregated and approximated so as to allow for meaningful decision rule induction. Within this
approach, each meta-scenario output consists of a risk profile of preference-adjusted outcomes, from which
(because of the properties of preference functions) we extract the mean value for rollback. A companion
paper provides a comprehensive introduction to this combination of system dynamics ands decision
analysis [Osgood & Kaufman 2005] for the more general case where endogenous uncertainty is present and.
discusses many other advantages offered by this technique.
° For a model including endogenous stochastics, any random number generator state would also have to be
included in this list.
terminal node will be identical for these different decision rules and the system.
dynamics state generated by and output generated from executions of these
different decision rules on this particular event history will be identical.
The system dynamics model evaluations here are being performed once for each
possible scenario (in this case, the terminal nodes labeled from 1 to 16) and not
for each possible decision rule. As we will see, in most decision trees there are
far more possible decision mules than there are scenarios.
After having determined the preference-adjusted consequences associated with each
terminal node (scenario), the modeler can use the dynamic programming algorithm of
backwards induction (“rollback”) to identify the preferred decision rule. This process
operates recursively on the decision tree by determining the preferred decision for each
decision node and requires no further system dynamics runs. To help clarify its operation
and running time, we describe the backwards induction algorithm “BI(-)” below in
procedural pseudocode.
Texminal Nodes. ‘The rollback value of a terminal node is simply the value of the
preference function applied to the (time-indexed) system dynamics output for that
node. As noted above, for simplicity we will typically compute the preference
function for the output directly in the system dynamics model.
BI(t) :
retum f(SDOutput(t))
Event Nodes. For event (chance) nodes, the backwards induction algorithm
associates the value of the event node with the expected value of the results of
applying the backwards algorithm to its children.
BI(e):
rum >) pBl)
cechild(e)
Decision Nodes. For decision nodes, the backwards induction algorithm
determines the optimal choice for the node and sets the value of the node to be the
value of that choice. The optimal choice for the node is the choice whose
expected preference value is greatest.
BI(d):
vMax =-1
cMax =nil
force in Child(d)
v =BI(0)
if v >vMax
vMax =v
cMax =c
Optimal Decision(d) =cMax
Retum vMax
In contrast to the normal form analysis presented in the previous section, this technique
requires one run of the system dynamics model for every scenario (terminal node of the
decision tree); the particular example at hand requires 16 execution of the system
dynamics model (one for each of the terminal nodes labeled 1-16) rather than the 32
required for the traditional algorithm. We note that this cost is far below that required by
the normal form analysis. As we will see in the next section, this is typical of evaluation.
of decision rules on nor-trivial trees.
5 Performance Comparison
This section examines the performance burden associated with each of the two techniques
for identifying a preferred decision rule.
Before starting the analysis we note again our assumption that because executing the
system dynamics model is expected to be relatively computationally expensive compared
to construction of or simple arithmetic operations on decision trees. The latter operations
are linear in the count of nodes in the tree, likely with a very small coefficient. As a result,
the performance cost for each technique is measured in terms of the number system
dynamics model runs required.
We will characterize below these performance costs in recursive fashion. We also
provide closed form solutions for these performance costs for the special case of
complete trees of altemating decision and event nodes and which have uniform branching
c for event nodes and d for decision nodes. The correctness of these formulas is
established via an inductive proof included within this paper’s appendix.
5.1 Performance when Identifying Preferred Decision Rule
Endogenously
The first form of analysis we will examine is the normal-form analysis discussed in
Section 3. Recall that when we rely upon the system dynamics model to endogenously
represent decision rules we enumerate the different decision rules, evaluating each in tum.
In order to evaluate each rule, the rule must be encoded using one or more formulas in the
systems dynamics model. In order to evaluate a decision rule, we must execute the
system dynamics model to compute the consequences resulting from adhering to this
decision mule. As we saw in Table 1 for the example, this evaluation process typically
involves several particular runs — each of which characterizes the performance of that
decision rule under different potential event histories.’° Because the simple structure of
the example, each decision mule is uniformly associated with 4 possible scenarios — one
for each of the possible combinations of economic growth in the early and late periods.
As a first step towards quantifying the performance of this approach, we first compute the
count of decision rules that must be encoded in this fashion. We will then consider how
many particular system dynamics runs are required for evaluating the set of all decision
tules for an arbitrary decision tree.
1° Note that for a model with endogenous stochastics, a particular rum of the system dynamics model would
typically conceptually correspond to the evaluation of all terminal nodes at serve as leaves for some subtree
rooted within the decision tree.
5.1.1 Decision Rule Count
Taken forr Ds
_/ < i,
\\Not Taken forr :
Na Dy
Figure 3: Decision nodes requiring specification within a rule; because the root decision has chosen
the upper choice, decisions D, and D, do not require specification.
An important insight for quantifying the count of decision mules associated with a
decision tree is the realization that while decision miles in general specify particular
decisions for decision nodes, they need not specify a specific decision for every decision
node in a decision tree. Instead, decision rules must specify decisions for every logically
consistent set of decision nodes in a decision tree.
To understand this distinction and the notion of “consistency” to which it appeals,
consider the decision tree depicted in Figure 3 in light of a specific decision mule r in
which the decision associated with the top branch of the root (decision) node has been
specified. A decision nile such as r must specify particular decisions to obtain for the
root decision node as well as descendant decision nodes D, and D,. By contrast, decision
tule r need not specify a decision for either D, or Da because the set of decisions that
have been decided upon in r (in particular the decision associated with the root node) will
make it impossible to reach a situation where these D, or Dg are required. We refer to
this phenomenon informally as “decision node occlusion”: Selection of a particular
choice (child) for a decision node d obviates the need for a decision node to specify
decisions within subtrees rooted at other children of d. This observation will have
substantial implications in limiting the count of decision mules that are possible for a
particular decision tree.
A general formula for the number of decision rules can easily be expressed recursively.
For a root node n, we can express a formula s(n) for the number of distinct decision rules
associated with the tree rooted at n. We define s(n) recursively based on the type of
node:
e Event node e For a subiree rooted at an event node, it is not possible for a
decision maker to determine which event will occur. As a result, a given decision
tule will have to independently specify whatever decisions require specification
for each possible outcome of this chance event. For the example shown in Figure
1, adecision mle for a tree rooted at a chance node associated with early growth
level of the economy would have to specify distinct sets of decisions for the case
where the economy experiences high early economic growth and the case where
the economy experiences low early economic growth. Because the decision rule
can independently specify any decisions required for each event outcome, the set
of possible decision rules required under node e is the set of possible
combinations (cartesian product) of the decision rules possible for the subtrees
associated with each possible outcome (child) of the event. The size of that set is
just the product of the number of possible decision rules required for each child of
s@= [] so
the event node, i.e. ceciild(e)
e Decision node d: For the case of a decision node, a specific decision rule need.
only specify exactly one possible outcome of this decision — but will also have to
specify particular decisions required by any decisions beneath this branch (but not
under any other branch). The set of decision rules that are possible for the subtree
rooted at d is simply the union of the set of decision mules that are possible for the
subtrees rooted at each particular child of d, with one twist: Each decision rule
from child c is extended with an extra element that specifies that the parent
(decision) node has made choice c. The count of particular decision rules
associated with this node is thus the same as the cardinality of the union of the set
of decision rules associated with the children of the decision node. ie
s(d) = > s(c).
cechild(d)
e Terminal nodet There is only one (empty) decision mule possible for this node:
Thus, s(t)=1.
The values of these recurrences will depend on the structure of the particular decision tre
on which they are invoked.
Consider a complete tree of altemating layers of decisions and event nodes, where the
decision nodes have d children and the event (choice) nodes have c children. Based on
these recurrences, we can arrive at a closed-form expression for the count of strategies at
height h of the tree (where the terminal nodes are considered to be at height 0). Slightly
abusing the notation introduced above, we can express the number of decision niles
required in the tree as a function of the height in this complete tree:
( J21_ gran |
2) gmath-a.2)
cl
sth)=d
This paper’s appendix includes a straightforward inductive proof for the correctness of
this formula.
5.1.2 Count of Runs Required to Evaluate a Decision Tree
The previous section provided a recursive characterization the number of distinct
decision rules possible for a decision tree of arbitrary structure. For a normal form
analysis, we must iterate through the set of decision rules, evaluating each in tum. Each
terminal node (scenario) execution required by this process must execute within the
system dynamics model and is expected to impose a heavy performance cost. This
section builds on the last section by providing a recursive formula for the count of
particular scenario executions (runs) r(n) required for a decision tree rooted at node n.
The formula for r(n) is based upon the count of decision rules s(n) that are required in
that tree.
Recall from Section 2.5 that in general, each terminal node can be reached by a common
sequence of events under many distinct decision rules. Under normal form analysis a
given terminal node may therefore require execution many times, as part of the
evaluation of many different decision rules. For example, in Table 1, each possible
terminal node (e.g. 1) must be executed to evaluate each of two distinct decision rules.
Not surprisingly, this holds more generally: The number of times a given terminal node
must be evaluated is equal to the number of decision rules that include it. Thus, for the
example under consideration, each terminal node is evaluated twice by the procedure for
nonmal form analysis.
An important insight for quantifying the count of decision mules that could result in
evaluation of a given terminal node T is to consider the general characteristics of those
decision rules in light of T’s location in the decision tree. In particular, consider the
decision nodes located on the path from the root of the decision tree to T. Because the
path terminates in a terminal node (T), we know that each such decision node d on the
path must have some child on the path to T. This child represents a particular specific
choice for decision d, and any decision rule that can lead to T must include that choice.
As a result of this reasoning, given a terminal node T we will in general know many
components of the decision rules that can lead to the scenario represented by T. These
decision rules are not fully fixed — they may vary with respect to the values they associate
with other decision nodes which are not on this path but which are consistent with it (See
Section 5.1.1). These other decision nodes may vary across all of their logically
consistent values.
Casting this insight into a quantitative form, we can construct a recursive expression for
the count r(n) of terminal nodes (scenarios) requiring execution for a given tree rooted at
n
¢ Terminal node. Fora trivial subtree consisting just of a terminal node, we have
just a single nm to execute: r(t)=1.
e Event node. As we saw in Section 5.1.1, for an event node a given decision rule
must specify values for decisions associated with all different possible chance
eventualities associated with the event. There is a combinatorial effect here in
tems of the number of runs required: If a given branch c of the event node itself
requires m distinct runs to evaluate when taken in isolation, each of these runs
must be executed for each of the possible decision rules that must be specified for
all other branches of the event node (note that in computing the number of runs
1(c) required under node c, we have already considered the decision mules lying
within the subtree rooted atc.) Thusr(e)= >’ r(c) [] sh).
cechild(e) bechild(e)&b+c
e Decision node. For a decision node, a particular decision rule must specify one
and only one decision to be made. Because these decisions are mutually
exclusive, a given mn associated with a particular one of the decisions c (i.e. lying
within the subtree rooted at c) needs only to be executed for a subset of decision
tules — namely, decision rules that happen to specify that particular decision c as
the outcome of this decision node. As a result, when we consider the number of
quns required by the decision node as a whole, there are no combinatorial effects
between the count of runs required by each child of that node. Specifically, the
total number of runs required for evaluating this decision node is just the sum of
the number of runs required to evaluate each possible decision of the decision
nodeintum. Thus r(d)= > r(c).
cechild(d)
Just as was the case for the recurrences giving the count of strategies associated with a
decision tree, the number of runs required will vary with the stricture of the tree. To give
a sense of run count scaling, we consider again the complete decision tree introduced
above, in which decision nodes and event nodes form altemating layers, where the
parents of the terminal nodes are all decision nodes, and where decision nodes are
associated with d children and event nodes with c. Once again slightly abusing the
notation above by overloading r to as a function of node height within a complete tree,
we can specify the number of runs required to conduct normal form analysis of a tree of
height h (treating the terminal nodes as lying at height 0) as follows:
naadild
As for the strategy count, we have constructed a straightforward inductive proof of the
correciness of this formula. This proof is included within the appendix to this paper.
5.2 Performance for Preferred Decision Rule Identification
Using a Decision Tree
Consider now the case in which we identify a preferred decision rule using a decision tree
and backwards induction (in short, make use of extensive form analysis). As discussed in
aprevious section, we assume that the performance is dominated by the system dynamics
Tuns required for scenario evaluation and ignore the cost of rollback. In this case,
because rollback proceeds after evaluating each of the terminal nodes, the number of
terminal node executions is simply equal to the number of terminal nodes. While we can.
write this mathematically as |ne Nodes(t) | IsTerminalNode(n)| (where t is the entire
tree), for consistency with the above we describe a recursive procedure for calculating it.
Given the root node n, r(n) will be the count of terminal node executions required to
identify the preferred decision mule using backwards induction.
¢ Terminal node. r(t)=1.
¢ Decisionnode. (d= >) ro).
cechild(d)
e Eventnode. 1(e)= > r(c)
cechild(e)
In order to provide a concrete sense of performance, consider again the case of a
complete tree of height h with altemating layers of decisions and event nodes, where each
decision and event node has d and c children respectively, where the terminal nodes are
considered to lie at height 0, and where the parents of the terminal nodes are decision.
nodes. In this case, the number of terminal nodes (runs) requiring evaluation is
h h
reta)=c id 2
h a)
[2a] 244
(1+c)c* 4+¢ a +mod(h,2)
As for the cases above, we have demonstrated the correctness of this fonmula via
inductive proof, although details are omitted here.
5.3 Comparisons
Runs Runs per Terminal Node
ny (BI
x saan JR, 4 woe ag a cea |-mod(h,2)
y (140) * -mod(h.2) mig a)
da
Extensive] 3/5) i
Foam agi?
Table 2 Closed form expressions for the count of ics executions required to identify a
preferred strategy for a complete tree of height h, with alternating rows of event nodes (each with c
dhildren) and decision nodes (each with d children), where all the parents of the terminal nodes are
decision trees.
Tree Height | #Decision Rules | Extensive form| Normal form run
runoountre() | country)
5 128 32 512
6 16,384 64 131,072
7 32,768 128 262,144
8 1,073,741,824 256 17,179,869,184
9 2,147,483,648 512 34,359, 738,368
Table 3: Summary of decision rule count for complete binary trees of the form noted above, and a
comparison of the number of system dynamics model runs required using normal formand extensive
form:
The performance disadvantages of the normal form approach can be readily appreciated
from Table 2 and Table 3. As seen from Table 2, for the normal form approach the
number of runs required is super-exponential in the height of the tree — the exponent on
the d tem rises geometric with the height of the tree. The extensive form approach is
substantially faster; because the number of terminal nodes in the tree rises geometrically
in the height of the tree, the number of terminal nodes whose evaluation is required
necessarily remains geometric in the tree height. However, the use of backwards
induction eliminates the super-exponential component in the number of runs required to
evaluate the tree.
Performance was also examined on random trees. Such trees are associated with
asymmetric structure and stochastic depth and branch counts for both event and decision.
nodes. For such trees, the gap in performance remained substantial but appeared.
somewhat less dramatic than for full trees. A later paper will describe these results in
greater detail.
5.4 Splitting Decision Rules between System Dynamics and
Decision Trees
It is worth remarking on a technique for reducing the performance costs of identifying a
preferred policy (decision rule) by decision rule order reduction. Some historic models
which have explicitly used both decision trees and system dynamics models have avoided.
the expenses associated with exploring a broad set of decision rules by factoring the
choice of decision rules into two parts, with one component of a decision rule represented.
in the decision tree and the other component within the system dynamics model.
Using this technique, the modeler first identifies certain particularly competitive subspace
of the space of possible decision rules. Frequently this subspace is a thin manifold of
decision rules that can be specified using just a few parameters. Such parameterized
decision rules (and the decisions that make them up) are then encoded within the system.
dynamics model. The parameters of those system dynamics decision rules are then set by
explicit decisions within the decision tree. In this way, a very large number of particular
decisions is reduced to just the few choices of parameter values needed to identify one of
member of the subspace of highly competitive decision mules.
High Economic Growth
10%resavemagin ~~ Consequence
\ Low Economic Growth Co 9
High Economic Growth.
12.5% reserve margin Consequuence3
~~" Low Economic Growth
es Seach Consequence.
High Economic Growth.
= 15%resavemagin ~~ Consequences
Lh AN a
LowEomomic Growth Cprernene6
High Economic Growth
17.5% reservemaryin ~~ Consequence?
“\\_ Low Economic Growth
eer Consequence8
High Economic Growth.
20% reserve margin ye Consequenced
~~ Low Economic Growth
—$§$———._ Consequencel0
For example, consider a decision tree for determining electrical capacity buildout over
time, where the timing and technology of buildout may be affected by the observed
(uncertain) electrical demand and fuel prices. This tree could be represented using a
successive series of choices as to whether to buildout and (if so) which technology to
employ. Using the altemate technique described in this section, the modeler can factor
the decision mule into two pieces.
e The system dynamics model can be used to represent the moment-to-moment
decision rule determining whether to build out given a certain desired reserve
margin. When a buildout is required, the implicit rule may further dictate that any
new expansion uses the cheapest available technology at the time of buildout.
e The decision tree can employ a (root) decision node for the choice of reserve
Margin. Chance nodes in the decision tree can dictate factors such as the rate of
economic growth, changes in fuel prices, and other factors.
By cleverly partitioning decision making between the system dynamics model and the
decision tree such techniques can offer substantial value in reducing the size of the space
of potential decision rules that must be evaluated. For the example cited here, this
reduces the problem from the selection of an optimal sequence of build-or-not-build
decisions (and choice of technologies) to the selection of a single initial decision of
reserve margin.
While very helpful for certain classes of problems, this approach does not escape from
the fundamental tradeoffs analyzed within this paper. In particular, such split-decision-
tule approaches exhibit several limitations, including two listed below:
¢ Difficulty of identifying the competitive (parameterized) set of decision rules
to be considered. In many complex policy choice problems, the subset of policy
options (decision rules) most likely to be competitive is not obvious a priori. As
a result, it may not be possible to parameterize such rules and encapsulate them
within the system dynamics model. In a rich decision tree that represents many
types of events and decisions, it may be highly challenging to identify the
desirable subset of decision rules to that should be implicitly specified. Even for
the sample example discussed here, the most appropriate choice of technology
may depend on the reserve margin due to economies of scale and differing length
of time required to commission a plant. Confining analysis to a very a small
subset of rules (eg. rules in which the technology which is currently least
expensive is employed for build-out) can mss opportunities for identifying such
relationships.
e Hidden re-evaluated substructure. While restricting the set of policy options
being considered can significantly reduce the size of a decision tree, shifting the
evaluations of decision mules to the decision tree obscures the common
substructure among different scenarios. While the split-decision-nile approach
will in some case offer a compelling advantage, the presence of hidden
substructure lessens the computational advantages of such approach. For the
example discussed above, the any of a wide range of operating margins may lead
to identical behavior in response to a variety of particular scenarios -— e.g. rates of
economic growth. When the choice to expand or not to expand is explicitly
represented in the decision tree, this substructure is exposed and can be exploited
through extensive form analysis. When the decision as to whether or not to
expand is represented only in the system dynamics model (parameterized just the
choice of an operating margin within the decision tree), this common substructure
remains hidden. The computational costs of hiding this substructure grow
considerably more apparent when one must consider other decisions, such as the
choice of sub-technology, the lumpiness of the investment, etc. While not shown
here, decision trees can be easily extended to handle an arbitrary number of types
of choices.
In summary, techniques that divide decision rules between the decision tree and the
system dynamics model present an attractive means of lessening the computational
burden of decision mule selection in those limited cases in which we can identify a highly
attractive subspace of decision mules a priori and when the common substructure among
the decision rules within this subspace is sufficiently limited to allow it to be re-evaluated.
for each decision mle without overriding costs.
5.5 Conclusions
This paper has provided a brief introduction to hybrid modeling techniques that combine
system dynamics with decision analysis. The use of decision analytic techniques in
conjunction with system dynamics offer a wide variety of advantages, including
modeling clarity, rapid sensitivity analysis on event probabilities, access to sophisticated
decision analytic tools, nimble interpolation between the results of examined policy
choices, and “offline” policy analysis for certain important classes of policies. This paper
has focused on the performance advantages of such hybrid modeling techniques for
policy selection. These advantages arise from the ability of hybrid models to take
advantage of the common substructure of decision niles by the use of dynamic
programming to select preferred decision niles. The extent of the performance
advantages is sufficiently large that hybrid techniques offer the opportunity to perfomm
substantially more detailed policy analysis than would be possible using system dynamics
alone. By partitioning decision rules between the decision tree and system dynamics
model, additional performance tuning is possible.
While realizing the integration between system dynamics and decision analysis is
currently tedious, we believe that a software framework that achieved this integration
would confer significant modeling advantages. Forthcoming papers will describe the
design and operation of such a framework as well as the generality and additional
advantages of the hybrid approach.
5.6 Acknowledgements
The author wishes to acknowledge several people who played an important role in
contributing related ideas and in making this research possible. Gordon Kaufman
deserves special thanks as an early and enthusiastic proponent of prospects for combining
decision analytic and system dynamics approaches, even in the face of pronounced
skepticism by other modelers. Gordon’s devotion of time and discussion to the topic was
seminal in encouraging the development of the methodology, and he was prescient in
inbuiting many important aspects of the framework that emerged. Andy Ford provided
very helpful comments, and was particularly helpful in highlighting out the relevance and
value of some previously overlooked past work that directly suggested the value of the
class of techniques discussed in Section 5.4. Finally the author is thankful to the
audience members of the System Dynamics Winter Conference and especially Itsung
Tsai and Charlie Lertpattarapong, who helped provide comments on closely related work.
References
[Osgood & Kaufman, 2005] Osgood, N. and Kaufman, G. Decision Analysis and System
Dynanics. In preparation for submission to System Dynamics Review.
Appendix — Proofs of Closed Form Solutions to
Recurrences
This appendix contains a proof of the formulas for the count of decision rules within the
tree and the count of runs required to evaluate those decision rules using normal form
analysis. In both cases, we assume a complete (full) tree of altemating layers of
decisions and event nodes, where the decision nodes (located at odd heights of the tree)
have d children and the event (choice) nodes (located at even heights of the tree at all
levels above height 0) have c children. Finally, the terminal nodes are considered to be at
height 0 of the complete tree.
The basic relations to be proven are that for the count of decision mules in the tree,
expressed as a function of the height h in the tree:
( gH
2) gmdth-t.2)
oth)=al et
and that for the count of runs required when the tree is evaluated using normal form
analysis. Once again, this is expressed as a function of the height of the node in the tree:
| [Re
H walla mana
mad? d
c-1
Because the formula for the count of required runs depends on the formula for the count
of decision rules, we will first prove the strategy formula, and then tum to the formula for
the count of runs.
Proof of Formula for Normal Form Strategy Count
Base case. Terminal nodes (h=0)
Recall: A terminal nod is treated as having a single (empty) strategy associated
withit.
Proof of formula:
EP oan (ath) (2)
s(h)=3(0)= dl Jag Jag ag'-1 oo
Inductive case:
Decision nodes:
Since decision nodes are assumed to lie at odd heights h+1 is odd (ie.
mod(h+1,2)=1). Also worth remarking is that || .|+|
Consider that formula is true for height < h, prove for h+1.
For the case of a decision node, a specific decision rule need only specify
exactly one possible outcome of this decision — but will also have to
specify particular decisions required by any decisions beneath this branch.
(but not under any other branch). The set of decision mules that are
possible for the subtree rooted at d is simply the union of the set of
decision rules that are possible for the subtrees rooted at each particular
child of d, with one twist: Each decision rule from child c is extended
with an extra element that specifies that the parent (decision) node has
made choice c. The count of particular decision rules associated with this
node is thus the same as the cardinality of the union of the set of decision
mules associated with the children of the decision node ie
s(d) = > s(c)
cechild(d)
Reasoning inductively, we have
Bees] (ihe) [Beal (lags
ct ce. cl. J cl cl
y d ‘= dd 4 = =d
iechild(d)
¢ = 1
=d }
—
But this is just d ’, which proves the inductive case for
decision nodes (i.e. the odd levels of the tree).
Event nodes:
Since even t nodes are assumed to lie at even heights h+1 is even (i.e.
mod(h+1,2)=09). Also worth remarking is — in contrast to the case for
decision nodes —| | - || -|2|+1.
For a subtree rooted at an event node, it is not possible for a decision
maker to determine which event will occur. As a result, a given decision
mule will have to independently specify whatever decisions require
specification for each possible outcome of this chance event. Because the
decision rule can independently specify any decisions required for each
event outcome, the set of possible decision rules required under node e is
the set of possible combinations (cartesian product) of the decision niles
possible for the subtrees associated with each possible outcome (child) of
the event. The size of that set is just the product of the number of possible
decision rules required for each child of the event node ie
( |B}a |
2)" _rmah-t.2
et
I] @
icchild(e)
Consider that formula is true for height < h, prove for h+1.
Wehaves(tht)= [] di
iechild(e)
ct cet
( als ( J3h+_ 0) ,
3)" gmoan-12) 30
be) )
TT} awacna)-1.2)
ot
a
e-1
=d =d o
This proves the inductive case for event nodes (i.e. the even levels of the
tree above level 0).
By virtue of having proven the formula for the base case and even and odd.
levels of the tree, we have proven the formula for strategy count for the
entire tree.
We now tum to proof of the formulas for the count of runs required for the normal form
analysis of the tree.
Proof of Formula for Normal Form Run Count
bh Ate )
ay {el
(veyed? ‘4 — |+meh,2)
h
Inductive proof of formulas forrun count; r(h) = dala
Base case (height 0)
For height 0, we have terminal nodes.
Each such node requires only ona single run in nonmal-form analysis >s(0)=1
The formula gives
Bal | 2 {_.]
(9 avesd? led > +002) vost a0 uke ¥(¢ 4
r(0)=c-/d ed Soh eed SKIES
_¢ tte) Flare acre gad g
Inductive case (height > 0)
We consider in tum. the case of decision nodes (located at odd heights) and event
nodes (located at event heights >0)
Decision nodes (odd heights)
For a decision node, a particular decision rule must specify one and only
one decision to be made. Because these decisions are mutually exclusive,
a given mn associated with a particular one of the decisions c (i.e. lying
within the subtree rooted at c) needs only to be executed for a subset of
decision rules — namely, decision mules that happen to specify that
particular decision c as the outcome of this decision node. As a result,
when we consider the number of runs required by the decision node as a
whole, there are no combinatorial effects between the count of runs
required by each child of that node. Specifically, the total number of runs
required for evaluating this decision node is just the sum of the number of
Tuns required to evaluate each possible decision of the decision node in
tum. Thusr(d= >° r(c)
cechild(d)
Assume that we are at a decision node at height h+1, and that the formula
has held for all heights < h. We now wish to demonstrate that the formula
holds for height h +1.
We note that because decision nodes are located at odd heights in the tree,
h+1 is odd and h is even (i.e. mod(h+1,2)=1 and mod(h,2)=0; moreover
mj-[3)).
Now
rih)= ¥ rth-
icdiaay
ny ( (2)
" avad? led My }-mod(h,2)
- > da
icchia(a)
a, (2) ,)
i aad? led cee 40
- > da
icchia(a)
ny (212) nay (Bl
| quad? eg ae let= ba wot hf at
- dig ‘ d3 d \ ) a
Thus the result matches the formula for r(h+1). We have thus proven the
correctness of the formula for odd heights of the tree.
Event nodes
For an event node a given decision rule must specify values for decisions
associated with all different possible chance eventualities associated with
the event. There is a combinatorial effect here in terms of the number of
quns required: If a given branch c of the event node itself requires m
distinct runs to evaluate when taken in isolation, each of these runs must
be executed for each of the possible decision rules that must be specified
for all other branches of the event node (note that in computing the
mumber of runs r(c) required under node c, we have already considered the
decision mules lying within the subtree rooted at c.) For a uniform tree
where h+ is the heightr(h+1)= >° r(h) TI s@.
)
ieqhild(e) bechild(e)&bec
We note that because event nodes are located at even heights in the tree,
h+1 is even and h is odd (i.e. mod(h+1,2)=0 and mod(h,2)=10; moreover
[e2|-[Bs)-(3)+0).
Assume that we are at a event node at height h+1, and that the formula has
held for all heights < h. We now wish to demonstrate that the formula
holds for height h +1. Now
rh+)= Y rh) [] sh
iechild(e) bechild(e)&bec
(jn \
dal _ man.) |
ct
iechild(e)
+mod(h,2) oer
d \ }
iechild(e)
Recognizing that there are c children of t he event node, we have
)
b
(adele 2 lid Bac
vey) jt “t ately
Lely aod? Wd?
aa
Thus the result matches the formula for r(h+), thereby proving the correctness of
the formula for even heights of the tree.
The formula for the number of runs in the tree is thereby proven for the base case and the
inductive case for even and odd heights of the tree. This proves it the fommula for all
heights of the complete tree.
Proof of Formula for Count of Extensive Form Runs
Base Case (Terminal nodes, h=0)
Here, we have only a single terminal node, which implies r;:(0)=1
Fommlacdd!=1 5
Inductive Case
Decision Nodes
We are at height h+1.
Here we have h+1 odd, heven =>|}| =, (2 4]
Assume true ete 7 prove true for height h+1
Here we count of runs required
r¢h) = 7: al. |. net. Gl _ CFT
ie xe dite
This proves the formula for odd layers of the decision tree.
Event Nodes
Weare at height h+1.
Here we have h+1 even, hodd =>| "=| 4.15 ma] a]
Assume true for heights <h, now prove true for height h+1
Here we have the count of runs required
y rhy= J ad eta? - gal dF
rs
icdila() iedila(c)
This proves the formula for even layers of the decision tree.
We have now proved the inductive formula for all heights of the tree greater than.
0.
We have proven the correctness of the formula for nm count for all heights of the tree.
The formula is thus correct.