Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
Search in posts
Search in pages

Go Home

System Dynamics Blog


Models, Maps, and Levels of Evidence

Models, Maps, and Levels of Evidence

by | Dec 22, 2020

Anyone who has attended a System Dynamics conference in recent years, or has read past posts in the System Dynamics Society blog, is surely aware of differences of opinion on the value of qualitative maps as opposed to quantified simulations.

What you may not know is that this debate has been going on for a long time, stretching back to the early 1980s, when Eric Wolstenholme and others asserted that one might be able to infer dynamics from qualitative maps.  In 1990, Peter Senge’s book The Fifth Discipline upped the ante by suggesting that certain problems might be categorized according to System Archetypes, which some took to mean that one could go directly to solutions without first simulating.  This became known as Systems Thinking.

In 2000, Geoff Coyle wrote a paper (SDR 2000, 16:3) that pointedly asked whether there might be situations so uncertain that quantified modeling cannot tell us more than qualitative mapping alone.  Rogelio Oliva and I wrote a rejoinder (SDR 2001, 17:4) that challenged this idea and sought to reclaim Jay Forrester’s original view of simulation as the necessary test bed for any dynamic hypothesis that might conceivably lead to policy decisions.

Even now, 20 years later, the debate continues.  We see in our conferences increasing use of Group Model Building that goes no further than a qualitative map, and from which the authors claim to have derived dynamic insights.

Some have been dismayed by this development—which appears to be the further expansion of Systems Thinking—fearing it is diluting and dumbing down the fundamentals of our field.  The modelers accuse the mappers of lacking rigor, while the mappers say that good group process has a rigor of its own.

What can we do about this long-brewing pot of trouble?

  Is there any way out of the impasse?

I’d like to suggest a possibility.  Several years ago, I published a paper (SDR 2014, 30:1-2) describing how a “levels of evidence” approach—a standard for classifying work in the biomedical sciences—might be applied in SD.  To achieve an “A” level of evidence, one would need both strong structural and behavioral evidence and the ability to reliably test one’s model.

  Structural evidence comes from conversations with subject matter experts and focused studies of cause and effect.  Behavioral evidence comes from a comparison of model output with time series data and historical records.   Work with strong support for structure but not behavior, or behavior but not structure, would achieve at best a “B” rating.  Work with strong support for neither would get a “C” rating.

In the biomedical sciences, works that have “B” or “C” ratings can still be presented at conferences and even appear in prestigious journals.  A rating less than “A” does not mean poor quality but rather lack of full, iron-clad reliability for drawing conclusions and making decisions; that is, something more exploratory or tentative.  Its level of evidence is designated in the conference proceedings or at the top of the paper so that the audience knows what they are dealing with—a decisive work (one from which decisions can be made with confidence) or something less than that.

It seems to me we can apply the Levels of Evidence filter objectively across both simulation models and qualitative maps.  Let’s start with the simple fact that a simulation model can be tested formally, while a qualitative map cannot.  It follows that the best possible rating for a simulation model is “A”, while the best possible for a qualitative map is “B”.

If we can agree on this much, then we may be able to find a way for simulation models and qualitative maps to coexist.  It would require acknowledgment from the advocates of qualitative maps that their work can never be considered decisive.  And, it would require acknowledgment from the advocates of simulation that a model lacking sufficient evidence may be no more reliable than a well-developed qualitative map.

Recent Posts

A Close-Up on Our New Look

A Close-Up on Our New Look We are excited to reveal our new logo and branding, embodying the System Dynamics Society’s values and vision for the future. The looped arrows with an S curve symbolize growth, change over time, and commitment to raising complex...

Upcoming Events

Recent Business cases

System Dynamics Helps Farmers Escape Poverty Trap in Guatemala

System Dynamics Helps Farmers Escape Poverty Trap in Guatemala EXECUTIVE Summary Guatemala holds the 4th highest global ranking for chronic malnutrition, and climate change is intensifying the challenges subsistence farmers face in providing food for their families....

Twinings Uses System Dynamics Games to Enhance HR Capability

Twinings Uses System Dynamics Games to Enhance HR Capability “Realistic simulation is a powerful approach to building capability. The business simulation developed [by Dashboard Simulations and Lane4] gave [Twinings staff] an experience that called for them to develop...

RSC Uses System Dynamics to Increase HVACR Sales Against the Tide

RSC Uses System Dynamics to Increase HVACR Sales Against the Tide “Using the proven Strategy Dynamic process focused our limited resources on organizing strategic issues, identifying the critical resources, and developing the insight to more rapidly create intuitive...

Join us

2 thoughts on “Models, Maps, and Levels of Evidence

  1. If one wants to compare a qualitative and quantitative work, the first difference is not about the methodology, but about the quality of the work. A quantitative work may be less credible than a qualitative one and reciprocally depending on their respective quality. Once verified that their level of quality is approximately the same, one can start to discuss about their methodology. About the A,B and C the idea is good, excepted that one cannot verify the outputs of a qualitative model, which makes the verification of the model itself more difficult. Hence this is more difficult for a quantitative model to get an A. But for a quantitative model, the outputs must be comparable to some kind of reality or judgment about it, otherwise the output will not have any utility nor its quantitative quality. For me what is important for a model, whether qualitative or quantitative is its capacity to help people to make good decisions and to effectively do it.
    Besides that, it is true that one can find many qualitative studies that seem totally incredible especially for someone used to build quantitative models. Building system thinking models is easier than SD models FORMALLY only. But a model that is associated and helped with a quantitative mechanism of verification is finally easier to build than a qualitative one that has not this facility.
    The reality is that system thinking attracts more people just because it is easier than SD and less costly for consultants or any practitioner who sells his work as an employee. Nobody will change this fact. It it true too that the SDS by accepting system thinking and SD reduces its specialization. It becomes more difficult to satisfy people who think more differently

Leave a Reply