Root Cause Analysis: Methods and Techniques

Various industries and contexts, including manufacturing, healthcare, information technology, and project management, widely use root cause analysis. Its effectiveness relies on a systematic approach, focusing on preventing future issues rather than just resolving the immediate problem. It is one of the many tools MKA Insights deploys to help our clients with risk mitigation.

There are multiple methods and approaches to conducting root cause analysis (“RCA”); each has its own strengths and weaknesses. The choice of an RCA technique depends upon the relevant issue we’re trying to solve.

Common Root Cause Analysis Techniques


The 5 Whys Technique: A simple yet powerful tool, this involves asking “why” repeatedly (typically five times) to drill down to a problem’s cause. It’s based on the idea that each question will uncover another layer of the issue. For example: A machine stops working. Why? The fuse blew. Why? It overloaded. Why? The machine wasn’t maintaining properly, and so forth. This technique’s strength lies in its simplicity, as it doesn’t require specialized tools. However, it may oversimplify complex issues and heavily depends on the expertise and perspective of those involved.

The Fishbone Diagram (also called Ishikawa or Cause and Effect Diagram): This method visually maps out the causes of a specific problem.  Identify a problem, brainstorm all possible causes, then categorize them into themes like Methods, Machines, People, Measurements, Materials and Environment. The diagram looks like a fishbone, with each “bone” representing a category of causes. This method is often used in manufacturing for quality control or in service industries for process improvement. This approach encourages thorough analysis and is visually intuitive. However, it can become unwieldy with complex problems and it may not identify all root causes.

Failure Mode and Effects Analysis (“FMEA”): FMEA is a systematic, proactive method for evaluating a process to identify where and how it might fail and assessing the relative impact of various failures. It’s used to identify potential failure points and prioritize them based on their severity, frequency, and detectability. Engineers and professionals employ this method in product design, manufacturing processes, and safety-critical systems. It is a systematic, proactive approach that is adaptable for risk management but it can be time-consuming and it requires detailed knowledge of processes.

Fault Tree Analysis (“FTA”): This is a top-down, deductive analytical method. Start with an undesirable event (top of the tree) and work backward, visualizing the pathways that lead to a failure using a decision tree diagram. Particularly useful for understanding complex systems, industries with high risks, such as aerospace and nuclear power, commonly employ it. FTA helps analyze complex systems and identifies both human and mechanical factors, but it requires expertise to construct and interpret, and can be time-intensive.

Pareto Analysis: Pareto Analysis, also known as the “80/20 Rule,” plays a role in RCA by identifying the most significant factors contributing to a problem. It is based on the principle that a small number of causes (20%) often lead to a large number (80%) of the problems. It uses data to identify and prioritize problems or causes based on their frequency or impact. Typically used in business management for resource allocation and in manufacturing for quality control, this technique helps focus efforts on the most impactful issues. Its limitation is that it assumes a specific distribution (80/20), so it may overlook less frequent but still critical issues.

Root Cause Confirmation: Once you identify potential root causes, it’s essential to confirm them. You can achieve this by testing whether removing or altering these causes eliminates or reduces the severity of the problem. This step accompanies any other RCA method as a validation step. It helps ensure that identified causes are truly the root causes, but it can be challenging if the problem is intermittent or rare.

Barrier Analysis: This method identifies barriers (physical, procedural, or human) that failed, were missing, or contributed to the issue. The analysis evaluates why these barriers did not work as intended. A typical use case is accident investigations, especially in safety-critical industries. Useful for understanding failures in safety mechanisms, this is more focused on symptoms than underlying systemic issues.

Change Analysis: This method compares a situation in which a problem occurred to one where it did not occur to identify key differences. This is recognized as troubleshooting or problem-solving in various industries. Effective for problems arising from changes in processes, equipment, or materials, it also requires a comparable situation where the problem did not occur.

Each of these methods has its strengths and is suitable for different types of problems and industries. Often, a combination of these methods is used to get a comprehensive understanding of the root causes of a problem. The selection of a particular method depends on the nature of the problem, the complexity of the system, the skills of the team conducting the analysis, and the resources available