Metrics

6 Steps to Fault Tree Analysis (FTA)


March 6, 2026
table of content

Navigating the complex landscape of system failures requires a methodical approach. Fault Tree Analysis (FTA) offers a systematic way to examine potential failure points, helping organizations preemptively mitigate risks.

In this guide, we will walk you through the steps to conduct a thorough FTA, from building a diverse team to developing risk mitigation strategies.

Step 1: Build a diverse team

When dealing with complex systems, you want different voices in the room. Experienced professionals in the field will be able to reference past experiences from their professional life. They will also be aware of the technical aspects of the system that impact them the most. Other team members with less technical knowledge can contribute by pitching out-of-the-box ideas and other helpful information.

Brainstorming sessions and meetings need a leader, someone who has experience in conducting FTA. Engineers of respective fields, industrial engineers, and system design specialists are required for any FTA team.

Step 2: Identify failure causes

FTA works from the top down. Start with the top event, then try to identify the various failures that could cause or contribute to it. If you keep digging to build off of each event, it will eventually lead you to the root causes (now that’s what we call getting your hands dirty!). You will be left with a beautiful fault tree.

Potential failures, their characteristics, duration, and different impacts of the failure have to be defined to start and complete the process. Take fire doors in a high traffic area or factory as an example. These doors are held open until the power fails or the fire alarm is triggered. If the fire alarm is faulty, there is an issue with the wiring, the backup batteries have run low, or someone has tampered with it. The alarm will trigger the doors to close when they are not supposed to.

Resulting in a low-level failure, but one that can cause massive frustration and interrupt the entire organization.

Step 3: Understand the inner workings of the system

The team performing FTA needs to have a deep understanding of the inner workings of the system. The engineers working at the system level will have a good idea of how everything works and what failures you will want to avoid. Other team members can then raise questions that result in an expanded list of failure causes worth exploring. Someone with knowledge and expertise of the system should be in charge of guiding the discussion.

The goal is to get a good grasp of the system’s requirements, connections, and dependencies. Your team should collect the schematics of the system, specifications of different components, and other available manufacturer information. If you’re using an CMMS like Limble, these asset specifications are available at the touch of a button.

Studying these materials should build an understanding of how each sub-system and component are connected to each other.

Step 4: Draw the FTA diagram

Once the team understands the system’s inner workings, the next step is to graphically present a functional map of the system using boolean logic. Using the fault tree symbols and structure above, your team can draw the graphical representation of the system and how they are all connected.

Step 5: Identify MCS, MPS, or CCF

After the fault trees are complete, your team can identify MCS, MPS, or CCF based on what they want to accomplish.

  • MCS or minimal cut sets are identified to know the most vulnerable parts of the system.
  • MPS or minimal path sets are determined to identify the core components and subsystems required to remain operational.
  • CCF identifies the components that cause the maximum number of failures.

Your reason for performing FTA in the first place will determine whether the team needs to find MCS, MPS, CCF, or a combination of the three.

Optional step: Assess the probability of failure

More often than not, you’ll find multiple pathways that can lead to the same failure event. For an extensive system, it would be nearly impossible to address all failure causes at once. To prioritize which events to address first, the team can calculate the probabilities of each failure for different critical sets. The critical set with the highest chance of failure should be given top priority. This is an optional but valuable step.

If you know the probability of each failure, it will be worth the time to use them!

Step 6: Develop risk mitigation strategies

Now it is time to use your Fault Tree Analysis to minimize your risk of failure.

  • High priority has to be given to protect MPS (the minimum set of components to keep the system operational).
  • Strict maintenance schedules have to be maintained for CCFs as they can cause a multitude of issues.

One potential risk mitigation strategy, especially for CCFs, is preventive maintenance.

A CMMS software like Limble can help you assure adherence to required maintenance schedules. This includes following the best practices for spare parts management, so the maintenance team always has replacement components in stock. This effort has to be put in to minimize the probability of failure.

Additional resources

As you can tell, a lot of research and expertise has gone into developing the Fault Tree Analysis process. If you would like to dive deeper into this subject, check out these additional resources:

related articles
10 Facility Management KPIs You Need to Start Tracking Today

Learn more
6 Steps to Fault Tree Analysis (FTA)

Learn more
A Quick Guide to Functional Failures: Definition, Examples & Common Causes

Learn more
An Introduction to Equipment Failure Patterns

Learn more
Analyzing Machine Downtime

Learn more
Calculating & Increasing the Efficiency of a Machine

Learn more
Calculating and Improving Mean Time to Repair

Learn more
Equipment Failure Causes, Risks, and How to Avoid Them

Learn more
Explaining The P-F Curve And The P-F Interval

Learn more
Failure Codes: A Guide to the Basics

Learn more
Fault Detection And Diagnostics In Equipment Maintenance

Learn more
First Pass Yield (FPY)

Learn more
First Time Fix Rate (FTFR)

Learn more
Guide to Understanding Failure Metrics

Learn more
How to Calculate and Improve MTBF

Learn more
How to Do a Root Cause Analysis

Learn more
How to Improve and Calculate MTTD

Learn more
How to Perform Criticality Analysis

Learn more
Idle Time

Learn more
Mean Time To Failure (MTTF)

Learn more
Mean Time To Repair (MTTR)

Learn more
OEE Calculation: Formulas, Examples, and Tools to Measure OEE

Learn more
Overall Equipment Effectiveness (OEE)

Learn more
Overall Operations Effectiveness (OOE)

Learn more
Performing FMEA and FMECA with Examples for Maintenance

Learn more
Planned Maintenance Percentage (PMP)

Learn more
Preventive Maintenance Compliance (PMC)

Learn more
Reliability, Availability, Maintainability (RAM) Concept Explained

Learn more
Replacement Asset Value (RAV)

Learn more
Reporting System Downtime: Strategies for Accurate Tracking of Production

Learn more
Schedule Compliance

Learn more
Six Big Losses in Manufacturing: How to Improve OEE

Learn more
The Basics of Field Failure Analysis

Learn more
The Simple Guide to Maintenance KPIs & Metrics

Learn more
Top 8 Asset Maintenance KPIs to Boost Performance

Learn more
Total Effective Equipment Performance (TEEP)

Learn more
What Is Event Tree Analysis?

Learn more
What is DFMEA Analysis?

Learn more
What is FMECA Analysis?

Learn more
What is FRACAS? Failure Reporting, Analysis, and Corrective Action System

Learn more
What is Maintenance Excellence Index & How Does It Drive Profit?

Learn more
What is PFMEA?

Learn more
Work Order Completion Rate

Learn more
Wrench Time

Learn more

Ready to learn more about Limble?

Schedule a demo or calculate your price right away.

Schedule demo