Root Cause Analysis in Workplace Safety: 5 Methods Every OHS Professional Needs
The Accident That Keeps Happening
A warehouse worker slips on a wet floor near the loading dock and sprains their wrist. The incident is recorded. The report reads: "Operator failed to observe wet floor." The corrective action: retrain the worker on housekeeping awareness. Box ticked. Case closed.
Six months later, a different worker slips in the same location and fractures their elbow.
This is the most common failure in workplace safety management: treating the symptom instead of the disease. The worker's inattention was the proximate cause — the immediate trigger. But the root cause was systemic: no drainage solution at the dock, no slip-resistant surface, no warning system for wet conditions. Those underlying factors were never identified, so they were never fixed.
Root cause analysis (RCA) is the structured process that prevents this pattern. It is what transforms a single incident report into lasting organizational change.
What Is Root Cause Analysis?
Root cause analysis is a systematic method for identifying the underlying causes of an incident or nonconformity — not just the immediate trigger — so that corrective actions address the actual source of the problem and prevent recurrence.
The distinction that matters:
| Level | What It Is | Example |
|---|---|---|
| Immediate cause | The direct trigger of the event | Worker slipped on wet floor |
| Contributing cause | Conditions that enabled the event | No warning sign was placed |
| Root cause | The systemic failure that allowed the problem to exist | No procedure for managing dock drainage after rain |
Addressing only the immediate cause produces compliance theater. Addressing the root cause produces actual prevention.
Why RCA Is a Legal and Regulatory Requirement
Root cause analysis is not optional in a formal OHS management system:
- ISO 45001 Clause 10.2 requires organizations to "evaluate the need for action to eliminate root causes of incidents and nonconformities, in order that they do not recur or occur elsewhere"
- ISO 45001 Clause 10.2.2 requires documented evidence of the investigation and its findings
- OSHA 29 CFR 1904 (US) and equivalent national legislation require incident investigation with corrective action
- ISO 45001-certified organizations must demonstrate RCA capability to pass Stage 2 and surveillance audits
Organizations seeking or maintaining ISO 45001 certification will find that a well-documented RCA process is one of the clearest demonstrations of Clause 10 compliance.
The 5 Levels of Causal Analysis
Before selecting a method, understand the depth you need to reach:
| Causal Level | Question | Example |
|---|---|---|
| Immediate cause | What happened? | Chemical splashed onto worker's face |
| Enabling cause | What allowed it to happen? | No splash guard on the container |
| Systemic cause | Why was no guard present? | Specification didn't require it |
| Management cause | Why was the specification inadequate? | No review process for equipment standards |
| Root cause | Why does that gap exist? | OHS management system lacks change-triggered review |
Most incident investigations stop at level 1 or 2. ISO 45001 requires reaching the systemic level — and documenting the path.
5 Root Cause Analysis Methods
Different incidents require different analytical tools. Here are the five methods most widely used in occupational safety.
Method 1: 5 Whys
What it is: Ask "why?" five times — each answer becomes the basis for the next question — until the root cause is reached.
Best for: Single-factor incidents; near misses; lower-severity events; quick investigations.
Real example (from our near miss reporting article):
A worker almost received a chemical splash while decanting a solvent.
- Why? → The container overflowed
- Why? → The worker couldn't see the fill level
- Why? → The container was opaque with no level indicator
- Why? → Container specification didn't require a level indicator
- Why? → The specification had not been reviewed since the solvent type changed
Root cause: No change-triggered review process for equipment specifications.
Corrective action: Update specification standard; require level-visible containers for all solvents; implement a change management procedure.
Limitation: 5 Whys follows one causal chain. When an incident has multiple independent contributing factors, it can miss important branches. For those situations, use Fishbone.
Method 2: Fishbone Diagram (Ishikawa / Cause-and-Effect Diagram)
What it is: A visual diagram that maps all possible contributing causes of an incident across six categories, branching from a central "spine" pointing to the incident (the fish head).
The 6M categories:
| Category | What It Covers |
|---|---|
| Man | Human factors: training, fatigue, distraction, behavior |
| Machine | Equipment condition, maintenance, design flaws |
| Method | Work procedures, instructions, processes |
| Material | Raw materials, chemicals, consumables, PPE |
| Measurement | Inspection practices, monitoring systems, metrics |
| Mother Nature (Environment) | Physical environment, lighting, noise, weather, layout |
Real example: A forklift struck a pedestrian in a warehouse aisle.
Man: Operator was not trained on pedestrian awareness; pedestrian was using an unmarked route. Machine: Reversing alarm had been faulty for two weeks; no proximity sensor installed. Method: No documented traffic management plan; no demarcated pedestrian routes. Material: No physical barriers between pedestrian and vehicle zones. Environment: Poor lighting in the rear section; congested layout after equipment reorganization. Management: Last safety audit was 14 months ago; a previous near miss had been reported but not investigated.
The Fishbone reveals that this was not a "driver error" incident — it was a systemic failure across five categories simultaneously.
Best for: Complex incidents with multiple potential contributing factors; high-severity incidents; ISO 45001 Clause 10 compliance documentation.
Limitation: Identifies where to look, but doesn't automatically identify which causes are root causes versus contributing factors. Combine with 5 Whys drilling into the most critical branches.
Method 3: Fault Tree Analysis (FTA)
What it is: A top-down, deductive method that works backward from an undesired event (the "top event") through logical AND/OR gates to identify the combination of failures that could cause it.
Structure:
- Top event: The undesired outcome (e.g., "Worker receives fatal electric shock")
- Intermediate events: System-level failures leading toward the top event
- Basic events: Individual component failures or human errors at the bottom of the tree
- AND gates: All inputs must occur simultaneously for the output to occur
- OR gates: Any single input is sufficient to produce the output
Example: For an electrical fatality:
- Top event: Worker receives fatal shock
- AND: Worker contacts live conductor
- AND: Conductor is energized
- OR: Lockout/tagout procedure not followed
- OR: LOTO procedure exists but inadequate
- AND: Procedure not reviewed after equipment change
- AND: No verification step in procedure
FTA reveals which failure combinations are most critical — AND gates represent higher-risk scenarios because multiple simultaneous failures are required, while OR gates reveal single points of failure.
Best for: Complex mechanical or electrical systems; high-consequence scenarios in energy, chemical, and heavy industry; proactive risk assessment (before an incident occurs).
Limitation: Requires specialized knowledge; time-intensive to construct; needs software tools for large systems.
Method 4: Bow-Tie Analysis
What it is: A visual risk model that places a hazard at the center (the "knot") and maps both threats (causes, on the left) and consequences (outcomes, on the right), with control barriers on each side preventing escalation.
[Threats] → |Barriers| → [HAZARD EVENT] → |Barriers| → [Consequences]
Why it is powerful for OHS:
The Bow-Tie integrates two analyses in one diagram:
- Left side (prevention): Why does an incident happen? What prevents it?
- Right side (mitigation): If it happens, what limits the harm?
The barriers correspond directly to the hierarchy of controls (elimination → substitution → engineering controls → administrative controls → PPE), making it ideal for ISO 45001 compliance documentation.
Example: Hazard = Chemical release in manufacturing
Left threats: pipe corrosion; operator overfill; valve failure; maintenance error Left barriers: inspection schedule; fill level alarms; pressure relief valves; LOTO procedures Right consequences: worker exposure; fire; environmental contamination Right barriers: PPE; emergency shutdown; spill containment; evacuation procedure
Best for: High-hazard process industries; demonstrating control adequacy to regulators and auditors; combining risk assessment and incident investigation; visualizing control gaps.
Limitation: Shows what controls exist, but not why they failed — combine with 5 Whys or Fishbone to investigate barrier failures after an incident.
Method 5: SCAT (Systematic Cause Analysis Technique)
What it is: A structured, checklist-driven method developed by the International Loss Control Institute (ILCI), aligned with Frank Bird's Safety Pyramid model. SCAT follows a standardized four-level chain:
- Contact → What was the nature of the incident? (energy transfer, hazardous substance, etc.)
- Immediate causes → Substandard acts and substandard conditions
- Basic causes → Personal factors (skill, motivation, physical/mental state) and job factors (leadership, standards, equipment)
- Lack of control → Management system gap that allowed the basic causes to exist
Why SCAT is valued in corporate OHS systems:
The SCAT checklist provides over 250 pre-defined cause categories across the four levels, making investigation consistent and systematic regardless of which investigator conducts the analysis. Large organizations and auditors can compare investigations across sites, time periods, and incident types.
The connection to Bird's Triangle (which underpins our near miss reporting framework) means SCAT data can also feed directly into leading indicator trend analysis.
Best for: Large organizations that need standardized, comparable investigation data across multiple sites; organizations with formal OHS management systems seeking audit-ready documentation.
Limitation: The checklist structure can create a checkbox mentality if investigators are not trained to think beyond the pre-defined categories.
Which Method Should You Use? Decision Guide
| Incident Type | Recommended Method | Why |
|---|---|---|
| Simple near miss or minor injury (single causal chain) | 5 Whys | Fast, practical, sufficient depth |
| Injury with multiple equipment/human factors | Fishbone (Ishikawa) | Maps all contributing categories |
| Complex mechanical or electrical system failure | Fault Tree Analysis | Logical rigor for multi-component systems |
| High-consequence or fatal incident | Bow-Tie Analysis | Visualizes all threat/consequence pathways |
| Corporate multi-site, standardized investigation | SCAT | Consistent, comparable, audit-ready |
| Any of the above + ISO 45001 Clause 10 documentation | Combine as needed | Use method + document findings formally |
For most OHS practitioners in construction, manufacturing, and logistics, Fishbone + 5 Whys drilling is the most practical combination: Fishbone maps the full causal landscape; 5 Whys drills into the highest-priority branches.
The 8-Step RCA Process
Regardless of which analytical method you choose, a rigorous investigation follows this sequence:
Step 1: Secure the scene and immediate response Preserve physical evidence before it is disturbed. Photograph the scene, equipment condition, and any environmental factors. Do not move or repair anything until documented.
Step 2: Gather evidence Collect witness statements (separately, not in groups), review CCTV footage, inspect equipment, obtain maintenance records, review training records for all involved workers.
Step 3: Build the timeline Reconstruct the sequence of events in chronological order. Include what was happening in the broader workplace before the incident — shift changes, concurrent activities, recent maintenance. Gaps in the timeline are often where the root causes hide.
Step 4: Assemble the investigation team Include the direct supervisor, a worker representative, a safety professional, and at least one person independent of the area involved. Independence matters — people who work in the area are prone to normalization bias.
Step 5: Apply the chosen RCA method Work systematically. For Fishbone: populate every category before drawing conclusions. For 5 Whys: commit to reaching level 5 before accepting a root cause. Avoid stopping when you reach "human error" — that is almost always a contributing cause, not a root cause.
Step 6: Validate the root cause Apply the validation test: "If this cause were eliminated, would this incident be prevented from recurring?" If the answer is yes, you have a root cause. If the incident could still occur through another mechanism, keep investigating.
Step 7: Define corrective and preventive actions For each root cause, define:
- Corrective action: What will fix this specific root cause?
- Preventive action: What broader systemic change prevents similar root causes elsewhere?
- Owner: Who is responsible?
- Due date: When will it be completed?
- Verification: How will closure be confirmed?
Step 8: Implement, verify, and close Assign actions to owners with clear deadlines. Verify implementation — not just that the action was completed, but that it was effective. Communicate findings and outcomes to the workforce. Close the investigation formally with documented evidence.
The Most Common RCA Mistakes
The "operator error" trap
Blaming an incident on worker carelessness or inattention is not root cause analysis — it is the stopping point that prevents root cause analysis. Human error is almost always a symptom of systemic failures: inadequate training, unclear procedures, poor equipment design, excessive workload, or organizational pressure to skip steps.
James Reason's Swiss Cheese Model reminds us: a single human error rarely causes a serious incident. It is the alignment of holes in multiple defensive layers — training, supervision, equipment, procedure, environment — that creates the pathway to harm. RCA must identify all the holes.
Stopping too early
Most investigators stop at level 2 (contributing causes). The test: ask "why?" one more time after every answer that feels like a conclusion. If a reasonable answer exists, the investigation is not finished.
Single-cause thinking
Complex incidents rarely have a single cause. The forklift example above had six simultaneous contributing factors. An investigation that identifies only one misses five ongoing hazards.
Findings without action
The most expensive RCA mistake is completing a thorough investigation, writing a detailed report, and filing it in a drawer. ISO 45001 Clause 10.2 requires that corrective actions be implemented and their effectiveness verified. Documented evidence of closure is required for certification audits.
How AI Tools Transform Incident Investigation
Traditional RCA is paper-intensive and slow. Modern AI-powered tools change what is possible:
AI-assisted photo analysis: Incident photographs are automatically analyzed for physical hazards, equipment conditions, and environmental factors — expanding the evidence set before the investigator arrives on-site.
Pattern matching across incidents: AI can identify whether the current incident resembles past incidents in other locations, surfacing previously identified root causes that may apply — reducing the risk of missing recurrent systemic issues.
Guided investigation workflows: AI can prompt investigators with the right questions based on the incident type, ensuring no causal category is overlooked — particularly useful for less experienced investigators.
Automated corrective action tracking: Every identified root cause is linked to a corrective action with an assigned owner, target date, and verification step. No manual spreadsheet management required.
Trend dashboards: Aggregating RCA findings across incidents reveals systemic patterns — the same root causes appearing across different locations or incident types — that a single-incident investigation would miss.
This is how FindRisk supports the complete incident investigation workflow: from AI-powered photo capture at the scene, through guided root cause analysis, to corrective action assignment and ISO 45001-ready documentation.
For a comprehensive look at how AI is reshaping every aspect of occupational health and safety, read our article on how AI is transforming occupational health and safety.
RCA and the Near Miss Connection
The most efficient use of root cause analysis is not after a serious accident — it is after a near miss. Near misses are identical in causal structure to serious incidents: they have the same systemic failures, the same control gaps, the same underlying management system weaknesses. The only difference is that the consequences have not materialized yet.
An organization that applies rigorous RCA to near misses is systematically dismantling the base of the Heinrich/Bird Safety Pyramid before it produces serious injuries. This is the highest-leverage use of RCA capability.
For a complete guide to building a near miss reporting culture, see our article on near miss reporting in the workplace.
Frequently Asked Questions
How long does a root cause analysis take?
It depends on the incident severity and the method used. A 5 Whys analysis for a minor near miss can be completed in 30–60 minutes. A Fishbone analysis for a lost-time injury with multiple factors typically requires 2–4 hours of investigation plus 24–48 hours for evidence gathering. A full Fault Tree Analysis for a serious incident in a complex system may take days to weeks. ISO 45001 does not specify a time requirement, but the standard does require that investigations be completed and corrective actions implemented in a timely manner.
Should we conduct RCA for every incident, including minor ones?
Yes — with proportionate depth. A minor near miss warrants a 5 Whys analysis documented in 10 minutes. A serious incident warrants a full multi-method investigation. The principle is consistent: every incident that is reported should have at least a basic causal analysis and a corrective action. Near misses that are systematically investigated prevent the serious incidents that follow.
What is the difference between 5 Whys and a Fishbone diagram?
5 Whys follows a single causal chain: one answer leads to the next why, in a linear sequence. It is fast and practical for simple incidents. Fishbone maps multiple causal categories simultaneously — Man, Machine, Method, Material, Measurement, Environment — allowing investigators to capture all contributing factors before identifying which are root causes. Use 5 Whys when the causal chain is straightforward; use Fishbone when you suspect multiple independent contributing factors.
Who should conduct a root cause analysis?
The investigation team should include: the area supervisor (accountability), a worker representative (frontline perspective), a safety professional (methodological guidance), and at least one person independent of the area (objective challenge to assumptions). For serious or fatal incidents, an external consultant or senior OHS professional should lead or review the investigation. Independence is critical — people who work in the area are prone to accepting "that's just how things are done" as an explanation.
How should RCA findings be documented for ISO 45001 compliance?
ISO 45001 Clause 10.2.2 requires documented information on: the nature of the incident; immediate causes; root causes identified; corrective actions taken; verification of effectiveness. The document must be retained as evidence. Many organizations use a standardized investigation report template that captures all required fields, links to the risk assessment register, and records the corrective action completion date and verification method. Digital tools that generate audit-ready reports automatically are increasingly used to ensure compliance without administrative burden.
Conclusion
Root cause analysis is the hinge between incident response and genuine prevention. Without it, the same hazards recur, the same workers get hurt, and safety management becomes a cycle of reactions rather than a system of controls.
The five methods covered in this guide — 5 Whys, Fishbone, Fault Tree Analysis, Bow-Tie, and SCAT — are not competing approaches. They are complementary tools, each suited to different incident types and investigative contexts. The most effective OHS professionals know which tool to reach for and when to combine them.
What they all share is the same underlying discipline: keep asking why until you reach the system, not the person. Fix the system, and the person no longer needs to be perfect.
Download FindRisk to enable your team to capture incident evidence instantly in the field — with AI-powered analysis, guided root cause investigation, and automatic corrective action tracking built into a complete ISO 45001-ready workflow.
Try FindRisk
Ready to modernize your safety workflow?
Conduct AI-powered risk assessments, generate reports instantly, and keep your team safe — anywhere, anytime.
