Skip to main content
Fidelis Associates

What is Reliability-Centered Maintenance (RCM)? A Complete Guide

Fidelis AssociatesPublished: 2026-02-26

What is Reliability-Centered Maintenance (RCM)? A Complete Guide

Author: Fidelis Associates | Published: 2026-02-26 | Last Updated: 2026-02-26

Meta Description: Reliability-Centered Maintenance (RCM) replaces time-based maintenance schedules with failure-mode analysis to reduce unplanned downtime by 25-50%. Learn RCM methodology and implementation.


Definition

Reliability-Centered Maintenance (RCM) is a structured methodology for determining the most effective maintenance strategy for physical assets based on their failure modes, failure consequences, and operating context. Rather than replacing or inspecting equipment on fixed time intervals, RCM analyzes how equipment can fail, what the consequences of each failure mode are, and then selects the most appropriate maintenance task to manage each failure — whether that is preventive, predictive, condition-based, or run-to-failure.

Developed originally for the commercial aviation industry in the 1960s-70s and formalized in SAE JA1011/JA1012, RCM has been widely adopted across petroleum refineries, petrochemical plants, chemical manufacturing facilities, LNG terminals, power generation facilities, and military applications. Organizations that implement RCM properly typically see 25-50% reductions in unplanned downtime and significant optimization of maintenance spending.


Table of Contents

  1. The Problem RCM Solves
  2. The Seven RCM Questions
  3. RCM vs. Traditional Maintenance
  4. Types of Maintenance Tasks
  5. How Do You Implement Reliability Centered Maintenance?
  6. Common Implementation Mistakes
  7. RCM Results and Metrics

The Problem RCM Solves

Traditional maintenance programs are often built on two assumptions:

  1. Equipment wears out predictably — so replacing components at fixed intervals prevents failures
  2. More maintenance is better — so increasing PM frequency improves reliability

Both assumptions are often wrong. Research from United Airlines (which led to RCM's development) found that only 11% of equipment failures follow an age-related pattern. The remaining 89% of failures are random — meaning time-based replacement doesn't prevent them and may introduce infant mortality failures from the maintenance activity itself.

RCM solves this by asking: "What is the most cost-effective way to manage each failure mode?" The answer varies by equipment, failure mode, and consequence — and it isn't always "do more maintenance."


The Seven RCM Questions

SAE JA1011 defines the RCM process through seven questions applied to each asset in its operating context:

1. What are the functions of the asset?

Define what the asset is expected to do in its specific operating context. Functions include both primary functions (what it's designed to do) and secondary functions (containment, safety, environmental, appearance).

2. How can it fail to fulfill its functions?

Identify functional failures — the ways in which the asset can fail to deliver its required performance. A pump's functional failure might be "fails to deliver required flow" or "delivers flow but with excessive vibration."

3. What causes each functional failure?

Identify failure modes — the specific physical events that cause each functional failure. A pump failing to deliver flow could be caused by: impeller wear, bearing failure, seal leak, coupling failure, motor failure, etc.

4. What happens when each failure occurs?

Describe the failure effects — what happens when each failure mode occurs, including the evidence of failure, safety impact, environmental impact, and operational impact.

5. In what way does each failure matter?

Classify the failure consequences:

  • Hidden — The failure is not evident to operators under normal conditions
  • Safety/Environmental — The failure could injure someone or cause environmental harm
  • Operational — The failure affects production output, quality, or customer service
  • Non-Operational — The failure only incurs repair costs

6. What should be done to predict or prevent each failure?

Select maintenance tasks based on failure mode characteristics and consequences:

  • Condition-based monitoring (predictive)
  • Scheduled restoration or replacement (preventive)
  • Failure-finding tasks (for hidden failures)
  • One-time changes (design, procedures, training)

7. What should be done if a suitable task cannot be found?

Default strategies when no proactive task is technically feasible or cost-effective:

  • Run-to-failure (for non-safety, low-consequence failures)
  • Redesign (for safety-critical failures with no effective task)

RCM vs. Traditional Maintenance

| Dimension | Traditional PM | Reliability-Centered Maintenance | | ----------------- | ---------------------------------------------- | --------------------------------------------- | | Basis | OEM recommendations and time intervals | Failure mode and consequence analysis | | Assumption | Equipment ages predictably | Most failures are random | | Strategy | One approach fits all equipment | Right strategy for each failure mode | | Scope | Often over-maintains non-critical equipment | Focuses effort where consequences matter | | Cost | Higher maintenance costs, not always effective | Optimized spending with better outcomes | | Downtime | PM-driven outages may be unnecessary | Maintenance only when justified by analysis | | Documentation | PM schedules and checklists | Living maintenance basis with traceable logic |


Types of Maintenance Tasks

RCM selects from a hierarchy of maintenance strategies:

Condition-Based Maintenance (Predictive)

Monitor equipment condition and intervene when degradation reaches a defined threshold. Examples: vibration analysis, oil analysis, thermography, ultrasonic testing, process parameter monitoring.

Best for: Failure modes with detectable degradation patterns, sufficient lead time between detection and failure, and consequences that justify monitoring costs.

Scheduled Restoration (Time-Based Preventive)

Restore equipment to original condition at fixed intervals. Examples: overhauls, re-builds, re-lining, re-coating.

Best for: The minority of failure modes that follow age-related patterns (wear-out mechanisms with consistent useful life).

Scheduled Replacement (Time-Based Preventive)

Replace components at fixed intervals regardless of condition. Examples: seal replacement, filter replacement, battery replacement.

Best for: Low-cost components with age-related failure patterns where the cost of replacement is less than the cost of condition monitoring.

Failure-Finding

Periodically test hidden functions to verify they work when needed. Examples: testing emergency shutdown systems, relief valve testing, backup power testing.

Best for: Hidden failures in protective systems that are not evident during normal operations but must function on demand.

Run-to-Failure

Allow equipment to operate until it fails, then repair or replace. This is a deliberate, analyzed decision — not neglect.

Best for: Non-critical equipment where failure has no safety or significant operational impact and the cost of prevention exceeds the cost of failure.

One-Time Changes

Modify the design, procedures, training, or operating context to eliminate or reduce the failure mode.

Best for: Failure modes where no maintenance task is effective, or where a design change eliminates the root cause permanently.


How Do You Implement Reliability Centered Maintenance?

Step 1: Select Systems and Assets

Prioritize systems for RCM analysis based on criticality, historical failure data, and business impact. In a petroleum refinery, this might mean starting with crude unit charge pumps and reactor feed compressors; in a petrochemical plant, it could be ethylene cracker rotating equipment. Not every asset needs full RCM analysis — focus on systems where failure consequences are highest.

Step 2: Assemble Cross-Functional Teams

RCM analysis requires input from operations, maintenance, engineering, and reliability. The people who operate and maintain the equipment understand failure modes and consequences better than anyone.

Step 3: Conduct the Analysis

Work through the seven RCM questions for each asset. Document functions, functional failures, failure modes, effects, consequences, and selected tasks. This produces the RCM maintenance basis — a traceable record of why each maintenance task exists.

Step 4: Implement Task Changes

Translate the RCM analysis into updated maintenance procedures, work orders, and condition monitoring programs. This may mean adding new predictive tasks, removing unnecessary PMs, changing intervals, or identifying design changes.

Step 5: Measure and Refine

Track reliability metrics before and after implementation. Refine the analysis based on actual operating experience and new failure data. RCM is a living process, not a one-time study.


Common Implementation Mistakes

1. Analyzing Everything

Trying to apply full RCM to every asset in the facility. RCM analysis is resource-intensive. Focus on critical systems where the investment in analysis is justified by the consequences of failure.

2. Skipping Consequences Analysis

Jumping from failure modes directly to maintenance tasks without properly evaluating consequences. The consequence classification determines which types of tasks are appropriate — without it, teams default to "prevent everything."

3. Not Involving Operators

Conducting RCM as an engineering exercise without operator input. Operators understand how equipment actually behaves, what failure looks like in practice, and which procedures are actually followed.

4. Treating Results as Static

Completing the RCM analysis and never revisiting it. Equipment ages, operating contexts change, and new failure modes emerge — a hydrogen production facility adding compression capacity or an LNG terminal modifying its liquefaction train will introduce failure modes not present in the original analysis. The maintenance basis must evolve with the facility.

5. Confusing RCM with FMEA

FMEA (Failure Modes and Effects Analysis) identifies failure modes and their effects. RCM goes further by selecting the most appropriate maintenance strategy for each failure mode based on its consequences. FMEA is an input to RCM, not a substitute.


RCM Results and Metrics

Typical Outcomes

Organizations that implement RCM properly report:

  • 25-50% reduction in unplanned downtime through targeted predictive maintenance
  • 10-40% reduction in maintenance costs by eliminating unnecessary PMs
  • Improved equipment availability through condition-based intervention before failure
  • Better maintenance workforce utilization by focusing effort where it matters

Key Metrics to Track

| Metric | Definition | RCM Impact | | ------------------------------------------ | ---------------------------------------------- | ------------------------------------------ | | MTBF | Mean Time Between Failures | Should increase | | MTTR | Mean Time to Repair | Should decrease (planned repairs) | | PM Compliance | Percentage of PMs completed on time | Should be high for RCM-justified tasks | | Unplanned Work % | Reactive vs. planned maintenance ratio | Should decrease below 20% | | Condition Monitoring Coverage | % of critical equipment monitored | Should increase for critical assets | | Maintenance Cost / Replacement Asset Value | Total maintenance spend relative to asset base | Should optimize (not necessarily decrease) |


Related Resources


How Fidelis Can Help

FidelisCheck — Free Maintenance & Reliability Diagnostic

Benchmark your maintenance program maturity across work planning, preventive/predictive balance, KPI alignment, and RCM utilization.

Start Free Reliability Diagnostic →

FidelisGap — Expert-Led Maintenance Assessment

Comprehensive evaluation of your maintenance strategy, reliability practices, and work processes by experienced professionals.

Request Gap Analysis →

FidelisCore — Maintenance & Reliability Consulting

Structured consulting programs for:

  • Maintenance maturity assessment and strategy development
  • RCM analysis facilitation and implementation
  • Work planning and scheduling optimization
  • KPI development and reliability dashboards
  • FMEA and failure analysis integration

Schedule a Discovery Call →


Fidelis Associates provides maintenance and reliability consulting through FidelisCore. Our team has led maintenance transformation programs at refineries, chemical plants, and energy facilities, including the Work Planning, Scheduling & Execution program at HF Sinclair.

Ready to strengthen your operations?

Talk to a Fidelis specialist about how we can help.

Related Resources