From Breakdown to Breakthrough: Rethinking Reliability in Centrifugal Pumps

Diego Cantor Alvarez

From Breakdown to Breakthrough: Rethinking Reliability in Centrifugal Pumps

For most engineers, the phrase “inherent reliability” is practically sacred: a machine leaves the factory with a baked-in limit to how reliable it can ever be. Maintenance and operating practices can help maintain that level, but they generally can’t raise it.

But what happens when a brand-new machine consistently fails long before its expected life? What if thirty newly installed centrifugal pumps — all API 610 VS6 vertical, multistage units — begin dropping out with mean times between failures (MTBF) as low as three months? That’s not disappointing; that’s catastrophic.

This case study follows what happened next. It’s a story of digging past assumptions, diagnosing the real culprits, and ultimately proving that when a machine’s original design is flawed or mismatched to its environment, redesign is not only possible — it’s the only way to restore the reliability the asset should have had from day one.

Instead of maintaining inherent reliability, the engineering team had to re-establish it.  

The Situation: Thirty New Pumps, Thirteen Sites, One Big Problem 

Across thirteen facilities, each operating a duty/standby pair of newly installed pumps, all units shared common specs:

# Stages Speed BEP MCSF Rated Flow Flow: Preferred Operating Region
(70% to 120% of BEP)
Flow: Allowable Operating Region
(40% to 120% of BEP)
          From To From To
3 stages 1488 rpm 827.2 331.8 673.3 579 992.6 331.8 992.6
4 stages 1488 rpm 836 335.7 673.3 585.2 1003.2 335.7 1003.2
Note: all flow units in US gpm

 

Almost immediately after commissioning, failures began showing up at a disturbing rate. Technicians found themselves pulling units after mere months — and in some cases even preemptively after only a few weeks — because degradation was so severe that major failures were imminent.
Early inspections painted the same picture across the fleet: critical rotor support components were wearing out far faster than any reasonable operating envelope should allow.   

What Was Failing — and How Badly 

Chart 1

As the teardown data accumulated, three components emerged as the biggest trouble spots. 

1. Bushings

Image Bushings Comparison

2. Sleeves

Sleeves Image

3. Impellar & Case Wear Rings

Impellar Image

Something wasn’t just slightly off — the failure modes pointed to systemic mechanical and hydraulic distress. 

Root Cause Analysis: Two Smoking Guns 

A comprehensive RCA revealed two primary mechanisms behind the short pump life. Both were serious. Together, they were devastating. 

Root Cause #1: Operation Far Outside the BEP

The data showed every pump was operating dramatically below its Best Efficiency Point — between 26% and 49% of BEP, averaging only 36%.
API 610's preferred window? 70%–120% of BEP.

Running that far off-curve created excessive hydraulic loads, which directly translated into:  

Line Chart Pump Performance

No surprise those components were wearing unevenly — they were being punished every minute they ran. 

Root Cause #2: Abrasive Solids in the Pumped Fluid 

The second factor only made the first worse: the fluid contained unexpected levels of abrasive solids coming from the reservoir. This was not anticipated during design.
Abrasive particles acted like microscopic grinding media inside every critical interface:

Accumulation Image

Nothing in the OEM material selection or geometry was built to survive this environment.
Together, these two root causes made premature failure essentially guaranteed.
 

Exploring the Options: What Could Be Done?

When all failed units are brand-new, replacing parts in-kind is just repeating the problem.
The team evaluated multiple alternatives using a multi-factor decision framework that weighed:

The conclusion was unavoidable: the pumps required a redesign of key rotor components to survive both the hydraulic loading and the abrasive solids. 

Proposed Corrective Actions Technically feasible? Worth doing (cost-effective)? Implementation is within our control? Will not generate a collateral damage? Fastest, simplest thing with the highest probability of success?
Increase the pump flow recirculation rate × × × ×
Changing pump operational philosophy from 24/7 to on-demand × ×
Increase the pump flow to reach 70% to 120% of the BEP × × ×
Introduce VFDs (Variable Frequency Drivers) × ×
Redesign the pump (de-staging, impellers replacement, etc.) × ×
Modify suction strainer and DP protections design ×
Redesign the critical pump components (rotor supports)

This wasn't "improving reliability through maintenance."

This was correcting a design mismatch so the pumps could achieve the reliability their process conditions demanded. 

 

The Redesign: Building Components That Could Survive Reality

Three major component categories were redesigned or upgraded.

1. New Bushings

The OEM bushings had a single helical groove and were made of PEEK — inadequate for both load and abrasive environment.

The fix:

Bushings Comparison

These changes addressed both hydraulic loading and solids tolerance. 

 2. New Sleeve Bearings

The base material remained A276-410, but the team applied a specialized surface hardening process (TSC-NCB-2020-60) supplied by a U.S. vendor.

This proprietary blend of chrome, boron, tungsten carbides, and nickel created a sleeve capable of resisting extreme abrasion. 

3. New Wear Rings 

Casing wear rings:

Impeller wear rings:

These changes created a much more durable, solids-resistant rotor support system. 

The Results: Reliability Restored — and Then Some

Two categories of upgraded components were installed and tracked:

Category I: Fully Upgraded Package

(bushings, sleeves, casing wear rings, impeller wear rings)

Category II: Partial Upgraded Package

(bushings + sleeves only)

 

Comparison of Mean Time Between Failure (MTBF) of centrifugal pumps using various categories of spare parts: Original spare parts (a), 

Upgraded spares Category I (b) and Upgraded spares Category II (c)

Spare parts category Mean Time Between Failure (MTBF) Difference with original MTBF
a) Original spare parts (reference)
  • Sleeve bearings: A276-410 Nitrocarburized
  • Bushings: PEEK
  • Casing wear ring: A426-CPCA15 Nitrocarburized
  • Impeller wear ring: A426-CPCA15
90 days (3 months) Not Apply
b) Upgraded spare parts Category I
  • Sleeve bearings: A276-410 + TSC-NCB-2020-60 coating
  • Bushings: six axial grooves, made of composite material with carbon fibers in a silicon carbide ceramic matrix
  • Casing wear rings: composite material with carbon fibers in a silicon carbide ceramic matrix
  • Impeller wear ring: A426-CPCA15 + TSC-NCB-2020-60 coating
1825 days (5 years) MTBF extended twenty times
c) Upgraded spare parts Category II
  • Sleeve bearings: A276-410 + TSC-NCB-2020-60 coating
  • Bushings: six axial grooves, made of composite material with carbon fibers in a silicon carbide ceramic matrix
1095 days (3 years) MTBF extended twelve times

Both solutions delivered massive leaps in availability, drastically reduced maintenance interventions, and lowered long-term operating costs. 

What This Case Proves About Reliability 

This experience reinforces a crucial but often misunderstood point:

Maintenance cannot increase a machine’s inherent reliability…

…but redesign absolutely can.

The pumps weren’t failing because they were poorly maintained.
They were failing because their original design assumptions were wrong:

By upgrading materials, geometry, and surface treatments, the engineering team didn’t simply “improve reliability.” They corrected the design so the pumps could finally achieve the reliability their operating conditions required.

In other words, they restored — and in some ways established for the first time — the reliability these pumps should have had the day they were commissioned.

Brand-new equipment failing early is frustrating, expensive, and often politically charged. But it’s also an opportunity.  

This case shows that when the operating environment doesn’t match design assumptions, no amount of preventive maintenance will save you. Reliability problems that come from design mismatches can only be solved through design intervention. By rethinking the internal architecture of key rotor components, the engineering team transformed a fleet of failure-prone pumps into long-life, high-availability assets — proving once again that reliability isn’t just something you maintain.

Sometimes, it’s something you have to rebuild.