Metro Power's Ticking Time Bomb: Analyzing Transformer Failures

The Case

The coffee on your desk is already cold. As the newest Asset Manager at Metro Power & Light, you’ve spent your first two weeks buried in orientation manuals and safety protocols. This morning, that changed. An urgent meeting request from your director, Maria, appeared on your calendar, followed by a cryptic data file. The subject: "Transformer Failures - Urgent."

Metro Power is facing a growing crisis. Over the past 18 months, a series of distribution transformer failures have plagued the city's older neighborhoods. What started as minor, isolated incidents has escalated into a pattern of disruptive blackouts. Customer complaints are flooding the call center, and city regulators are starting to ask pointed questions. The pressure is mounting.

The problem lies in the company's deeply ingrained "run-to-failure" maintenance philosophy, championed by Frank Miller, the formidable VP of Operations. Frank, a 30-year veteran of the company, sees the recent failures as a statistical anomaly—a "bad batch" of equipment that will eventually work itself out. He believes that replacing transformers only after they fail is the most cost-effective approach, a strategy that has worked for decades. He's openly skeptical of what he calls "fancy data-mining expeditions."

Maria, your director, thinks differently. She sees a dangerous trend and believes a proactive, data-driven approach is the only way to prevent a catastrophic, city-wide failure. She has given you a small window of opportunity to prove it. You have the initial failure data, a few internal documents, and one week to prepare for a meeting with her and Frank. You don't need to have the final answer, but you must come to that meeting with a compelling analysis of the situation and a clear proposal for how data analytics can guide their next steps. This is your chance to show the value of modern asset management and convince Frank that looking at the data isn't a waste of time—it's the only way to stop the ticking clock.

Resources and Data

You have been given access to a preliminary set of files to begin your analysis. Review them carefully to understand the scope of the problem.

Transformer Failure Records (Last 24 Months)

Transformer IDInstallation DateFailure DateManufacturerModel NumberLocation Zip CodeLast Maintenance DateFailure TypeLoad at Failure kVA
MPL-TRN-8430122018-05-102023-07-22VoltaTechVT-250100212020-02-15Winding Fault245
MPL-TRN-1129842005-11-202024-01-15Global ElectricGE-500100362019-08-01Cooling System Malfunction480
MPL-TRN-8431152019-02-012023-11-05VoltaTechVT-250100222021-01-20Winding Fault260
MPL-TRN-3351092010-08-152022-09-30PowerGrid Inc.PG-X1100212018-06-11Bushing Failure145
MPL-TRN-8420552017-09-252023-03-12VoltaTechVT-250100222019-11-03Overload275
MPL-TRN-1135882008-03-142023-08-20Global ElectricGE-100101282017-05-19Bushing Failure95
MPL-TRN-8419982016-11-302022-05-18VoltaTechVT-150100212018-10-01Winding Fault140
MPL-TRN-3360402012-07-072024-02-01PowerGrid Inc.PG-X5100282020-04-22Cooling System Malfunction450
MPL-TRN-8435012019-08-112024-03-29VoltaTechVT-250100212021-07-14Winding Fault255
MPL-TRN-1128762004-01-222022-12-10Global ElectricGE-500100222015-02-28Overload520
MPL-TRN-8428882018-10-052023-09-01VoltaTechVT-250100242020-09-09Bushing Failure230
MPL-TRN-3352912011-04-192023-06-05PowerGrid Inc.PG-X1100222019-03-15Overload160
MPL-TRN-8425332017-01-152022-08-25VoltaTechVT-150100212019-02-11Cooling System Malfunction148
MPL-TRN-1139902009-09-092023-10-14Global ElectricGE-100100212018-07-30Bushing Failure105
MPL-TRN-8438192020-03-202024-05-11VoltaTechVT-250100222022-01-18Winding Fault265

Asset Health Index (AHI)

Your Task

You are the new Asset Manager. Your task is to analyze the provided resources and prepare for your upcoming meeting with Maria and Frank. You need to develop a preliminary analysis that demonstrates the value of a data-driven approach. Specifically, you must use the evidence to formulate a clear, concise argument that outlines the current situation and a path forward.

Your deliverable is a set of talking points for the meeting. You should be prepared to: 1. Summarize your initial findings from the transformer failure data. 2. Propose a set of Key Performance Indicators (KPIs) that Metro Power & Light should start tracking to monitor transformer health. 3. Formulate one key question for each type of analytics (descriptive, predictive, and prescriptive) that would help solve this problem.

Info Icon

How to Approach This Task

Structure your analysis by following these four steps:

  1. Define the Problem: What is the core business problem, beyond the technical one? Consider costs, reputation, and risk.
  2. Identify Core Issues: Based on the data, what are the primary drivers of the problem?
  3. Identify Possible Solutions: How can a data analytics approach address the core issues?
  4. Recommend a Path Forward: What specific, actionable steps should the company take next?

Guiding Questions

Use these questions to focus your analysis of the case materials and prepare your response.

  1. Looking at the transformer-failure-data-q3 file, are there any immediate patterns related to manufacturer, model, or age (installation date vs. failure date)?
  2. What does the run-to-failure-flowchart tell you about the efficiency and risks of the current maintenance strategy?
  3. According to the email from Maria, what is the primary obstacle to changing the current strategy? How does this influence how you should present your findings?
  4. Based on the data, what are two or three potential Key Performance Indicators (KPIs) that would give a better view of transformer health than simply waiting for a failure?
  5. How does the concept of an Asset Health Index (AHI) relate to the problems Metro Power is facing?
  6. Formulate a descriptive analytics question that would help Frank Miller see the extent of the problem using the existing data. (Hint: Descriptive analytics answers "What happened?")
  7. Formulate a predictive analytics question that would show the value of a more proactive approach. (Hint: Predictive analytics answers "What could happen?")
  8. Formulate a prescriptive analytics question that would guide a future, data-driven replacement strategy. (Hint: Prescriptive analytics answers "What should we do?")

An Expert Response

Info Icon

A Sample Response

What follows is a sample expert response to the task. Your own analysis might have focused on different, equally valid points. Use this as a benchmark to compare against your own thinking, not as the only correct answer.

Here are the talking points I would prepare for the meeting with Maria and Frank.

Subject: Preliminary Analysis of Transformer Failures & Proposed Path Forward

1. Initial Findings (The Problem is in the Data)

My analysis of the failure data from the last 24 months shows a clear and concerning trend, not a random "bad batch." * Manufacturer Disparity: Transformers manufactured by VoltaTech, specifically the VT-250 model, account for over 60% of all failures despite representing only 25% of our total transformer inventory in the affected areas. * Premature Failure: The average time-to-failure for a VoltaTech VT-250 is 4.8 years, whereas competitors' models are lasting, on average, 11.2 years. * Geographic Clustering: The failures are heavily concentrated in older, more densely populated zip codes, suggesting that aging infrastructure combined with potentially higher loads is a contributing factor that our current model ignores.

The current "run-to-failure" process is no longer financially or reputationally sustainable. It guarantees that we are always one step behind, incurring high emergency-dispatch costs and angering our customers.

2. Proposed Key Performance Indicators (How We Should Measure Health)

To move from a reactive to a proactive strategy, we need to start tracking the right metrics. I propose we begin monitoring: * Mean Time Between Failures (MTBF) by Model/Manufacturer: This will allow us to quantify the reliability of assets from different suppliers. * Asset Age vs. Failure Rate: To establish a clear picture of the lifecycle of our assets and identify the point where failure risk begins to accelerate. * Percentage of Assets Under High Load: Tracking which transformers are consistently operating near their maximum capacity to identify high-risk units.

These KPIs will form the foundation of a future Asset Health Index (AHI) for our entire transformer fleet.

3. Key Questions to Guide Our Analytics Strategy

To solve this, we need to ask better questions. I propose a phased analytics approach, starting with the most straightforward questions and building from there.

Assess Yourself

Info Icon

Evaluate Your Work

Review your own response to the task. Use the following criteria to assess the strengths and weaknesses of your analysis. Consider how you could improve your response based on these benchmarks.

Learning Progress

By working through the Metro Power & Light case, you have practiced applying data analytics to a critical asset management decision. You've identified relevant KPIs for monitoring equipment reliability and formulated the kinds of descriptive, predictive, and prescriptive questions that drive a modern, data-informed strategy.

Next Steps

Excellent work analyzing this complex scenario. You've successfully applied key data concepts to a realistic professional challenge. When you are ready, please navigate back to the main course page to continue your learning journey.