Written by: Monserrat Raya
The Temptation of Simple Numbers
nAt some point, almost every engineering leader hears the same question. nn“How do you measure performance?” nnThe moment is usually loaded. Year-end reviews are approaching. Promotions need justification. Leadership above wants clarity. Ideally, something simple. Something defensible. nnThe easiest answer arrives quickly. nnCommits. Tickets closed. Velocity. Story points delivered. Hours logged. nnEveryone in the room knows these numbers are incomplete. Most people also know they are flawed. Still, they feel safe. They are visible. They fit neatly into spreadsheets. They create the impression of objectivity. nnAnd under pressure, impression often wins over accuracy. nnWhat starts as a convenience slowly hardens into a framework. Engineers begin to feel reduced to counters. Leaders find themselves defending metrics they do not fully believe in. Performance conversations shift from curiosity to self-protection. nnThis is not because leaders are careless. It is because measuring performance is genuinely hard, and simplicity is tempting when stakes are high. nnThe problem is not that activity metrics exist. The problem is when they become the conversation, instead of a small input into it.
Why Activity Metrics Feel Safe (But Aren’t)
n
Activity metrics persist for a reason. They offer relief in uncomfortable moments.
n
The Appeal of Activity Metrics
n
They feel safe because they are:
n
- n
- Visible. Everyone can see commits, tickets, and throughput.
- Comparable. Numbers line up nicely across teams and individuals.
- Low-friction. They reduce the need for nuanced judgment.
- Defensible upward. Leaders can point to charts instead of narratives.
n
n
n
n
n
In organizations under pressure to “simplify” performance measurement, these traits are attractive. They create the sense that performance is being managed, not debated.
n
The Hidden Cost
n
The downside is subtle but significant.
n
Activity metrics measure motion, not contribution.
n
They tell you something happened, not whether it mattered. They capture effort, not impact. Over time, they reward visibility over value and busyness over effectiveness.
n
This is not a new insight. Even Harvard Business Review has repeatedly warned that performance metrics, when misapplied, distort behavior rather than clarify it, especially in knowledge work where output quality varies widely. When leaders rely too heavily on activity metrics, they gain short-term clarity and long-term confusion. The numbers go up, but understanding goes down.
The Behaviors These Metrics Actually Create
nMetrics do more than measure performance. They shape it. Once activity metrics become meaningful for evaluation, engineers adapt. Not maliciously. Rationally. nn
What Optimizing for Activity Looks Like
nnOver time, teams begin to exhibit familiar patterns: n
- n
- More commits, smaller commits, noisier repositories
- Work sliced unnaturally thin to increase visible throughput
- Preference for tasks that show progress quickly
- Reluctance to take on deep, ambiguous, or preventative work
nn
nn
nn
n
nRefactoring, mentoring, documentation, and incident prevention suffer first. These activities are critical to long-term outcomes, but they rarely show up cleanly in dashboards. nnEngineers notice. Quietly. nnThey learn which work is valued and which work is invisible. The system teaches them what “good performance” looks like, regardless of what leaders say out loud. nnThis is where trust begins to erode. When engineers feel evaluated on metrics that misrepresent their contribution, performance conversations become defensive. Leaders lose credibility, not because they lack intent, but because the measurement system feels disconnected from reality. nnMetrics do not just observe behavior. They incentivize it.
What “Outcomes” Actually Mean in Engineering
nAt this point, many leaders nod and say, “We should focus on outcomes instead.” nThat phrase sounds right, but it often remains vague. nOutcomes are not abstract aspirations. They are concrete, observable effects over time. nn
Outcomes, Grounded in Reality
nnIn engineering, outcomes often show up as: n
- n
- Improved reliability, fewer incidents, faster recovery when things break
- Predictable delivery, with fewer last-minute surprises
- Systems that are easier to change six months later, not harder
- Teams that unblock others, not just ship their own backlog
- Reduced cognitive load, making good decisions easier under pressure
nn
nn
nn
nn
n
nNone of these map cleanly to a single number. That is precisely the point. nnOutcomes require interpretation. They demand context. They force leaders to engage with the work, not just the artifacts of it. nnThis does not make performance measurement weaker. It makes it more honest. nn
Using Metrics as Inputs, Not Verdicts
n
This is the heart of healthier performance conversations.
Metrics are not the enemy. Treating them as verdicts is.
n
Where Metrics Actually Help
n
Used well, metrics act as signals. They prompt questions rather than answer them.
n
A drop in commits might indicate:
n
- n
- Work moved into deeper problem-solving
- Increased review or mentoring responsibility
- Hidden bottlenecks or external dependencies
n
n
n
n
A spike in throughput might signal:
n
- n
- Healthy momentum
- Superficial work being prioritized
- Short-term optimization at long-term cost
n
n
n
n
Strong leaders do not outsource judgment to dashboards. They use data to guide inquiry, not to end discussion.
n
This approach aligns with how Scio frames trust and collaboration in distributed environments. In Building Trust Across Screens: Human Capital Insights from Nearshore Software Culture, performance is treated as something understood through patterns and relationships, not isolated metrics.
Removing judgment from performance reviews does not make them fairer. It makes them emptier.
Where Activity Metrics Fall Short (and What Outcomes Reveal)
nn
Activity vs Outcome Signals in Practice
What’s Measured | n What It Tells You | n What It Misses | n
|---|---|---|
| Number of commits | nLevel of visible activity | nQuality, complexity, or downstream impact | n
| Tickets closed | nThroughput over time | nWhether the right problems were solved | n
| Velocity / story points | nShort-term delivery pace | nSustainability and hidden trade-offs | n
| Hours logged | nTime spent | nEffectiveness of decisions | n
| Fewer incidents | nSurface stability | nPreventative work that avoided incidents | n
| Easier future changes | nSystem health | nIndividual heroics that masked fragility | n
This table is not an argument to discard metrics. It is a reminder that activity and outcomes answer different questions. Confusing them leads to confident conclusions built on partial truth.
How Experienced Leaders Run Performance Conversations
nLeaders who have run reviews for years tend to converge on similar practices, not because they follow a framework, but because experience teaches them what breaks.n
What Changes with Experience
nSeasoned engineering leaders tend to:n
- n
- Look at patterns over time, not snapshots
- Ask “what changed?” instead of “how much did you produce?”
- Consider constraints and trade-offs, not just results
- Value work that prevented problems, even when nothing “happened”
nn
nn
nn
n
nThese conversations take longer. They require trust. They cannot be fully automated.nThey also produce better outcomes.nEngineers leave these discussions feeling seen, even when feedback is hard. Leaders leave with a clearer understanding of impact, not just activity.nnThis perspective often emerges after leaders see how much performance is shaped by communication quality, not just individual output. In How I Learned the Importance of Communication and Collaboration in Software Projects, Scio explores how delivery outcomes improve when expectations, feedback, and ownership are clearly shared across teams. That same clarity is what makes performance conversations more accurate and less adversarial.
Why This Matters More Than Fairness n
nnMost debates about performance metrics eventually land on fairness. nnFairness matters. But it is not the highest stake. nn
The Real Cost of Shallow Measurement
nnWhen performance systems feel disconnected from reality: n
- n
- Trust erodes quietly
- Engineers disengage without drama
- High performers stop investing emotionally
- The best people leave without making noise
nn
nn
nn
n
nThis is not a tooling problem. It is a leadership problem. nHealthy measurement systems are retention systems. They signal what the organization values, even more than compensation does. nScio partners with engineering leaders who care about outcomes over optics. By embedding high-performing nearshore teams that integrate into existing ownership models and decision-making processes, Scio helps leaders focus on real impact instead of superficial productivity signals. nnThis is not about control. It is about clarity.
Measure to Learn, Not to Control
nnThe goal of performance measurement is not to rank engineers. nIt is to understand impact. Activity is easy to count. Outcomes require judgment. Judgment requires leadership. nWhen organizations choose outcomes-first thinking, performance conversations become less defensive and more constructive. Alignment improves. Trust deepens. Teams optimize for results that matter, not numbers that impress. nnMeasuring well takes more effort. It also builds stronger teams.
FAQ: Engineering Performance Measurement
n- n
- n n nnnn
Because they are easy to collect, easy to compare, and easy to defend from an administrative standpoint. However, they often fail to reflect real impact because they prioritize volume over value.
n nn - n n nnnn
No. Metrics are valuable inputs, but they should serve as conversation starters that prompt questions rather than as final judgments. Context is always required to understand what the numbers actually represent.
n nn - n n nnnn
The primary risk is eroding trust. When engineers feel that their contributions are misunderstood or oversimplified by flawed metrics, engagement drops, morale fades, and talent retention suffers significantly.
n nn - n n nnnn
They align evaluation with real impact, encourage healthier collaboration behavior, and support the long-term health of both the system and the team by rewarding quality and architectural integrity.
n n