Story points often create more confusion than clarity. Discover better alternatives for estimating work and measuring engineering productivity.
Jay Derinbogaz
Founder

Story points have become the de facto standard for estimating work in agile development. Walk into any engineering team's planning meeting, and you'll likely hear debates about whether a task is a 3 or a 5, or why that "simple" feature somehow became an 8.
But here's the uncomfortable truth: story points often create more problems than they solve. After years of watching teams struggle with estimation, it's time to explore better alternatives that actually help engineering teams deliver value.
Story points promise relative estimation without the pressure of time-based commitments. In theory, a 5-point story should take roughly 2.5 times longer than a 2-point story. In practice, this relationship rarely holds.
Consider this scenario: Your team estimates a user authentication feature at 5 points. Later, you estimate a payment integration at 5 points. Are these really equivalent in complexity? The authentication might involve straightforward CRUD operations, while the payment integration requires third-party API integration, security compliance, and error handling.
Once story points become a metric that matters, teams inevitably game the system. Developers inflate estimates to make velocity look better. Managers pressure teams to increase velocity without understanding that story points are relative, not absolute.
This creates "velocity theater" – the illusion of progress measurement while actually obscuring real productivity insights. A team that completes 50 points this sprint versus 40 points last sprint hasn't necessarily improved; they might have simply re-calibrated their estimates.
Planning poker sessions, while engaging, often become exercises in false consensus. The loudest voice wins, or teams converge on estimates to avoid conflict rather than genuinely assess complexity.
Worse, these sessions consume significant time. A typical planning session might spend 30 minutes debating whether a feature is 3 or 5 points – time that could be spent on actual problem-solving or breaking down work into smaller, more manageable pieces.
Instead of numeric story points, use t-shirt sizes (XS, S, M, L, XL) with explicit, team-specific definitions:
This approach maintains the benefits of relative sizing while avoiding false precision. Teams naturally understand that an XL item needs to be broken down, preventing the "8-point story that takes three sprints" problem.
Focus on measuring actual delivery times rather than estimated complexity:
| Metric | Definition | Use Case |
|---|---|---|
| Cycle Time | Time from first commit to production | Identifying bottlenecks in development process |
| Lead Time | Time from request to delivery | Understanding total customer wait time |
| Time to First Review | Time from PR creation to first review | Measuring code review efficiency |
These metrics provide concrete data about your delivery process without the subjectivity of estimation. They highlight real bottlenecks: slow code reviews, lengthy QA cycles, or deployment friction.
Instead of estimating individual items, track how many items your team completes per time period. This approach focuses on flow rather than estimation accuracy.
For example, if your team typically completes 8-12 small items or 2-3 large items per sprint, plan accordingly. This method acknowledges that estimation is inherently uncertain while still enabling predictable planning.
Use historical data to create probabilistic forecasts. If your team has completed similar features in 3-8 days over the past six months, you can forecast with confidence ranges:
This approach embraces uncertainty rather than hiding behind false precision.
Don't abandon story points overnight. Instead, run parallel experiments:
Compare the accuracy and usefulness of each approach for your specific context.
Regardless of your estimation method, the goal should be continuous improvement in delivery capability. Regular retrospectives should focus on:
Instead of velocity, track metrics that directly correlate with business value:
These DORA metrics provide actionable insights into engineering effectiveness without the overhead of story point estimation.
Story points aren't universally bad. They can work well for:
The key is using them as a tool for conversation and planning, not as a precise measurement system.
Story points promised to solve the estimation problem, but they often create new issues: false precision, gaming, and time-consuming planning rituals. The alternatives – t-shirt sizing, cycle time metrics, throughput planning, and probabilistic forecasting – offer more practical approaches to planning and measuring engineering work.
The best estimation system is the one your team actually finds useful for making decisions and improving delivery. Focus on continuous improvement, measure what matters, and remember that the goal isn't perfect estimation – it's delivering value to customers efficiently and predictably.
Start small, experiment with different approaches, and choose the methods that help your team ship better software faster. Your future self (and your team) will thank you for moving beyond the story point trap.
Start measuring developer productivity with AI-powered PR analysis. Free for open source projects.
Try GitRank Free
Learn proven strategies to reduce development cycle time while maintaining code quality. Optimize your team's delivery speed with actionable insights.

Master DORA metrics to transform your engineering team's performance. Learn deployment frequency, lead time, and failure recovery strategies.

Discover the key metrics that truly measure engineering team effectiveness beyond vanity numbers. Learn actionable insights for better team performance.