‘Pretty Crazy’ Token Usage Is Testing Bosses’ Bet on AI

ARCHIVE This story is marked as archive content due to its age and may not reflect the current state of events.

Summary: A Silicon Valley software maker and an ecommerce company reveal to WIRED how they are navigating the emerging challenge of “tokenomics.”

As artificial intelligence becomes an increasingly important tool for software development, a new challenge is emerging for businesses deploying advanced coding models at scale: the cost of computation. While AI coding assistants can dramatically accelerate development workflows, the resources required to power these systems are creating new financial and operational considerations for organizations seeking to integrate them into daily engineering activities.

The issue has become particularly visible with advanced coding models capable of handling large codebases, extended reasoning chains, and complex software engineering tasks. Unlike traditional chatbots that generate relatively short responses, coding-focused AI systems often process massive amounts of context, analyze multiple files simultaneously, generate extensive code outputs, and execute long sequences of reasoning steps. Each of these operations consumes significant computational resources and increases the number of tokens processed during an interaction.

For many organizations, the cost of AI adoption is no longer measured solely by subscription fees. Every prompt, code review, refactoring task, architectural analysis, or automated debugging session requires compute resources that ultimately translate into infrastructure expenses. As companies scale AI usage across hundreds or thousands of developers, these costs can grow rapidly, prompting executives to examine whether productivity gains justify the additional spending.

The economics of AI coding tools are closely tied to the concept of tokens, the units used by language models to process and generate information. Large-scale coding tasks frequently require millions of tokens over the course of a project, particularly when models are asked to understand entire repositories, maintain context across multiple files, or execute extended development workflows. As models become more capable, they often require greater computational investment to deliver those capabilities.

This dynamic is creating a new balancing act for organizations. More powerful AI systems can solve increasingly sophisticated problems, but they also consume more compute resources. Engineering teams must decide when it makes sense to use frontier models for complex tasks and when smaller, less expensive models may be sufficient. The goal is to maximize productivity without allowing AI operating costs to grow beyond sustainable levels.

The challenge is especially relevant as AI agents evolve beyond simple code completion tools. Modern systems are increasingly expected to perform end-to-end software engineering tasks, including planning, code generation, testing, debugging, documentation, deployment assistance, and repository-wide analysis. These activities require significantly more computation than traditional autocomplete features and can generate substantial infrastructure demands when used continuously.

At the same time, AI providers are investing heavily in larger context windows, longer reasoning capabilities, and more autonomous agentic workflows. While these improvements expand what coding assistants can accomplish, they also increase the amount of processing required for each interaction. As a result, the industry’s focus is gradually shifting from raw model capability to efficiency and return on investment.

Some organizations are responding by implementing governance policies around AI usage, tracking token consumption, and evaluating which workflows produce the greatest productivity benefits. Others are experimenting with hybrid approaches that combine smaller models for routine tasks and more powerful systems for complex engineering challenges. The objective is to optimize both performance and cost in an environment where compute has become a strategic resource.

The discussion reflects a broader transformation occurring across the AI industry. For years, the primary focus was on building larger and more capable models. Increasingly, however, attention is turning toward economic sustainability. Businesses want AI systems that are not only powerful but also efficient enough to deploy at scale without creating unsustainable operational expenses.

As AI becomes a standard component of software development, understanding the relationship between model capability, token usage, and compute costs will be critical for both vendors and customers. The organizations that successfully balance these factors may gain significant competitive advantages, while those that overlook the economics of AI adoption could find that technological progress comes with a larger price tag than expected.

Ultimately, the future of AI-assisted software engineering may depend as much on efficiency as intelligence. The next generation of coding systems will likely be judged not only by what they can build, but by how economically they can deliver those results at scale.

Meta News

‘Pretty Crazy’ Token Usage Is Testing Bosses’ Bet on AI

Key facts

Why it matters

Structured details

‘Pretty Crazy’ Token Usage Is Testing Bosses’ Bet on AI

Key facts

Why it matters

Structured details

Related stories

Share on social