What GitHub has actually changed
In April 2026, GitHub quietly paused new sign-ups for Copilot Pro and Copilot Pro+. The move was a signal — rather than an anomaly — that the company is transitioning its AI coding assistant from flat-rate subscriptions to a consumption-based billing model. The changeover date is June 1, 2026. From that point, charges will be tied to how much of Copilot your team actually uses, rather than a fixed per-seat fee that is the same regardless of engagement.
To understand why this matters, consider the current pricing baseline. Copilot Pro costs £10 per month for individual developers. Copilot Business, the tier aimed at organisations, costs £19 per user per month and adds features such as organisation-level policy controls, audit logs, and the ability to exclude specified files from AI suggestions. These are flat rates — you pay the same whether an engineer uses Copilot constantly throughout the day or barely opens it.
Usage-based billing breaks that relationship. Costs will now scale with consumption. That is good news for teams where Copilot usage is uneven — you stop paying full price for seats that barely touch the product. It is potentially bad news for teams where power users are clocking very high completion volumes, and where the per-unit rate, when multiplied across a busy sprint, exceeds what the old flat fee would have been.
GitHub has not yet published the precise per-completion or per-token rate for the new model at the time of writing. That rate is expected to be announced before June 1, giving organisations a short window to model their expected spend. Do not wait for the announcement to start pulling your usage data — do it now.
The industry context: why this was inevitable
GitHub's move is not an isolated pricing experiment. It mirrors decisions made by virtually every other major AI infrastructure provider over the past two years. AWS Bedrock, Azure OpenAI, and Anthropic's API have all operated on consumption pricing from the outset. The pattern is consistent: flat subscriptions are a useful acquisition tool during early adoption, but as usage patterns diverge sharply across customers, consumption billing aligns revenue with actual delivered value far more accurately.
For the AI coding assistant category specifically, this transition is now well underway across the competitive landscape. Cursor, which has grown rapidly in 2025 and into 2026, charges on a combination of seat fee and usage-based "fast requests" that reset monthly. Claude Code from Anthropic, which integrates directly into the terminal and editor environment, bills through the Anthropic API on token consumption. Both models reflect a market that has moved past the introductory phase and into one where pricing granularity matters.
The shift also reflects what has happened to AI coding assistant capability. Early Copilot was primarily an inline autocomplete tool — context-limited, single-file, straightforward in what it consumed. Modern Copilot handles multi-file edits, agent-mode tasks, PR reviews, and extended context windows. The resource consumption footprint per session is dramatically higher, and flat-rate pricing no longer captures the cost structure on GitHub's side either.
Set a spending alert in your GitHub organisation billing settings today. If GitHub Copilot is provisioned through a GitHub Enterprise Cloud agreement, contact your account manager to understand how consumption billing integrates with your enterprise agreement terms before June 1.
Forecasting your team's bill: a practical method
The most important thing engineering leaders can do right now is establish a usage baseline. GitHub provides Copilot usage data through its REST API (the GET /orgs/{org}/copilot/usage endpoint) as well as through the organisation billing and Copilot settings pages in the GitHub web interface. Pull at least 60 days of data — ideally 90 — to get a picture that spans both quiet periods and high-intensity sprints.
When you examine that data, look for three things. First, the distribution of usage across your team: how many engineers are heavy users (multiple hours of active Copilot engagement per day), how many are moderate users, and how many have low or near-zero engagement. Second, the per-engineer daily completion volume — most teams average between 200 and 400 completions per developer per day, but the range is wide, with power users sometimes exceeding 800. Third, look at the temporal pattern: does usage spike during certain project phases, or is it relatively flat week-on-week?
Once you have that breakdown, you can model cost scenarios. The table below illustrates how costs might compare between the current flat-rate model and hypothetical usage-based rates at different consumption levels. Note that the per-unit rates in the usage-based column are illustrative — use the rates GitHub publishes before June 1 to replace these figures with actuals.
| Usage level | Completions/day per engineer | Current flat rate (Business) | Estimated usage-based (illustrative) | Delta |
|---|---|---|---|---|
| Low | 50–100 | £19/user/month | ~£8–12/user/month | Saving of ~£7–11 |
| Moderate | 200–300 | £19/user/month | ~£16–22/user/month | Broadly neutral |
| Heavy | 400–600 | £19/user/month | ~£28–38/user/month | Cost increase of ~£9–19 |
| Power user | 800+ | £19/user/month | ~£50–70/user/month | Potentially 2–3x flat rate |
The key insight from this model is that usage-based billing is not uniformly good or bad — it depends almost entirely on where your team sits on the consumption distribution. A team of 20 engineers where 15 are low-to-moderate users and 5 are power users will see a net spend reduction. A team of 20 where the majority are heavy Copilot users during a product launch sprint could see costs spike materially above the old flat rate for that month.
Build your forecast around the 90th percentile of your observed usage, not the average. That gives you a conservative upper bound and protects you from an unexpectedly large bill during your next high-intensity period.
The competitive landscape: is now the moment to switch?
The billing transition creates a natural evaluation point. If you are re-examining Copilot's pricing, you should simultaneously re-examine the product's positioning relative to alternatives — because the category has moved substantially since most teams last did a thorough comparison.
Cursor has become a serious contender for teams that have adopted agent-first development workflows. Its Composer mode handles multi-file, multi-step changes with a high degree of reliability, and the .cursorrules configuration file lets teams embed project-specific context — coding conventions, architecture constraints, preferred patterns — directly into every agent session. Teams that have invested in well-crafted .cursorrules configuration report significant quality gains, with some citing approximately 70% reduction in PR review comments on AI-generated code. For a deeper comparison of how these agent modes perform in production, see our analysis of Cursor Composer 2 vs Claude Code: which agent ships your PRs faster?
Claude Code from Anthropic operates differently to both Copilot and Cursor. It is a terminal-first, agentic coding tool that works across your entire repository rather than within a single editor window. Its strength is in long-context, multi-file tasks — architectural refactors, understanding large codebases, and sustained back-and-forth during complex debugging sessions. It bills through Anthropic's standard API pricing, which means costs are token-based and transparent from the outset. There is no flat-fee alternative, which some teams find easier to budget and others find less predictable than a capped monthly fee.
Codeium offers a free tier for individual developers and competitive business pricing that remains flat-rate. If the primary concern driving your Copilot reassessment is cost predictability rather than capability, Codeium deserves a genuine evaluation. Tabnine similarly offers flat-rate business plans and has positioned strongly on privacy — its enterprise tier supports fully on-premises deployment, which matters to teams with strict data residency requirements, including a number of regulated-sector organisations in the UK and India.
The broader agent question is also worth considering. As Cursor 3 explores parallel agents as a new IDE surface, the boundary between "AI coding assistant" and "AI coding agent" is dissolving. Teams that invest in understanding the agent paradigm now — rather than treating AI tools as autocomplete on steroids — are likely to get substantially more value from whichever tool they use. See also GitHub's gh skill: a package manager for AI coding agents for a look at how GitHub itself is evolving the agent tooling ecosystem beyond Copilot.
Before you switch tools, run a two-week trial on your actual production codebase — not a toy project. Measure completion acceptance rate, the rate at which AI-suggested changes pass your CI pipeline without modification, and the subjective time-savings reported by your engineers. Benchmark numbers from other teams are directionally useful but never a substitute for your own data on your own stack.
Understanding Copilot benchmarks in context
Any decision about AI coding tools should be grounded in a clear-eyed reading of the benchmark data — which, in this category, is easy to misread. The SWE-bench family of evaluations is now the standard reference for comparing AI coding agents on real-world software engineering tasks. Understanding the difference between the two main variants matters before you draw conclusions.
SWE-bench Verified focuses on a human-curated subset of tasks that have been confirmed as solvable and correctly specified. Claude Opus 4.7, for example, scores 87.6% on SWE-bench Verified — a figure that looks impressive but applies to a filtered, tractable set of problems. SWE-bench Pro, by contrast, uses harder, less filtered tasks and produces substantially lower scores across all models, with top performers typically landing in the 55–65% range. The gap between 88% and 64% on seemingly similar benchmarks is not a measurement error; it reflects genuinely different task distributions.
For teams evaluating Copilot against alternatives, the practical implication is that headline benchmark figures are a starting point, not a conclusion. A tool that scores higher on SWE-bench Verified does not automatically outperform on your specific codebase, your language stack, or your PR pattern. For a detailed examination of why the two benchmarks diverge so significantly, SWE-bench Verified vs Pro: why one says 88% and the other says 64% is essential reading before you make capability-based purchasing decisions.
What the usage-based shift means for UK and India engineering teams specifically
For teams based in India, the currency exposure introduces an additional planning variable. Copilot is billed in USD or GBP depending on how your GitHub subscription is structured. If your organisation's GitHub billing runs in USD and you are accounting in INR, you carry exchange rate risk on your AI tooling costs — the same risk that applies to cloud compute and SaaS spend more broadly. Build a currency buffer of at least 10% into your consumption-based forecast, particularly given the INR/USD volatility of the past 18 months.
For UK teams, the £10 / £19 flat-rate baseline is denominated in sterling, which simplifies the exchange rate picture but does not eliminate the variability concern. Under consumption billing, the total monthly charge will fluctuate with team activity patterns in a way that flat-rate billing did not. Engineering managers who have historically treated Copilot as a fixed line item in their tooling budget will need to reclassify it as a variable expense — and finance teams will need to be briefed accordingly.
Indian product teams at scale-ups and enterprises should also consider whether Copilot's GitHub-native positioning is a genuine advantage. If your organisation is deeply integrated with GitHub — repositories, Actions, Projects, and Packages — then Copilot's tight coupling with that ecosystem has real workflow value that alternatives cannot easily replicate. If you are on GitLab, Bitbucket, or a self-hosted Git server, that advantage diminishes and the usage-based cost increase becomes harder to justify relative to alternatives.
Our Verified AI Builders include engineering leaders who have navigated AI tooling procurement at scale. If you are building a framework for evaluating AI coding tools across your organisation, add your profile to connect with others working through the same decisions.
A pre-June 1 action checklist
Given the short timeline, here is the practical sequence engineering leaders should follow between now and June 1.
Pull 90 days of Copilot usage data from your GitHub organisation dashboard or via the Copilot usage API. Segment by user to identify your usage distribution — the gap between your median and 90th-percentile users is the number that matters most for worst-case forecasting. Calculate your current per-seat cost and compare it to a projected consumption-based cost using the actual rates when GitHub publishes them. Identify which of your engineers are power users whose consumption patterns would push costs materially above the old flat rate, and have a conversation with them about usage — not to restrict it, but to understand whether their usage patterns are intentional and productive, or partially habitual.
Set a hard monthly spending cap and billing alert in your GitHub organisation settings as soon as the consumption-based interface is available. If you are on a GitHub Enterprise Cloud contract, escalate to your account manager immediately — enterprise pricing terms may insulate you from some of the consumption-based exposure, or may have different cap and alert mechanics than the standard organisational billing interface.
Finally, run a competitive assessment in parallel. Even if you intend to stay on Copilot, knowing what Cursor, Claude Code, Codeium, and Tabnine would cost for your usage patterns gives you negotiating context and a fallback plan if June costs come in higher than modelled.
Usage-based AI tooling costs are not a Copilot-specific story. AWS, Azure OpenAI, and virtually every inference API already operate on consumption pricing. GitHub Copilot is the last major AI developer tool to make the transition from flat subscription to metered billing. Engineering leaders who build the internal discipline now — usage tracking, per-team cost attribution, alert thresholds, and regular spend reviews — will be better positioned as AI tooling costs become a material and variable line in every engineering budget.
What to watch for in the official announcement
When GitHub publishes the usage-based pricing details before June 1, there are four specific things to look for beyond the headline per-unit rate. First, whether there is a minimum monthly commitment or baseline included price that effectively functions as a floor — this would mean low-usage teams do not benefit as much as the pure consumption model might suggest. Second, whether different Copilot features (inline completions, agent mode, PR review, Copilot Chat) are priced differently or pooled into a single consumption metre. Third, how spending caps and overages work in practice — specifically, whether hitting a cap cuts off service, queues requests, or continues billing with an alert. Fourth, what the enterprise agreement implications are for organisations on GitHub Enterprise Cloud contracts, which may have negotiated terms that differ from standard organisational billing.
The AI News section at AI Tech Connect will carry analysis of the official pricing announcement as soon as it is published. For ongoing coverage of product developments across the AI coding tools landscape, see our Product news section.