Table of Contents
Executive Summary
Most pharma and biotech leaders working through their AI portfolio do not lack candidate use cases. They lack a defensible mechanism for choosing among the candidates. The prioritization matrix we use with clients addresses that gap by scoring each candidate against five axes — strategic value, technical feasibility, data readiness, regulatory risk, and time-to-value — and surfacing the sequencing implications in a form that survives executive scrutiny.
This article walks through the matrix as we use it: the five axes and what each measures, how scoring is calibrated to avoid the typical overscoring problem, how the axes are weighted to produce a total score, and how the resulting matrix is used in board-level investment conversations. We close with the common mistakes clients make and the maintenance discipline that keeps the matrix useful across the portfolio over time.
Why Prioritization Is the Binding Constraint
In our client engagements over the past two years, the most common diagnostic finding is not that the AI portfolio lacks ambition. It is that the portfolio lacks discipline. Most pharma and biotech leaders we work with have between 12 and 40 candidate AI use cases at any given moment, surfaced by IT, R&D, commercial, manufacturing, regulatory, and other functions. The capacity to actually deliver these is typically constrained to 3-6 use cases per year at meaningful depth. The arithmetic of the gap is obvious; the discipline to act on it is rare.
The failure mode is consistent. Without an explicit prioritization mechanism, the use cases that get funded are the ones whose sponsors are loudest, most senior, or most persistent — not the ones with the best risk-adjusted return on the AI investment. This is not a hypothetical concern. The widely cited BCG analysis of value capture in AI found that only about 25% of organizations are capturing significant value from their AI investments, and the gap between leaders and laggards is largely about portfolio discipline rather than technology choice.
The prioritization matrix is the operational artifact that makes the discipline real. It does three things: it forces an explicit articulation of the criteria that matter, it produces a defensible ranking that can be reviewed and revised, and it provides a vocabulary for the cross-functional conversations through which AI investment decisions actually get made. Without the matrix, those conversations default to advocacy. With the matrix, they become structured.
A useful way to frame this with executive sponsors: the prioritization matrix is not the answer; it is the structure within which the answer becomes visible. The matrix does not eliminate judgment, but it disciplines the judgment by making the criteria and trade-offs explicit. This framing tends to land well with executive sponsors who have seen too many AI initiatives funded on the strength of presentation rather than substance.
The Five Scoring Axes
The matrix uses five axes. We have tried matrices with more axes (seven, nine) and matrices with fewer (three, four), and converged on five as the right balance between fidelity and usability. More axes produce false precision; fewer axes lose meaningful signal. The five we use:
Strategic value. The contribution to enterprise strategic priorities if the use case succeeds. Not the size of the addressable benefit (that comes in time-to-value), but the alignment with what leadership has already committed to. A use case that advances a top-three strategic priority scores high; a use case that advances a tactical efficiency goal scores lower.
Technical feasibility. The likelihood that the use case can be technically delivered with available capabilities (in-house or vendor) within a reasonable timeline. This is the axis where AI advocates most often overscore; healthy skepticism is appropriate. A use case requiring novel research scores low; a use case using a well-understood architecture scores higher.
Data readiness. The state of the data the use case requires — availability, quality, governance, accessibility. As McKinsey’s State of AI research has consistently shown, data readiness is the most frequently cited blocker for enterprise AI value capture, more than algorithmic sophistication or talent. This axis surfaces use cases that look attractive on other axes but require data preparation work that exceeds the use case itself.
Regulatory risk. The regulatory exposure the use case creates. For pharma and biotech clients, this is unavoidable rather than incidental. A use case affecting safety, quality, or efficacy carries different regulatory weight than a use case in marketing or HR. The scoring is inverted: lower regulatory risk scores higher in this axis, and higher regulatory risk requires the corresponding investment in governance discipline.
Time-to-value. The expected payback period and the magnitude of value at the end. Quick payback at meaningful scale scores high; long payback or small scale scores lower. The axis combines speed and magnitude because either alone is insufficient: a fast use case with no value is not worth doing, and a high-value use case with no realistic payback path is not actionable.
Each axis is scored on a 1-5 scale with anchored definitions for each score level. The anchored definitions matter; without them, the scoring drifts toward the middle, and the matrix loses its discriminative power.
Calibrating the Scoring Discipline
Anchored scoring definitions are what keep the matrix useful. The table below summarizes the anchors we use, refined through dozens of client applications.
| Axis | Score 1 | Score 3 | Score 5 |
|---|---|---|---|
| Strategic value | Tactical efficiency; not on leadership’s stated priority list | Supports a stated strategic priority but is not the primary mechanism | Directly advances a top-three strategic priority |
| Technical feasibility | Requires novel research or unproven architecture | Uses proven architecture but requires meaningful customization | Uses well-understood architecture with available capability |
| Data readiness | Required data does not exist or requires major collection effort | Required data exists but needs significant preparation | Required data is available, governed, and accessible |
| Regulatory risk (inverted) | High-impact AI directly affecting safety, quality, or efficacy | Moderate exposure with established validation pathway | Low or no regulatory exposure |
| Time-to-value | Multi-year payback or unclear value mechanism | 12-18 months to material value | Less than 12 months to material value at meaningful scale |
Calibration sessions are the discipline that makes anchored definitions work. We run a calibration session with the scoring team — typically the AI governance committee or a designated cross-functional group — before any use case is scored. The session walks through 3-4 example use cases, scores them collectively, and surfaces the implicit assumptions different team members are bringing to the scoring. Without calibration, the scoring varies by who happens to be in the room, and the matrix’s defensibility erodes.
The most common calibration finding: technical advocates systematically overscore technical feasibility, and business advocates systematically overscore strategic value. The cross-functional scoring team and the anchored definitions both work against these biases, but neither eliminates them. Sustained attention to the calibration is a continuing discipline, not a one-time setup.
Weighting and the Total Score
The five axes do not always carry equal weight. The default weighting we start with assigns 25% to strategic value, 20% to technical feasibility, 20% to data readiness, 15% to regulatory risk, and 20% to time-to-value. These weights are starting points, not prescriptions, and we adjust them based on the client’s specific context.
The most common adjustments. Clients with explicit board-level pressure on revenue growth often increase the weight on time-to-value to 25-30%. Clients in early regulatory engagement cycles often increase the weight on regulatory risk to 20-25%. Clients with significant data infrastructure gaps often increase the weight on data readiness to 25%, reflecting that data work is the binding constraint for their portfolio. The point of weighting is to make the strategic priorities visible in the scoring; weights that produce results inconsistent with the strategic priorities reveal a mismatch worth surfacing.
The total score is the weighted sum, producing a number on a 1-5 scale. We have found that converting the total to a tier (A: 4.0+; B: 3.0-3.9; C: 2.0-2.9; D: below 2.0) is more useful than reporting the precise number. Tiers create natural cut lines for portfolio decisions; precise numbers invite false confidence.
Using the Matrix in Board Conversations
The matrix’s value is largely in the conversations it enables, not in the numbers it produces. The most productive way we have found to use the matrix in board-level conversations is as a structured input, not as a recommendation.
The pattern that has worked across clients. The AI governance committee presents the scored matrix, including the weighting rationale, the anchored definitions, and the tiered output. The committee then articulates a recommended portfolio (typically 3-6 use cases for the next 12 months) drawn from the A and B tiers, with explicit notes on the trade-offs being made. The board reviews the matrix, probes the assumptions, and either ratifies the recommendation or requests adjustments. This pattern produces decisions that survive scrutiny because the structure of the analysis is visible alongside the recommendation.
The alternative — presenting only the recommendation without the underlying matrix — consistently produces worse outcomes. Board members ask questions the recommendation cannot answer, the conversation drifts to advocacy, and decisions get made on the basis of which presenter was most persuasive rather than which use case represented the best risk-adjusted return.
The Deloitte State of AI in the Enterprise research has consistently found that the organizations capturing the most value from their AI investments are those with formalized governance and portfolio mechanisms. The prioritization matrix is one of the most concrete operational expressions of that governance discipline.
Common Mistakes Clients Make
Across client engagements, five mistakes recur often enough to be worth naming.
Scoring without anchored definitions. The fastest way to produce a useless matrix. Without anchors, scoring drifts toward the middle, the discriminative power collapses, and the matrix becomes a ritual rather than a tool.
Scoring by individual rather than by cross-functional group. Single-scorer matrices reflect individual biases. Cross-functional scoring with calibration produces materially more defensible results, even when it takes longer.
Treating the matrix as static. The portfolio changes, capabilities mature, regulatory expectations evolve. A matrix scored once and treated as authoritative for two years ages badly. The matrix should be refreshed at least quarterly.
Letting strategic value swallow the other axes. A use case that is strategically aligned but technically infeasible or regulatorily indefensible is not a viable portfolio investment. The other axes exist precisely to discipline the strategic enthusiasm that otherwise drives funding decisions.
Failing to surface trade-offs. The matrix produces a ranked list, but the operational decision is which combination of use cases the portfolio can actually deliver. The trade-off conversation — between a high-scoring use case with significant data work and a moderately-scoring use case ready to start — is where real decisions get made.
Gartner’s research on AI program risk reinforces these patterns. As reported in Gartner’s July 2024 generative AI program risk analysis, the convergence of unclear business value, poor data quality, and inadequate risk controls explains the bulk of AI program abandonment. The prioritization matrix addresses all three at the gating step, before the investment is committed.
Maintaining the Matrix Over Time
The matrix is most useful when it is maintained as a living artifact rather than a one-time deliverable. The maintenance discipline we recommend.
Quarterly refresh. Each quarter, the AI governance committee re-scores the active portfolio against the matrix, adds new candidate use cases, and reviews whether the anchored definitions or weights need adjustment. Quarterly is the right cadence: more frequent produces churn, less frequent allows the portfolio to drift from the strategic priorities.
Anchored definition review annually. Once a year, the cross-functional group reviews whether the anchored definitions still reflect the operational reality. As technical capabilities mature, score 5 on technical feasibility may no longer represent the same level of difficulty. As data infrastructure matures, score 5 on data readiness may no longer be aspirational. The definitions should evolve.
Weighting review during strategic planning. Once a year, in line with strategic planning, the weights are reviewed against current priorities. If the strategic priorities have shifted, the weights should shift accordingly. If they have not shifted, the weighting confirms continuity.
Outcome tracking. The matrix’s predictive power should be tested over time. Use cases that scored high but failed to deliver, or use cases that scored low but succeeded, should be reviewed to understand why. This is the mechanism through which the matrix itself improves; without outcome tracking, the matrix never gets better than the assumptions baked into the initial design.
The compounding value of the matrix is in its second and third year. The first year produces a defensible portfolio. The second year produces refined definitions, validated weights, and the beginning of empirical evidence about which axes most strongly predict success. The third year produces a matrix that is meaningfully calibrated to the specific client’s operational environment, with predictive power that generic frameworks cannot match.
Adapting the matrix for early-stage clients
An adaptation worth surfacing: early-stage biotech clients (Series A or B) operate under different constraints than established pharma, and the matrix needs calibration to reflect that. For early-stage clients, the time-to-value axis becomes more heavily weighted (often 30%+), because runway is the binding strategic constraint. The strategic value axis collapses toward platform-defining versus operational, because the strategic question is whether the use case advances the platform thesis or merely improves operational metrics. The regulatory risk axis becomes more nuanced, because the regulatory profile of the company itself is still being established, and use cases that contribute to that profile have value beyond their direct output.
The adaptations are not invalidations. The core five axes remain right; the weighting and the anchored definitions adjust to the early-stage context. Clients who try to apply the established-pharma version of the matrix to an early-stage context consistently produce results that look defensible but do not survive operational implementation. The calibration to context is part of the discipline.
How the matrix interacts with the GxP compliance posture
For pharma and biotech clients, the matrix interacts directly with the GxP compliance posture in ways worth understanding. The regulatory risk axis surfaces the use cases that will require the most validation work, and the prioritization decision implicitly commits the organization to that validation work for the use cases that get funded. Quality and regulatory leadership should be part of the scoring team for this reason; their input on regulatory risk and on the realistic validation timeline materially affects which use cases are viable.
The integration with the GxP framework also matters for the portfolio’s longer-term defensibility. Use cases that score well across the five axes but that the GxP framework cannot accommodate produce a brittle portfolio that fails under inspection pressure. Conversely, use cases that score moderately on technical feasibility but that the GxP framework readily accommodates often produce better long-term outcomes than the technical scoring alone would suggest. The matrix and the GxP framework should be designed to interoperate, and the cross-functional scoring discipline is where that interoperation gets enforced.
What the matrix does not address
An honest scope statement: the matrix does not address everything that matters in AI portfolio management. It does not address the talent question (whether the organization has the people to deliver the prioritized use cases). It does not address the change management question (whether the organization can absorb the operational changes the prioritized use cases require). And it does not address the financial structure question (whether the prioritized use cases produce the cash flow profile the business needs).
These three questions are addressed elsewhere in the broader AI strategy framework. The matrix focuses on the use case selection problem, and is most useful when treated as one input to the broader strategy rather than as the strategy itself. Clients who expect the matrix to answer questions outside its scope are disappointed; clients who use it as the disciplined surface for the use case prioritization decision are consistently better positioned than peers who skip the discipline entirely.
References & Sources
For Further Reading
References & Sources
- Gartner Predicts 30% of Generative AI Projects Will Be Abandoned After Proof of Concept By End of 2025 — Gartner Press Release. Source for the abandonment statistic and the structural drivers (data quality, risk controls, costs, unclear value) that prioritization discipline addresses.
- Where’s the Value in AI? — Boston Consulting Group. Analysis of value capture in enterprise AI, including the gap between leaders and laggards being largely about portfolio discipline rather than technology choice.
- The state of AI — McKinsey QuantumBlack. McKinsey’s annual State of AI research documenting data readiness as the most frequently cited blocker for enterprise AI value capture.
- State of AI in the Enterprise — Deloitte Insights. Research on the governance and portfolio mechanisms that distinguish organizations capturing meaningful value from AI investments.
- Managing the Risks of Generative AI — Harvard Business Review. Framework analysis on prioritizing AI investments under uncertainty, including risk-weighted approaches to portfolio selection.
- Expanding AI’s Impact With Organizational Learning — MIT Sloan Management Review and BCG. Research on the organizational disciplines, including portfolio governance, that translate AI investment into operational value.








Your perspective matters—join the conversation.