Game Concept Validation: A 5-Pillar Scorecard for Indie Studios
Game concept validation is the structured process of testing whether a game idea has commercial viability, audience demand, and production feasibility before significant capital is committed. Unlike playtesting, which checks whether a built game is fun, concept validation answers a more fundamental question: should this game exist at all? The discipline pulls evidence from market data, comparable post-mortems, audience research, and monetization benchmarks. The output is a defensible verdict: proceed, pivot, or kill. For an indie studio, the difference between a 12-month build that ships into demand and a 36-month build that ships into silence is almost always a concept-level question that was knowable before a single line of code was written. This guide breaks down the 5-pillar scorecard used to assess any game concept, with named case studies from games that worked, and one launch that very publicly didn't.
Key takeaways
-
The 5 pillars of game concept validation are Market Opportunity, Core Hook & Differentiation, Monetization & Economy, Execution Feasibility, and Scalability & Live Ops.
-
Each pillar is scored 0 to 100. The composite score maps to one of four verdicts: PROCEED, PROCEED-CONDITIONAL, PIVOT, or KILL.
-
Concept validation sits upstream of paper prototyping and playtesting. It tests the idea, not the execution.
-
The median indie game on Steam earns roughly $1,136 over its lifetime, based on analysis of every paid 2019 release. Most of those games trace their commercial outcome back to concept-level decisions made before development began. Source: How To Market A Game: The median indie game does not earn a whole lot.
-
A KILL verdict is not a judgment. It is the cheapest version of the truth. The alternative is finding out 18 months and several hundred thousand dollars later.
Why most failed games failed before they were built
Across three decades of working with studios, from carrier-era mobile distribution at Player X to mentoring early-stage founders at Block Dojo, the same pattern keeps showing up in post-mortems. The team built well. The art was strong. The mechanics held up. But the concept was wrong.
The expensive part of game development is not building. The expensive part is building the wrong thing. Specifically:
-
Wrong for the moment: timing, genre saturation, or an audience shift the studio missed.
-
Wrong for the audience: concept built for the developer's tastes, not a documented player segment.
-
Wrong for the team: scope exceeds the runway, technical risk exceeds the team's track record.
Concept validation exists to make the wrong-thing problem cheap. A scored pre-production assessment takes hours. The same answer, discovered post-launch, costs the studio.
The 5 pillars of game concept validation
Pillar 1: Market Opportunity
This pillar asks whether the genre, sub-genre, and player audience the concept targets has room for a new entrant, and whether the timing favors entry. It looks at three things:
-
Genre saturation: how many comparable games shipped in the last 24 months, and how they performed.
-
Regional total addressable market: where demand actually sits, and how localization affects it.
-
Demand timing: whether the genre is rising, plateaued, or in decline.
A useful case study here is Stardew Valley. When Eric Barone (ConcernedApe) began work in 2012, the Western farming-sim space was effectively empty. Harvest Moon was a Japanese-published niche franchise. FarmVille was a Facebook spin-off in decline. The audience for farming sims existed and expressed demand consistently on forums, but had no contemporary product. Released in 2016 after four years of solo development, the game has since sold over 41 million copies. Source: Game Developer: Stardew Valley sales grew to over 41 million by 2024's end.
The market opportunity scored high here, not because farming sims were trending. It scored high because the audience was real and the supply was zero.
A concept that scores well on this pillar typically has at least one of three signals:
-
An underserved genre with documented audience demand.
-
A platform shift creating a new market window (Steam Deck unlocking PC handheld, the early app-store era, console subscription services opening genres that didn't fit boxed retail).
-
A specific regional market where the concept fits local taste better than the incumbent products do.
A concept that scores poorly is usually competing for shelf space in a saturated genre against established incumbents whose marketing budgets the studio cannot match.
Pillar 2: Core Hook & Differentiation
The Hook pillar asks whether the concept can be explained in one sentence that a player would recognize as new, and whether the differentiation is something competitors cannot easily copy. A weak hook produces "different but not better" positioning, which is one of the most common patterns in failed-game post-mortems.
The case study here is Vampire Survivors by Poncle, released into Early Access in October 2021. It launched with one mechanic that no major competitor was executing at commercial scale: a player who attacks automatically while moving through escalating waves of enemies. The hook was specific enough that players could describe it to a friend in one sentence. The execution was tight enough that imitators have struggled to differentiate inside the genre it created. Vampire Survivors went on to win Best Game, Game Design, and British Game at the 19th British Academy Games Awards in March 2023, a rare clean sweep for an indie title against AAA competition.
A strong-hook concept passes three tests:
-
The single-sentence test: can the hook be explained without analogy?
-
The why-now test: what makes this hook executable today that was not executable five years ago?
-
The defensibility test: what stops a better-resourced competitor from cloning the hook within six months?
The third test is where most concepts fail. A novel mechanic is not a moat. A novel mechanic paired with a content-creation pipeline, an active community, or a steep player skill ceiling is.
Pillar 3: Monetization & Economy
This pillar is where most pre-production assessments get sloppy. The questions are concrete:
-
What is the monetization model, and does it fit the session shape?
-
What are the comparable ARPDAU and conversion benchmarks for that model inside the target genre?
-
Does the projected economy survive sensitivity testing against realistic player behavior distributions?
The reference benchmark for the modern collectible-battler and digital-TCG space is Marvel Snap by Second Dinner, launched in October 2022. Sensor Tower's first-year reporting on Marvel Snap placed mobile revenue in the nine-figure range, driven by a season-pass model paired with a deeply tuned card-acquisition curve. Source: Sensor Tower mobile gaming insights.
The monetization design was not invented from scratch. It adapted established free-to-play card-game patterns into a faster session loop. That is how the Monetization pillar scores in practice. It does not score on creativity. It scores on whether the concept's projected revenue per player falls inside a verifiable benchmark range for the chosen model and genre.
Patterns I have seen across studios that score well on this pillar:
-
A monetization mechanic that fits the session shape. Long-session games support different IAP curves than short-session games.
-
A projected ARPDAU range grounded in named comparable titles, not in invented numbers.
-
An economy stress test that does not collapse under realistic whale-to-minnow distributions.
Patterns that score poorly:
-
Monetization bolted on after the design is locked, with no benchmark backing the projections.
-
Models borrowed from a different genre without testing whether the session shape supports the same conversion behavior.
-
Pricing that assumes the studio's preferred ARPDAU rather than testing it against actuals.
Pillar 4: Execution Feasibility
This pillar asks whether the team that will build the concept can actually ship it within the available runway. It scores three dimensions:
-
Team-skill fit against the technical requirements the concept implies.
-
Scope realism against the studio's track record on prior projects.
-
Technical risk against the platforms targeted.
The public case study for what happens when this pillar is under-weighted is No Man's Sky by Hello Games, launched in August 2016. The launch version shipped meaningfully smaller than what was communicated pre-launch, and the gap between expectation and reality produced one of the most documented marketing failures in modern games. Hello Games subsequently rebuilt trust through years of free major updates, including Foundation, Pathfinder, NEXT, Beyond, Origins, and many more. The full update history is published on Hello Games' update log.
The redemption arc is real. The team was talented, and the post-launch work was extraordinary. The lesson is not about talent. The lesson is that pre-launch scope communication wrote checks that the production runway could not cash on the launch date itself.
Patterns I have seen across studios that score well on Execution Feasibility:
-
Scope explicitly bounded against the team's available person-months, not aspirational targets.
-
Named risk areas with documented mitigation strategies.
-
A publicly committed feature set that is a strict subset of the internally committed feature set.
The rule of thumb I share with founders at Block Dojo: if you cannot describe the launch version in one paragraph without using the word "and," the scope is too wide for the runway.
Pillar 5: Scalability & Live Ops
The final pillar asks what happens after launch. Specifically, whether the concept has the structural capacity to support ongoing content, monetization, and community engagement at a cadence the team can sustain.
The indie reference point is Dead Cells by Motion Twin, with Evil Empire taking over update production from 2019 onward. The base game launched in 2018 and through 2024 received multiple major free updates plus paid DLCs including Bad Seed, Fatal Falls, The Queen and the Sea, and Return to Castlevania. The Castlevania partnership in particular brought a major IP into the game within a structure that already supported drop-in content. Source: Dead Cells official site and update history.
The studio sustained a multi-year live-ops cadence because the base game's run-based structure was designed to accept new content as drop-in modules. The architectural decisions that made post-launch updates cheap were made during pre-production, not after.
A concept that scores well on Scalability has documented answers to three questions:
-
What is the structural unit of new content? (Levels, characters, modes, seasons.)
-
What is the unit cost to produce, given the chosen art and engineering pipelines?
-
What cadence can the team realistically sustain without burning out.
A concept that scores poorly usually treats live ops as a future problem. This becomes the current problem in month three of post-launch.
The 4 verdicts: what each one means and what to do next
Every concept assessment resolves to one of four verdicts. Each verdict has a specific action attached. None of them are judgments about the team or the idea.
| Verdict | What it means | What to do next |
|---|---|---|
| PROCEED | All five pillars score above their threshold. The concept is defensible across market, hook, economy, execution, and scalability. | Move to prototype. The pre-production assessment becomes a baseline for measuring concept drift during production. |
| PROCEED-CONDITIONAL | Four pillars pass, one is borderline. The concept is viable but has a specific, named risk. | Address the named risk first, with additional research, a scoped prototype, or a scope adjustment. Then re-score. Do not proceed to full production without closing the conditional. |
| PIVOT | Two or more pillars fail, but the core hook or market opportunity is strong. The current concept is not viable, but a recognizable variant of it might be. | Re-scope against the failing pillars. Common pivots: smaller genre target, different platform, different monetization model, narrower launch feature set. |
| KILL | The composite score is below the threshold, and no realistic pivot recovers it. The concept does not have a defensible path to a successful launch on the current team and runway. | Stop development. Document the failure modes. The cost of stopping now is the cheapest possible version of the lesson. |
A KILL verdict is not a failure of the team. It is the function of the pre-production assessment to work correctly. The alternative is the same answer at month 18 of development, after the runway has been spent.
A worked example: how the Monetization pillar pulls its benchmark
To illustrate how scoring traces back to specific evidence, here is the citation chain for a Monetization assessment on a hypothetical mobile collectible-battler concept.
-
Identify the comparable set. The concept is a session-based collectible card game targeting mobile, with seasonal content drops. The closest public comparables are Marvel Snap (Second Dinner), Hearthstone (Blizzard Entertainment), and Legends of Runeterra (Riot Games).
-
Pull the benchmark range. Sensor Tower and AppMagic both publish quarterly mobile-genre revenue reports. For the digital-TCG segment, the ARPDAU benchmark band is derived from the comparables' published mobile revenue divided by their disclosed or reasonably estimated daily active user counts.
-
Score the concept against the band. The concept's projected ARPDAU is calculated from its monetization mix (pack purchases, season pass, cosmetic IAP) and tested against the benchmark band.
-
Stress-test the score. A Monte Carlo simulation runs the concept's economy through 10,000 player-behavior distributions. The simulation surfaces whether the model holds up under realistic whale-to-minnow ratios or collapses at the extremes.
-
Trace the score back to its sources. The final Monetization pillar score links back to the specific Sensor Tower report, the specific comparable titles, and the simulation parameters used. No black box. Every score is independently verifiable.
That citation chain is what separates pre-production assessment from gut instinct. The output is a number. But the number is anchored in evidence that a publisher, an investor, or a co-founder can independently verify.
Frequently asked questions
What is the difference between game concept validation and playtesting?
Game concept validation tests the idea before the game exists. Playtesting tests the execution once a playable build exists. Concept validation answers questions like "is there a market for this?" and "does the monetization model fit the genre?" Playtesting answers questions like "is this mechanic fun?" and "Where do players get stuck?" The two disciplines are sequential: validation upstream of playtesting. They are not substitutes for each other.
How long does game concept validation take?
A focused single-concept assessment using the structured 5-pillar process typically takes between four and twelve working hours, depending on how much comparable-title research already exists for the genre. The Market and Monetization pillars carry the most research overhead. Execution and Scalability are usually faster because the team's own track record is the primary evidence base.
Can a game concept be validated without a prototype?
Yes. That is the central premise of pre-production assessment. The pillars test market-level, audience-level, and structural questions that do not require a playable build. Paper prototyping and digital prototyping are useful tools for testing execution-level questions (does this mechanic feel good?), but they are not substitutes for concept-level questions (should we build this game at all?). The two activities are complementary.
When should an indie studio start validating its game concept?
Before the first sprint of production, concept validation is most valuable when it can still change the decision, which means before significant capital has been committed to art, engineering, or marketing. Studios that run validation after pre-production has locked tend to use the output as confirmation rather than guidance. That defeats the purpose.
What happens if the concept gets a KILL verdict?
A KILL verdict means the structured assessment did not find a defensible path to a successful launch on the current team and runway. The studio's next action is usually one of three things: pivot the concept against the failing pillars and re-score, shelve the concept and run a fresh assessment on a different idea, or address the specific weaknesses with additional research before re-scoring. A KILL verdict is the cheapest possible version of a hard truth.
How to use this in your own pre-production process
The 5-pillar game concept validation scorecard works whether the assessment is run on paper, in a spreadsheet, or inside a dedicated platform. The discipline matters more than the tooling. Score each pillar against the questions in this guide, document the evidence behind each score, and let the composite verdict drive the next decision.
