Platform21 April 2026

Structure Before Speed: Why Better Decisions Start With the Right Question

By Forenta Team

Most bad decisions are not made by people who lacked information. They are made by people who had enough to feel certain, and not enough structure to test whether that certainty was warranted. There is a second failure mode that shows up even earlier: asking the wrong question before any deliberation begins. A rigorous answer to the wrong question is still the wrong answer.

This is not a productivity problem. It is a judgment problem. As the cost of building drops, it becomes a more expensive one. When a team can move from idea to working prototype in days, the constraint shifts. The question is no longer whether something can be built quickly. It is whether the right thing is being built, whether the right question was asked, and whether the reasoning behind that choice has been genuinely challenged before work begins.

The structural weakness of single-source analysis

A single analysis, whether from a colleague, a consultant, or an AI assistant, carries a hidden risk that has nothing to do with how capable the source is. It is coherent by construction. One thinker organises information into a logical narrative, and that narrative will always appear more persuasive than the uncertainty it has smoothed over. The gaps are invisible because they were already resolved, or quietly dismissed, by the author.

The cognitive mechanism behind this is well-documented. Tversky and Kahneman showed in a 1974 study in Science that when people estimate, they anchor to an initial value and adjust insufficiently, even when that initial value is arbitrary.¹ When the first analysis in a discussion establishes a frame, subsequent thinking adjusts around that frame rather than generating independent views. This is not a sign of limited intelligence; it is a documented property of how human judgment operates under uncertainty.

At the group level, the effect compounds. Irving Janis's analysis of high-stakes policy failures, from the Bay of Pigs to Pearl Harbour, identified a pattern he called groupthink: cohesive teams under time pressure suppress dissenting views, converge prematurely on shared conclusions, and stop stress-testing their assumptions.² The mechanism is not poor individual judgment. It is the social pressure that emerges when groups need to feel certain about a decision before committing to it.

Single analysis vs independent frames
Single analysis
Initial frame established
All thinking adjusts to it
Blind spots stay hidden
Blind spot persists
Independent frames
Multiple independent views
Each surfaces different gaps
Gaps become visible
Gaps visible, correctable

Concept: Tversky & Kahneman (1974); Janis (1982)

What the research shows about collective judgment

Research on collective intelligence points toward a different model. In a 2010 study in Science, Woolley and colleagues measured group performance across a range of cognitive tasks and found a general factor, a 'c factor,' that predicted group performance better than any individual member's IQ.³ This factor correlated with how evenly participation was distributed and how socially perceptive group members were. It did not correlate strongly with having the most intelligent person in the room.

Francis Galton's 1907 observation at a county fair provides a simpler illustration.⁴ When 800 visitors independently estimated the weight of an ox, the median of all guesses was accurate to within 0.8 percent of the actual value, more accurate than any individual expert. Galton recognised the operative condition: independence. Correlated guesses converge toward shared errors. Independent guesses, carefully aggregated, converge toward accuracy.

Shared opinion vs independent estimates

Concept: Galton (1907). Vox populi. Nature.

Six decades of structured deliberation

In the early 1950s, the RAND Corporation faced a concrete problem: how to get reliable forecasts from expert groups without distortion by social hierarchy, dominant personalities, or premature consensus. The solution, developed by Dalkey and Helmer and published in Management Science in 1963, became known as the Delphi method.⁵ Experts submit assessments independently and anonymously, receive controlled feedback about the distribution of responses, and revise their views based on the aggregate, not based on who said what.

Roger Cooke's foundational work on structured expert elicitation formalised this further: reliable expert judgment requires calibrating experts against known quantities, weighting contributions by track record, and being explicit about where genuine uncertainty remains rather than forcing false consensus.⁶ The output is not the average of all opinions. It is a calibrated summary of what the available evidence actually supports, including where that evidence runs out.

Gary Klein introduced a complementary technique: the premortem. Instead of asking a team to evaluate whether a plan might fail, you ask them to imagine it has already failed, then identify what went wrong.⁷ Research found that imagining failure as accomplished fact increases the ability to correctly identify failure causes by 30% compared to imagining it as a possibility. The technique works because it legitimises dissent. People who would not openly criticise a plan will freely describe its failure.

The Delphi method in four stages
1
Submit independently
No cross-visibility
2
See aggregate
Distribution, not names
3
Revise with evidence
Based on data
4
Calibrated output
What the evidence supports

Source: Dalkey & Helmer (1963). Management Science.

Why blind rounds improve judgment quality

In a controlled experiment at the 2017 WSDM conference, Tomkins, Zhang, and Heavlin tested what happens when reviewers can see the identity of the authors they are evaluating.¹⁰ Single-blind reviewing produced a significant advantage for papers from prestigious institutions, a bias that disappeared entirely under double-blind conditions. When reviewers knew who wrote a paper, they evaluated the conclusion through the lens of the author's affiliation rather than the quality of the argument itself.

Research on devil's advocacy adds an important nuance. Schwenk's 1990 meta-analysis found that structured dissent, whether devil's advocacy or dialectical inquiry, consistently produces higher-quality decisions than consensus approaches.⁸ But Nemeth and colleagues showed in 2001 that authentic minority dissent is more valuable than any form of role-played devil's advocacy: when a dissenter genuinely holds a different view, it stimulates more divergent thinking than when dissent is assigned as a role.⁹ The goal is not to add a critical voice as a formality. It is to structure the process so that genuine disagreement can surface and survive.

The insight from six decades of expert elicitation research is not that more opinions produce better decisions. It is that independent opinions, collected before group influence takes hold, structured to surface disagreement, and synthesised with explicit attention to uncertainty, produce better calibrated judgment.

The second failure mode: generic deliberation

Structured deliberation produces better-calibrated judgment than consensus-first thinking. But a structured process applied to the wrong question, in the wrong format, for the wrong audience, still wastes effort. A validation question deserves different rigor than a go/no-go decision. A codebase under review is a different object than an early-stage idea. An artefact for investors requires a different shape than an internal founder memo. Treating every decision as the same problem is a subtle version of the single-source failure mode: one frame imposed on heterogeneous situations.

This is why Forenta's Council Deliberation begins with intake rather than analysis. Before any advisor is activated, the system classifies five things: the object (idea, product, codebase, website, document, or strategic question), the stage (concept, prototype, MVP, live, or scaling), the goal (validation, go/no-go, investor materials, review, launch plan, roadmap, build instructions, or strategic direction), the audience (founders, investors, developers, or customers), and the deliverables that will actually be used. Only then is the council configured, with advisor weights tuned to the use case and outputs matched to what the user confirmed.

From research to product: Forenta Council Deliberation

Council Deliberation runs in three stages. First, a five-question intake that typically takes under two minutes. If the initial message already contains enough context, the intake compresses to a confirmation prompt. No advisor brief is generated until object, stage, goal, audience, and deliverables are confirmed.

Second, ten advisor roles perform blind, parallel analysis. Each role has a mandate that counters a specific cognitive bias. The Risk Sentinel falsifies and lists kill criteria. The Ground Truth Architect separates fact from assumption with explicit evidence-strength labels. The Opportunity Cartographer maps adjacent paths and ten-x scenarios. The Pattern Scout applies reference-class reasoning. The Delivery Marshal sequences the critical path with dated milestones. The Automation Strategist maps AI leverage and cost. The Experience Composer audits UX and visual hierarchy. The Capital Steward models commercial reality and runway. The GTM and Adoption Strategist designs distribution and adoption friction. The Codebase Reliability Architect assesses production readiness. No advisor sees the others' output during the initial round.

Forenta Council Deliberation process
1
Intake & classify
5 questions: object, stage, goal, audience, deliverable
2
10 advisors, blind round
Weighted per use case. No cross-visibility.
3
Peer review, rebuttal, 10th-Man
Steelman gate. Concede 1, strengthen 1, revise 1.
4
Chair synthesis
Verdict + falsifier + adaptive deliverable

After the blind round, an anonymised peer-review phase asks each advisor to steelman their peers' strongest point before critiquing argument quality. A rebuttal pass follows: each advisor concedes one weakness, strengthens one point, and revises one claim. When six or more advisors converge on a major judgment, a tenth-man memo is triggered and one advisor is appointed to argue the opposite case, specifically to prevent false consensus.

Third, a separate Council Chair synthesises the record. The chair does not average and does not count votes. It picks the strongest case or declares irreducible uncertainty with explicit bet terms, preserves dissent where substantively stronger, quotes the single most uncomfortable point raised in the council, and states a falsifier: the specific observable that would prove the verdict wrong, with a timeframe.

Adaptive deliverables, not one-size output

The output changes shape with the use case. An idea-validation run produces a founder memo with stress-tested assumptions and kill criteria. A codebase review produces a production-readiness audit. An investor-materials run produces a structured memo with falsifier. A go-to-market run produces a thirty-sixty-ninety plan with distribution channels and adoption friction mapped. A build-execution run produces concrete implementation instructions. In every case, a short synthesis document with verdict, confidence, falsifier, most uncomfortable point, and top three actions is written first. Further deliverables are generated only when confirmed during intake.

No output is produced that was not confirmed. If a pitch deck was not requested, no pitch deck is produced. If a build plan was not confirmed, no build plan is produced. Structured output that nobody uses is the same waste as unstructured output nobody reads.

What this does not cover

Council Deliberation is designed for decisions that benefit from multiple analytical frames before a commitment is made. It is less useful for decisions that are genuinely reversible at low cost, for execution questions where speed matters more than validation, or for problems requiring deep domain expertise the system's advisors do not have. The quality of any deliberation is bounded by the quality of the input: a vague brief produces a less useful output than a specific one.

There is a second honest limitation. The ten advisor roles share one underlying language model. They are given different mandates, different biases to counter, and different required artifacts, which produces genuinely divergent reasoning, but shared-model bias is real and cannot be fully eliminated, only discounted. The chair applies a one-tier confidence downgrade on positive findings about options the system itself might favour. Independent human judgment on real decisions is the rigorous test. Building that evaluation is part of the work ahead and will be reported as it happens.

Judgment is a process, not a moment

The research finding is precise. Independent analysis collected before group influence takes hold, structured to surface disagreement, and synthesised with explicit attention to uncertainty, produces better-calibrated judgment than consensus-first approaches. Intake before analysis sharpens the same principle: the right question, asked of the right role, for the right audience, produces a decision artefact someone can actually use.

The practical implication for founders and builders is unchanged. The point before execution is the cheapest moment to surface a weak assumption, a weak question, or a weak plan. Every week of work built on an unexamined premise adds cost to the eventual correction. Structured deliberation with intake is not overhead. It is what prevents both weak reasoning and the wrong problem from compounding into expensive problems.

Back to Journal