Probably Raises $9 Million to Build a System to Prevent AI Hallucinations in Production Use

FUNDING

Probably Raises $9 Million to Build a System to Prevent AI Hallucinations in Production Use

Organizations using large language systems across internal operations often require strict structure in outputs. These systems feed into document generation, reporting tools, customer communication systems, and financial workflows.

By Donna Joseph
June 17, 2026 9:40 PM • Updated June 17, 2026

Probably Raises $9 Million to Build a System to Prevent AI Hallucinations in Production Use

Photo by SBR

Summary

Probably has raised 9 million dollars in seed funding to build systems that detect and block errors in large language system outputs before they reach users, focusing on reliability in production deployments.
The company is building a validation layer between model output and application delivery that checks structure, consistency, and rule compliance before responses enter operational workflows.
Probably’s first product targets structured data use cases, using deterministic checks, audit trails, and reprocessing loops to reduce errors while lowering reliance on larger models.

SAN FRANCISCO, Calif., June 16, 2026 — Probably has raised nine million dollars in seed funding from Andreessen Horowitz to develop systems that detect and block errors in large language system outputs before they reach users. The funding supports early product development and engineering work focused on reliability in production deployments of artificial intelligence systems.

The company is working on infrastructure that sits between the language system output and final application delivery. Rather than building another general-purpose generator, it focuses on verification and control of generated responses before they enter operational workflows.

Founder Peter Elias describes the work as building guardrails that prevent incorrect or inconsistent outputs from reaching end users. The target is higher reliability in environments where accuracy requirements are strict, and errors carry operational cost.

Factual Errors in Production Workflows

Large language systems can produce fluent responses that still contain factual errors or inconsistent information. These issues appear even in advanced systems and remain difficult to eliminate through generation alone.

In production use, these errors become more visible because outputs are often used directly in business workflows such as reporting, data extraction, or automated decision processes. Even small inconsistencies can disrupt downstream systems that rely on structured inputs.

Existing detection methods often depend on post hoc review or separate validation scripts. These methods vary widely across applications and can introduce uneven handling of generated content across different systems.

Validation Layer Between Generation and Delivery

Probably builds a validation layer that evaluates outputs before they are accepted into downstream systems. This layer checks structure, consistency, and alignment with predefined rules.

Outputs are not treated as final until they pass validation checks. If a response does not match the required structure or contains inconsistencies, it is rejected or reprocessed. This creates a controlled pathway between generation and usage.

The system is designed to reduce reliance on manual correction or application-specific parsing logic. Instead of each workflow handling errors independently, validation rules can be defined once and applied across multiple use cases.

This setup allows generated content to be constrained by deterministic rules that verify structure and correctness before delivery into production systems.

Data Science Tool and Structured Output Verification

The company’s first product is a data science tool designed to produce answers from structured datasets. Each output includes citations and an audit trail that shows how the result was derived from source data.

To reduce errors, outputs pass through a validation system that checks whether results match dataset constraints. If mismatches occur, the system rejects the output and reprocesses it until it satisfies the required rules.

This method is described internally as a “data science mech suit,” referring to the way validation rules wrap around model outputs and constrain behavior. The system also aligns generation behavior with the same rules used for verification.

The company reports that stronger validation can reduce dependence on highly capable models. In some configurations, smaller models can be used while still maintaining reliable output because validation handles correctness enforcement.

This also reduces compute requirements and lowers token costs, since smaller models can run on local hardware rather than large cloud infrastructure.

Enterprise Use Cases and Structured Reliability

The same validation architecture is intended for use beyond data science workflows. Potential applications include accounting systems, compliance tools, and healthcare workflows where structured outputs and traceability are required.

In these settings, even minor errors can lead to operational disruption. Validation systems reduce that risk by checking outputs before they are accepted into downstream processes.

Probably treats model output as untrusted until it passes verification rules. Each response must satisfy structure and consistency checks before it enters production systems.

Founder Peter Elias has said that major AI providers have not prioritised this type of constraint-based reliability system, noting that incentives often favour repeated usage rather than reducing correction cycles.

The company treats generated output as untrusted data until it passes validation rules. This principle guides how the product evaluates, filters, and standardizes responses before they reach downstream systems.

Our Standards: Associated Press Stylebook

What To Read Next

NHTSA Clears Regulatory Barrier to Zoox’s Commercial Robotaxi Rollout

Under the federal exemption, Zoox’s commercial fleet will be limited to 2,500 vehicles annually for two years.

July 31, 2026 • By Donna Joseph

Encore Raises $30 Million to Expand Adoption of AI Agents That Learn From Customer Conversations

The startup enables companies to deploy AI agents that communicate with customers through voice or text. These agents can manage customer interactions directly or assist employees by recommending responses and engagement strategies during conversations.

July 29, 2026 • By Donna Joseph

Fish Audio Raises $52 Million to Scale its AI Speech Technology

The company has grown to more than 8 million users and $21 million in annual recurring revenue as it develops AI voice technology for creators, developers, and enterprises.

July 29, 2026 • By Donna Joseph

ROBOTICS

Enigma Raises $71 Million to Explore a New Kind of Robotic Intelligence

LATEST IN FINANCIAL LITERACY

Content provided by finlittoday.com

Jonathan V. Gould Details OCC’s Financial Literacy Efforts as Banking Services Evolve

Probably Raises $9 Million to Build a System to Prevent AI Hallucinations in Production Use

Summary

What To Read Next

NHTSA Clears Regulatory Barrier to Zoox’s Commercial Robotaxi Rollout

Encore Raises $30 Million to Expand Adoption of AI Agents That Learn From Customer Conversations

Fish Audio Raises $52 Million to Scale its AI Speech Technology

Enigma Raises $71 Million to Explore a New Kind of Robotic Intelligence

Corgi’s Valuation Set to Double as AI-Powered Insurance Startup Secures Another Round

Beyond the Care Cliff: How Neuro20 Technologies is Reimagining the Future of Neurological Rehabilitation

How Imagi is Taking Vibe Coding to More K-12 Schools Worldwide

Business

NHTSA Clears Regulatory Barrier to Zoox’s Commercial Robotaxi Rollout

Enigma Raises $71 Million to Explore a New Kind of Robotic Intelligence

Corgi’s Valuation Set to Double as AI-Powered Insurance Startup Secures Another Round

Beyond the Care Cliff: How Neuro20 Technologies is Reimagining the Future of Neurological Rehabilitation

LATEST IN FINANCIAL LITERACY

Jonathan V. Gould Details OCC’s Financial Literacy Efforts as Banking Services Evolve

How Heartland Bank and Trust Builds on Generations of Trust and Service

What Students Learned About Money Management at Mississippi Charities’ Back-to-School Fair

Gráinne Griffin Appointed Ireland’s Financial Literacy Ambassador

How Paramount Pictures Monetizes Films, Franchises, and Intellectual Property

How The Walt Disney Company Generates Billions Across Entertainment, Streaming, and Experiences

Universal Studios’ Century-Long Evolution from Film Industry Pioneer to Global Entertainment Powerhouse

Christopher Nolan is One of Cinema’s Most Valuable Creative Assets

Probably Raises $9 Million to Build a System to Prevent AI Hallucinations in Production Use

Summary

What To Read Next

LATEST IN FINANCIAL LITERACY

Subscribe to Our Weekly Newsletter