Technology

Navigating the Challenges of Non-Deterministic AI in Enterprise Software

Published

1 July, 2025

As generative AI technologies become increasingly embedded in software products and workflows, these systems start to mirror the characteristics of large language models (LLMs) themselves. This shift has led to concerns about reliability, as these models are inherently non-deterministic, often producing varied responses to identical inputs. This characteristic is both a feature and a challenge, particularly in enterprise environments where consistency and accuracy are paramount.

The non-deterministic nature of LLMs means that errors can propagate, especially when reasoning models and AI agents are involved. According to Dan Lines, COO of LinearB, “Ultimately, any kind of probabilistic model is sometimes going to be wrong. These kinds of inconsistencies that are drawn from the absence of a well-structured world model are always going to be present at the core of a lot of the systems that we’re working with and systems that we’re reasoning about.”

Understanding the Non-Determinism of LLMs

LLMs are designed to be “dream machines,” capable of generating novel and unexpected outputs. However, when these outputs are factually incorrect, they become problematic. In enterprise software, where reliability is crucial, understanding and mitigating these errors is essential. Daniel Loreto, CEO of Jetify, highlights the difficulty in predicting LLM behavior, emphasizing the need for tools and processes to ensure desired system performance.

Enterprise applications rely heavily on trust, which is built on authorized access, high availability, and idempotency. For GenAI processes, accuracy is an additional critical factor. Tariq Shaukat, CEO of Sonar, notes, “A lot of the real success stories that I hear about are apps that have relatively little downside if it goes down for a couple of minutes or there’s a minor security breach or something like that.”

Addressing Hallucinations and Enhancing Accuracy

One common issue with LLMs is “hallucinations,” where the model generates inaccurate or irrelevant information. Retrieval-augmented generation (RAG) is a typical approach to grounding responses in factual data, yet even RAG systems can falter. Amr Awadallah, CEO of Vectara, points out, “Even when you ground LLMs, 1 out of every 20 tokens coming out might be completely wrong, completely off topic, or not true.”

To mitigate these issues, additional guardrails on prompts and responses are necessary. Maryam Ashoori, Head of Product at watsonx.ai, IBM, explains the importance of filtering on both input and output sides to prevent harmful or inappropriate content from being generated.

Implementing Observability and Monitoring

Observability in LLMs is crucial for understanding and rectifying errors. Abby Kearns, CTO of Alembic, highlights the need for reinventing traditional tooling for machine workloads. While standard software metrics like logs and stack traces provide insights into system performance, LLMs require more nuanced approaches to measure hallucination rates, factual consistency, and bias.

Mark Doble, CEO of Alexi, suggests using multiple models to evaluate outputs, akin to a “LLM-as-judge” approach. This method ensures more reliable outputs by leveraging the collective intelligence of various models.

Ensuring Reliability in AI Workflows

Incorporating determinism into AI workflows is essential for enterprise applications. Jeremy Edberg, CEO of DBOS, emphasizes the importance of durable execution technologies that save progress within workflows, thereby preventing costly failures. “We’ve always had a cost to downtime, right? Now, though, it’s getting much more important because AI is non-deterministic,” he says.

Qian Li, cofounder of DBOS, advocates for checkpointing applications to ensure progress is saved, reducing the need for repeated prompts and minimizing the risk of varied responses.

Ultimately, while LLMs offer powerful capabilities, they also introduce complexity and risk. For personal projects, the non-determinism of AI can be intriguing and even delightful. However, for enterprise software, reliability and trust are non-negotiable. As Raj Patel, AI transformation lead at Holistic AI, aptly puts it, “Trust is key. I think trust takes years to build, seconds to break, and then a fair bit to recover.”

In this article:Dan Lines, LinearB

Lifestyle

Wall Street Zen Upgrades Amerant Bancorp to “Buy” Rating

Shares of **Amerant Bancorp** (NYSE:AMTB) received an upgrade from Wall Street Zen on March 10, 2024, transitioning from a hold rating to a buy...

Editorial30 July, 2025

Sports

UFC Abu Dhabi: Steven Nguyen Sets Knockdown Record Amid Health Concerns for Yahya

The UFC event in Abu Dhabi on July 26, 2025, featured a record-breaking performance from Steven Nguyen, who achieved an unprecedented feat by knocking...

Editorial26 July, 2025

Sports

Aces Dominate Sparks 89-74 with Stellar Performances

The Las Vegas Aces secured a convincing victory over the Los Angeles Sparks, defeating them 89-74 on March 12, 2024, at Crypto.com Arena. This...

Editorial30 July, 2025

Sports

Top Tight Ends to Watch in Fantasy Football 2025

As the 2025 NFL season approaches, fantasy football enthusiasts are gearing up for their drafts, particularly focusing on tight ends. With players like Brock...

Editorial24 July, 2025

California Defies Federal Order to Ban Transgender Athletes in School Sports

California has taken a stand against a federal directive from the Trump administration demanding the exclusion of transgender athletes from girls’ and women’s sports....

Editorial8 July, 2025

Affordable Motorcycle Helmets Under ₹1000: Essential Safety Now

URGENT UPDATE: Affordable motorcycle helmets under ₹1000 are now available for safety-conscious riders across India. With road safety becoming a pressing issue, these helmets...

Editorial17 July, 2025

Tech Giants Invest $41M in Pioneering Carbon Removal with Arbor’s BECCS

Frontier, a coalition of technology leaders including Google and Meta, has announced a landmark investment in Arbor, a cutting-edge startup specializing in bioenergy with...

Editorial9 July, 2025

Entertainment

Olivia Munn Opens Up About Trichotillomania Triggered by Public Scrutiny

Olivia Munn, the acclaimed actress, recently shared an intimate revelation about her personal struggles with trichotillomania, a disorder that compels individuals to pull out...

Editorial30 June, 2025

Sports

HBO Max Unveils Exciting Trailer for Season 2 of ‘Peacemaker’

HBO Max has released the official trailer for the highly anticipated second season of Peacemaker, featuring John Cena in the lead role. The unveiling...

Editorial27 July, 2025

Health

Study Reveals Diet Soda Linked to 38% Higher Diabetes Risk

A recent study led by researchers at Monash University indicates that consuming just one can of diet soda daily increases the risk of developing...

Editorial30 July, 2025

Entertainment

JP Saxe Cancels Fall Tour Due to Low Ticket Sales

Singer-songwriter JP Saxe has announced the cancellation of his upcoming fall tour, citing insufficient ticket sales as the primary reason. The tour was intended...

Editorial6 days ago

Entertainment

Hulk Hogan’s Estrangement from Daughter Brooke Sparks Controversy

Complications in the relationship between wrestling legend Hulk Hogan and his daughter Brooke Hogan have surfaced, with revelations about Hogan’s apparent disinterest in reconnecting....

Editorial5 days ago

Understanding the Non-Determinism of LLMs

Addressing Hallucinations and Enhancing Accuracy

Implementing Observability and Monitoring

Ensuring Reliability in AI Workflows

Trending

Lifestyle

Wall Street Zen Upgrades Amerant Bancorp to “Buy” Rating

Sports

UFC Abu Dhabi: Steven Nguyen Sets Knockdown Record Amid Health Concerns for Yahya

Sports

Aces Dominate Sparks 89-74 with Stellar Performances

Sports

Top Tight Ends to Watch in Fantasy Football 2025

Top Stories

California Defies Federal Order to Ban Transgender Athletes in School Sports

Top Stories

Affordable Motorcycle Helmets Under ₹1000: Essential Safety Now

Top Stories

Tech Giants Invest $41M in Pioneering Carbon Removal with Arbor’s BECCS

You May Also Like

Lifestyle

Wall Street Zen Upgrades Amerant Bancorp to “Buy” Rating

Sports

UFC Abu Dhabi: Steven Nguyen Sets Knockdown Record Amid Health Concerns for Yahya

Sports

Aces Dominate Sparks 89-74 with Stellar Performances

Sports

Top Tight Ends to Watch in Fantasy Football 2025

Top Stories

California Defies Federal Order to Ban Transgender Athletes in School Sports

Top Stories

Affordable Motorcycle Helmets Under ₹1000: Essential Safety Now

Top Stories

Tech Giants Invest $41M in Pioneering Carbon Removal with Arbor’s BECCS

Entertainment

Olivia Munn Opens Up About Trichotillomania Triggered by Public Scrutiny

Sports

HBO Max Unveils Exciting Trailer for Season 2 of ‘Peacemaker’

Health

Study Reveals Diet Soda Linked to 38% Higher Diabetes Risk

Entertainment

JP Saxe Cancels Fall Tour Due to Low Ticket Sales

Entertainment

Hulk Hogan’s Estrangement from Daughter Brooke Sparks Controversy