Neurosymbolic AI May Be the Reply to Hallucination in Massive Language Fashions

The Trump EPA tried to bury some excellent news » Yale Local weather Connections

5 June 2025

[Podcast] Key traits in city mobility transformation

5 June 2025

The primary downside with huge tech’s experiment with synthetic intelligence will not be that it may take over humanity. It’s that enormous language fashions (LLMs) like Open AI’s ChatGPT, Google’s Gemini, and Meta’s Llama proceed to get issues improper, and the issue is intractable.

Often called hallucinations, essentially the most outstanding instance was maybe the case of US regulation professor Jonathan Turley, who was falsely accused of sexual harassment by ChatGPT in 2023.

OpenAI’s answer appears to have been to mainly “disappear” Turley by programming ChatGPT to say it may’t reply to questions on him, which is clearly not a good or passable answer. Making an attempt to resolve hallucinations after the occasion and case by case is clearly not the best way to go.

The identical may be stated of LLMs amplifying stereotypes or giving western-centric solutions. There’s additionally a complete lack of accountability within the face of this widespread misinformation, because it’s tough to establish how the LLM reached this conclusion within the first place.

We noticed a fierce debate about these issues after the 2023 launch of GPT-4, the latest main paradigm in OpenAI’s LLM improvement. Arguably the controversy has cooled since then, although with out justification.

The EU handed its AI Act in document time in 2024, for example, in a bid to be world chief in overseeing this discipline. However the act depends closely on AI firms regulating themselves with out actually addressing the problems in query. It hasn’t stopped tech firms from releasing LLMs worldwide to a whole bunch of tens of millions of customers and accumulating their information with out correct scrutiny.

In the meantime, the most recent checks point out that even essentially the most subtle LLMs stay unreliable. Regardless of this, the main AI firms nonetheless resist taking accountability for errors.

Sadly LLMs’ tendencies to misinform and reproduce bias can’t be solved with gradual enhancements over time. And with the arrival of agentic AI, the place customers will quickly be capable to assign initiatives to an LLM resembling, say, reserving their vacation or optimizing the cost of all their payments every month, the potential for bother is about to multiply.

The rising discipline of neurosymbolic AI may remedy these points, whereas additionally decreasing the big quantities of information required for coaching LLMs. So what’s neurosymbolic AI and the way does it work?

The LLM Downside

LLMs work utilizing a way referred to as deep studying, the place they’re given huge quantities of textual content information and use superior statistics to deduce patterns that decide what the following phrase or phrase in any given response must be. Every mannequin—together with all of the patterns it has discovered—is saved in arrays of highly effective computer systems in giant information facilities often known as neural networks.

LLMs can seem to motive utilizing a course of referred to as chain-of-thought, the place they generate multi-step responses that mimic how people may logically arrive at a conclusion, primarily based on patterns seen within the coaching information.

Undoubtedly, LLMs are an excellent engineering achievement. They’re spectacular at summarizing textual content and translating and should enhance the productiveness of these diligent and educated sufficient to identify their errors. Nonetheless they’ve nice potential to mislead as a result of their conclusions are at all times primarily based on chances—not understanding.

A well-liked workaround is known as human-in-the-loop: ensuring that people utilizing AIs nonetheless make the ultimate selections. Nevertheless, apportioning blame to people doesn’t remedy the issue. They’ll nonetheless typically be misled by misinformation.

LLMs now want a lot coaching information to advance that we’re having to feed them artificial information, that means information created by LLMs. This information can copy and amplify current errors from its personal supply information, such that new fashions inherit the weaknesses of outdated ones. Consequently, the price of programming AI fashions to be extra correct after their coaching—often known as post-hoc mannequin alignment—is skyrocketing.

It additionally turns into more and more tough for programmers to see what’s going improper as a result of the variety of steps within the mannequin’s thought course of turns into ever bigger, making it more durable and more durable to right for errors.

Neurosymbolic AI combines the predictive studying of neural networks with instructing the AI a collection of formal guidelines that people be taught to have the ability to deliberate extra reliably. These embrace logic guidelines, like “if a then b”, which, for instance, would assist an algorithm be taught that “if it’s raining then all the things exterior is often moist”; mathematical guidelines, like “if a = b and b = c then a = c”; and the agreed upon meanings of issues like phrases, diagrams, and symbols. A few of these might be inputted instantly into the AI system, whereas it’ll deduce others itself by analyzing its coaching information and performing “information extraction.”

This could create an AI that may by no means hallucinate and can be taught sooner and smarter by organizing its information into clear, reusable components. For instance, if the AI has a rule about issues being moist exterior when it rains, there’s no want for it to retain each instance of the issues that is likely to be moist exterior—the rule may be utilized to any new object, even one it has by no means seen earlier than.

Throughout mannequin improvement, neurosymbolic AI additionally integrates studying and formal reasoning utilizing a course of often known as the neurosymbolic cycle. This entails {a partially} educated AI extracting guidelines from its coaching information then instilling this consolidated information again into the community earlier than additional coaching with information.

That is extra vitality environment friendly as a result of the AI needn’t retailer as a lot information, whereas the AI is extra accountable as a result of it’s simpler for a person to manage the way it reaches specific conclusions and improves over time. It’s additionally fairer as a result of it may be made to comply with pre-existing guidelines, resembling: “For any determination made by the AI, the result should not depend upon an individual’s race or gender.”

The Third Wave

The first wave of AI within the Nineteen Eighties, often known as symbolic AI, was really primarily based on instructing computer systems formal guidelines that they might then apply to new data. Deep studying adopted because the second wave within the 2010s, and lots of see neurosymbolic AI because the third.

It’s best to use neurosymbolic rules to AI in area of interest areas, as a result of the foundations may be clearly outlined. So, it’s no shock that we’ve seen it first emerge in Google’s AlphaFold, which predicts protein buildings to assist with drug discovery; and AlphaGeometry, which solves complicated geometry issues.

For extra broad-based AI fashions, China’s DeepSeek makes use of a studying approach referred to as “distillation” which is a step in the identical path. However to make neurosymbolic AI absolutely possible for common fashions, there nonetheless must be extra analysis to refine their means to discern common guidelines and carry out information extraction.

It’s unclear to what extent LLM makers are engaged on this already. They definitely sound like they’re heading within the path of attempting to show their fashions to suppose extra cleverly, however in addition they appear wedded to the necessity to scale up with ever bigger quantities of information.

The truth is that if AI goes to maintain advancing, we’ll want methods that adapt to novelty from just a few examples, that verify their understanding, that may multitask and reuse information to enhance information effectivity, and that may motive reliably in subtle methods.

This fashion, well-designed digital expertise may probably even supply a substitute for regulation, as a result of the checks and balances can be constructed into the structure and maybe standardized throughout the trade. There’s a protracted solution to go, however at the least there’s a path forward.

This text is republished from The Dialog beneath a Artistic Commons license. Learn the unique article.