News

GTO Wizard AI Surpasses GPT-5.3 and Grok 4 in Poker Benchmark

By Mrinal Gujare

15 Apr 2026

Mrinal Gujare 15 Apr 2026

Share this article

GTO Wizard AI outperforms traditional and general AI models in poker.
Developed as Ruse AI, it adapts decisions in real-time for superior gameplay.
Proven strength through a high win rate and variance-reducing AIVAT system.

Image Credit: GTO Wizard

GTO Wizard AI has emerged as the leading poker intelligence model, outperforming both legacy bots and top general AI systems.

In the fast-moving intersection of poker and artificial intelligence, one question continues to dominate: when will machines consistently outperform human players at scale? A new benchmark suggests that moment is no longer theoretical.

The journey began in 2019 when Pluribus defeated elite human professionals, marking the first time an AI system outperformed humans in complex multiplayer poker. Since then, competition has intensified.

In a large-scale AI showdown involving nearly 4,000 hands, multiple models competed, with OpenAI o3 emerging as the top performer while others struggled to maintain profitability.

Today, that benchmark has been surpassed.

GTO Wizard AI represents a shift from traditional poker bots to adaptive, real-time decision engines. Originally developed as Ruse AI by Canadian programmers Marc-Antoine Provost and Philippe Beardsell, the technology was acquired in 2023 and refined into a high-performance poker agent.

Unlike earlier systems such as Slumbot, which relied on pre-computed strategies, GTO Wizard AI learns dynamically. It was trained through hundreds of millions of self-play hands, optimizing decisions based on expected value rather than fixed strategy libraries.

Using deep reinforcement learning, the model evaluates each situation in real time, solving complex poker scenarios within seconds rather than relying on static solutions.

Head-to-Head: GTO Wizard AI vs Slumbot

To validate its performance, GTO Wizard AI faced Slumbot in a controlled 150,000-hand match. The results were decisive.

GTO Wizard AI delivered a win rate of 19.4 bb per 100 hands. For perspective, elite human professionals typically target around 5 bb per 100 hands.

At stakes of $50 and $100 with 200 hands per hour, this translates to approximately $19.4 profit per hand or an hourly win rate of $3,880. This gap highlights not just superiority, but a structural advantage in decision-making efficiency.

Benchmarking Against General AI Models

Beyond legacy poker bots, GTO Wizard AI was tested against leading general-purpose AI models to evaluate whether broad reasoning systems could compete in specialized domains.

The results were clear. While general models have advanced significantly, they remain outmatched in high-level poker strategy.

GPT-5.3: -16.0 bb per 100 hands
Claude Opus 4.6: -20.4 bb per 100 hands
Gemini 3.1 Pro: -30.8 bb per 100 hands
Grok 4: -60 bb per 100 hands

Despite strong reasoning capabilities, these models lack the domain-specific optimization required to compete with specialized agents.

Eliminating Variance with AIVAT

Poker outcomes are heavily influenced by variance, making short-term results unreliable. To address this, the benchmark used AIVAT, a variance-reduction system that adjusts for luck.

AIVAT reduces the number of hands required for statistically meaningful conclusions by up to ten times. This allows for faster and more accurate evaluation of true performance, ensuring that results reflect skill rather than short-term variance.

Open Access: API for Competitive Evaluation

To promote transparency and competition, GTO Wizard has launched API access for developers and researchers. This enables external models to be tested directly against its system under standardized conditions.

Participants must play a minimum of 2,500 hands of Heads-Up No-Limit Hold'em using 200 big blind stacks that reset each hand.

Usage is capped at 100,000 hands per month, ensuring controlled and fair benchmarking. The API allows simulation and result retrieval while keeping the solver’s internal logic protected.

What Comes Next for AI in Poker

The next phase of benchmarking is expected to expand into Heads-Up Pot-Limit Omaha, a more complex variant that introduces additional strategic depth.

The broader implication is clear. General AI models are improving rapidly, but specialized systems still dominate in environments that demand precision, adaptation, and deep strategic optimization.

The era of unverified claims in AI performance is fading. Measurable results, standardized benchmarks, and transparent competition are now defining leadership in the field.

Upcoming Events

13 June 2026

BasePoker Weekly Freeroll Poker BasePoker Weekly Freerolls: Everything You Need to Know

12 June 2026

BCPoker Weekend Freerolls Poker Boost Your Bankroll with Two Exclusive Weekly Freerolls at BCPoker

888poker $300GTD Freebuy Poker A Community Freebuy Worth Marking Down: PokerHeaven’s $300 GTD on 888poker

PokerStars Open Malaga 2026 Poker PokerStars Open Malaga Aims for Another Record-Breaking Year

APT Incheon 2026 Poker Asian Poker Tour returns to Paradise City Casino

WSOP Circuit Tallin 2026 Poker Chasing the Gold Rings: A Complete Guide to WSOP Circuit Tallinn 2026

EPT Barcelona 2026 Poker Sun, Sand, and Six-Figure Scores: The EPT Barcelona 2026 Preview

WPT Australia 2026 Poker September in Sydney: Everything You Need to Know About WPT Australia 2026

APT Jeju 2026 Poker Asian Poker Tour Announces Ten-Day APT Jeju 2026 Festival at LES A Casino

Battle of Malta Autumn Edition 2026 Poker What to Expect from the Battle of Malta October 2026 Autumn Edition

APT Championship Taipei 2026 Poker APT Championship Taipei 2026 Set to Ignite Asia Poker Arena with TWD 165M GTD Main Event

Poker EventsStarts in

BasePoker Weekly Freeroll BCPoker Weekend Freerolls 888poker $300GTD Freebuy PokerStars Open Malaga 2026 APT Incheon 2026 WSOP Circuit Tallin 2026 EPT Barcelona 2026 WPT Australia 2026 APT Jeju 2026 Battle of Malta Autumn Edition 2026 APT Championship Taipei 2026

Latest News

See all