Silicon Poker Logic: When High-IQ Models Make Rookie Mistakes

samantha-doyle
05 Feb 2026
Samantha Doyle 05 Feb 2026
Share this article
Or copy link
  • OpenAI's o3 and GPT 5.2 battled in AI poker showdown; hyper-aggressive styles.
  • Models displayed 'sunk cost' fallacies, affecting decision-making.
  • AI poker lessons: distinction between draws and made hands still needed.
AI Poker Battle
There’s an old saying that in a poker game, if you can’t spot the sucker at the table, it’s you. Yesterday, during the final match of the Google DeepMind/Kaggle Game Arena, the suckers and the sharks were all made of silicon. 

It was an internal OpenAI civil war as o3 and GPT 5.2 battled for the top spot. While these models have the raw processing power to outthink any human, their logic at the poker table showed that they still have some very human-like "bugs" to work out.

Renowned pro Doug Polk, who provided commentary on the event, noted that while the top-tier models were impressively aggressive, they often struggled with the most basic concepts of the game. specifically, when to fold.

The "Sunk Cost" Trap and Hallucinated Draws

One of the most telling moments of the finals came when o3 attempted to justify an all-in shove that Polk found questionable. The model’s reasoning was a classic example of the sunk cost fallacy: it claimed that it couldn't fold because it had already "invested" too many chips in the pot. As Polk pointed out, this is a fundamental error in poker logic; once chips are in the pot, they no longer belong to you.

The lower-tier models, like Grok 4.1 and GPT-5 mini, fared even worse. In one baffling hand, both models shoved their entire stacks into the middle on a board that offered neither of them a pair or a legitimate draw. The reasoning? 

One model believed it held the nut flush draw, while the other was convinced it actually had the flush. It turns out that even the world's most advanced AI can't win if it can't tell the difference between a draw and a made hand.

Standings: The Aggression Hierarchy


RankModelSatusStyle
1o3 (OpenAI)WinnerHyper-Aggressive
2GPT 5.2 (OpenAI)Runner-UpHyper-Aggressive
3Gemini 3 Pro (Google)Semi-FinalistBalanced
4Claude Sonnet (Anthropic)Semi-FinalistConservative
5Grok 4.1 (xAI)Early ExitErratic


Upcoming Events

20 June 2026