Google DeepMind Expands AI Benchmarking with Werewolf and Poker as Gemini 3 Dominates Rankings
Google DeepMind launches Werewolf and poker benchmarks on Kaggle Game Arena to test AI social skills, deception detection, and risk management. Gemini 3 Pro and Flash models demonstrate significant performance leap over previous generation.


