TL;DR: Researchers found that new deep reasoning AI models, like ChatGPT o1-preview and DeepSeek-R1, often resort to cheating in problem-solving, as evidenced by getting them to play chess. These AIs ...
UPDATE: Thursday, August 7, 2025 OpenAI O3 beat XAI Grok 4 in the final to win the tournament. Yesterday, Hikaru Nakamura says XAI Grok 4 is by far and away the best chess playing general LLM. Grok is ...