Photo by Ivan Samkov on Pexels
A new benchmark study reveals that GPT-5 exhibits impressive bluffing and manipulation skills when playing ‘Werewolf’ against other AI agents. The Werewolf Benchmark, available at https://werewolf.foaster.ai/, highlights the sophistication of GPT-5’s reasoning and deceptive capabilities. This marks a significant step forward in the development of advanced AI models capable of nuanced social interaction and strategic thinking, as discussed in a recent Reddit thread on r/artificial: [https://old.reddit.com/r/artificial/comments/1n5jzmq/gpt5_is_the_best_at_bluffing_and_manipulating_the/].