From 5127ede356ab0f8d0881b7fb14604d3a4fd8914a Mon Sep 17 00:00:00 2001 From: stephanebonnevay <stephane.bonnevay@lizeo-group.com> Date: Thu, 5 Jun 2025 07:39:13 +0200 Subject: [PATCH] Readme --- README.md | 5 ++--- 1 file changed, 2 insertions(+), 3 deletions(-) diff --git a/README.md b/README.md index 825f4bd..dfac790 100644 --- a/README.md +++ b/README.md @@ -429,10 +429,9 @@ For our experiments, we consider two simple models for the opponent where: We evaluate the models' ability to identify these behavioural patterns by calculating the average number of points earned per round. -Figures present the average points earned per round and the 95% confidence interval for each LLM against the two opponent behavior -models in the matching pennies game, whether the LLM generates a strategy or one-shot actions. +Figures present the average points earned and prediction per round (95% confidence interval) for each LLM against the two opponent behavior (constant and alternate) models in the matching pennies game. -... +Against Constant behavior, <tt>GPT-4.5</tt> and <tt>Qwen3</tt> ...   -- GitLab