Readme

5127ede3 · stephanebonnevay · 88756456 · 5127ede3
Commit 5127ede3 authored 2 months ago by stephanebonnevay
--- a/README.md
+++ b/README.md
@@ -429,10 +429,9 @@ For our experiments, we consider two simple models for the opponent where:

 We evaluate the models' ability to identify these behavioural patterns by calculating the average number of points earned per round.

-Figures present the average points earned per round and the   95% confidence interval for each LLM against the two opponent behavior  
-models in the matching pennies game, whether the LLM generates a strategy or one-shot actions. 
+Figures present the average points earned and prediction per round (95% confidence interval) for each LLM against the two opponent behavior (constant and alternate) models in the matching pennies game. 

-...
+Against Constant behavior, <tt>GPT-4.5</tt> and <tt>Qwen3</tt> ...

 ![Prediction Accuracy per Round by Actions Against Constant Behaviour (with 95% Confidence Interval)](figures/mp/mp_prediction_ConstHT.svg)
 ![Points Earned per Round by Actions Against Constant Behaviour (with 95% Confidence Interval)](figures/mp/mp_payoff_ConstHT.svg)