diff --git a/README.md b/README.md index b6e082a415505fe4afe512d10daff146da61c12c..97f914d8d2b33d2b8e64a08da46108f5b9841a32 100644 --- a/README.md +++ b/README.md @@ -147,8 +147,6 @@ The results indicate how well each model performs under each belief type. | llama3 | 1.00 | 0.90 | 0.17 | | deepseek-r1 | 0.83 | 0.57 | 0.60 | -Here’s a refined version of your text: - GPT-4.5 achieves a perfect score across all belief types, demonstrating an exceptional ability to take rational decisions, even in the implicit belief condition. Mistral-Small consistently outperforms the other open-weight models across all belief types.