LLM4AAMAS: Add AGENTBENCH

74cd9dbf · Maxime MORGE · 07633ce3 · 74cd9dbf
Commit 74cd9dbf authored 5 months ago by Maxime MORGE
--- a/README.md
+++ b/README.md
@@ -178,6 +178,15 @@ Many models are available at the following URLs:
   homo silicus?](https://www.nber.org/papers/w31122)** Horton, J. J. (2023).
   National Bureau of Economic Research.   

+- ***[AgentBench: Evaluating LLMs as
+  Agents](https://openreview.net/forum?id=zAdUB0aCTQ)**. Xiao Li et al. Poster.
+  Proc. of 12th International Conference on Learning Representations (ICLR),
+  Vienna, Austria, May 7-11, 2024.
+
+    AGENTBENCH a systematically designed multi-dimensional evolving benchmark
+    for evaluating LLMs as agents which measure a significant performance gap
+    between these top-tier models and their OSS competitors.
+

 ### Generative Autonomous Agents on the shelf