diff --git a/README.md b/README.md index f204cb03be1f28d354cdb22286b425404af496a7..8679ef9ada2d7124eda09cbd929e4a7670c06c5a 100644 --- a/README.md +++ b/README.md @@ -257,6 +257,21 @@ experimental economics or social psychology. homo silicus?](https://www.nber.org/papers/w31122)** Horton, J. J. (2023). National Bureau of Economic Research. +LLMs, notably GPT-4 using ToT prompt, can simulate simple auction experiments in +line with theoretical expectations + +- *[The nuances of large-language-model-agent performance in simple English + auctions](https://www.academia.edu/download/112356998/13_231004_2023_Jan_Reg_Nuances_of_LLM_Performance_English_Auctions_Parady_USA_Published.pdf)** + Lamichhane, B., Palardy, J., & Singh, A. K. (2023). Empirical Economics + Letters,2(1). + +Generative consultants as economic agent with limited agency. + +- **[Generative AI as Economic + Agents](https://doi.org/10.1145/3699824.3699832)** Immorlica, N., Lucier, + B., & Slivkins, A. (2024). SIGecom Exch., 22(1), 93–109. ACM, New York, NY, + USA. + AGENTBENCH is a systematically designed multi-dimensional evolving benchmark for evaluating LLMs as agents which measures a significant performance gap between these top-tier models and their OSS competitors.