-
Maxime MORGE authored78876330
LLM4AAMAS
Generative Autonomous Agents and Multi-Agent Systems (AAMAS) offer promising opportunities for solving problems in open environments and simulating complex social dynamics.
This repository contains a collection of papers and ressources related to generative AAMAS. This list is a work in progress and will be regularly updated with new resources.
Artificial Intelligence
-
Intelligence artificielle : une approche moderne (4e édition) Stuart Russell, Peter Norvig, Fabrice Popineau, Laurent Miclet, Claire Cadet (2021) Publisher: Pearson France
-
Apprentissage artificiel - 3e édition : Deep learning, concepts et algorithmes Antoine Cornuéjols, Laurent Miclet, Vincent Barra (2018) Publisher: Eyrolles
Neural networks (RNN, Transformers)
-
Learning representations by back-propagating errors David E. Rumelhart, Geoffrey E. Hinton, Ronald J. Williams (1986) Published in Nature
-
ImageNet Classification with Deep Convolutional Neural Networks Alex Krizhevsky, Ilya Sutskever, Geoffrey E. Hinton (2012) Presented at NeurIPS
Large Language Models
-
A Survey of Large Language Models Wayne Xin Zhao, Kun Zhou, Junyi Li, et al. (2024) Published on arXiv
-
Large Language Model based Multi-Agents: A Survey of Progress and Challenges Taicheng Guo et al. (2024) Published on arXiv arXiv:2402.01680 [cs.CL]
-
Improving language understanding by generative pre-training Alec Radford, Karthik Narasimhan, Tim Salimans, Ilya Sutskever (2018) Published by OpenAI
-
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding Jacob Devlin, Ming-Wei Chang, Kenton Lee, Kristina Toutanova (2019) Presented at NAACL-HLT
-
Sequence to Sequence Learning with Neural Networks Ilya Sutskever, Oriol Vinyals, Quoc V. Le (2014) Published on arXiv
-
Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation Kyunghyun Cho, Bart van Merrienboer, Caglar Gulcehre, et al. (2014) Published on arXiv
Tuning
Instruction tuning
-
LoRA: Low-Rank Adaptation of Large Language Models Edward J. Hu, Yelong Shen, Phillip Wallis, et al. (2021) Published on arXiv
-
Language Models are Few-Shot Learners Tom Brown, Benjamin Mann, Nick Ryder, et al. (2020) Presented at NeurIPS
Alignement tuning
- Training language models to follow instructions with human feedback Long Ouyang, Jeffrey Wu, Xu Jiang, et al. (2022) Presented at NeurIPS
Existing LLMs
Many models are available at the following URLs: https://ollama.com, https://www.nomic.ai/gpt4all and https://huggingface.co/models.
-
GPT-4 Technical Report OpenAI Team (2024) Published on arXiv
-
The Llama 3 Herd of Models Meta Team (2024) Published on arXiv
-
Stanford Alpaca: An Instruction-Following LLaMa Model Rohan Taori, Ishaan Gulrajani, Tianyi Zhang, Yann Dubois, et al. (2023) Published on GitHub
-
Gemma 2: Improving Open Language Models at a Practical Size Google AI Team (2024) Published on arXiv
-
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning DeepSeek-AI (2025) Published on arXiv
-
Mixtral of Experts Mistral AI team (2024) Published on arXiv
-
Mistral 7B Mistral AI team (2023) Published on arXiv
-
The Lucie-7B LLM and the Lucie Training Dataset: Open Resources for Multilingual Language Generation Olivier Gouvert, Julie Hunter, Jérôme Louradour, Evan Dufraisse, Yaya Sy, Pierre-Carl Langlais, Anastasia Stasenko, Laura Rivière, Christophe Cerisara, Jean-Pierre Lorré (2025)
Prompt engineering
ICL
- A Survey on In-context Learning Qingxiu Dong, Lei Li, Damai Dai, Ce Zheng, Jingyuan Ma, Rui Li, Heming Xia, Jingjing Xu, Zhiyong Wu, Baobao Chang, Xu Sun, Lei Li, Zhifang Sui (2024) Presented at the Conference on Empirical Methods in Natural Language Processing (EMNLP) Location: Miami, Florida, USA Published by: Association for Computational Linguistics
CoT
- Chain-of-Thought Prompting Elicits Reasoning in Large Language Models Jason Wei, Xuezhi Wang, Dale Schuurmans, Maarten Bosma, et al. (2022) Presented at NeurIPS
RAG
- Retrieval-Augmented Generation for Large Language Models: A Survey Yunfan Gao, Yun Xiong, Xinyu Gao, Kangxiang Jia, Jinliu Pan, Yuxi Bi, Yi Dai, Jiawei Sun, Meng Wang, Haofen Wang (2024) Published on arXiv
Generative Autonomous Agents
-
A Survey on Large Language Model Based Autonomous Agents Lei Wang, Chen Ma, Xueyang Feng, Zeyu Zhang, Hao Yang, Jingsen Zhang, Zhiyuan Chen, Jiakai Tang, Xu Chen, Yankai Lin, Wayne Xin Zhao, Zhewei Wei, Jirong Wen (2024)* Published in Frontiers of Computer Science (Volume 18, Issue 6, Pages 186345) Publisher: Springer
-
HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in Hugging Face Yongliang Shen, Kaitao Song, Xu Tan, Dongsheng Li, Weiming Lu, Yueting Zhuang (2023) Presented at Advances in Neural Information Processing Systems (NeurIPS) Pages: 38154–38180 Publisher: Curran Associates, Inc. Volume: 36
-
Toolformer: Language Models Can Teach Themselves to Use Tools Timo Schick, Jane Dwivedi-Yu, Roberto Dessi, Roberta Raileanu, et al. (2023) Presented at NeurIPS
-
Cognitive Architectures for Language Agents Theodore R. Sumers, Shunyu Yao, Karthik Narasimhan, Thomas L. Griffiths (2024) Published on arXiv
Generative Autonomous Agents on the shelf
-
LangChain is an open-source framework for designing prompts for LLMs. It can be used to define high-level reasoning sequences, conversational agents, RAGs (Retrieval-Augmented Generation), document summaries, or even the generation of synthetic data.
-
LangGraph is a low-level library for the design of cognitive architecture for autonomous agents, whose reasoning engine is an LLM.
-
AutoGPT is a platform for the creation, deployment, and management of generative agents.
-
WorkGPT is similar to AutoGPT
Generative MAS
-
Large language models empowered agent-based modeling and simulation: A survey and perspectives *Chen Gao, Xiaochong Lan, Nian Li, Yuan Yuan, Jingtao Ding, Zhilun Zhou, Fengli Xu, Yong Li (2024) Published in Humanities and Social Sciences Communications, Volume 11, Issue 1, Pages 1–24. The repository.
-
Social Simulacra: Creating Populated Prototypes for Social Computing Systems Joon Sung Park, Lindsay Popowski, Carrie Cai, Meredith Ringel Morris, Percy Liang, Michael S. Bernstein (2022) Published in Proceedings of the 35th Annual ACM Symposium on User Interface Software and Technology Articleno: 74, Pages: 18, Location: Bend, OR, USA
-
Generative Agents: Interactive Simulacra of Human Behavior Joon Sung Park, Joseph O'Brien, Carrie Jun Cai, Meredith Ringel Morris, Percy Liang, Michael S. Bernstein (2023) Published in Proceedings of the 36th Annual ACM Symposium on User Interface Software and Technology Articleno: 2, Pages: 22, Location: San Francisco, CA, USA, Series: UIST '23
-
Agentverse: Facilitating multi-agent collaboration and exploring emergent behaviors Weize Chen, Yusheng Su, Jingwei Zuo, Cheng Yang, Chenfei Yuan, Chi-Min Chan, Heyang Yu, Yaxi Lu, Yi-Hsin Hung, Chen Qian, et al. (2023) Published in The Twelfth International Conference on Learning Representations (ICLR 2023)
-
Training socially aligned language models on simulated social interactions Ruibo Liu, Ruixin Yang, Chenyan Jia, Ge Zhang, Denny Zhou, Andrew M. Dai, Diyi Yang, Soroush Vosoughi (2023) Published on arXiv arXiv:2305.16960
-
S3: Social-network Simulation System with Large Language Model-Empowered Agents Chen Gao, Xiaochong Lan, Zhihong Lu, Jinzhu Mao, Jinghua Piao, Huandong Wang, Depeng Jin, Yong Li (2023) Published on arXiv arXiv:2307.14984