This is a repo to record my random reading history, i will also write some sparks about the paper randomly
- [Chain-of-code]
-
CogAgent: A Visual Language Model for GUI Agents Tsinghua, multi-modal agent
-
Webarena: A REALISTIC WEB ENVIRONMENT FOR BUILDING AUTONOMOUS AGENTS
-
[A Collaborative Multi-agent Reinforcement Learning Framework for Dialog Action Decomposition]
-
Tool-Augmented Reward modeling teach reward model to use tools
-
ViperGPT: Visual Inference via Python Execution for Reasoning
CV
-
Confucius: Iterative Tool Learning from Introspection Feedback by Easy-to-Difficult Curriculum
AAAI 2024