Feature/multi agent framework #329

Keyu-He · 2025-08-24T07:12:26Z

📑 Description

Add scenario examples from NegotiationArena and implement comprehensive multi-agent support for 3+ agents.

Multi-agent core implementation (MultiAgentSotopiaEnv, MultiAgentBackground)
Enhanced UniformSampler for 3+ agent detection
NegotiationArena scenarios integration

✅ Checks

My pull request adheres to the code style of this project
My code requires changes to the documentation
I have updated the documentation as required
All the tests have passed
Branch name follows type/descript (e.g. feature/add-llm-agents)
Ready for code review

ℹ Additional Information

Test Coverage: 3-agent auction scenario included in examples/experimental/multi_agent_tests/

Refactored negotiation arena example scripts to use more explicit typing, improved agent profile retrieval logic, and removed unused OpenAI API key loading. Updated .gitignore to exclude negotiation_arena redis-data directory.

XuhuiZhou

Good job! Overall, I think we need to condense things a bit and make trackable changes

see my comments

XuhuiZhou · 2025-08-25T18:31:21Z

sotopia/envs/evaluators.py

        model_name: str,
-        response_format_class: type[EvaluationForTwoAgents[T_eval_dim]],
+        response_format_class: Union[
+            type[EvaluationForTwoAgents[T_eval_dim]],


Could we merge EvaluationForTwoAgents and EvaluationForMultipleAgents together? those two classes should basically serve as the same functionality?

XuhuiZhou · 2025-08-25T18:43:21Z

sotopia/envs/multi_agent_parallel.py

+from sotopia.envs.evaluators import unweighted_aggregate_evaluate
+
+
+class MultiAgentSotopiaEnv(ParallelSotopiaEnv):


Same here, let's just merge ParallelSotopiaEnv and MultiAgentSotopiaEnv together? If there's logic that we think needs separation, we could do that. However, instead of rewriting the function, we should do super. etc.

XuhuiZhou · 2025-08-25T18:43:42Z

sotopia/messages/message_classes.py

            )


+class MultiAgentBackground(ScriptBackground):


Ok done! Thanks for pointing this out!

Remove separation between 2-agent and multi-agent cases by consolidating duplicate classes and logic paths into unified implementations

XuhuiZhou

Great job!

I left some nitpick comments

one thing to think carefully about is how to unify different scenarios

rn the negotiation arena seems to have its own logic of doing data

rather than use their own logic, we should think carefully and design our own way.

.gitignore

XuhuiZhou · 2025-08-28T03:00:49Z

.gitignore


 sotopia/cli/install/redis-data/*
 redis-stack-server-*/
+examples/experimental/negotiation_arena/redis-data/*


what's this? do we need to add this line?

I add redis-data to that folder for testing the negotiation scenarios (see /Users/keyuhe/sotopia/examples/experimental/negotiation_arena/README.md) but I assume redis-data should not be pushed

XuhuiZhou · 2025-08-28T03:05:08Z

examples/experimental/multi_agent_tests/.gitignore

if we have folder level ignore, we might not need global level gitignore?

oh I will keep the global level gitignore and remove the folder level ones

XuhuiZhou · 2025-08-28T03:08:32Z

examples/experimental/multi_agent_tests/__init__.py

this is not really aligned with the repo level setup?

will move this to tests/experimental and modify correspondingly

XuhuiZhou · 2025-08-28T03:10:31Z

sotopia/envs/evaluators.py

                break
        stale_too_long = stale_count > self.max_stale_turn
-        terminated = conversation_too_long or p1_leaving or p2_leaving or stale_too_long
+        terminated = conversation_too_long or players_leaving or stale_too_long


the logic here should at least two agents are in the playground i guess? (for werewolf games, some agents could leave/die?)

changed to if too_few_agents (# agents < 2). But I think in the future, this maybe very depending on the game/scenario (e.g. in werewolf, we consider end even if 5 agents are still on (god + 2 villagers + seer + witch))

XuhuiZhou · 2025-08-28T03:12:37Z

sotopia/messages/message_classes.py

+        agent_goals: list[str],
+    ) -> "ScriptBackground":
+        """Create a ScriptBackground for multi-agent scenarios."""
+        return cls(


this is not really multi agent?

we should change the ScriptBackground dataclass.

resolve comments

also deleted temperature (to support gpt-5)

ProKil · 2025-09-16T00:27:57Z

I will begin reviewing this PR when tests are successful.

…LMEvaluator

Keyu-He · 2025-09-16T01:29:23Z

@ProKil Hi! now the tests are passed

sotopia/samplers/uniform_sampler.py

sotopia/generation_utils/generate.py

sotopia/messages/message_classes.py

ProKil

LGTM now

Keyu-He and others added 8 commits July 7, 2025 02:33

Add scenarios from NegotiationArena

4fcb008

[autofix.ci] apply automated fixes

0f90dd3

Refactor negotiation arena examples and update .gitignore

a55b978

Refactored negotiation arena example scripts to use more explicit typing, improved agent profile retrieval logic, and removed unused OpenAI API key loading. Updated .gitignore to exclude negotiation_arena redis-data directory.

Fix Mypy type errors in negotiation arena files

3b46d13

Modify error handling and update readme

c6c0b9f

Add multi-agent support for 3+ agents

aab3deb

Add multi-agent test suite with documentation and examples

8047421

Fix unused variable in evaluators

ed3fedc

XuhuiZhou requested changes Aug 25, 2025

View reviewed changes

Unify classes for all agent scenarios

9dc799b

Remove separation between 2-agent and multi-agent cases by consolidating duplicate classes and logic paths into unified implementations

XuhuiZhou requested changes Aug 28, 2025

View reviewed changes

Keyu-He added 2 commits September 2, 2025 20:39

minor edits

c2680a6

resolve comments

minor modifications

b74ab0d

Bekaboo mentioned this pull request Sep 3, 2025

Feat support private messages #330

Merged

6 tasks

support multi-agent scenarios

9c8d5e8

also deleted temperature (to support gpt-5)

XuhuiZhou requested a review from ProKil September 10, 2025 20:55

Keyu-He added 2 commits September 10, 2025 23:32

fix: update tests for multi-agent compatibility

482d59b

Update evaluators.py

dde5961

fix(evaluators): enforce per-agent structured evaluations in EpisodeL…

860e899

…LMEvaluator

ProKil requested changes Sep 16, 2025

View reviewed changes

sotopia/samplers/uniform_sampler.py Outdated Show resolved Hide resolved

sotopia/generation_utils/generate.py Show resolved Hide resolved

sotopia/messages/message_classes.py Show resolved Hide resolved

Keyu-He added 2 commits September 18, 2025 01:14

Add temperature back

fe86408

Update test_generation.py

9d25036

Keyu-He requested review from ProKil and XuhuiZhou September 21, 2025 00:13

ProKil approved these changes Sep 21, 2025

View reviewed changes

XuhuiZhou approved these changes Sep 21, 2025

View reviewed changes

XuhuiZhou merged commit ac5cfa3 into sotopia-lab:main Sep 21, 2025
7 checks passed

		from sotopia.envs.evaluators import unweighted_aggregate_evaluate


		class MultiAgentSotopiaEnv(ParallelSotopiaEnv):

Feature/multi agent framework #329

Feature/multi agent framework #329

Uh oh!

Conversation

Keyu-He commented Aug 24, 2025

📑 Description

✅ Checks

ℹ Additional Information

Uh oh!

XuhuiZhou left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

XuhuiZhou left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ProKil commented Sep 16, 2025

Uh oh!

Keyu-He commented Sep 16, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ProKil left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants