Skip to end of metadata
Go to start of metadata

Attendees

  • Umberto, Mert, Antti K., Fabio, Lukas, Sami

Goals

  • longer update from Antti K.

Discussion items

TimeItemWhoNotes
Antti K.

Multi-agent research

In meeting

Antti K.: currently working on 3 projects:

  1. assist agents by observing only
  2. improving ad-hoc teammate performances by modeling other agents
  3. improving multiagents cooperation by optimizing beliefs over memory


The longer update focuses on (2). There a 2 agents and a set of items (several middle relevance, 1 huge relevance) to be collected from a gridworld. The 2 agents are fully cooperating. Currently the idea is to employ Meta Learning and CTDE (Centralized Training Decentralized Execution). It looks like ad-hoc cooperation provides better result with respect to the standard approach.

Action items

  • Mert to provide a longer update next week.

0 Comments

You are not logged in. Any changes you make will be marked as anonymous. You may want to Log In if you already have an account.