Absolutely fascinating all the GIFs in this post and quite entertaining! I do not think I would’ve thought of the “box-surfing” strategy...
In their environment, agents play a team-based hide-and-seek game. Hiders (blue) are tasked with avoiding line-of-sight from the seekers (red), and seekers are tasked with keeping vision of the hiders.
There are objects scattered throughout the environment that hiders and seekers can grab and lock in place, as well as randomly generated immovable rooms and walls that agents must learn to navigate.
Before the game begins, hiders are given a preparation phase where seekers are immobilized to give hiders a chance to run away or change their environment.
Read more: https://openai.com/blog/emergent-tool-use/
Comments