MinAtar Space Invaders

Usage

Note that the MinAtar suite is provided as a separate extension for Pgx (pgx-minatar). Therefore, please run the following command additionaly to use the MinAtar suite in Pgx:

pip install pgx-minatar

Then, you can use the environment as follows:

import pgx

env = pgx.make("minatar-space_invaders")

Description

MinAtar is originally proposed by [Young&Tian+19]. The Pgx implementation is intended to be the exact copy of the original MinAtar implementation in JAX. The Space Invaders environment is described as follows:

The player controls a cannon at the bottom of the screen and can shoot bullets upward at a cluster of aliens above. The aliens move across the screen until one of them hits the edge, at which point they all move down and switch directions. The current alien direction is indicated by 2 channels (one for left and one for right) one of which is active at the location of each alien. A reward of +1 is given each time an alien is shot, and that alien is also removed. The aliens will also shoot bullets back at the player. When few aliens are left, alien speed will begin to increase. When only one alien is left, it will move at one cell per frame. When a wave of aliens is fully cleared a new one will spawn which moves at a slightly faster speed than the last. Termination occurs when an alien or bullet hits the player.

github.com/kenjyoung/MinAtar - space_invaders.py

Specs

Name	Value
Version	`v0`
Number of players	`1`
Number of actions	`4`
Observation shape	`(10, 10, 6)`
Observation type	`bool`
Rewards	`{0, 1}`

Observation

Index	Channel
`[:, :, 0]`	Cannon
`[:, :, 1]`	Alien
`[:, :, 2]`	Alien left
`[:, :, 3]`	Alien right
`[:, :, 4]`	Friendly bullet
`[:, :, 5]`	Enemy bullet

Action

No op (0), left (1), right (2), or fire (3).

Version History

v1: Specify rng key explicitly (API v2) by @sotetsuk in #1058 (v2.0.0)
v0 : Initial release (v1.0.0)

Training example

For MinAtar environments, we provide a PPO training example, which takes only 1 min to train on a single GPU.

Baseline models

We provide a baseline model for the MinAtar Space Invaders environment, which reasonably plays the game.

model = pgx.make_baseline("minatar-space_invaders_v0")

logits, value = model(state.observation)

We trained the model with PPO for 20M steps. See wandb report for the details of the training.

Reference

[Young&Tian+19] "Minatar: An atari-inspired testbed for thorough and reproducible reinforcement learning experiments" arXiv:1903.03176

LICENSE

Pgx is provided under the Apache 2.0 License, but the original MinAtar suite follows the GPL 3.0 License. Therefore, please note that the separated MinAtar extension for Pgx also adheres to the GPL 3.0 License.