T

dotaclient

public

28 stars

7 forks

11 issues

Commits

List of commits on branch master.

Unverified

8615b90b7d5b61005f51ba8e73cab206db1d1731

Change hp and mana rewards

TTimZaman committed 6 years ago

Unverified

22a950e6f32ac6adac3421e3237781ee3467748a

Add log statements

TTimZaman committed 6 years ago

Unverified

ada855a09d1c73a28cbe13817a6f2bd85b9ab0e6

Typo in logger

TTimZaman committed 6 years ago

Unverified

62977fa9b2ee006b83a49132a425a382243b9cb1

More debugging statements in agents, fix typos

TTimZaman committed 6 years ago

Unverified

e11115af89236e3a506b181f95507f07287fde1c

Add mana to reward function.

TTimZaman committed 6 years ago

Unverified

a0238e7b76c243128b8fe7ba8269c016ec4e2bed

Disable agents gradient entirely, and handle dota crashes better.\n\nFixes #41

TTimZaman committed 6 years ago

README

The README file for this repository.

DotaClient on K8s

DotaClient is a reinforcement learning system to train RL agents to play Dota 2 through self-play.

Video: (Youtube) 1v1 self play, 9 Mar 2019, uses fountain for regen!.
Video: (Youtube) 1v1 self play, 29 jan 2019.

This is built upon the DotaService project, that exposes the game of Dota2 as a (grpc) service for synchronous play.

Distributed Agents self-playing Dota 2.
Experience/Model Broker (rmq).
Distributed Optimizer (PyTorch)

Prerequisites

Kubeflow's PyTorch Operator
Kubernetes Cluster (e.g. GKE).
Build the dota docker image
Build the dotaservice docker image
Build the rabbitmq docker image
Install ksonnet

Launch distributed dota training

cd ks-app
ks show default  # Shows the full manifest
ks param list  # Lists all parameters
ks apply default  # Launches everything you need

Note: A typical job has 40 agents per optimizer. One optimizer does around 1000 steps/s.