T

dotaclient

public

28 stars

7 forks

11 issues

Commits

List of commits on branch master.

Unverified

c30c1d9c8e7407f13593fdd830b363659edbee1e

Only mk logdir when not exists

TTimZaman committed 6 years ago

Unverified

57d4efa1d4f63ca53b7c33717e20282ae14405cb

Reduce model uploads and recreate tb event files

TTimZaman committed 6 years ago

Unverified

49c98be9fc5bec10eecf8c63aeb71cdb7b02dc90

Fixes to validator ks orchestration

TTimZaman committed 6 years ago

Unverified

2340554162fd37bc658d177c874ed5f66f9c3f19

Extend optimizer timing logging

TTimZaman committed 6 years ago

Unverified

253d479962eb268b3714d55740c05cf51c5af6df

Add per-team validation agents, instead of random team.

TTimZaman committed 6 years ago

Unverified

531b593342b8cd723105e5be6203a1d1e984abb5

Move ability availability to staticmethod

TTimZaman committed 6 years ago

README

The README file for this repository.

DotaClient on K8s

DotaClient is a reinforcement learning system to train RL agents to play Dota 2 through self-play.

Video: (Youtube) 1v1 self play, 9 Mar 2019, uses fountain for regen!.
Video: (Youtube) 1v1 self play, 29 jan 2019.

This is built upon the DotaService project, that exposes the game of Dota2 as a (grpc) service for synchronous play.

Distributed Agents self-playing Dota 2.
Experience/Model Broker (rmq).
Distributed Optimizer (PyTorch)

Prerequisites

Kubeflow's PyTorch Operator
Kubernetes Cluster (e.g. GKE).
Build the dota docker image
Build the dotaservice docker image
Build the rabbitmq docker image
Install ksonnet

Launch distributed dota training

cd ks-app
ks show default  # Shows the full manifest
ks param list  # Lists all parameters
ks apply default  # Launches everything you need

Note: A typical job has 40 agents per optimizer. One optimizer does around 1000 steps/s.