GitXplorerGitXplorer
T

dotaclient

public
28 stars
7 forks
11 issues

Commits

List of commits on branch master.
Unverified
c30c1d9c8e7407f13593fdd830b363659edbee1e

Only mk logdir when not exists

TTimZaman committed 6 years ago
Unverified
57d4efa1d4f63ca53b7c33717e20282ae14405cb

Reduce model uploads and recreate tb event files

TTimZaman committed 6 years ago
Unverified
49c98be9fc5bec10eecf8c63aeb71cdb7b02dc90

Fixes to validator ks orchestration

TTimZaman committed 6 years ago
Unverified
2340554162fd37bc658d177c874ed5f66f9c3f19

Extend optimizer timing logging

TTimZaman committed 6 years ago
Unverified
253d479962eb268b3714d55740c05cf51c5af6df

Add per-team validation agents, instead of random team.

TTimZaman committed 6 years ago
Unverified
531b593342b8cd723105e5be6203a1d1e984abb5

Move ability availability to staticmethod

TTimZaman committed 6 years ago

README

The README file for this repository.

DotaClient on K8s


DotaClient is a reinforcement learning system to train RL agents to play Dota 2 through self-play.

This is built upon the DotaService project, that exposes the game of Dota2 as a (grpc) service for synchronous play.

dotaclient schema

  • Distributed Agents self-playing Dota 2.
  • Experience/Model Broker (rmq).
  • Distributed Optimizer (PyTorch)

Prerequisites

Launch distributed dota training

cd ks-app
ks show default  # Shows the full manifest
ks param list  # Lists all parameters
ks apply default  # Launches everything you need

Note: A typical job has 40 agents per optimizer. One optimizer does around 1000 steps/s.