GitXplorerGitXplorer
T

dotaclient

public
28 stars
7 forks
11 issues

Commits

List of commits on branch master.
Unverified
8615b90b7d5b61005f51ba8e73cab206db1d1731

Change hp and mana rewards

TTimZaman committed 6 years ago
Unverified
22a950e6f32ac6adac3421e3237781ee3467748a

Add log statements

TTimZaman committed 6 years ago
Unverified
ada855a09d1c73a28cbe13817a6f2bd85b9ab0e6

Typo in logger

TTimZaman committed 6 years ago
Unverified
62977fa9b2ee006b83a49132a425a382243b9cb1

More debugging statements in agents, fix typos

TTimZaman committed 6 years ago
Unverified
e11115af89236e3a506b181f95507f07287fde1c

Add mana to reward function.

TTimZaman committed 6 years ago
Unverified
a0238e7b76c243128b8fe7ba8269c016ec4e2bed

Disable agents gradient entirely, and handle dota crashes better.\n\nFixes #41

TTimZaman committed 6 years ago

README

The README file for this repository.

DotaClient on K8s


DotaClient is a reinforcement learning system to train RL agents to play Dota 2 through self-play.

This is built upon the DotaService project, that exposes the game of Dota2 as a (grpc) service for synchronous play.

dotaclient schema

  • Distributed Agents self-playing Dota 2.
  • Experience/Model Broker (rmq).
  • Distributed Optimizer (PyTorch)

Prerequisites

Launch distributed dota training

cd ks-app
ks show default  # Shows the full manifest
ks param list  # Lists all parameters
ks apply default  # Launches everything you need

Note: A typical job has 40 agents per optimizer. One optimizer does around 1000 steps/s.