pytorch-xla-transformer-language-model

This repository is an open source test case for pytorch/xla that runs a minimal training loop for a Transformer language model on a single TPU device.

This code is intended to be used as reference for testing the compilation of the model by XLA, and is not intended to be used for training a reasonable language model. During initial runs, this code triggered recompilation far too often, but these issues have now been resolved.

Depends on Docker image gcr.io/tpu-pytorch/xla:r0.1.

export TPU_IP=#YOU MUST SET YOUR TPU IP
export XRT_TPU_CONFIG="tpu_worker;0;$TPU_IP"
export XLA_USE_32BIT_LONG=1
export XLA_IR_DEBUG=1
export XLA_HLO_DEBUG=1
python3 train.py

Output is in run.log.

pytorch-xla-transformer-language-model

Commits

Merge branch 'master' of github.com:bmccann/pytorch-xla-transformer-language-model

adding data, train, and model

Update README.md

update README

adding transformer and train code

Update README.md

README

pytorch-xla-transformer-language-model