GitXplorerGitXplorer
s

LLaMA3-for-Code-Generation

public
0 stars
0 forks
0 issues

Commits

List of commits on branch main.

No commits found

There are no commits on branch main.

README

The README file for this repository.

LLAMA3 AND UNSLOTH AI POWERED CODE GENERATOR

This project involves building a code generator using the code-search-net dataset from Hugging Face. The model used for training is the llama-3-8b-bnb-4bit, a quantized model that improves training speed by 5x. We also use Wandb for logging the model training process.

Table of Contents

  1. Project Overview
  2. Dependencies
  3. Installation

Project Overview

This project focuses on creating an efficient code generation model. The key steps include installing necessary dependencies, importing required libraries, loading the model and dataset, training the model, and miscellaneous operations for optimizing the process.

Dependencies

The project requires the following dependencies:

  • Unsloth
  • Xformers (Flash Attention)
  • TRL
  • PEFT
  • Accelerate
  • Bitsandbytes
  • Triton (OpenAI's open-source library for GPU Programming)

Installation

To install the necessary dependencies, run the following commands:

pip install "unsloth[colab-new] @ git+https://github.com/unslothai/unsloth.git"
pip install --no-deps xformers==0.0.25.post1 "trl<0.9.0" peft accelerate bitsandbytes
pip install triton