GitXplorerGitXplorer
s

LLaMA3-for-Code-Generation

public
0 stars
0 forks
0 issues

Commits

List of commits on branch main.
Unverified
0f9df5927ece8a9cc92c6d0fd44d0ddc75582672

ipnb file: refined the notebook

ssarthakforwet committed 6 months ago
Unverified
ea0f94ae62441a0a2014dfa2c567907284cc6be9

added Readme file

ssarthakforwet committed 6 months ago
Unverified
88f6cfec82851eb2c9b462cd0d9d79146c02e918

added pipeline

ssarthakforwet committed 6 months ago

README

The README file for this repository.

LLAMA3 AND UNSLOTH AI POWERED CODE GENERATOR

This project involves building a code generator using the code-search-net dataset from Hugging Face. The model used for training is the llama-3-8b-bnb-4bit, a quantized model that improves training speed by 5x. We also use Wandb for logging the model training process.

Table of Contents

  1. Project Overview
  2. Dependencies
  3. Installation

Project Overview

This project focuses on creating an efficient code generation model. The key steps include installing necessary dependencies, importing required libraries, loading the model and dataset, training the model, and miscellaneous operations for optimizing the process.

Dependencies

The project requires the following dependencies:

  • Unsloth
  • Xformers (Flash Attention)
  • TRL
  • PEFT
  • Accelerate
  • Bitsandbytes
  • Triton (OpenAI's open-source library for GPU Programming)

Installation

To install the necessary dependencies, run the following commands:

pip install "unsloth[colab-new] @ git+https://github.com/unslothai/unsloth.git"
pip install --no-deps xformers==0.0.25.post1 "trl<0.9.0" peft accelerate bitsandbytes
pip install triton