Back to Projects
Open Source LLMs Finished

Local LLM Fine-Tuning Pipeline

An end-to-end pipeline for fine-tuning open-source Large Language Models (like Llama 3 and Mistral) on custom domain-specific datasets.

2024 AI Engineer

Overview

Developed a comprehensive fine-tuning pipeline for customizing open-source Large Language Models. The system utilizes Parameter-Efficient Fine-Tuning (PEFT) techniques such as QLoRA to train large models on consumer-grade hardware. It includes automated data preprocessing, model 4-bit quantization, distributed training using Hugging Face Accelerate, and a customized evaluation framework to assess task-specific performance.

Key Features

  • QLoRA fine-tuning for low VRAM usage
  • Automated dataset formatting and tokenization
  • Support for Llama 3, Mistral, and other open weights
  • Custom evaluation metrics for domain-specific tasks