Back to Projects
Open Source LLMs
Finished
Local LLM Fine-Tuning Pipeline
An end-to-end pipeline for fine-tuning open-source Large Language Models (like Llama 3 and Mistral) on custom domain-specific datasets.
NO_IMAGE // LLM_PIPELINE
Overview
Developed a comprehensive fine-tuning pipeline for customizing open-source Large Language Models. The system utilizes Parameter-Efficient Fine-Tuning (PEFT) techniques such as QLoRA to train large models on consumer-grade hardware. It includes automated data preprocessing, model 4-bit quantization, distributed training using Hugging Face Accelerate, and a customized evaluation framework to assess task-specific performance.
Key Features
- QLoRA fine-tuning for low VRAM usage
- Automated dataset formatting and tokenization
- Support for Llama 3, Mistral, and other open weights
- Custom evaluation metrics for domain-specific tasks