Get in Touch

Course Outline

Introduction to Open-Source LLMs

  • Understanding open-weight models and their significance.
  • An overview of LLaMA, Mistral, Qwen, and other community-driven models.
  • Application scenarios for private, on-premise, or secure deployments.

Environment Setup and Tools

  • Installing and configuring the Transformers, Datasets, and PEFT libraries.
  • Selecting appropriate hardware configurations for fine-tuning.
  • Loading pre-trained models from Hugging Face or other repositories.

Data Preparation and Preprocessing

  • Dataset formats, including instruction tuning, chat data, and text-only inputs.
  • Tokenization techniques and sequence management.
  • Creating custom datasets and data loaders.

Fine-Tuning Techniques

  • Comparing standard full fine-tuning with parameter-efficient methods.
  • Applying LoRA and QLoRA for efficient fine-tuning.
  • Utilizing the Trainer API for rapid experimentation.

Model Evaluation and Optimization

  • Assessing fine-tuned models using generation capabilities and accuracy metrics.
  • Managing overfitting, generalization, and validation sets.
  • Performance tuning tips and effective logging practices.

Deployment and Private Use

  • Saving and loading models for inference tasks.
  • Deploying fine-tuned models within secure enterprise environments.
  • Strategies for on-premise versus cloud deployment.

Case Studies and Use Cases

  • Examples of enterprise utilization of LLaMA, Mistral, and Qwen.
  • Handling multilingual and domain-specific fine-tuning challenges.
  • Discussion: Evaluating the trade-offs between open and closed models.

Summary and Next Steps

Requirements

  • A solid understanding of large language models (LLMs) and their underlying architectures.
  • Hands-on experience with Python and PyTorch.
  • Basic familiarity with the Hugging Face ecosystem.

Audience

  • Machine learning practitioners.
  • AI developers.
 14 Hours

Number of participants


Price per participant

Upcoming Courses

Related Categories