AI/LLM Expert & Instructor – Enterprise RAG, LoRA & Fine-Tuning Training (Remote)
Rozpočet: $60.0 - $500.0
HOURLY / PART_TIME
⭐ 5.00 (10)
United States
python, machine-learning, artificial-neural-networks, artificial-intelligence
Project Overview
We are looking for an experienced AI/LLM Engineer and Technical Trainer to deliver an advanced training program for our engineering team focused on Enterprise LLMs, Retrieval-Augmented Generation (RAG), LoRA, and Efficient Fine-Tuning.
The goal is to transform our technical team into experts capable of designing, deploying, and maintaining enterprise AI solutions using open-source LLMs.
This engagement combines technical instruction, architecture guidance, hands-on labs, and mentoring.
Training Scope
Block 1 – Enterprise LLM & RAG Fundamentals
Duration: 1–2 weeks
Module 1: Embeddings & Vector Databases
Topics include:
How embeddings represent documents as vectors
Semantic search concepts
Vector similarity search
Vector database architectures
Storage options and tradeoffs
Pinecone
Weaviate
Milvus
Qdrant
Indexing strategies
Chunking techniques
Metadata filtering
Module 2: Retrieval & Generation Evaluation
Teach best practices for evaluating RAG systems, including:
Retrieval metrics
NDCG
MRR
Recall
Precision
Generation quality metrics
BLEU
ROUGE
Hallucination detection
RAG benchmarking
Evaluation pipelines
Ground-truth datasets
Module 3: Build an Internal Enterprise Chatbot (POC)
Guide the team through building a production-style proof of concept:
Enterprise documentation chatbot
Private knowledge base
Source citation and traceability
Document ingestion pipeline
RAG architecture
Deployment within existing infrastructure
Module 4: RAG Integration Platform
Explain how to orchestrate RAG pipelines using a centralized platform, including:
Model orchestration
Prompt pipelines
Retrieval workflows
API integration
Enterprise architecture
Observability
Block 2 – LoRA & Efficient Fine-Tuning
Duration: 2–3 weeks
Module 1: LoRA & QLoRA Fundamentals
Topics include:
Parameter-efficient fine-tuning
LoRA architecture
QLoRA
Adapter-based training
Low-rank matrices
GPU memory optimization
VRAM reduction
Cost comparison:
LoRA vs Full Fine-Tuning
Performance tradeoffs
Module 2: Dataset Preparation
Teach best practices for:
Dataset collection
Cleaning
Labeling
Validation
Balancing
Support conversations
FAQs
Domain-specific documentation
Synthetic data generation
Module 3: Domain-Specific Adapter Training
Hands-on training covering:
Training LoRA adapters
Financial domain
Telecommunications
Legal
Healthcare
Model evaluation
Comparison against base models
Module 4: Reusable Adapter Catalog
Teach how to build reusable adapter libraries:
Modular adapters
Domain-specific adapters
Adapter versioning
Combining LoRA with RAG
Multi-domain architecture
Production deployment
Desired Deliverables
The selected expert will:
Deliver live remote training sessions
Prepare presentation materials
Create hands-on labs
Provide code examples
Assist with architecture discussions
Review participant exercises
Answer technical questions
Help build a production-quality Proof of Concept (POC)
Required Skills
We are looking for someone with strong experience in:
LLMs
Llama
Mistral
Qwen
Gemma
DeepSeek
Open-source language models
RAG
LangChain
LlamaIndex
Haystack
Hybrid Search
Vector databases
Embeddings
Vector Databases
Experience with one or more:
Pinecone
Weaviate
Milvus
Qdrant
Fine-Tuning
LoRA
QLoRA
PEFT
Hugging Face Transformers
TRL
Unsloth (preferred)
Axolotl (nice to have)
ML Frameworks
PyTorch
Hugging Face
Accelerate
BitsAndBytes
Deployment
FastAPI
vLLM
TGI
Docker
Kubernetes (preferred)
REST/gRPC APIs
Evaluation
Experience with:
RAG evaluation
BLEU
ROUGE
NDCG
MRR
Hallucination detection
Benchmarking
Nice to Have
Enterprise AI architecture experience
Production LLM deployment
Multi-tenant AI platforms
GPU optimization
Model serving at scale
Experience teaching engineers or delivering corporate training
English fluency (Spanish is a strong plus)
Engagement
Remote
Part-time / Contract
Approximately 3–5 weeks
Live sessions plus office hours
Flexible scheduling
To Apply
Please include:
A brief introduction about your experience.
Examples of enterprise RAG or LLM projects you've built.
Experience with LoRA/QLoRA fine-tuning.
Experience teaching or mentoring technical teams.
Links to GitHub, technical blog posts, publications, or conference talks (if available).
Your proposed hourly rate and availability.
Otevřít na Upwork