Senior AI Backend Engineer Needed to Build a Production AI Platform (RAG, LLMs, Django/FastAPI)
Budget: $110.0
FIXED /
⭐ 5.00 (2)
Tunisia
python, django-framework, api, machine-learning, artificial-intelligence
We're building an AI-powered platform that combines document intelligence, Retrieval-Augmented Generation (RAG), LLM workflows, and scalable backend infrastructure.
This is not a simple ChatGPT wrapper or weekend AI demo.
The platform is intended to support thousands of users, multiple AI providers, document processing pipelines, background workers, and production-grade APIs. We're looking for an experienced engineer who has already built and shipped real AI systems.
You'll work directly with the founder to build the first production version of the platform and establish the technical foundation for future development.
Important: Personalized Loom Video Required
Please include a 5–10 minute Loom video created specifically for this job.
Generic introduction videos will be ignored.
In your Loom, we'd like you to walk through a relevant project you've built and explain:
The overall architecture
Your role in the project
How the AI pipeline worked
Which LLMs and embedding models you used
How documents were processed
How retrieval was implemented
How background jobs were handled
Any scaling or production challenges you solved
What you would prioritize when building a production AI backend
Please hide any confidential client information.
About the Project
The platform allows users to upload documents, create AI assistants, and interact with their own knowledge through natural language.
Core features include:
Document upload and ingestion
OCR support
Embedding generation
Vector search
RAG pipelines
Conversation history
AI provider integrations
Background processing
User authentication
Admin dashboard
Usage tracking
API access
The goal is to build a backend that is clean, scalable, and production-ready rather than simply making something that works.
Core Responsibilities
Design backend architecture
Build scalable APIs
Implement RAG pipelines
Build document ingestion workflows
Integrate OpenAI, Gemini, Claude, or similar models
Optimize vector search performance
Build background job pipelines
Improve logging and monitoring
Design database schemas
Write clean, maintainable code
Help define technical direction
Current Technology
Python
Django / Django REST Framework
FastAPI
PostgreSQL
Redis
Celery
Docker
OpenAI
Gemini
LangChain (optional)
Milvus / Qdrant / Weaviate
GitHub Actions
Experience Required
We're looking for someone who has actually built production AI systems.
Ideal experience includes:
Production RAG systems
Vector databases
Embedding pipelines
Multi-document search
FastAPI or Django
PostgreSQL optimization
Redis and Celery
Docker
AI API integrations
Authentication and RBAC
Production deployments
Nice to Have
OCR
Whisper
Multi-modal LLMs
Computer Vision
Multi-agent systems
Kubernetes
AWS or GCP
CI/CD pipelines
WebSockets
Multi-tenant SaaS architecture
Not a Good Fit If
You've only built chatbot demos.
Most of your work is prompt engineering without backend ownership.
You rely heavily on low-code/no-code AI tools.
You haven't deployed AI systems to production.
You need detailed step-by-step instructions for every task.
First Milestone (Fixed Price – $120)
We'll begin with a paid milestone focused on building one production-ready backend component.
Depending on your expertise, this may involve:
Building part of the document ingestion pipeline
Implementing a RAG workflow
Creating vector search APIs
Integrating an LLM provider
Building authentication and RBAC
Setting up background workers
Successful completion will lead to additional milestones and a long-term collaboration.
Please Include in Your Proposal
A personalized Loom video
A brief introduction
One production AI project you've built
Your role in that project
The technologies you used
Which vector database you've worked with
Which LLM providers you've integrated
Links to your GitHub or portfolio (if available)
Your estimated availability
Openen op Upwork