Python AI Engineer for LLM Deployment
Presupuesto: $50.0 - $80.0
HOURLY / PART_TIME
⭐ 0.00 (0)
United States
restful-api, python, artificial-intelligence, amazon-web-services, git, docker, django-framework
We're not just another AI shop – we're a people-first business on a mission to solve real-world problems, from removing CO₂ from the atmosphere to building tools that actually matter. And we want you to grow with us – not for 7 days, but for years.
This gig starts as a part‑time 7‑10 day project, but if you're the right fit, we're keeping you long‑term. We need a Python AI Engineer who wants to build something epic, have fun, and be part of a team that actually cares.
Still with us? Good. Here's what we're building:
---
PYTHON AI ENGINEER – PART-TIME (7–10 DAY PROJECT)
Long-term role for the right person.
We are deploying a full AI ecosystem with LLMs, memory systems, search, video generation, and creative tools. We need a Python AI Engineer to deploy the AI core on production infrastructure.
This is a PRODUCTION deployment role – NOT research.
---
WHAT YOU'LL DO:
· Deploy vLLM inference servers on RunPod GPU pods (Llama 3.3 70B, Qwen 2.5 Coder 32B, DeepSeek R1 32B, BGE-large)
· Set up OpenAI-compatible API endpoints, VRAM management, Flash Attention, health checks
· Build Memory Service (FastAPI) – embeddings with BGE, vector storage with Qdrant, similarity search
· Build Brain Gateway (FastAPI) – router for LLM requests, orchestration, context retrieval, session management
· Build Scraping Worker – web scraping, text chunking, async processing with NATS
· Set up RAG pipeline with Qdrant + Typesense (hybrid search)
· Deploy FLUX / Playground inference + Whisper.cpp for subtitles + Video Engine API
· Containerise all services with Docker, integrate with Kong API Gateway
· Set up logging (Loki), metrics (Prometheus), tracing (Tempo)
· Write API docs (Swagger) and deployment runbook
· Train the Full Stack team on using the AI APIs
---
REQUIREMENTS (Must Have):
· Python (5+ years professional)
· FastAPI or Flask (expert)
· vLLM or TensorRT-LLM (production deployment)
· RunPod, AWS SageMaker, or similar GPU cloud
· Docker & Docker Compose (advanced)
· Vector DBs (Qdrant, Pinecone, or Weaviate)
· PostgreSQL with pgvector
· Redis (caching + queue)
· Git & CI/CD
· Linux admin (Ubuntu, shell scripting)
Nice to have:
Llama/Qwen/DeepSeek experience, AWQ/GPTQ quantisation, NATS, Kong, Vault, Prometheus/Grafana/Loki/Tempo, MinIO/S3, Whisper, FLUX/Stable Diffusion.
---
HOW TO APPLY:
Send your application to
Include:
· CV/Resume highlighting Python AI production experience
· GitHub/Portfolio with AI deployment projects
· Brief intro video
Subject Line: "Python AI Engineer - [Your Name]"
Abrir en Upwork