Senior AI Engineer: Production RAG/LLM Pipelines
Budget: $65.0 - $95.0
HOURLY / FULL_TIME
⭐ 0.00 (0)
United States
api-integration, python, postgresql, machine-learning
A19 Projects is hiring a Senior AI Engineer (contract) to build production AI features for our client, a venture-backed fintech operating across 20+ countries. You'll ship LLM-powered features that process real financial data, working alongside the client's backend, data, and product teams. This is a build role, not research: you take AI from prototype to production.
What you'll build:
• Production LLM pipelines: RAG, prompt engineering, output parsing, chain orchestration
• AI features such as spend categorization, document extraction, anomaly detection, financial Q&A, and reconciliation
• Vector search (Pinecone, Weaviate, or pgvector): ingestion, chunking, embedding selection, re-ranking
• Output validation, fallback handling, and confidence scoring for financial-grade reliability
• Model serving with latency SLOs, monitoring, drift detection, and retraining
• Clean backend integration: API contracts, circuit breakers, graceful degradation, and human-in-the-loop review
Required:
• 5+ years software engineering, 3+ in production AI/ML
• Shipped LLM apps on OpenAI, Anthropic, or Cohere APIs in production
• RAG design: chunking, embeddings, and vector databases
• Strong Python and an orchestration framework (LangChain or LlamaIndex)
• Model serving (REST/gRPC), validation, latency budgeting, and monitoring
• Backend fundamentals: REST, PostgreSQL, async, and cloud (AWS, GCP, or Azure)
• Observability: logging, tracing, and dashboards
Nice to have:
• Fintech or other regulated-industry experience
• Prompt evaluation and A/B testing of AI outputs
• ML lifecycle tools (MLflow, Weights & Biases, Vertex AI, or SageMaker)
Engagement:
• Fully remote; Latin America or US-timezone overlap preferred
• Full-time (30+ hrs/week), Part-time (15+ hrs/week), ongoing (6+ months)
• Start: ASAP
To apply, answer the screening questions and include 1-2 examples of production RAG or LLM systems you've shipped (links, repos, or a short writeup).
Openen op Upwork