Senior AI Agent Engineer — RAG, LLM Orchestration & Production Voice/SMS Agents

Presupuesto: $25.0 - $60.0 HOURLY / PART_TIME ⭐ 4.99 (265) United States

typescript, node.js, python, websockets, api, orchestration

We're an established custom software studio shipping production-grade AI agents for real businesses — receptionists, schedulers, recruiters, and executive assistants that handle live calls and messaging at volume. We live in the agentic stack: RAG pipelines, tool-calling, multi-agent orchestration, and tight telephony integration. You'll work alongside a team that knows the difference between a slick demo and a system that survives contact with real production traffic. We're looking for an engineer to help extend and harden our agent platform. Depending on fit, the work may include: Designing and tuning RAG pipelines — chunking strategy, embeddings, hybrid search, re-ranking — over pgvector (or Qdrant/Pinecone) Building agentic workflows with tool-calling and structured outputs (LangGraph, LlamaIndex, or custom orchestration) Wiring LLMs into real-time voice and SMS via Twilio, including latency-sensitive turn-taking Writing evals so we measure quality instead of guessing at it Driving down hallucination and improving grounding and citation accuracy You should have: Real production RAG experience — not a tutorial chatbot. You can talk about retrieval quality, eval metrics, and why your chunking choices mattered. Strong command of modern LLM tooling: function/tool calling, embeddings, vector DBs, prompt and context engineering Node.js and/or Python; PostgreSQL a plus Bonus: voice agents, Twilio, MCP, fine-tuning/distillation To apply: skip the generic pitch. Tell me about one RAG or agent system you actually shipped — what broke, how you measured quality, and what you'd change. A short Loom or a code link beats a long cover letter.

Abrir en Upwork