TypeScript Engineer — AI/LLM Pipeline & RAG

Buget: $15.0 - $25.0 HOURLY / FULL_TIME ⭐ 5.00 (1) United States

postgresql, typescript, amazon-web-services, node.js

TypeScript Engineer — AI/LLM Pipeline & RAG (Education SMS Assistant) We run an AI-powered SMS assistant for a fast-growing online school. It's the primary way thousands of parents talk to us — asking about enrollment, programs, their child's progress, and more. You'll be a core engineer on that assistant: improving how it understands messages, retrieves the right information, and replies accurately and on-brand. This is a production system, not a prototype, and you'll work directly with the lead engineer. TWO HARD REQUIREMENTS (please don't apply if you don't meet both): 1. NATIVE ENGLISH FLUENCY. You'll read and reason about real customer conversations every day, judge tone, and write customer-facing copy. Near-native is not enough for this role. 2. CLOSE OVERLAP WITH AUSTIN, TEXAS HOURS (US CENTRAL TIME). This is required, not a preference. You must be able to work a large daily block that overlaps US Central business hours (roughly 9 am–5pm CT). State your timezone and your real overlap in your application. WHAT YOU'LL DO - Improve and debug our retrieval (RAG) pipeline: chunking, embeddings, hybrid (dense + keyword) search, and figuring out WHY the right info did or didn't reach the model - Own our knowledge base system end-to-end: take human-flagged issues, make the fix, reseed embeddings, confirm retrieval, and lock it in with regression tests - Write and maintain evaluation fixtures so quality improvements are provable and don't regress - Work in our multi-agent reply pipeline (routing, prompt composition, structured outputs) - Patch bugs across an async, event-driven backend (queues, webhooks, idempotency) - Read traces and production conversations to diagnose real failures MUST-HAVE SKILLS - Strong TypeScript / Node.js (this is a TypeScript codebase top to bottom — no Python required) - Shipped at least one LLM-powered feature to production (an agent framework, or hand-rolled with the OpenAI / Anthropic / OpenRouter SDKs) - Real RAG experience — and specifically experience DEBUGGING retrieval, not just wiring up "embed docs + top-k" - Comfortable with PostgreSQL / SQL and an ORM (we use Drizzle) - An eval / test-driven mindset — you reach for a regression test, not a manual spot-check - Clear written English and the judgment to evaluate tone and quality in customer-facing replies NICE TO HAVE - Hybrid search / reranking experience (BM25 + dense, RRF) - Experience working closely with a small, fast-moving team - Experience with vector databases, embeddings, etc

Deschide pe Upwork