← Állások

AI Engineer — LLM Features for Team Chat (Summarization, Smart Replies, AI Assistant)

Költségvetés: $15.0 - $25.0 HOURLY / FULL_TIME ⭐ 0.00 (0) Japan

natural-language-processing, python

We are building TeamChat, a workspace-based team collaboration platform (similar to Slack). This role owns the LLM-powered product features: thread summarization, smart replies, and an in-app AI assistant with tool use. We have a detailed scope document ready to share with shortlisted candidates. This is one of two AI roles we are hiring; strong performance leads to ongoing, long-term collaboration. CORE RESPONSIBILITIES & SCOPE OF WORK 1. Thread & Channel Summarization: On-demand and scheduled digests ("catch me up") with map-reduce summarization for long threads, multilingual content handling (EN/JA), and cost-aware model routing (cheap model first, escalate when needed). 2. Smart Replies & Drafting: Context-aware reply suggestions and message drafting that respect channel tone; perceived latency under ~2s via streaming. 3. In-App AI Assistant: Conversational assistant with tool use / function calling (search workspace via the RAG service, summarize, draft, set reminders), human-approval steps for sensitive actions, structured outputs. 4. LLM Orchestration Layer: Provider-agnostic abstraction over Anthropic Claude / OpenAI APIs with retries, fallbacks, token budgeting, per-workspace rate limits, and full cost tracking per feature. 5. Evaluation & Safety: Offline eval sets for summarization quality, prompt-injection hardening for user-generated content, PII-aware logging. 6. Delivery: Python service with documented internal APIs; tests + eval harness included. REQUIRED TECH STACK - Python 3.11+, FastAPI - LLM APIs: Anthropic Claude, OpenAI (function calling / tool use, streaming, structured outputs) - Orchestration: thin hand-rolled layer preferred (LangChain experience fine) - Redis/Celery workers, PostgreSQL PROJECT DETAILS - Engagement: Hourly, $15–$25/hr depending on experience. ~30 hrs/week, initial 3 months, ongoing long-term for the right person. - Process: Daily async standup (English, text), code review via GitHub PRs, 2-week sprints. At least 3–4 hours of overlap with JST (UTC+9). - IP & Code: All code delivered in our GitHub org from day one; full source ownership by us. - Language: English required. Urdu-speaking developers welcome. WHO SHOULD APPLY Please do NOT apply if your experience is limited to basic chatbot demos, simple OpenAI API wrappers, or tutorial-level LangChain projects. We will ask about production metrics (cost, latency, output quality). QUESTIONS TO ANSWER IN YOUR PROPOSAL 1. Describe an LLM-powered product feature you shipped to real users. What did it cost per user per month, and how did you reduce that cost? 2. Describe an agent/tool-use implementation you built: which tools, how you validated tool inputs, and how you handled failures. 3. What is your approach to evaluating summarization quality beyond "it looks good"? 4. GitHub/portfolio links, timezone, weekly availability, proposed rate. 5. Start your proposal with the word TEAMCHAT.
Megnyitás Upworkön