AI Knowledge Graph Engineer (Backend, ML, AWS)

Presupuesto: - HOURLY / FULL_TIME ⭐ 5.00 (5) United States

python, machine-learning, data-science, amazon-web-services, neo4j

We’re building an AI-powered knowledge graph platform that transforms unstructured documents into structured, query able intelligence. We have a defined ontology, a working prototype, and a clear roadmap. Now we’re hiring an engineer who can take this foundation and turn it into a scalable, production-grade system. This role is ideal for someone who enjoys building end-to-end data and ML systems, working with LLMs, and designing clean, reliable backend pipelines. What You’ll Work On You will own the development of a complete AI-driven knowledge acquisition workflow, including: ● Batch ingestion of documents and data from public sources ● Text parsing and normalization for downstream ML tasks ● Multi-pass LLM extraction pipelines (entities, events, relationships) ● Normalization/resolution of extracted data into structured, ontology-aligned objects ● Integration of human-in-the-loop review workflows and LLM Finetuning to improve extraction accuracy over time ● Construction and maintenance of the knowledge graph (AWS Neptune / Neo4j) ● Backend services and APIs that make the graph queryable What You’ll Bring We’re looking for someone with hands-on experience in several of the following areas: ● Graph databases (AWS Neptune, Neo4j, or similar) ● Knowledge graph construction or ontology-driven data systems ● LLM-based extraction (prompting, multi-pass pipelines, structured extraction) ● LLM fine-tuning or distillation (improve model efficiency) ● Deploying AI/ML workloads on AWS (SageMaker, Bedrock) ● Backend engineering in Python (Pydantic, structured data modeling, ETL frameworks) ● Web scraping or automated document ingestion (Proxies, Crawl4Ai, Scrapling) ● Familiarity with AWS cloud infrastructure, observability, and CI/CD Who You Are ● You’re a strong backend engineer with an interest in LLMs and structured knowledge systems. ● You enjoy designing multi-step pipelines and understanding how data moves end to end. ● You’re pragmatic — you choose what works best, whether that’s fine-tuned models, external services, or standard AWS components. ● You like refining schemas and ontologies when new patterns emerge. ● You care about reliability, versioning, and not breaking production systems. ● You’re comfortable being the primary owner of a sophisticated backend/ML pipeline. Nice to Haves ● Experience modeling temporal events or lifecycle chains ● Prior work with ontology evolution or schema refinement ● Insights into building human-in-the-loop review cycles ● Interest in improving extraction accuracy through ML techniques over time Why This Role Is Interesting ● You’ll be the first engineer shaping a system that turns messy, real-world documents into structured intelligence. ● You’ll work with a founder who already built a working prototype and a detailed ontology. ● The system is technically deep: LLMs, graph databases, ETL, AWS cloud, and structured knowledge representation. ● Plenty of room to grow into an architect-level role as the platform scales.

Abrir en Upwork