AI/RAG Developer & Data Engineer — Document Extraction Engine
Budżet: $7000.0
FIXED /
⭐ 0.00 (0)
France
machine-learning, python, artificial-intelligence, natural-language-processing, chatbot-development, api, deep-learning
Tech startup looking for an AI developer / data engineer to improve and scale the extraction engine of our existing AI document-processing software, DataXtrak.
Important: the software already exists and works. We are not starting from scratch ,there is a functional prototype in place. We need someone to build on top of it, improve it and make it production-ready, not rebuild it from zero.
Scope:
-Enhance the existing document data-extraction pipeline (data engineering): make it handle large volumes reliably.
-Improve / extend the AI / RAG system (retrieval-augmented generation) for more accurate, intelligent extraction.
-Add strong data cleaning and structuring (handle messy, inconsistent or duplicate data and turn it into clean, reliable output — structured exports / spreadsheets).
-Add an optimized database search system (fast, efficient querying and indexing across large datasets).
-Set up the server architecture and deployment to move the existing software from desktop to a scalable, secure SaaS (servers, storage, APIs).
-Ensure strong data security and confidentiality at every layer (we are a European company and must follow GDPR / European data-protection standards).
Profile:
Strong in Python, data engineering and document processing.
Solid experience with LLMs / RAG systems.
Experience in data cleaning and data quality (normalizing messy real-world data).
Comfortable with databases and query optimization.
Experience deploying secure, scalable server / cloud (SaaS) infrastructure.
Mindful of data security and GDPR compliance.
Able to work on and improve an existing codebase (not just greenfield projects).
Engagement: remote, hourly.
Goal: production-ready version before the end of August.
Please share examples of similar AI / RAG and data-engineering projects you've built.
Otwórz na Upwork