Search Infrastructure Architect – Elasticsearch, PostGIS, & Uber H3 Expert (Consultant/Lead)
Buget: $500.0
FIXED /
⭐ 4.92 (2)
United Kingdom
elasticsearch, python, software-architecture, postgresql, database, database-architecture
We are building a next-generation business directory platform. We are bootstrapping our database with a backlog of 5 million core business listings.
We have an internal team of 2-3 capable MERN stack developers to handle frontend React components, standard API endpoints, and general Node.js routing. However, we need a specialized Search & Geolocation Infrastructure Architect to act as a consultant/lead. You will design our data structures, configure our servers, and guide our MERN team on how to interface with the core search cluster correctly.
Your Mission:
To design and deploy a high-speed, enterprise-grade hybrid search engine (Lexical + AI Vector Semantic Search) that processes complex geographic filters and dynamic ranking rules in under 50 milliseconds across 5 million rows.
Key Responsibilities:
Cluster Mapping: Architect and configure our Elasticsearch instance, writing production-ready JSON mapping templates, synonym filters, and autocomplete completion suggesters.
Geospatial Infrastructure: Set up our spatial database strategy using PostgreSQL/PostGIS. Integrate Uber’s H3 spatial index framework to handle high-speed radius lookups.
AI Vector Engineering: Define the vector generation pipeline for unstructured text data and implement kNN (k-Nearest Neighbors) semantic searches fused with keyword queries via Reciprocal Rank Fusion (RRF).
Algorithmic Ranking Logic: Write custom script_score structures inside Elasticsearch to combine text relevance with platform business rules (Bayesian review metrics, profile completeness scoring, and daily randomization fairness shuffles).
Synchronization Strategy: Design a Change-Data-Capture (CDC) pipeline (e.g., using PGsync or equivalent WAL streaming loops) to batch-sync millions of relational database rows out to Elasticsearch without thread pool exhaustion.
Team Guidance: Conduct architecture reviews and provide clear documentation/code boilerplates for our MERN developers so they can build standard API integrations flawlessly.
Required Technical Stack Expertise:
Search Engine: Elasticsearch / OpenSearch (Advanced level: Script scoring, Custom tokenizers, Hybrid search, RRF).
Spatial Database: PostgreSQL with PostGIS extension (ST_Contains, ST_Distance).
Spatial Hashing: Uber H3 Grid System (Experience calculating expanding hexagon grid disks natively in-database or via Node.js).
Geocoding Pipelines: Practical knowledge of open-source, self-hosted geocoding engines (Pelias or Nominatim/OpenStreetMap).
AI/ML Integration: Vector embeddings generation (text-embedding-3-small or HuggingFace open-source libraries) and vector search tuning.
Deschide pe Upwork