← Zakázky

Web Scraping Expert: Dynamic Regional Data Collection for Childcare Platform

Rozpočet: $1250.0 FIXED / ⭐ 0.00 (0) United States

data-scraping, data-extraction, python

JOB DESCRIPTION: We are seeking a precise Data Extraction Specialist to build out our seed database for a localized summer and year-round school camp aggregator platform (GoCampCrew) targeting the Raleigh-Durham-Cary, NC market. You will extract programmatic metadata from approximately 500+ regional camp provider domains, city recreation portals, and multi-branch sites (e.g., local YMCAs). KEY DELIVERABLES: Structured Data Pipelines: Build resilient scripts to extract data fields including: Provider/Brand name, physical facility branch locations, exact operational hours, price brackets, age groups, and comprehensive scheduling arrays. Schema Mapping & Data Normalization: Format all extracted outputs directly into our pre-defined relational CSV/Airtable template. Text fields must be parsed cleanly; age limits and prices must be converted strictly to standardized numbers (integers), and categorical targets (e.g., traditional calendar vs. year-round school tracks 1, 2, 3, 4) must be bucketed into clean multi-select tags. API Enrichment: Match and verify all physical addresses against the Google Places API to append precise Lat/Long coordinates and master Google Place IDs to every unique location row. Proposal should start with the word "Camp" so I know you read this thoroughly. REQUIRED EXPERIENCE: Proven expertise with modern extraction tools and LLM-driven browser agents (e.g., Apify, BeautifulSoup, Puppeteer, Scrapy). Deep understanding of relational data structures, data normalization, and working with Python or Node.js scripts. Experience integrating and cleaning data utilizing the Google Places/Maps API.
Otevřít na Upwork