← Lavori

Data Scraping & Research Specialist

Budget: $650.0 FIXED / ⭐ 0.00 (0) United States

data-scraping, marketing-research, contact-lists

Job Description We are seeking an experienced data scraping and research professional to build a comprehensive statewide database of California Regional Center service providers. The goal is to identify, organize, and analyze all vendorized providers across California’s 21 Regional Centers. This project requires web scraping, data extraction, data cleaning, deduplication, and spreadsheet/database organization. Scope of Work Research and collect provider/vendor information from all 21 California Regional Centers. Extract and organize: * Provider/Vendor Name * Vendor Number * Service Code(s) * Service Description * Regional Center(s) Served * Contact Name (if available) * Email Address (if available) * Phone Number (if available) * Website (if available) * Physical Address (if available) Required Deliverables Deliverable 1 – Master Vendor Database Excel workbook containing: * All vendorized providers statewide * All associated service codes * Regional center association * Contact information Deliverable 2 – Service Code Analysis Create a report showing: * Number of vendors by service code * Number of vendors by regional center * Number of vendors statewide * Top service categories * Vendors with multiple service codes Deliverable 3 – Marketing Lists Separate Excel sheets containing email lists grouped by: * Independent Living Services (ILS) * Supported Living Services (SLS) * Respite Services * Transportation Services * Day Programs * Residential Services * Employment Services * Behavioral Services Each list should indicate: * Company Name * Email * Regional Center * Service Code Deliverable 4 – Deduplication Many providers may appear in multiple directories. Please: * Identify duplicate providers * Merge records where appropriate * Create a unique statewide vendor count Technical Requirements Preferred experience with: * Python * BeautifulSoup * Scrapy * Playwright * Selenium * Pandas * Excel Data Analysis Experience scraping government, healthcare, or provider directories is highly preferred. Project Success Criteria The final deliverable should allow us to answer questions such as: * How many ILS providers exist statewide? * How many SLS providers exist statewide? * How many providers exist by service code? * Which regional centers have the highest concentration of providers? * What contact information is available for providers by service category? Proposal Requirements Please include: 1. Examples of similar scraping or database projects. 2. Estimated completion timeline. 3. Estimated accuracy rate. 4. Whether you can automate updates in the future. 5. Total fixed-price bid. Budget Open to proposals. Preference will be given to applicants who can demonstrate large-scale scraping, data cleaning, and deduplication experience.
Apri su Upwork