Data Scraping & Research Specialist
Budget: $650.0
FIXED /
⭐ 0.00 (0)
United States
data-scraping, marketing-research, contact-lists
Job Description
We are seeking an experienced data scraping and research professional to build a comprehensive statewide database of California Regional Center service providers.
The goal is to identify, organize, and analyze all vendorized providers across California’s 21 Regional Centers.
This project requires web scraping, data extraction, data cleaning, deduplication, and spreadsheet/database organization.
Scope of Work
Research and collect provider/vendor information from all 21 California Regional Centers.
Extract and organize:
* Provider/Vendor Name
* Vendor Number
* Service Code(s)
* Service Description
* Regional Center(s) Served
* Contact Name (if available)
* Email Address (if available)
* Phone Number (if available)
* Website (if available)
* Physical Address (if available)
Required Deliverables
Deliverable 1 – Master Vendor Database
Excel workbook containing:
* All vendorized providers statewide
* All associated service codes
* Regional center association
* Contact information
Deliverable 2 – Service Code Analysis
Create a report showing:
* Number of vendors by service code
* Number of vendors by regional center
* Number of vendors statewide
* Top service categories
* Vendors with multiple service codes
Deliverable 3 – Marketing Lists
Separate Excel sheets containing email lists grouped by:
* Independent Living Services (ILS)
* Supported Living Services (SLS)
* Respite Services
* Transportation Services
* Day Programs
* Residential Services
* Employment Services
* Behavioral Services
Each list should indicate:
* Company Name
* Email
* Regional Center
* Service Code
Deliverable 4 – Deduplication
Many providers may appear in multiple directories.
Please:
* Identify duplicate providers
* Merge records where appropriate
* Create a unique statewide vendor count
Technical Requirements
Preferred experience with:
* Python
* BeautifulSoup
* Scrapy
* Playwright
* Selenium
* Pandas
* Excel Data Analysis
Experience scraping government, healthcare, or provider directories is highly preferred.
Project Success Criteria
The final deliverable should allow us to answer questions such as:
* How many ILS providers exist statewide?
* How many SLS providers exist statewide?
* How many providers exist by service code?
* Which regional centers have the highest concentration of providers?
* What contact information is available for providers by service category?
Proposal Requirements
Please include:
1. Examples of similar scraping or database projects.
2. Estimated completion timeline.
3. Estimated accuracy rate.
4. Whether you can automate updates in the future.
5. Total fixed-price bid.
Budget
Open to proposals.
Preference will be given to applicants who can demonstrate large-scale scraping, data cleaning, and deduplication experience.
Ouvrir sur Upwork