← Jobs

Reddit data scraper — academic research project

Budget: $600.0 FIXED / ⭐ 0.00 (0) Canada

scrapy-framework, automation, python, selenium, data-scraping, crawlers, scripts-and-utilities, xpath, etl, pandas

Description I am a university researcher and need a professional Reddit scraper for an academic data collection project. The task is well-defined and I have sample output you can use as a template. The task: Scrape posts and comments from a list of 15 specified subreddits Date range: January 2020 – December 2026 Keep only posts where the original post is an image Collect all comments on each qualifying post Output must match a specified set of columns (template provided) Maximum total volume: 500,000 posts Deliverable 1 (milestone 1): Scrape 2020–2021 only, then pause and report back with the post count and a revised cost and timeline estimate for completing the full 2020–2026 range. I will review and confirm before you proceed. Deliverable 2 (milestone 2): Full scrape of 2022–2026 to agreed scope, delivered as clean CSV(s) matching the sample output template. What I will provide: Full list of 15 subreddits Sample output CSV to use as column template Detailed data collection spec document In your proposal, please include: - Your estimated total cost for the full 2020–2026 scrape - Your estimated timeline - Brief description of your approach (which tools/APIs you plan to use) Note: the estimate will be confirmed or adjusted after Milestone 1 based on actual data volume.
Auf Upwork öffnen