Reddit data scraper — academic research project
Бюджет: $600.0
FIXED /
⭐ 0.00 (0)
Canada
scrapy-framework, automation, python, selenium, data-scraping, crawlers, scripts-and-utilities, xpath, etl, pandas
Description
I am a university researcher and need a professional Reddit scraper for an academic data collection project. The task is well-defined and I have sample output you can use as a template.
The task:
Scrape posts and comments from a list of 15 specified subreddits
Date range: January 2020 – December 2026
Keep only posts where the original post is an image
Collect all comments on each qualifying post
Output must match a specified set of columns (template provided)
Maximum total volume: 500,000 posts
Deliverable 1 (milestone 1):
Scrape 2020–2021 only, then pause and report back with the post count and a revised cost and timeline estimate for completing the full 2020–2026 range. I will review and confirm before you proceed.
Deliverable 2 (milestone 2):
Full scrape of 2022–2026 to agreed scope, delivered as clean CSV(s) matching the sample output template.
What I will provide:
Full list of 15 subreddits
Sample output CSV to use as column template
Detailed data collection spec document
In your proposal, please include:
- Your estimated total cost for the full 2020–2026 scrape
- Your estimated timeline
- Brief description of your approach (which tools/APIs you plan to use)
Note: the estimate will be confirmed or adjusted after Milestone 1 based on actual data volume.
Відкрити на Upwork