This was data from pushshift before Reddit nuked it in March. You can find this torrent (called "Reddit comments/submissions 2005-06 to 2022-12") and others, including 2023-01 and 2023-02, on https://academictorrents.com by user Watchful1.
this post was submitted on 26 Jun 2023
38 points (97.5% liked)
Piracy: ꜱᴀɪʟ ᴛʜᴇ ʜɪɢʜ ꜱᴇᴀꜱ
54424 readers
421 users here now
⚓ Dedicated to the discussion of digital piracy, including ethical problems and legal advancements.
Rules • Full Version
1. Posts must be related to the discussion of digital piracy
2. Don't request invites, trade, sell, or self-promote
3. Don't request or link to specific pirated titles, including DMs
4. Don't submit low-quality posts, be entitled, or harass others
Loot, Pillage, & Plunder
📜 c/Piracy Wiki (Community Edition):
💰 Please help cover server costs.
Ko-fi | Liberapay |
founded 1 year ago
MODERATORS
What's the context and background here? It would be nice to know what's in some of these 4GB compressed files before downloading them.
It has json files with every written post on Reddit.
Agreed, what’s in these? Raw text? Image metadata?
Nothin' but JSON compressed with zstd. You can also grab individual subreddits at https://the-eye.eu/redarcs/