this post was submitted on 23 Jul 2023
17 points (100.0% liked)

Programming

17319 readers
153 users here now

Welcome to the main community in programming.dev! Feel free to post anything relating to programming here!

Cross posting is strongly encouraged in the instance. If you feel your post or another person's post makes sense in another community cross post into it.

Hope you enjoy the instance!

Rules

Rules

  • Follow the programming.dev instance rules
  • Keep content related to programming in some way
  • If you're posting long videos try to add in some form of tldr for those who don't want to watch videos

Wormhole

Follow the wormhole through a path of communities !webdev@programming.dev



founded 1 year ago
MODERATORS
 

Is anyone aware of an existing project that can do something like this:

  • Access an RSS feed.
  • Parse the contents of the items in the feed, and fetch linked images.
  • Take the new feed elements and add them to previously fetched elements.
  • Store all of the content in a merged RSS/XML file, or something like a SQLite DB.

Context: I'd like to archive Mastodon posts of an account automatically. I'd prefer it to be a script/binary I could run on Linux as I'd likely throw it in a GitHub action and save the resulting output in the git repo.

I could probably whip something together but I'm lazy and I'd prefer to use something that already exists.

you are viewing a single comment's thread
view the rest of the comments
[โ€“] abhibeckert@lemmy.world 3 points 1 year ago* (last edited 1 year ago) (1 children)

I don't know of a project that does this, but if I was to tackle it I would convert the RSS to the Activity Streams standard - https://www.w3.org/TR/activitystreams-core/.

Activity Streams are basically the new RSS and it's a lot better than RSS.

Mastodon is built on Activity Pub, which is built on Activity Streams - so you shouldn't even need to touch RSS at all. The AS already exists. You can access it via the API.

Under European laws all services are required to give you a copy of all data associated with your account if you ask for it. And Mastodon being a European product is of course fully compliant. Just go to your profile and hit the "Request your Archive" button. You could do that once a month or something.

[โ€“] bogo@sh.itjust.works 1 points 1 year ago

Yes, the "Request Archive" method may be the "don't over engineer this stupid" option I go with.