this post was submitted on 21 Jul 2023
39 points (97.6% liked)

sh.itjust.works Main Community

7705 readers
6 users here now

Home of the sh.itjust.works instance.

Matrix

founded 1 year ago
MODERATORS
39
submitted 1 year ago* (last edited 1 year ago) by Master@lemm.ee to c/main@sh.itjust.works
 

Going to post this here. Assuming their server doesn't have to work to post this to the federated section on this server...

I know the .ml s are having issues because of the government taking over the .ml domain. But what is going on with sh.itjust.works?

top 17 comments
sorted by: hot top controversial new old
[–] TheDude@sh.itjust.works 78 points 1 year ago (3 children)

Hey all,

There has been a few issues over the last couple of weeks. Here's a rundown of some of the issues we've had.

  • Broken Images: A few weeks ago we had issues with broken images. This was due to me migrating our local image storage to object storage (like AWS S3). When I did the switch it broke all old images as they needed to be migrated. Pictrs at that time did not support concurrent uploads which means the migration would have taken days or weeks to complete during which time the image service would be offline. Instead I waiting for a newer version to be released that supported concurrent connections and did the migration in about 30-40 minutes one evening.

  • 5xx server errors: Some of you may have experience a lemmy page with an error code on it. This was due to me trying to implement an additional proxy to shield and mitigate future risks. While rolling this out I hit a few blockers that caused downtime as we worked to rectify it. I'm glad to say that as of Today this has since been implemented.

In the addition to the above, the lemmyverse and especially this instance has been under bot attack almost daily. These bot attacks are eating resources and causing a query floods.

Lemmy is still very young and in its early phases. In time these issues will slowly go away.

P.S you don't have to worry about me leaving you guys hanging.

[–] eestileib@sh.itjust.works 22 points 1 year ago

The Dude Abides.

[–] Zaphodquixote@sh.itjust.works 15 points 1 year ago

S'alright. You're the man, the dude

[–] thelsim@sh.itjust.works 12 points 1 year ago

Thanks for the update, and thank you for all the hard work.

[–] rarely@sh.itjust.works 17 points 1 year ago (1 children)

They were down but aren’t. This is going to happen from time to time for reasons, but most importantly (and this is not an advert or endorsement for centralized services like reddit):

  • these instances are run by small teams, maybe even one person per instance. By “run by” I mean the admins who can actually host and support the hosting environment of the instance, not moderators though that’s an important task too.
  • At reddit or other for-profit companies, multiple teams of people monitor multiple data centers worth of servers, have 24/7 tech support crew, dashboards, alarms, alerts, escallation proceedures drafted by other teams, people they can escallate problems to including usually a decent sized team at the physical datacenter due to the amount of servers they buy because of what they can afford based off advertising income because the site is popular enough, which is why it’s much more rare to see these services go down.

But so many things can and do fail, including:

  • updates (dependencies, breaking updates, “this should just have worked but it didn’t, why?!”)
  • server issues (too many memes and now the disk has runeth over)
  • one server that gets overloaded or is in a data center that has a network failure, or a hardware failure on the server where the virtual server is hosted
  • account got hacked
  • 0 day exploit targeted directly at this server
  • DoS or DDoS attack
  • Admin has a day job that they need to do to keep the lights on at home and at the lemmy instance and has to do their day job work.

Speaking from experience, but not with lemmy in particular.

[–] xylene@sh.itjust.works 5 points 1 year ago (1 children)

I had a new one this morning!

  • Dog bumps into server cabinet, pushing it against the wall, kinking the fiber optic cable that the DNS server uses.
[–] Master@sh.itjust.works 11 points 1 year ago

Cool it's back up again and my post from lemm.ee is here.

I didnt expect a message posted to a local instance on another server with the main server being down would make it back to the target server when it comes back up. I guess it's just showing that this system does work as intended.

[–] merde@sh.itjust.works 9 points 1 year ago (1 children)
[–] Master@lemm.ee 1 points 1 year ago (1 children)

It's still down for me.

https://sh.itjust.works/

But the pre-federated section of the local server still works.

[–] PrincessLeiasCat@lemmy.world 1 points 1 year ago* (last edited 1 year ago) (2 children)

I know it’s 3 days later, but now I’m currently having issues as well and I can’t log into my account in the server, hence why I’m using this one.

Wondering if it’s just me.

[–] Master@lemm.ee 2 points 1 year ago

Right now I can connect just fine. But its been sporadic all weekend.

[–] Master@sh.itjust.works 1 points 1 year ago

Up for my account here too

[–] OhNoMyInstanceIsDown@lemm.ee 7 points 1 year ago (1 children)
[–] Master@lemm.ee 3 points 1 year ago
[–] loaExMachina@sh.itjust.works 6 points 1 year ago

Sometimes, shit just doesn't work.

[–] exapsy@sh.itjust.works 5 points 1 year ago

we're here bro we ain't living you.