this post was submitted on 07 Jun 2024
528 points (98.4% liked)

Technology

59381 readers
2796 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS
 

A user on the online forum 4chan has leaked a massive 270GB of data purportedly belonging to The New York Times. This leak includes what is claimed to be the source code for the newspaper’s digital operations.

top 50 comments
sorted by: hot top controversial new old
[–] RagingHungryPanda@lemm.ee 178 points 5 months ago (5 children)
[–] wreckedcarzz@lemmy.world 120 points 5 months ago

"send nodes"

[–] SpaceNoodle@lemmy.world 26 points 5 months ago (3 children)
[–] CaptainSpaceman@lemmy.world 41 points 5 months ago (5 children)

Node has been around longer than web3

NPM nightmares intensify

load more comments (5 replies)
load more comments (2 replies)
[–] General_Effort@lemmy.world 22 points 5 months ago (3 children)

270GB of mostly node modules?

[–] SpaceNoodle@lemmy.world 52 points 5 months ago

You're right, it would be bigger if it was node

[–] asdfasdfasdf@lemmy.world 8 points 5 months ago

Sounds pretty average

load more comments (1 replies)
load more comments (2 replies)
[–] lurch@sh.itjust.works 98 points 5 months ago (2 children)

reminds me of the time someone said "Who is this 4chan?" on tv and it became a meme. good times

[–] setsneedtofeed@lemmy.world 7 points 5 months ago

He can't keep getting away with it.

[–] merthyr1831@lemmy.world 55 points 5 months ago* (last edited 5 months ago)

270GB feels insane for the source code of a single organisation. Is there media assets or backups in there too?

EDIT: yep, multiple subsidiaries and slack Comms which could inflate it by a lot. we post a whole lot of uncompressed shit on our slack

[–] daddy32@lemmy.world 53 points 5 months ago

NY Times has a freaking great data visualisations, they are (were?) employing a wizard in this space, doing custom extensions on d3.js.

[–] DudeImMacGyver@sh.itjust.works 47 points 5 months ago (2 children)

Source code... for a website?

[–] Potatos_are_not_friends@lemmy.world 113 points 5 months ago (1 children)

Subscription software. Tracking software. Ad tools. Promotion tools. Tools for journalists.

The website is just what you see.

[–] DudeImMacGyver@sh.itjust.works 19 points 5 months ago (1 children)

Yeah, I guess I didn't consider all the other operational shit that goes into providing content and funding for the website.

[–] aStonedSanta@lemm.ee 22 points 5 months ago (3 children)

It’s why our PCs have gotten insanely fast but websites still load like fucking trash. All the back end spying shit takes up a ton of cpu cycles. If you don’t already have em run ublock origin and no script and the internet is so fucking speedy 😆

[–] DudeImMacGyver@sh.itjust.works 7 points 5 months ago (1 children)

I hadn't noticed but then again I run Ublock Origin on Firefox.

[–] aStonedSanta@lemm.ee 5 points 5 months ago (1 children)

Yeah. You got yourself covered no script helps with JavaScript being pesky. But breaks a lot of shit tbh.

load more comments (1 replies)
load more comments (2 replies)
[–] MacNCheezus@lemmy.today 66 points 5 months ago* (last edited 5 months ago)

Anything more complicated than a static website is going to have a significant amount of server-side code.

Also, the article explains that it's not just the website, but ALL of their repos, which would include their smartphone apps, backend tools, etc.

[–] Dark_Arc@social.packetloss.gg 40 points 5 months ago* (last edited 5 months ago) (1 children)

I doubt this will affect much ... that's a lot more source code than I'd expect though, dang.

Presumably a lot of it is for internal operations (custom editing software or something of that ilk).

[–] tal@lemmy.today 15 points 5 months ago

It sounds like it's not all source code, from the article.

[–] jonne@infosec.pub 39 points 5 months ago (2 children)

Now everyone will get to run Wordle!

[–] General_Effort@lemmy.world 8 points 5 months ago* (last edited 5 months ago)

In case anyone missed the hubbub: [ETA: This is from March 2024; unconnected to this hack/leak]

https://apnews.com/article/new-york-times-wordle-clones-takedown-dmca-35d32b7548f7312ea74a2065b2cd31a6

The Times has filed several Digital Millennium Copyright Act, or DMCA, takedown notices to developers of Wordle-inspired games, which cited infringement on the Times’ ownership of the Wordle name, as well as its look and feel — such as the layout and color scheme of green, gray and yellow tiles.

Numerous impacted developers have also taken to social media to share their frustrations. Many said that their games, which range from Wordle-like offerings in other languages to more guessing games, would be taken down as a result.

Still, Brauneis said he believes the Times’ arguments for Wordle copyright infringement are on “a little bit shaky ground” for several reasons. Rules of a game, for example, are not covered by copyright — and that can include the layout of the game itself, he said.

load more comments (1 replies)
[–] autonomoususer@lemmy.world 23 points 5 months ago* (last edited 5 months ago) (3 children)

We still have no legal right to use, change and share its source code, control it both ourselves and in groups. It's still anti-libre software.

[–] seathru@lemmy.sdf.org 59 points 5 months ago* (last edited 5 months ago) (4 children)

Anything that may help develop better adblockers/paywall bypasses or exposes how/what of our personal information is collected is a win in my book. And this may very well be none of those things.

load more comments (4 replies)

Just seeing how something is approached helps.

I sometimes rebuild software from one language to another for practice.

load more comments (1 replies)
[–] Dogyote 16 points 5 months ago

Did this leak happen before or after NYT published an investigation detailing how Israeli forces were raping and torturing defenseless Palestinian detainees brought in from the Gaza Strip?

[–] skymtf@pricefield.org 14 points 5 months ago (2 children)

I have not read the news in a really long time just cause paywalls are annoying as frick.

[–] Dark_Arc@social.packetloss.gg 8 points 5 months ago (2 children)

Consider paying for the news...?

[–] Serinus@lemmy.world 16 points 5 months ago (1 children)

I'd only do that if you want independent news.

[–] Dark_Arc@social.packetloss.gg 5 points 5 months ago (3 children)

I'm not sure what you're saying here ...

[–] Serinus@lemmy.world 9 points 5 months ago (1 children)

Pay for news if you want it to be independent, and not beholden to sponsors.

I'd go as far as to say that paying for news (if you have the means to do so comfortably), is your duty as a commitment to democracy.

[–] Dark_Arc@social.packetloss.gg 5 points 5 months ago (3 children)

Ahh, yes I agree on all points; thanks for the clarification!

load more comments (3 replies)
[–] PrivateNoob@sopuli.xyz 4 points 5 months ago (2 children)

He probably means one of these (or both):

  1. New York Times is a huge corporation. The commenter would only support a site which is run by one creator, or with a genuine small team, which is transparent and not an asshole.

  2. New York Times is biased politically or accepting bribery attempts from other corpos to make them look in a better light.

[–] Serinus@lemmy.world 7 points 5 months ago (5 children)

Jesus Christ, no. It's almost like you're trying to sow distrust in the news and facts.

The NYT isn't perfect, but it's some of the most reliable news the world has.

As of March 2023, The New York Times Company employs 5,800 individuals,[101] including 1,700 journalists according to deputy managing editor Sam Dolnick.[122] Journalists for The New York Times may not run for public office, provide financial support to political candidates or causes, endorse candidates, or demonstrate public support for causes or movements.[123] Journalists are subject to the guidelines established in "Ethical Journalism" and "Guidelines on Integrity".[124] According to the former, Times journalists must abstain from using sources with a personal relationship to them and must not accept reimbursements or inducements from individuals who may be written about in The New York Times, with exceptions for gifts of nominal value.[125] The latter requires attribution and exact quotations, though exceptions are made for linguistic anomalies. Staff writers are expected to ensure the veracity of all written claims, but may delegate researching obscure facts to the research desk.[126] In March 2021, the Times established a committee to avoid journalistic conflicts of interest with work written for The New York Times, following columnist David Brooks's resignation from the Aspen Institute for his undisclosed work on the initiative Weave.[127]

load more comments (5 replies)
[–] TheBat@lemmy.world 7 points 5 months ago
  1. New York Times is a huge corporation. The commenter would only support a site which is run by one creator, or with a genuine small team, which is transparent and not an asshole.

Yeah but good luck chasing multiple stories across the world as a small team.

load more comments (1 replies)
load more comments (1 replies)
[–] Objection@lemmy.ml 5 points 5 months ago

You can go to archive.is and put in the url of a news story you want to read in the second box and it will usually let you bypass the paywall.

[–] JoMiran@lemmy.ml 10 points 5 months ago (1 children)

I expect that paywall to be fully useless soon.

[–] Dark_Arc@social.packetloss.gg 34 points 5 months ago (2 children)

That's a really silly take ... a Paywall is just an authorization mechanism.

That's like saying the source code of lemmy leaks and you expect your account to be compromised any second.

[–] example@reddthat.com 23 points 5 months ago (2 children)

I can sell you a copy of lemmys source code, are you interested?

[–] DudeImMacGyver@sh.itjust.works 6 points 5 months ago

I'll sell it for cheaper!

[–] wabafee@lemmy.world 6 points 5 months ago

I can give you 25 schmeckles.

load more comments (1 replies)
[–] LodeMike@lemmy.today 10 points 5 months ago (1 children)

Oh. They stopped seeding the torrent at 85%...

[–] reddithalation@sopuli.xyz 14 points 5 months ago (1 children)

but then made another torrent that is fully seeded

load more comments
view more: next ›