this post was submitted on 25 May 2024
132 points (94.6% liked)

Technology

58133 readers
4819 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS
 

Along with @maciejwolczyk we've been training a neural network that learns how to play NetHack, an old roguelike game, that looks like in the screenshot. Recently, something unexpected happened.

you are viewing a single comment's thread
view the rest of the comments
[–] ArbitraryValue@sh.itjust.works 51 points 3 months ago* (last edited 3 months ago) (2 children)

Their problem:

So apparently NetHack has a mechanic that slightly changes how the game plays every time it's full moon according to your system clock

The model wasn't trained on a full moon. They had a system to set up the environment for replicable results but it didn't include modifying the system time.

It reminds me of another bug with the system time, which a friend of mine encountered. He was working on hardware and he was getting a lot of units that worked fine at the factory, immediately failed at the client's location, and then worked again when they were returned to the factory. It turned out that when these machines were turned on, their embedded OS automatically queried some server to update the current time. The client's internet connection had such high latency that the server's response only came back after the machine was already in use. This generated a huge delta-t value that triggered the sanity checks and shut the machine down. The factory had a much lower-latency connection and so the race condition could never be replicated there.

As for the weirdest bug I ever encountered myself: a compiler generating bad machine code. I have often said that the worst part of programming is that the computer always does exactly what you tell it to, but that was the one and only time in twenty years that the computer actually didn't.

[–] tja@sh.itjust.works 45 points 3 months ago (1 children)
[–] doubletwist@lemmy.world 2 points 3 months ago

That was the first thing I thought of.

[–] tsonfeir@lemmy.world 7 points 3 months ago

Their problem was not understanding the game ;)