this post was submitted on 29 Jun 2024
133 points (91.8% liked)
ChatGPT
8912 readers
2 users here now
Unofficial ChatGPT community to discuss anything ChatGPT
founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
I did Google that fwiw and the answer I got was that sparse autoencoders work so that it checks the output aligns with the input
If it's unknowable if the input is correct, won't it still be subject to outputting confidently incorrect information