this post was submitted on 09 Sep 2024
56 points (86.8% liked)
Technology
59197 readers
2797 users here now
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related content.
- Be excellent to each another!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, to ask if your bot can be added please contact us.
- Check for duplicates before posting, duplicates may be removed
Approved Bots
founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
Don't many game engines kinda sorta do this?
Yes, but game engines also hold the entire world inside themselves. There's no guessing, no estimating, no making sure that what it's looking at is actually a human or a bush - it already knows that.
The problem with computer vision being lazy is that it can't ignore something without understanding what it's looking at, and it can't understand what it's looking at without analyzing the data. It's a circular problem, and will be ridiculously hard to solve - the crux of the issue is that we as people are analyzing that same data, we just don't realize it.
Humans are bad at it, too. If you've ever ridden a bike or motorcycle, you quickly learn that car and truck drivers simply aren't looking for 2 wheelers. And therefore they don't see them. (I think this reinforces your point).
They are, but we've mostly got our subconscious doing it still, it's not that we're always doing big tasks we just have dedicated processes for it, so maybe that's one way to tackle the problem, specialised processes for sorting data types that engages the main process to do the processing of said data.