this post was submitted on 23 Oct 2024
39 points (100.0% liked)

Technology

37734 readers
380 users here now

A nice place to discuss rumors, happenings, innovations, and challenges in the technology sphere. We also welcome discussions on the intersections of technology and society. If it’s technological news or discussion of technology, it probably belongs here.

Remember the overriding ethos on Beehaw: Be(e) Nice. Each user you encounter here is a person, and should be treated with kindness (even if they’re wrong, or use a Linux distro you don’t like). Personal attacks will not be tolerated.

Subcommunities on Beehaw:


This community's icon was made by Aaron Schneider, under the CC-BY-NC-SA 4.0 license.

founded 2 years ago
MODERATORS
 

Some days, continuing to read the news can be stressful.

top 6 comments
sorted by: hot top controversial new old
[–] Deceptichum@quokk.au 22 points 1 month ago* (last edited 1 month ago)

Oh sick. Now this is the stuff I’m most excited with AI lately. Apples doing an implementation as well.

Now you could say a command such as “close the window” or “click the picture of a puppy”. It’s an amazing accessibility tool. So much better then those eye tracking or screen grid coordinate systems we had prior.

Or issuing a command such as “go to this website, add this to my cart, and check out” sure my Alexa or Home can do it with their predefined stores, but this opens up any site or program that a human can operate. So it’s useful for everyone at the end of the day.

[–] Moonrise2473@feddit.it 8 points 4 weeks ago (1 children)

The fact that suddenly it went to watch a leisure website during work... did they use stolen screen recordings from human activity for training? Like if some corporation allowed them to record all the activity of their employees for training

[–] mosscap 10 points 4 weeks ago

You mean like Microsoft Recall?

[–] Kissaki@beehaw.org 6 points 4 weeks ago (1 children)

IIRC Windows has an accessibility feature where the cursor jumps to the primary default action in opening dialogs.


Doing it screenshot based seems inefficient if y du could iterate through windows and controls.

[–] flashgnash@lemm.ee 3 points 4 weeks ago (1 children)

Makes it work universally, even if the gui isn't made with a standard toolkit

Also it's ai they don't care about efficiency

[–] Toribor@corndog.social 2 points 4 weeks ago

Yeah this is one of those things where accessibility settings can probably get you 90% there but screenshots and machine learning can probably close the gap somewhat reliably (even if it's much less efficient).