156
‘Embarrassingly simple’ probe finds AI in medical image diagnosis ‘worse than random’
(venturebeat.com)
Artificial intelligence (AI) is intelligence demonstrated by machines, unlike the natural intelligence displayed by humans and animals, which involves consciousness and emotionality. The distinction between the former and the latter categories is often revealed by the acronym chosen.
We have models that are specifically made to be good at these kinds of tasks. Why would you choose the ones that aren't and then make generalizing claims about how AI sucks in this domain?
Yeah this is probably just straight up misinformation. By no means is a diagnosis going to be made by a generalist multimodal LLM. Diagnosis is a literally a binary classification (although that is an oversimplification) and on medical CV you are optimizing on that directly.
They did not use a LLM.
You've quoted them stating they used LLMs while claiming they did not use a LLM? What am I missing here?
"L" "M" "M"
Which in this context just means multimodal LLM, correct?
Correct.
large language models (LLM) vs. large multi-modal models (LMM)
Regardless, they both use an LLM as the main driver. Multi modal just means that the LLM is interfaced with generative and/or predictive AIs for other types of content like images, sound, video, etc.
This is using a generalist tool for a specialized job. I'd expect the limit for LMMs is telling you if your picture is a heart or a kidney... Maybe. With low accuracy. Diagnosing? lol, hell no.
What a joke, a few generic LLMs making a judgement call about all AI models.
They used one to create the dataset for their experiments:
It seems like this is a case of "they just aren't using AI right, if they used it right it works" when it sure looks like they are using the models intended for these specific medical tasks.
Those are not the sort of model anybody in the field would use (medical CV with deep learning based analysis is a vibrant field with many breakthroughs in recent years). These are the sort of models tech bros are trying to sell to the public as general AI. There is a world of difference.