GPT-4 performance comparable with physicians on official medical board residency examinations. Model performance near or above official passing rate in all medical specialties tested

cyu@sh.itjust.works · 6 months ago

GPT-4 performance comparable with physicians on official medical board residency examinations. Model performance near or above official passing rate in all medical specialties tested

loathesome dongeater@lemmygrad.ml · 6 months ago

This research has been done a lot of a times but I don’t see the point of it. Exams are something I would expect LLMs, especially the higher end ones, to do well because of their nature. But it says next to nothing about how reliable the LLM as an actual doctor.

beardown@lemm.ee · 6 months ago

But it says next to nothing about how reliable the LLM as an actual doctor.

Yet these tests say anything about how a human would be as an actual doctor?

loathesome dongeater@lemmygrad.ml · 6 months ago

It says as much as it does for an LLM but doctors have to have a lot of field experience after passing these tests before they get certified as doctors.

beardown@lemm.ee · 6 months ago

Then we should remove such tests and, if anything, increase such field experience

loathesome dongeater@lemmygrad.ml · 6 months ago

Why?

beardown@lemm.ee · 6 months ago

Because clearly passing such tests doesn’t matter. If it did matter then it would be noteworthy and have implications for the labor value of doctors that gpt could pass the tests to a better extent than many of them

loathesome dongeater@lemmygrad.ml · 6 months ago

Tests are meant to gatekeep who gets to get the field training required to become a doctor. Sending every jabroni into residency willy-nilly is probably gonna collapse the healthcare system completely.

gregorum@lemm.ee · edit-2 6 months ago

Even those who do well in testing of wrote knowledge can perform poorly in practical exercises. That’s why medical doctors have to train and qualify through several years of supervised residency before being allowed to practice even basic medicine.

GPT-4 can’t do even that.

GPT-4 performance comparable with physicians on official medical board residency examinations. Model performance near or above official passing rate in all medical specialties tested

GPT-4 performance comparable with physicians on official medical board residency examinations. Model performance near or above official passing rate in all medical specialties tested

Just a moment...