This research has been done a lot of a times but I don’t see the point of it. Exams are something I would expect LLMs, especially the higher end ones, to do well because of their nature. But it says next to nothing about how reliable the LLM as an actual doctor.
It says as much as it does for an LLM but doctors have to have a lot of field experience after passing these tests before they get certified as doctors.
Because clearly passing such tests doesn’t matter. If it did matter then it would be noteworthy and have implications for the labor value of doctors that gpt could pass the tests to a better extent than many of them
Tests are meant to gatekeep who gets to get the field training required to become a doctor. Sending every jabroni into residency willy-nilly is probably gonna collapse the healthcare system completely.
Even those who do well in testing of wrote knowledge can perform poorly in practical exercises. That’s why medical doctors have to train and qualify through several years of supervised residency before being allowed to practice even basic medicine.
This research has been done a lot of a times but I don’t see the point of it. Exams are something I would expect LLMs, especially the higher end ones, to do well because of their nature. But it says next to nothing about how reliable the LLM as an actual doctor.
Yet these tests say anything about how a human would be as an actual doctor?
It says as much as it does for an LLM but doctors have to have a lot of field experience after passing these tests before they get certified as doctors.
Then we should remove such tests and, if anything, increase such field experience
Why?
Because clearly passing such tests doesn’t matter. If it did matter then it would be noteworthy and have implications for the labor value of doctors that gpt could pass the tests to a better extent than many of them
Tests are meant to gatekeep who gets to get the field training required to become a doctor. Sending every jabroni into residency willy-nilly is probably gonna collapse the healthcare system completely.
Even those who do well in testing of wrote knowledge can perform poorly in practical exercises. That’s why medical doctors have to train and qualify through several years of supervised residency before being allowed to practice even basic medicine.
GPT-4 can’t do even that.