OpenAI says its latest models outperform doctors in medical benchmark

Pro@programming.dev · edit-2 2 months ago

OpenAI says its latest models outperform doctors in medical benchmark

Bezier@suppo.fi · edit-2 2 months ago

Tl;dr: After performing poorly on benchmarks, OpenAI created their own. OpenAI products perform much better on OpenAI benchmark.

Opinionhaver@feddit.uk · 2 months ago

The bar exam isn’t created by OpenAI, yet the outdated GPT-4 model still ranked in the 90th percentile on it.

orclev@lemmy.world · 2 months ago

Wake me up when someone besides OpenAI says they’re the best at something. When a company releases a benchmark they designed that their own tool that’s generally regarded as not very good is suddenly the best at, that’s not news, at best that’s PR, at worst propaganda. This reeks of “we investigated ourselves and found we did nothing wrong”.

banghida@lemm.ee · 2 months ago

Sure thing

taladar@sh.itjust.works · 2 months ago

So they created a test so broken and warped that no actual professional can understand it but their AI performs well on it?

etchinghillside@reddthat.com · 2 months ago

US Healthcare will now be affordable!

Buffalox@lemmy.world · 2 months ago

I almost feel sad for IBM, this was supposed to be their thing.

etchinghillside@reddthat.com · 2 months ago

Had forgotten we’ve been promised this before.

Kalvin@lemmy.world · edit-2 8 days ago

Removed by mod

TrendigOsthyvel@lemmy.world · 2 months ago

Much wow.

Kalvin@lemmy.world · edit-2 8 days ago

Removed by mod