Devin, the obviously fake "AI Developer", turns out to be fake

V0ldek@awful.systems · edit-2 10 months ago

Devin, the obviously fake "AI Developer", turns out to be fake

aio@awful.systems · 10 months ago

the task the AI solves is writing test cases for finding the Least Common Multiple modulo a number.

Looking at the image of the prompt, it looks more like a CRT computation to me.

It’s famously much easier to verify modulo arithmetic than it is to actually compute it.

It’s not particularly difficult to compute CRT, though it is definitely trivial to verify the result afterwards. I’m not sure I’d agree that that’s a general fact about modular arithmetic computations though.

V0ldek@awful.systems · 10 months ago

It’s provably easier to verify whether a multiplicative inverse of a modulo m is correct than it is to actually find it. And non-provably, but rather obviously, it takes much less code and effort.

Devin, the obviously fake "AI Developer", turns out to be fake

Devin, the obviously fake "AI Developer", turns out to be fake

archive.is