I've been saying for a while now: AI demos really well, but when you actually need it to do a thing it often fails spectacularly.
It's a verisimilitude engine: It tries to make something that looks like it should be right, rather than actually trying to be right. Sometimes the easiest answer is the right answer so it gives that, but when you start asking it harder questions, it'll just make something up that looks right but isn't.