Since every new release by OpenAI is usually followed by nonsense commentary and bubble inflation: a note
Testing a large statistical model with a handful of manual tests ("it solved this riddle!") is like assuming the home remedy you took cured your cold. These are methods that only lead to superstition.