Polish mathematician Bartosz Naskrecki created a complex math problem for AI testing The problem took nearly 20 years to design and spanned 13 pages of detailed reasoning GPT-5.4 solved the problem correctly once in 11 attempts, surprising the mathematician