Your small guide to: AI Safety
Even when given a very simple goal, an AI can act unpredictably. This phenomenon is known as “specification gaming” or “reward hacking”—achieving the goal that was technically specified, but failing to achieve the actual intended goal. It happens frequently, because it is really hard to explain to a computer what you actually intend for it to do.
Read more