Because it's convergently implied by almost any of the strange, inscrutable things that they might end up wanting as a result of gradient descent on these " thumbs up" and " thumbs down" things internally.
因为它们想要的这些奇怪、难以捉摸的东西通向了一个共同的结果,来自 AI 内部的“赞”和“踩”得出的梯度下降。
Gradient descent is a first-order optimization algorithm. To find a local minimum of a function using gradient descent, one takes steps proportional to the negative of the gradient (or of the approximate gradient) of the function at the current point. If instead one takes steps proportional to the positive of the gradient, one approaches a local maximum of that function; the procedure is then known as gradient ascent.