There’s a lot of research in this regard, like reinforcement learning by human feedback.
j previous speech k next speech