What's today? Monday? Ahhh, yeh, that's still true: RLHF is better than RLAIF. Uncle Sam (Altman) is still calling you to fine-tune. So get your thumbs ready. ππ
Share this post
Humans Still Outperform AI in Reinforcementβ¦
Share this post
What's today? Monday? Ahhh, yeh, that's still true: RLHF is better than RLAIF. Uncle Sam (Altman) is still calling you to fine-tune. So get your thumbs ready. ππ