Discussion about this post

User's avatar
Daniel Popescu / ⧉ Pluralisk's avatar

The brownie experiment absolutely nailed the RL concept; what if our future AI models started demanding actual rewards for optimal perfomance, imagine the compute costs.

Expand full comment

No posts