Rlhf Questions

Can you tell the difference between SFT DPO and PPO models that had the same base model and are identical up to the algorithm? How much access do you need to make this feasible? What about in a verifiable computing context where the model provider helps by providing “proof”?

Gaia Prime

Explorer

Rlhf Questions

Backlinks

Graph View