OpenRLHF is a high-performance RLHF framework built on Ray, DeepSpeed and HF Transformers: data = { "prompt": xxx, "query": xxx, "label": json.dumps({ 'uuid': uuid ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results