TriAiExperiments/SFR-Iterative-DPO-LLaMA-3-8B-R - 模力方舟(Gitee AI)