API
HOT
模型
应用
解决方案
专区
文档
登录
THUDM
/
LongReward-llama3.1-8b-DPO
like
0
文本生成
Transformers
Safetensors
Chinese
English
AutoTrain Compatible
text-generation-inference
custom_code
模型介绍
模型文件
Issues
部署
LongReward-llama3.1-8b-DPO
3 位贡献者
提交历史
.gitattributes
initial commit
1 个月前
README.md
Update README.md
23 天前
config.json
Upload folder using huggingface_hub
1 个月前
configuration.json
Upload folder using huggingface_hub
1 个月前
generation_config.json
Upload folder using huggingface_hub
1 个月前
model-00000-of-00005.safetensors
4.06 GB
LFS
Upload folder using huggingface_hub
1 个月前
model-00001-of-00005.safetensors
4.06 GB
LFS
Upload folder using huggingface_hub
1 个月前
model-00002-of-00005.safetensors
4.06 GB
LFS
Upload folder using huggingface_hub
1 个月前
model-00003-of-00005.safetensors
832.03 MB
LFS
Upload folder using huggingface_hub
1 个月前
model-00004-of-00005.safetensors
1.96 GB
LFS
Upload folder using huggingface_hub
1 个月前
model.safetensors.index.json
Upload folder using huggingface_hub
1 个月前
modeling_llama.py
Upload folder using huggingface_hub
1 个月前
tiktoken_tokenizer.py
Upload folder using huggingface_hub
1 个月前
tokenizer.tiktoken
Upload folder using huggingface_hub
1 个月前
tokenizer_config.json
Upload folder using huggingface_hub
1 个月前