Junhui's Journal 2
Home
Tags
Categories
实例 GRPO in TRL
训练 Tokenizer
Tokenizer Pass_model Post Process
Transformer 库中的 models
Transformer 库中的 tokenizer
Transformers Big Pic
A2C
Concept
DQN
Policy Bases Methods