Junhui's Journal 2
Home
Tags
Categories
LLM
Transformer库中的embedding
Concepts
Topk Topp Temperature
实例 GRPO
实例 GRPO Fine Tune Models
实例 Unsloth
BPE Tokenizer
Dataset 库
RL in LLMs
实例 GRPO in TRL