Dongwei DeepSeek-R1-Distill-Qwen-7B-GRPO

ARC Score: N/A
HellaSwag Score: N/A
MMLU Score: N/A

**Dongwei/DeepSeek-R1-Distill-Qwen-7B-GRPO**
Extrinsic Performance (LLM Leaderboard)
Rank	N/A
Average Score	0.33
Intrinsic Architecture
Architecture	Qwen2ForCausalLM
Hidden Layers	28
Attention Heads	28
Vocab Size	152064

Dongwei/DeepSeek-R1-Distill-Qwen-7B-GRPO is a deep learning model with a recorded average score of 0.33 on the Open LLM Leaderboard.

Performance Metrics

This page was last updated automatically by the Almanac Ingestor bot.