Dongwei DeepSeek-R1-Distill-Qwen-7B-GRPO

From OODA WIKI
Jump to navigation Jump to search
Dongwei/DeepSeek-R1-Distill-Qwen-7B-GRPO
Extrinsic Performance (LLM Leaderboard)
Rank N/A
Average Score 0.33
Intrinsic Architecture
Architecture Qwen2ForCausalLM
Hidden Layers 28
Attention Heads 28
Vocab Size 152064


Dongwei/DeepSeek-R1-Distill-Qwen-7B-GRPO is a deep learning model with a recorded average score of 0.33 on the Open LLM Leaderboard.

Performance Metrics

  • ARC Score: N/A
  • HellaSwag Score: N/A
  • MMLU Score: N/A

This page was last updated automatically by the Almanac Ingestor bot.