Migtissera Tess-3-Mistral-Nemo-12B

From OODA WIKI
Jump to navigation Jump to search


migtissera/Tess-3-Mistral-Nemo-12B is a deep learning model with a recorded average score of 0.45 on the Open LLM Leaderboard.

migtissera/Tess-3-Mistral-Nemo-12B
Extrinsic Performance (LLM Leaderboard)
Rank N/A
Average Score 0.45
Intrinsic Architecture
Architecture MistralForCausalLM
Hidden Layers 40
Attention Heads 32
Vocab Size 131075

Performance Metrics

  • ARC Score: N/A
  • HellaSwag Score: N/A
  • MMLU Score: N/A

This page was last updated automatically by the Almanac Ingestor bot.