Llama 3.1

ベースモデル

開発者

技術仕様

アーキテクチャ

Optimized Decoder-only Transformer (全モデルGQA)

パラメータバリエーション

Llama 3.1 8B(8B)

HuggingFace

軽量ベースモデル。128Kコンテキスト、多言語対応、ツール使用機能。1.46M GPU時間で学習。

GGUFファイルは登録されていません

Llama 3.1 8B Instruct(8B)

HuggingFace

指示追従最適化版。HumanEval 72.6%達成。

GGUFファイルは登録されていません

Llama 3.1 70B(70B)

HuggingFace

大規模ベースモデル。128Kコンテキスト、GQA採用。7.0M GPU時間で学習。

GGUFファイルは登録されていません

Llama 3.1 70B Instruct(70B)

HuggingFace

指示追従最適化版。

GGUFファイルは登録されていません

Llama 3.1 405B(405B)

HuggingFace

史上最大のオープンウェイトモデル。MMLU 5-shot 87.3%達成。30.84M GPU時間で学習。FP8量子化でシングルノード実行可能。

GGUFファイルは登録されていません

Llama 3.1 405B Instruct(405B)

HuggingFace

指示追従最適化版。HumanEval 89.0%達成。合成データ生成・蒸留用の親モデルとして推奨。

GGUFファイルは登録されていません

家系図

現在のモデル: Llama 3.1

ベース

派生

表示中

Llama 3.1

技術仕様

アーキテクチャ

パラメータバリエーション

Llama 3.1 8B(8B)

Llama 3.1 8B Instruct(8B)

Llama 3.1 70B(70B)

Llama 3.1 70B Instruct(70B)

Llama 3.1 405B(405B)

Llama 3.1 405B Instruct(405B)

関連モデル

LLaMA 1

Llama 2

Code Llama

Llama Guard 1

Spirit-LM

Swallow (Llama 2)

Llama 3

Llama Guard 2

Swallow (Llama 3)

ELYZA Japanese

Llama Guard 3

Swallow (Llama 3.1)

DeepSeek-R1-Distill-Llama

Llama 3.2

Llama 3.3

Swallow (Llama 3.3)

Llama 4

Llama Guard 4

家系図