Gemini / gemma3

multimodal

gemma3GemmaGoogle
0 0 0 更新于 2026-05-06 22:07

概述

Gemma 3

Gemma 是 Google 基于 Gemini 技术构建的轻量级模型系列。Gemma 3 模型是多模态模型(可处理文本和图像),并具有 128K 上下文窗口,支持超过 140 种语言。Gemma 3 提供 270M、1B、4B、12B 和 27B 的参数大小,在问答、摘要和推理等任务中表现出色,其紧凑的设计使其能够在资源有限的设备上部署。目前,Gemma3模型是单个 GPU 上运行功能最强大的模型。

模型

名称尺寸上下文输入Ollama 下载命令
gemma3:latest3.3GB128KText, Imageollama run gemma3:latest
gemma3:270m292MB32KTextollama run gemma3:270m
gemma3:1b815MB32KTextollama run gemma3:1b
gemma3:4b3.3GB128KText, Imageollama run gemma3:4b
gemma3:12b8.1GB128KText, Imageollama run gemma3:12b
gemma3:27b17GB128KText, Imageollama run gemma3:27b
gemma3:270m-it-qat241MB32KTextollama run gemma3:270m-it-qat
gemma3:270m-it-q8_0292MB32KTextollama run gemma3:270m-it-q8_0
gemma3:270m-it-fp16543MB32KTextollama run gemma3:270m-it-fp16
gemma3:270m-it-bf16543MB32KTextollama run gemma3:270m-it-bf16
gemma3:1b-it-qat1.0GB32KTextollama run gemma3:1b-it-qat
gemma3:1b-it-q4_K_M815MB32KTextollama run gemma3:1b-it-q4_K_M
gemma3:1b-it-q8_01.1GB32KTextollama run gemma3:1b-it-q8_0
gemma3:1b-it-fp162.0GB32KTextollama run gemma3:1b-it-fp16
gemma3:4b-it-qat4.0GB128KText, Imageollama run gemma3:4b-it-qat
gemma3:4b-it-q4_K_M3.3GB128KText, Imageollama run gemma3:4b-it-q4_K_M
gemma3:4b-it-q8_05.0GB128KText, Imageollama run gemma3:4b-it-q8_0
gemma3:4b-it-fp168.6GB128KText, Imageollama run gemma3:4b-it-fp16
gemma3:12b-it-qat8.9GB128KText, Imageollama run gemma3:12b-it-qat
gemma3:12b-it-q4_K_M8.1GB128KText, Imageollama run gemma3:12b-it-q4_K_M
gemma3:12b-it-q8_013GB128KText, Imageollama run gemma3:12b-it-q8_0
gemma3:12b-it-fp1624GB128KText, Imageollama run gemma3:12b-it-fp16
gemma3:27b-it-qat18GB128KText, Imageollama run gemma3:27b-it-qat
gemma3:27b-it-q4_K_M17GB128KText, Imageollama run gemma3:27b-it-q4_K_M
gemma3:27b-it-q8_030GB128KText, Imageollama run gemma3:27b-it-q8_0
gemma3:27b-it-fp1655GB128KText, Imageollama run gemma3:27b-it-fp16

评估

image

Gemma 3 270M

Benchmarkn-shotGemma 3 270m instruction tuned
HellaSwag0-shot37.7
PIQA0-shot66.2
ARC-C0-shot28.2
WinoGrande0-shot52.3
BIG-Bench Hardfew-shot26.7
IF Eval0-shot51.2

这些模型针对大量不同的数据集和指标进行了评估,以涵盖文本生成的不同方面:

推理、逻辑和代码能力

BenchmarkMetricGemma 3 PT 1BGemma 3 PT 4BGemma 3 PT 12BGemma 3 PT 27B
HellaSwag10-shot62.377.284.285.6
BoolQ0-shot63.272.378.882.4
PIQA0-shot73.879.681.883.3
SocialIQa0-shot48.951.953.454.9
TriviaQA5-shot39.865.878.285.5
Natural Questions5-shot9.4820.031.436.1
ARC-C25-shot38.456.268.970.6
ARC-e0-shot73.082.488.389.0
WinoGrande5-shot58.264.774.378.8
BIG-Bench Hard28.450.972.677.7
DROP3-shot, F142.460.172.277.2
AGIEval3–5-shot22.242.157.466.2
MMLU5-shot, top-126.559.674.578.6
MATH4-shot24.243.350.0
GSM8K5-shot, maj@11.3638.471.082.6
GPQA9.3815.025.424.3
MMLU (Pro)5-shot11.223.740.843.9
MBPP3-shot9.8046.060.465.6
HumanEvalpass@16.1036.045.748.8
MMLU (Pro COT)5-shot9.7NaNNaNNaN

多语言能力

文件 1:多语言 / 文本任务

BenchmarkGemma 3 PT 1BGemma 3 PT 4BGemma 3 PT 12BGemma 3 PT 27B
MGSM2.0434.764.374.3
Global-MMLU-Lite24.957.069.475.7
Belebele26.659.478.0
WMT24± (ChrF)36.748.453.955.7
FloRes29.539.246.048.8
XL-Sum4.828.5512.214.9
XQuAD (all)43.968.074.576.8

多式联运能力

BenchmarkGemma 3 PT 4BGemma 3 PT 12BGemma 3 PT 27B
COCO Captions (CIDEr)102111116
DocVQA (val)72.882.385.6
InfoVQA (val)44.154.859.4
MMMU (pt)39.250.356.1
TextVQA (val)58.966.568.6
RealWorldQA45.552.253.9
ReMl27.338.544.8
AI2D63.275.279.0
ChartQA45.460.963.8
ChartQA (augmented)81.888.588.7
VQAv2
BLINK38.035.939.6
OKVQA51.058.760.2
TallyQA42.551.854.3
SpatialSense VQA50.960.059.4
CountBenchQA26.117.868.0