That comment got me interested so I'm pulling a laptop focused model (gemma3n:e4b) on one of my old thinkpads to see how it performs. From just a quick test it's running okayish on my i7-8650U (no gpu on this laptop). Definitely slower than my desktop. The gemma3n:e2b was much more usable speed wise.
I benched the laptop with both the small n3 models and then benched my desktop with the usual model I run (gemma3:12b) and the largest of the n3 models. Here's those reports.
# Laptop
-------Linux----------
No NVIDIA GPU detected.
rocminfo failed: [Errno 2] No such file or directory: 'rocminfo'
Total memory size : 15.49 GB
cpu_info: Intel(R) Core(TM) i7-8650U CPU @ 1.90GHz
gpu_info: no_gpu
os_version: "NixOS 25.05 (Warbler)"
ollama_version: 0.11.10
----------
running custom benchmark from models_file_path: benchmark.yaml
Disabling sendinfo for custom benchmark
LLM models file path:benchmark.yaml
Checking and pulling the following LLM models
gemma3n:e4b
gemma3n:e2b
----------
Running custom-model
model_name = gemma3n:e4b
prompt = Summarize the key differences between classical and operant conditioning in psychology.
eval rate: 7.11 tokens/s
prompt = Translate the following English paragraph into Chinese and elaborate more -> Artificial intelligence is transforming various industries by enhancing efficiency and enabling new capabilities.
eval rate: 6.68 tokens/s
prompt = What are the main causes of the American Civil War?
eval rate: 6.63 tokens/s
prompt = How does photosynthesis contribute to the carbon cycle?
eval rate: 6.64 tokens/s
prompt = Develop a python function that solves the following problem, sudoku game.
eval rate: 6.56 tokens/s
--------------------
Average of eval rate: 6.724 tokens/s
----------------------------------------
model_name = gemma3n:e2b
prompt = Summarize the key differences between classical and operant conditioning in psychology.
eval rate: 11.24 tokens/s
prompt = Translate the following English paragraph into Chinese and elaborate more -> Artificial intelligence is transforming various industries by enhancing efficiency and enabling new capabilities.
eval rate: 11.23 tokens/s
prompt = What are the main causes of the American Civil War?
eval rate: 11.24 tokens/s
prompt = How does photosynthesis contribute to the carbon cycle?
eval rate: 11.25 tokens/s
prompt = Develop a python function that solves the following problem, sudoku game.
eval rate: 11.12 tokens/s
--------------------
Average of eval rate: 11.216 tokens/s
----------------------------------------
# Desktop
-------Linux----------
No NVIDIA GPU detected.
Total memory size : 30.46 GB
cpu_info: AMD Ryzen 5 7600 6-Core Processor
gpu_info: AMD Ryzen 5 7600 6-Core Processor
AMD Radeon RX 7800 XT
AMD Radeon Graphics
os_version: "NixOS 25.05 (Warbler)"
ollama_version: 0.11.10
----------
running custom benchmark from models_file_path: benchmark.yaml
Disabling sendinfo for custom benchmark
LLM models file path:benchmark.yaml
Checking and pulling the following LLM models
gemma3:12b
gemma3n:e4b
----------
Running custom-model
model_name = gemma3:12b
prompt = Summarize the key differences between classical and operant conditioning in psychology.
eval rate: 35.54 tokens/s
prompt = Translate the following English paragraph into Chinese and elaborate more -> Artificial intelligence is transforming various industries by enhancing efficiency and enabling new capabilities.
eval rate: 34.07 tokens/s
prompt = What are the main causes of the American Civil War?
eval rate: 33.89 tokens/s
prompt = How does photosynthesis contribute to the carbon cycle?
eval rate: 36.10 tokens/s
prompt = Develop a python function that solves the following problem, sudoku game.
eval rate: 33.89 tokens/s
--------------------
Average of eval rate: 34.698 tokens/s
----------------------------------------
model_name = gemma3n:e4b
prompt = Summarize the key differences between classical and operant conditioning in psychology.
eval rate: 46.62 tokens/s
prompt = Translate the following English paragraph into Chinese and elaborate more -> Artificial intelligence is transforming various industries by enhancing efficiency and enabling new capabilities.
eval rate: 46.55 tokens/s
prompt = What are the main causes of the American Civil War?
eval rate: 46.42 tokens/s
prompt = How does photosynthesis contribute to the carbon cycle?
eval rate: 46.14 tokens/s
prompt = Develop a python function that solves the following problem, sudoku game.
eval rate: 45.68 tokens/s
--------------------
Average of eval rate: 46.282 tokens/s
----------------------------------------