Size (B)Speed (T/s)ModelTypeQuantSpec Dec (B)Spec Quant
1.5282qwen 2.5MLX4--
1.576qwen 2.5MLX8--
770qwen 2.5GUFFQ4_K_M--
7101qwen 2.5MLX4--
758qwen 2.5MLX8--
1235wayfarerGUFFQ6_K--
1265wayfarerMLX4--
1245wayfarerMLX6--
1236wayfarerMLX8--
1436qwen 2.5GUFFQ4_K_M--
1452qwen 2.5MLX4--
1455qwen 2.5MLX41.54
1430qwen 2.5MLX8--
2435mistral small 3MLX4--
3218qwen 2.5GUFFQ4_K_M--
3223qwen 2.5MLX4--
3230qwen 2.5MLX41.54
3230qwen 2.5MLX41.54
3234qwen 2.5MLX41.58
3226qwen 2.5 r1MLX41.54
3233qwen 2.5 coderMLX41.54
3231qwen 2.5 coderMLX434
3225qwqMLX3--
3224qwqMLX4--
3218qwqMLX41.54
3222qwqMLX41.58
3216qwqMLX474
3216qwqMLX478
3216qwqMLX6--
3216qwqMLX61.54
3216qwqMLX61.58
7012wayfarer largeGUFFQ2_K_S--
7015wayfarer largeMLX3--
30 - A393qwen 3MLX4--
30 - A376qwen 3MLX41.74
30 - A381qwen 3MLX6--
30 - A370qwen 3MLX61.74
30 - A370qwen 3MLX8--
3222qwen 3MLX4--
3226qwen 3MLX41.74
2418Devstral Small 2507MLX8--

mlx convert and upload to huggingface

https://huggingface.co/docs/hub/en/mlx

https://huggingface.co/mlx-community

git clone [email protected]:NexVeridian/NexVeridian-web.git

just uv

just mlx_create "Qwen/QwQ-32B" "4 6 8" "/Users/elijahmcmorris/.cache/lm-studio/models" "mlx-community" fasle false
# or
uv venv
uv pip install huggingface_hub hf_transfer mlx_lm
uv run huggingface-cli login

uv run mlx_lm.convert --hf-path Qwen/QwQ-32B -q --q-bits 4 --upload-repo mlx-community/QwQ-32B-4bit --mlx-path /Users/elijahmcmorris/.cache/lm-studio/models/mlx-community/QwQ-32B-4bit

or use https://huggingface.co/spaces/mlx-community/mlx-my-repo

LLM Settings.md

Qwen 3

TempMin PTop PTop KRepeat P
0.60.000.9520-

Qwen 3 /no_think

TempMin PTop PTop KRepeat P
0.70.000.80201.5