With around 4.6 GiB model size the new Qwen3-8B quantized to 4-bit should fit co...

		reichardt 7 months ago \| parent \| context \| favorite \| on: Mistral ships Le Chat – enterprise AI assistant th... With around 4.6 GiB model size the new Qwen3-8B quantized to 4-bit should fit comfortably in 16 GiB of memory: https://huggingface.co/mlx-community/Qwen3-8B-4bit