← Dashboard
0xSero

0xSero ✓

44.7K followers
4 tweets
Communities: Local AI x/LocalLLaMA
# Tweet Community Topic Views Ratio Engagement Posted
1
[text] Breakthroughs: 1. Turboquant merged into vLLM 75% vram reduction for kvcache near losslsss 2. Someone merged M2.7 & M2.5 & got it to perform better than M2.7 3. 40% faster prefill on AMD strix halo (128gb for MoEs w 10B active params) 4. Megatrain 100B model trained on 1 GPU
Local AI Artificial Intelligence 54.2K 1.2x 1.3K Apr 16
2
[image] Locally Part 1 - Apple Silicon Macs give you large pools of memory to run big models, but the token generation speed will be lower than most are used to. Macs are best with large MoEs that have low ACTIVE params. Basically when you see a model like Qwen3.5-397B-A17B this
Local AI Artificial Intelligence 32.1K 0.7x 430 Apr 22
3
[text] Here’s what I’d recommend if you’re just getting started in AI, local or otherwise. 1. Work with the compute you have, even the dumbest LLMs can be useful if you treat them as a node in your system. Some basic problems of what could be useful to get you started - tag all
Local AI Artificial Intelligence 23.1K 0.6x 753 Apr 3
4
[image] Best harnesses for local models: 1. Droid: - Very good performance, forces the models to behave, you can wire in all your local LLMs very easily w BYOK - Allows you to use your local models as orchestrators/subagents so you can benefit from Cloud as models as well - Practically
Local AI Artificial Intelligence 19.5K 0.5x 478 Apr 4
5
[text] Help me spread this, I am on a role and need to squeeze every last bit out of it.
x/LocalLLaMA Artificial Intelligence 16.7K 0.5x 333 Mar 20
6
[image] Guide to running BIG B0Is on your small hardware. 1. Use REAPs: up to 50% savings 2. Use quantisations: 75% savings - AWQ / GPTQ / W4A16 / FP8 = FAST inference - GGUF / EXL3 = Slow but just works - MLX = Best for apple 3. Use 8bit KV cache: 50-75% savings
Local AI Artificial Intelligence 14.6K 0.3x 353 Apr 14
7
[text] Local AI is a human right, our children, families, neighbours, friends, and fellow humans deserve privacy and freedom.
Local AI Artificial Intelligence 6.2K 0.1x 202 Apr 7
8
[text] I meant to post this in here, I will be posting weekly models meant for limited hardware budgets, RN I am learning how to deal with the 30b~ class
x/LocalLLaMA Artificial Intelligence 6.0K 0.2x 94 Mar 28