← Dashboard
0xSero

0xSero ✓

53.4K followers
5 tweets
Communities: Local AI x/LocalLLaMA
# Tweet Community Topic Views Ratio Engagement Posted
1
[video] Local AI is about to be competitive. I will do everything in my power to make it better than Claude desktop/claude code by end of year.
Local AI 213.0K 4.5x 4.2K May 1
2
[image] Important: This is a summary of an amazing video by one of the best creators I know. Video in the first comment. If you've been struggling to setup a productive local environment I'll summarise, but you should watch the video. 1. Qwen3.6-27B with NO THINKING - 4bit - 16bit
Local AI 61.1K 1.3x 1.4K May 2
3
[image] Weekly best models for your hardware: ~~ 8 to 16gb ~~ Granite models are amazing: [NEW] - Gemma-E4B is a good general QA model - Qwen3.5-9B is the best at this level imo - ~~ 16 to 64gb ~~ Another
Local AI 56.6K 1.2x 1.3K May 2
4
[text] Breakthroughs: 1. Turboquant merged into vLLM 75% vram reduction for kvcache near losslsss 2. Someone merged M2.7 & M2.5 & got it to perform better than M2.7 3. 40% faster prefill on AMD strix halo (128gb for MoEs w 10B active params) 4. Megatrain 100B model trained on 1 GPU
Local AI 54.2K 1.2x 1.3K Apr 16
5
[image] Best models for your hardware - 4gb to 12gb vram - VibeThinker-3B - smokes everything remotely close to its weight class. Challenging 30b models! Last version was also topping math benchmarks - 12gb to 24gb vram - Gemma-12B-coder Built on top of
Local AI 46.5K 0.9x 1.0K Jun 18
6
[image] Locally Part 1 - Apple Silicon Macs give you large pools of memory to run big models, but the token generation speed will be lower than most are used to. Macs are best with large MoEs that have low ACTIVE params. Basically when you see a model like Qwen3.5-397B-A17B this
Local AI 37.2K 0.8x 469 Apr 22
7
[text] Here’s what I’d recommend if you’re just getting started in AI, local or otherwise. 1. Work with the compute you have, even the dumbest LLMs can be useful if you treat them as a node in your system. Some basic problems of what could be useful to get you started - tag all
Local AI 23.1K 0.6x 753 Apr 3
8
[image] Cheapest competitive build on Nvidia 2 sparks = 8000$ Total specs - 256gb - 8tb - 546gb/s memory bandwidth - tons of flops --- models: - Deepseek-v4-flash - MiMo-v2.5-flash fp4 - MiniMax-M2.7 - Qwen3.5-397b-reap Flaws: - Low mem bandwidth - You need 2 for best perf
Local AI 22.1K 0.4x 317 May 15
9
[image] Best harnesses for local models: 1. Droid: - Very good performance, forces the models to behave, you can wire in all your local LLMs very easily w BYOK - Allows you to use your local models as orchestrators/subagents so you can benefit from Cloud as models as well - Practically
Local AI 19.5K 0.5x 478 Apr 4
10
[image] One thing I've done this year is: - Download all my X data from settings/account - Download all my youtube, gmaps, gmail, google from takout google com - Download all my personal data from Claude, ChatGPT - Export a copy of every AI session on Cursor Claude Code, Codex, Droid,
Local AI 16.9K 0.4x 470 Apr 23
11
[text] Help me spread this, I am on a role and need to squeeze every last bit out of it.
x/LocalLLaMA 16.7K 0.5x 333 Mar 20
12
[image] Based on my knowledge of: - AI labs - hardware manufacturers - model size performance - training process - training economics If you want to buy hardware for local AI you need 500-750gb of 500gb/s+ memory bandwidth to make serious use of local hardware at frontier levels
Local AI 16.1K 0.3x 267 May 15
13
[image] To my fellow homelab friends, I need some help. My room is starting to cook, so I need to setup air conditioning (I already have a portable one but need something more pro) Essentially the rooms humidity is quite low and given I have cats there’s a lot of dust I need to clean
Local AI 14.9K 0.3x 200 May 29
14
[image] Guide to running BIG B0Is on your small hardware. 1. Use REAPs: up to 50% savings 2. Use quantisations: 75% savings - AWQ / GPTQ / W4A16 / FP8 = FAST inference - GGUF / EXL3 = Slow but just works - MLX = Best for apple 3. Use 8bit KV cache: 50-75% savings
Local AI 14.6K 0.3x 353 Apr 14
15
[image] Top 5 builds for AI inference in 2025-2026 I have spent around 12 months researching, building, experimenting, and bench-marking AI models, tools, hardware and costs. Top 3 picks will be the safest, best cost to performance ratios. The last 2 will be more interesting
Local AI 12.0K 0.2x 195 May 2
16
[image] I got 2 intel bad boys on Friday. At this point I’m struggling to find more power for all this. - 4x 6000s - 1x DGX Spark - AMD Strix - 4x 3090 - 2x intel arc b70 - Mac mini 16gb - MacBook Pro 32gb 544gb VRAM 300gb mixed Total = 844gb AI + 512gb ddr4 I need some guru help
Local AI 11.5K 0.2x 200 Jun 2
17
[image] Why I built vllm-studio - Storing configurations for models and engines - Fully integrated agent with Pi - Model downloading, exploration, and config - Creating benchmark datasets Backend can be deployed to multiple servers, and the frontend can connect to all from anywhere.
Local AI 7.0K 0.1x 196 May 22
18
[text] Local AI is a human right, our children, families, neighbours, friends, and fellow humans deserve privacy and freedom.
Local AI 6.2K 0.1x 202 Apr 7
19
[text] I meant to post this in here, I will be posting weekly models meant for limited hardware budgets, RN I am learning how to deal with the 30b~ class
x/LocalLLaMA 6.0K 0.2x 94 Mar 28