7B LLM @ 30+ TOK/s on Jetson Orin Nano
NanoMind is my personal edge AI project: running a 7-billion parameter LLM entirely on an NVIDIA Jetson Orin Nano with zero cloud dependency.
doranst@proton.me