← Back to writing
Local AIEngineering

Running capable models on a laptop

8 min read

A practical setup for local inference, why it matters for sensitive work, and the trade offs you accept when you leave the cloud.


There is a quiet revolution in small, capable models that run entirely on your own hardware.

bash</>
ollama run llama3.1:8b-instruct-q4_K_M