The I in LLM is for intelligence

May 28, 2026

The I in LLM is for intelligence

I've finally got DeepSeek running on my Nvidia 4070 and working through VS Code.

Yea, yea, the cloud platforms are way faster and much better that trying to do this with a locally hosted model but. I have a i9-11900KB with an RTX 4070 as a daily driver but can't afford Yet Another Subscription.

Originally, I planned to host Ollama and spent hours in vain trying to get that to live inside VS Code. Bottom line is that VS Code would never execute the tooling regardless of which plugin installed. I'd been running LM Studio on my laptop (Ultra 9 185H w/ 64G) and it worked.... v... e... r... r... y... s... l... o... w... l... y... So I tried that with the RTX 4070. This combination is workable. with a few gotchas.

Of course, CoPilot will stupidly refuse to work with anything 'not cloud' hosted no matter what tweaks you try out. I get the feeling this is a design choice and not an oversight. There is always one critical piece missing or in the way (in the name of security). And no, self-signed certificate authorities are not acceptable, nor is a local host proxy (not sure how it figured that out).

Conntinue.dev does work but not with config.yaml for some reason. You must use config.json; the old, deprecated configuration mechanism. I should probably look to see if there is a bug report on that yet.

For now, I'm running LM Studio on Ubuntu 24.4 (bleck) as a desktop app. llmster is not behaving well and I don't have time this week to look at it. Because of my limited VRAM on the 4070 I'm running:
- deepseek-r1-distill-qwen-14b for planning, chat, and analysis of existing code.
- qwen2.5-coder:9b for a coding assistant along with
- qwen2.5-coder:1.5b for code completion

I can't help but wonder if the creators of deepseek are fans of the Hitch Hiker's Guide series who ran a fowl of Vogon copyright law.

--- Updated 2026-06-15 ---

Went back to Ollama.
It turns out that no matter how you configure Ubuntu 24.4 desktop you can't really reliably keep it from going to sleep. Even with LMS running, as soon as you log out the computer goes to sleep, or the network device goes to sleep, or .... Yea, yea, yea, there are all kinds of suggestions on how to prevent this but all of them have various limitations including leaving the monitor on 24/7 even though I'm never using it as a workstation and my Acer display refuses to just be turned off.

Running as a server prevents this headache reliably!

The other key consideration is that only VS Code Insider's edition will actually allow copilot to connect to a private LLM. You still need random garbage for an API Key and to be logged in to github (¯\_(ツ)_/¯)

Search Me

jerunamuck - 猫父

The I in LLM is for intelligence

Comments

Post a Comment