If you have a high VRAM GPU (like 16GB or more) you can try running some model locally with LMStudio. You won’t get the same level as cloud models but depending on use case it might be good enough.
That’s poor but not horrible. LMStudio has Vulkan compute backend option that works on almost any card. You can also do partial GPU/RAM split giving you total of 72GB and increased performance vs CPU only.
If you have a high VRAM GPU (like 16GB or more) you can try running some model locally with LMStudio. You won’t get the same level as cloud models but depending on use case it might be good enough.
I only have an Radeon RX Vega 56 8GB HBM2, so limited to CPU-only. 64 GB memory there at least. Not upgrading unless hardware dies.
That’s poor but not horrible. LMStudio has Vulkan compute backend option that works on almost any card. You can also do partial GPU/RAM split giving you total of 72GB and increased performance vs CPU only.
Hmm, have to try it sometime when the hot phase at work is over. Thanks!