Can I run local LLMs on Intel ARC/AMD with 8GB of RAM?

@foggy@lemmy.world

You can rent super powerful GPUs by the minute via cloud infrastructure. It’s probably the most viable way.

@MigratingtoLemmy@lemmy.world

Sorry, but I don’t think that’s a private idea. I probably won’t be doing that

@j4k3@lemmy.world

What software do you want to run?

I’ve been doing a lot of research on this over the last 2 weeks. I have my machine in the mail, but have not tried anything myself on my own hardware.

For Stable Diffusion, 8GBV is usually considered absolute minimum to do very basic stuff only. 16GBV or more is the basic need for a decent workflow.

For AMD I have seen multiple sources saying to avoid it, but there are a few people that have working examples in the wild. Apparently, AMD only supports the 7k series of GPUs officially with ROCm/hips/AI stuff.

Officially with Stable Diffusion, only nvidia is supported.

@MigratingtoLemmy@lemmy.world

I don’t know the kind of LLM I would want to run. I’m just going through some names, would you be able to recommend anything that might learn from text?

Thanks, it would seem I need to stick to Nvidia, although I don’t like the idea very much. Unfortunate

@j4k3@lemmy.world

This is a general list that was shared recently (has google analytics though):

https://llm.extractum.io/

PrivateGPT is on my list to try after someone posted about it weeks ago with this how to article (that has a view limit embedded before a pay wall)/github project repo:

Can I run local LLMs on Intel ARC/AMD with 8GB of RAM?

Can I run local LLMs on Intel ARC/AMD with 8GB of RAM?

Selfhosted