How to use GPUs over multiple computers for local AI?

marauding_gibberish142@lemmy.dbzer0.com · 4 days ago

How to use GPUs over multiple computers for local AI?

CondorWonder@lemmy.ca · edit-2 4 days ago

You don’t need cards to have full bandwidth, they only time it will matter is when you’re loading the models on the card. You need a motherboard with x16 slots but even x4 connections would be good enough. Running the model doesn’t need a lot of bandwidth. Remember you only load the model once then reuse it.

An x4 pcie gen 4 slot has ~7.8 GiB/s theoretical transfer rate (after overhead), a x16 has ~31.5GiB/s - so disk I/O is likely your limit even for a x4 slot.

overhead was already in calculations

marauding_gibberish142@lemmy.dbzer0.com · 4 days ago

I see. That solves a lot of the headaches I imagined I would have. Thank you so much for clearing that up