Csaba Kecskemeti PRO
csabakecskemeti
AI & ML interests
None yet
Recent Activity
updated
a model
32 minutes ago
DevQuasar/MiniMaxAI.MiniMax-M2.1-GGUF
published
a model
32 minutes ago
DevQuasar/MiniMaxAI.MiniMax-M2.1-GGUF
posted
an
update
about 5 hours ago
Just sharing a result of a homelab infrastructure experiment:
I've managed to setup a distributed inference infra at home using a DGX Spark (128GB unified gddr6) and a linux workstation with an RTX 6000 Pro (96GB gddr7) connected via 100Gbps RoCEv2. The model I've used (https://lnkd.in/gx6J7YuB) is about 140GB so could not fit either of the GPU. Full setup and tutorial soon on devquasar.com
Screen recording:
https://lnkd.in/gKM9H5GJ