LLM on Rockcup NPU

ali · February 24, 2025, 8:36pm

Anyone sucessful running DeepSeek-R1 medium models? I know that probably these board are not meant to be able to process a real big Large lanuage model, but just out of curiosity I wanted to check how it perfomes I couldn’t run even 7B model running(only 1.5B works) I constantly get this error:

I assume the model is too large to fit into the chip’s memory. On a normal CPU, swap helps to run the model, but apparently, for an NPU, it cannot run at all if it doesn’t fit in memory. I understand that even if it were possible, it would be too slow, but out of curiosity, I’d like to know if running it from scratch is feasible(On NPU). Any subtle hints would be appreciated as well.

Morgan · February 25, 2025, 7:36pm

hi, @ali

NPU must need an enough continuous memory space, which mean it can not use swap memory, for 8B rkllm model, the size have reach almost 8GB, if your 5b+ memory is 8GB it would be killed by system.

best,
Morgan

ali · February 25, 2025, 7:37pm

Okay thanks at least now I know it’s not even possible