Go-rknnlite support for RK3576 (Rock 4D)

3djelly · June 13, 2025, 4:50am

Recently I updated https://github.com/swdee/go-rknnlite/ to support other models in the RK35xx series, specifically to compare the two core 6 TOPS NPU of the RK3576 (Rock 4D) versus the three core 6 TOPS NPU RK3588 (Rock 5B). Also I added in the RK3566 single core 1 TOPS NPU (Zero 3E) for comparison as other users were interested in that.

Overall the RK3576’s NPU is comparable, sometimes it performs a bit faster due to the Rock 4D having faster DDR5 memory. On inference models that have a lot of CPU post processing (such as Segmentation Models) these perform slower as the CPU cores are much slower. The raw CPU speed in the RK3576 is about the same as on the RPI 5.

dominik · June 13, 2025, 8:36am

This will always depend on particular task, what benchmarks are similar on those both?
4D is really interesting for me because of UFS (that can replace some nvme)
It would be nice to see 5b+ in this comparison

3djelly · June 13, 2025, 8:57am

I ran sbc-bench on the Rock 4D which you can compare to the Pi 5 results.

The other interesting thing about the RK3576 is it runs pretty cool. With the CPU stressed out it was sitting around 62 degrees C with only a small 10x15mm passive heat sink on it which makes it ideal for embedded products.