I will compile and optimize mediapipe for your arm device with GPU acceleration

R
richter1976
R
richter1976
Richter

About this gig

MediaPipe doesn't ship ARM64 wheels. I build them with GPU acceleration.


I compile from Bazel source, patched for ARM Mali GPU with EGL/GBM headless support. You get a pip-installable .whl with GPU delegate working no X11, no display server, no Docker GPU headaches.


What you get:

Custom .whl for your ARM board + Python + MediaPipe version

GPU delegate via EGL GBM (truly headless)

Install script + verification test

Benchmark report (CPU vs GPU, latency + throughput)


Verified platforms:

RK3576 (Mali-G52) primary dev board

RK3588 (Mali-G610)

Raspberry Pi 5 (VideoCore VII)

Any ARM64 Linux with Mali/VideoCore GPU + DDK


Benchmark: https://asciinema.org/a/Mv4LEGvaroBSs6oJ


Why this matters:

Stock: CPU-only, 100+ms/frame on ARM

My build: GPU-accelerated, 44ms/frame (2.3x faster)

Headless: Docker, CI/CD, server rack

No NPU SDK needed standard GPU drivers only


What I need:

Board model + OS (Ubuntu, Debian, Yocto)

Python version (3.10/3.11/3.12)

Modules: Pose, Face, Hand, Holistic, or all


Contact me before ordering if your setup is unusual I'll confirm compatibility.


Get to know Richter

Richter
4.8(4)
  • FromChina
  • Member sinceOct 2024
  • Last delivery1 year
  • Languages

    English, Chinese, German
I build computer vision systems that ship — on NVIDIA CUDA servers and ARM edge. Not demos. Production. 6 projects deployed in 12 months: YOLO detection + tracking on CUDA and NPU (17x speedup), multi-camera RTSP pipelines with FFmpeg hardware decoding, MediaPipe GPU compiled from source for ARM Mali (2.3x faster, headless), PyTorch custom model training, and rPPG contactless vital signs from video. Stack: Python, C++, PyTorch, OpenCV, CUDA, ONNX, YOLO, Docker. GPUs: RTX 4060 Ti, Hailo-8L NPU, Mali-G52. 3600+ lines in a real school. 20K+ lines in a shipping edge AI product.

My Portfolio