Didn't expect that such a nano-demo would require an Nvidia GPU > 20 series and Linux (WSL) for Triton. Didn't read far /deep (pyproject.toml) enough, so I bumped into the Triton and then NV-GPU>20 ...