Works fine on RTX4090 , Cuda 12.9, python3.12. Takes less than 30s for example below.
· Sign up or log in to comment