Note: Audio length is trimmed to max 9 seconds, using 580 GPU credits.
Consider adding word at beginning of clip that is later trimmable since lipsync does not always start immediately.