alibaba/happyhorse-1.0-i2v

happyhorse-1.0-i2v
Docs

HappyHorse image-to-video model generates physically realistic and smoothly animated video content from a first frame image. The model can optionally use text prompts for guidance, supporting 720P/1080P resolution and 3-15 seconds duration. The output video aspect ratio follows the input first frame image automatically.

$0.1280~$0.2290/sec
image-to-video

Input

Optional text prompt to guide video generation. Supports any language input. Maximum length: 5000 non-Chinese characters or 2500 Chinese characters (automatically truncated if exceeded)
Video duration in seconds. Must be an integer between 3 and 15
5
First frame image URL or base64 encoded image. Supports HTTP/HTTPS URLs and data URI format. Image requirements: Format: JPEG, JPG, PNG, WEBP; Resolution: Width and height >= 300 pixels; Aspect ratio: 1:2.5 ~ 2.5:1; File size: <= 10MB
Hint: Drag and drop files, paste from clipboard (Ctrl/Cmd+V), or provide a URL.
Video resolution level. The model automatically scales to the nearest total pixels based on the selected resolution. The output video aspect ratio approximately matches the input first frame
1080P
Random seed for reproducibility. If not specified, the system generates a random seed. Note: Due to the probabilistic nature of model generation, even with the same seed, results may not be completely identical

Result

No results yet

Run the model to preview the output here.

More in this series