Wan 2.6 Reference-to-Video model generates videos from reference URLs (images/videos) with multi-character interaction and role-playing capabilities. Generates silent videos by default.
$0.1000~$0.1500/sec
video-to-video
Input
Negative prompt describing unwanted content
Video content description (supports Chinese and English, up to 1500 characters). Can reference characters using 'Character1/Character2' format
Video duration in seconds
5
Reference URLs array (images + videos ≤ 5)
Hint: Drag and drop files, paste from clipboard (Ctrl/Cmd+V), or provide a URL.