VINO

Junyi Chen1, Tong He1, Zhoujie Fu3, Pengfei Wan2, Kun Gai2, Weicai Ye2✉️
1Shanghai Jiao Tong University, 2Kling Team, Kuaishou Technology 3Nanyang Technology University, ✉️Corresponding Author

What is VINO

VINO is a unified visual generator designed to do image and video generation and editing. Built on a single architecture, VINO integrates high-level text instructions, reference images, and video context to create high-quality visuals with impressive flexibility. VINO excels at generating content that aligns with user prompts, making it perfect for a wide range of creative tasks. Experience the future of visual content creation with VINO — where imagination meets innovation. Be patient for loading video on this website.

Image Generation

Place your mouse on image to see the prompt.

Video Generation

Place your mouse on video to see the prompt.

Customized Video Generation

Generate customized videos by specifying reference images.

Image Editing

Place your mouse on image to see the edited result.

Instruction-Based Video Editing

Edit videos through instruction

Image Ref Video Editing

Edit videos by providing reference image

Video generation driven by reference video

Generative videos by providing reference video (motion/expression/camera clone)

Understand then Generate

Place your mouse on video to see the prompt.