LogoMystarion AI
  • Image Tools
    Text to Image
    Image to Image
    Image Upscaler
    Background Remover

    Image Models

    Nano Banana
    Z-Image Turbo
    Nano Banana Pro
    Seedream 4.5
    Flux 2 Pro
    Seedream 4.0
    Flux Kontext Pro
    GPT Image 1.5
    Grok Imagine
    Qwen Image Edit
  • Video Tools
    Text to Video
    Image to Video
    Reference to Video

    Video Models

    Veo 3.1
    Sora 2
    Seedance 2.0
    Seedance 1.5 Pro
    Kling 2.6
    Kling 2.5 Turbo
    Wan 2.6
    Wan 2.5
    Hailuo 2.3
  • Pricing
  • My Creations
  • AI Studio

Image Tools

  • Text to Image
  • Image to Image
  • Image Upscaler
  • Background Remover

Video Tools

  • Text to Video
  • Image to Video
  • Reference to Video
LogoMystarion AI

Bring your imagination to life with Mystarion AI, creating stunning videos and images effortlessly.

TwitterX (Twitter)DiscordEmail
Image Tools
  • Text to Image
  • Image to Image
  • Image Upscaler
  • Background Remover
Image Models
  • Nano Banana
  • Z-Image Turbo
  • Nano Banana Pro
  • Seedream 4.5
  • Flux 2 Pro
  • Seedream 4.0
  • Flux Kontext Pro
  • GPT Image 1.5
  • Grok Imagine
  • Qwen Image Edit
Video Tools
  • Text to Video
  • Image to Video
  • Reference to Video
Video Models
  • Veo 3.1
  • Sora 2
  • Seedance 2.0
  • Seedance 1.5 Pro
  • Kling 2.6
  • Kling 2.5 Turbo
  • Wan 2.6
  • Wan 2.5
  • Hailuo 2.3
Platform
  • Features
  • Pricing
  • FAQ
  • Terms of Service
  • Privacy Policy
  • Refund Policy
© 2026 Mystarion AI All Rights Reserved.
  • AI Studio

Image Tools

  • Text to Image
  • Image to Image
  • Image Upscaler
  • Background Remover

Video Tools

  • Text to Video
  • Image to Video
  • Reference to Video

Reference to Video

Aspect Ratio
Duration
Audio
Credits required
20 Credits
HistoryMy Creations

Reference to Video AI

Turn reference images or videos into coherent AI clips with Veo 3.1 and Wan 2.6

FEATURES

Why Use Reference to Video

Reference to Video is built for visual consistency: upload multiple references, describe motion clearly, and generate fast 8-second clips suitable for social, concept, and creative workflows.

1-3 Reference Images

Use one to three images as visual anchors so subjects, style, and key composition cues remain more consistent across the generated clip.

Veo 3.1 Fast Pipeline

Runs on Veo 3.1 fast generation mode for a practical balance between turnaround speed and stable motion quality in everyday production.

Aspect Ratio Control

Choose Auto, 16:9, or 9:16 to match your distribution target, from landscape explainers to vertical short-form social formats.

Audio-Capable Output

Generated videos support background audio behavior defined by the upstream model, helping you move faster from generation to review.

Prompt Translation Enabled

Requests are submitted with translation enabled to improve prompt interpretation reliability in multilingual workflows and global teams.

Fixed 8s Duration

A fixed 8-second output keeps timing predictable for iteration, storyboard tests, and quick side-by-side model comparisons.

How It Works

Generate in 3 Steps

Upload references, describe motion, and create a ready-to-review clip in minutes.

🖼️STEP 1

Upload References

Add 1 to 3 reference images that define the subject, mood, and visual direction for your generated video.

✍️STEP 2

Write Motion Prompt

Describe camera movement, subject behavior, and transition intent. Set aspect ratio, then submit generation.

🎬STEP 3

Review and Download

Preview the 8-second result, inspect details in history, and download the clip for editing or publishing.

FAQ

Frequently Asked Questions

You can upload 1 to 3 reference images. At least one image is required, and uploads above three images are rejected by validation.

This page uses Veo generation type REFERENCE_2_VIDEO with the fast Veo 3.1 model profile, optimized for guided reference-based motion generation.

Not in this model setup. Duration is fixed at 8 seconds for predictable iteration speed and stable workflow behavior across repeated runs.

You can choose Auto, 16:9, or 9:16. Auto is convenient for quick tests, while explicit aspect ratios are better for production delivery targets.

Yes. Requests are sent with translation enabled by default to improve prompt interpretation consistency when prompts are not originally written in English.

The model supports audio-capable output behavior from the upstream pipeline. In rare sensitive scenarios, audio may still be suppressed by provider policy.

No. This model is scoped to the dedicated Reference to Video page, so existing Text to Video and Image to Video model lists remain unchanged.

Use clear, high-quality references and precise motion prompts. Explicit camera verbs and scene intent usually improve consistency and reduce random drift.