News · Product Launch

Introducing Sonilo v1.0
our first video-to-music foundation model.

Overview

We recently launched Sonilo v1.0, our first-generation video-to-music foundation model. Built as an end-to-end video-conditioned music generation system, Sonilo v1.0 takes video as the primary input modality and directly generates full-length music that aligns with the video's structure. Unlike prompt-based systems, it learns from visual signals — including motion, pacing, transitions, and emotional cues — to produce music that naturally fits the narrative flow.

Compared with traditional music sourcing or prompt-driven generation tools, Sonilo v1.0 delivers a step-change in workflow efficiency and alignment quality. It eliminates the need for manual searching, trimming, and synchronization, enabling creators to generate perfectly timed music in seconds. The result is a more seamless, scalable, and intuitive creative process for video-first content production.

Key highlights

  1. 01

    Native Video-to-Music Generation

    Sonilo v1.0 generates music directly from video without requiring text prompts. The output is inherently aligned with the video's duration, pacing, and scene transitions, making music a native part of the creation pipeline rather than a post-production step.

  2. 02

    Strong Temporal and Emotional Alignment

    The model captures fine-grained temporal dynamics and emotional progression from video content. It synchronizes beats, intensity, and structure with motion, cuts, and narrative arcs, producing music that feels tightly integrated with the visuals.

  3. 03

    Real-Time Creative Workflow Efficiency

    With fast generation speeds — music from video in 20 seconds — Sonilo v1.0 enables rapid iteration for creators. Upload a video and receive a fully matched soundtrack in seconds, dramatically reducing production time compared to traditional workflows.

  4. 04

    Seamless Integration into Creation Pipelines

    Designed for modern creator ecosystems, Sonilo v1.0 integrates into video editing tools, AI workflows, and node-based platforms like ComfyUI, allowing music generation to happen directly within the content creation process.

  5. 05

    Commercial-Ready Output

    The model produces full-length, production-quality music suitable for creator content, marketing videos, and commercial use cases — supporting both individual creators and enterprise-scale applications.

Try Sonilo v1.0 at sonilo.com.