Video Merger

The Video Merger node concatenates multiple videos into a single output video, with support for custom ordering, resolution control, and audio management.

What does the Video Merger node do?

The Video Merger node takes multiple video files and concatenates them into a single video. It handles differences in codec, resolution, frame rate, and aspect ratio automatically, so you can combine videos from different sources seamlessly.

Common use cases:

Combining intro, main content, and outro clips into a single video
Merging multiple AI-generated video segments (e.g., from Image to Video nodes) into one continuous video
Assembling a compilation from videos stored in a Static Video node or Google Drive
Creating video slideshows from a sequence of short clips processed through a Loop

Quick setup

Add the node to the canvas

Open the Node Library, go to Tools > Video Processing, then drag and drop the Video Merger node onto your workspace.

Choose your input mode

Open the node settings. By default, the node accepts a single input (an array of video URLs). Enable Multiple Inputs to create 2–10 separate input handles, useful when combining videos from different nodes.

Connect your videos

Single input mode: Connect one node that outputs an array of video URLs (e.g., Static Video, or a Loop output).

Multi-input mode: Connect each upstream video source to its own input handle (video_1, video_2, etc.). Use the drag & drop interface in the settings to reorder the clips.

Configure output settings

Choose the output resolution (Auto detects from source videos, or pick a specific resolution). Select whether to keep or mute the audio track.

Connect the output

Connect the output to the next node. The merged video is available as a JSON object with the video URL, MIME type, and metadata.

Configuration parameters

Required fields

Name string required default: Video Merger

Node name — A label to identify this node on the canvas.

Description string required default: Concatenate multiple videos into one

Node description — A short description of what this merge operation does.

Optional fields

Multiple Inputs boolean default: false

Multiple inputs mode — When enabled, creates separate input handles (video_1, video_2, etc.) instead of a single array input. Useful when combining videos from different nodes. You can add between 2 and 10 inputs.

Output Resolution select default: auto

Output resolution — The resolution of the merged video. Options:

Auto — Automatically matches the source videos’ resolution
480p — 854×480
720p — 1280×720
1080p — 1920×1080
1440p — 2560×1440
2160p — 3840×2160 (4K)

Audio select default: keep

Audio — How to handle audio tracks:

Keep — Preserves the audio from all input videos
Mute — Removes all audio from the output video

Tip

In multi-input mode, use the drag & drop interface in the settings to reorder your video clips. The order in the list determines the order in the final merged video.

What does the node output?

The node outputs a JSON array containing the merged video information:

[
  {
    "video": "merged_output.mp4",
    "mimeType": "video/mp4",
    "video_count": 3,
    "resolution": "1080p"
  }
]

video string

The filename of the merged video.

mimeType string

The MIME type of the output video (typically video/mp4).

video_count number

The number of videos that were merged.

resolution string

The resolution of the output video.

Usage examples

Example 1: Merge intro + content + outro

You have three separate video clips and want to combine them into a final video.

Configuration:

Multiple Inputs: Enabled, 3 inputs
Video order: video_1 (Intro), video_2 (Content), video_3 (Outro)
Output Resolution: 1080p
Audio: Keep

Connect each clip to its respective input handle and reorder them using drag & drop if needed.

Example 2: Combine AI-generated video segments

You generate multiple short video clips from images using the Image to Video node inside a Loop, then merge them all.

Workflow:

Static Image — Upload source images
Loop — Iterate over each image
Image to Video — Generate a video clip per image
Video Merger — Combine all clips into one video (single input mode, receives the array from the Loop)

Configuration:

Multiple Inputs: Disabled (single array input)
Output Resolution: Auto
Audio: Mute

Example 3: Create a compilation from Google Drive videos

Pull videos from Google Drive and merge them into a compilation.

Workflow:

Google Drive Reader — Fetch video files
Video Merger — Concatenate them in order

Best practices

Tip

Use “Auto” resolution unless you need a specific output size. Auto resolution avoids unnecessary re-encoding and produces faster results.

Tip

Use multi-input mode when you need precise control over the order of clips from different sources. Use single-input mode when merging a dynamic list of videos (e.g., from a Loop).

Warning

Keep video count reasonable. Merging many large videos takes time. The node has an auto-timeout for very long operations. If you hit timeouts, try reducing the number of videos or lowering the resolution.

Warning

Different formats are normalized automatically, but this adds processing time. For fastest results, use videos with the same codec, resolution, frame rate, and aspect ratio.

Common issues

The merge takes a very long time or times out

Cause: Too many videos, very high resolution, or mixed formats requiring heavy re-encoding.

Solution: Reduce the number of videos, lower the output resolution, or ensure input videos share the same format. The node normalizes codec, FPS, and aspect ratio automatically, but this adds processing time.

The output video has no audio

Cause: The Audio setting is set to “Mute”, or the source videos don’t have audio tracks.

Solution: Check the Audio setting and make sure it’s set to “Keep”. If source videos are silent, the output will also be silent regardless of this setting.

Videos are in the wrong order

Cause: In multi-input mode, the default order may not match your intended sequence.

Solution: Open the node settings and use the drag & drop interface to reorder the video clips. The list order directly controls the output order.

The node rejects my input connection

Cause: The connected node doesn’t output video content (e.g., it outputs text or images).

Solution: Make sure the upstream node produces video output. The Video Merger accepts video from: Static Video, File input (with video files), Google Drive Reader (filtered to videos), Image to Video, and other video-producing nodes.

How does it fit into a workflow?

Video Merger is typically used at the end of a video production pipeline:

graph LR
    Source1[Video Source 1] --> Merger[Video Merger]
    Source2[Video Source 2] --> Merger
    Source3[Video Source 3] --> Merger
    Merger --> Output[Final Video]

Or after a Loop that generates multiple clips:

graph LR
    Images[Source Images] --> Loop[Loop]
    Loop --> I2V[Image to Video]
    I2V --> Merger[Video Merger]

Static Video

Provide fixed video files as input to the Video Merger.

Image to Video

Generate video clips from images, then merge them together.

Loop

Generate multiple video clips in a loop before merging them.

Extract Video Frame

Extract frames from the merged video for thumbnails or further processing.

Video Merger

What does the Video Merger node do?

Quick setup

Add the node to the canvas

Choose your input mode

Connect your videos

Configure output settings

Connect the output

Configuration parameters

Required fields

Optional fields

What does the node output?

Usage examples

Example 1: Merge intro + content + outro

Example 2: Combine AI-generated video segments

Example 3: Create a compilation from Google Drive videos

Best practices

Common issues

How does it fit into a workflow?

Related nodes