Video Processing
Process full videos with real-time preview and audio preservation.
Quick Recast processes videos frame by frame, applying face replacement to every frame in the video. This guide covers video-specific features including input formats, trimming, audio handling, progress tracking, and output configuration.
Video Input
To process a video, drag a video file onto the Target drop zone in the Quick Recast interface. You can also click the drop zone to open a file browser.
Supported Formats
Quick Recast supports the following video formats:
| Format | Extension | Notes |
|---|---|---|
| MPEG-4 | .mp4 | Most common format, recommended for best compatibility |
| AVI | .avi | Widely supported, larger file sizes |
| QuickTime | .mov | Common on macOS, Apple ecosystem |
| Matroska | .mkv | Open format, supports multiple tracks |
| Windows Media | .wmv | Windows platform format |
| Flash Video | .flv | Legacy web video format |
MP4 Recommended
Video Trimming
You do not need to process an entire video. Quick Recast includes a built-in video trimming tool that lets you select a specific range of frames to process.
Trim Range Slider
When a video is loaded, a dual-handle range slider appears below the target preview. The slider represents the full duration of the video:
- Left handle: Sets the start frame. Drag it right to skip the beginning of the video.
- Right handle: Sets the end frame. Drag it left to skip the end of the video.
- Timecodes: The start and end timecodes are displayed above the slider, showing the exact position in hours:minutes:seconds:frames format.
Only the frames within the selected range will be processed. This is useful for focusing on specific scenes, testing different models on a short clip before committing to the full video, or skipping intro/outro segments.
Test with a Short Clip First
Audio Preservation
Quick Recast automatically preserves the original audio from your source video. The pipeline works as follows:
Audio detection
Before processing, Recaster uses ffprobe to check whether the video contains an audio stream.
Video-only processing
The video frames are processed through the face swap pipeline without touching the audio. Each frame is replaced individually.
Audio muxing
After all frames are processed, the original audio track is combined ("muxed") with the processed video using ffmpeg. The result contains both the swapped video and the original audio.
FFmpeg Required for Audio
Processing Progress
Video processing provides detailed real-time feedback through several indicators:
Progress Bar
A progress bar shows the current completion percentage. Below the bar, the current frame number and total frame count are displayed (for example, "Frame 450/1800").
Real-Time Preview
The before/after comparison view updates periodically as frames are processed. You can see intermediate results without waiting for the entire video to finish. This allows you to spot quality issues early and stop processing if needed.
Stopping Processing
Click the Stop button at any time to cancel video processing. Frames that have already been processed are kept — you will get a partial video with all completed frames up to the point where you stopped.
Batch Processing Optimization
When processing videos that contain multiple faces per frame, Recaster uses batch processing to speed things up. Instead of processing each face one at a time, multiple faces are grouped and processed in a single GPU call. This optimization provides a 30-50% speed improvement for multi-face scenes.
Batch processing is automatic and works with InSwapper models. Ghost-style models use sequential processing by design, as each swap requires a unique source image reference.
Output Settings
The processed video is saved with the following defaults:
- Location: Same directory as the target file
- Filename: Original name with a
_recastsuffix (e.g.,scene_01_recast.mp4) - Format: Same as input format
- Frame rate: Preserved from original
- Audio: Original audio track muxed back in
You can customize the output location by clicking the folder icon next to the output path field before starting processing.
Performance Tips
Video processing speed depends on several factors. Here are tips to optimize performance:
| Tip | Impact |
|---|---|
| Use InSwapper 128-FP16 | 20-30% faster than standard InSwapper with minimal quality loss |
| Set detection to Fast (320px) | 2-3x faster face detection per frame |
| Disable enhancement for drafts | Skip the enhancement pass to save 30-50% processing time |
| Trim video to target scene | Only process the frames you actually need |
| Use remote processing | Cloud GPUs (RTX 3090/4090) process 5-10 FPS vs local lower-end GPUs |
Remote Video Processing Studio
Studio tier users can process videos on cloud GPUs. The workflow consists of three phases:
Upload
Source and target files are uploaded to the cloud instance via rsync. A progress indicator shows the upload percentage.
Process
The video is processed on the cloud GPU. Preview snapshots are streamed back at configurable intervals (default: every 5 seconds) so you can monitor results in real time.
Download
The final processed video is downloaded back to your local machine. A progress indicator shows the download percentage.
Internet Connection Required
Preview Update Interval
When processing remotely, you can configure how often preview snapshots are sent back from the cloud instance. The interval is set in seconds (range: 1-60 seconds, default: 5 seconds). Lower values give more frequent updates but increase network usage.
The interval is automatically converted to a frame count based on the video's FPS. For example, at 30fps with a 5-second interval, a preview snapshot is saved every 150 frames.
Typical Processing Times
Processing speed varies based on GPU, video resolution, number of faces, and model choice. Here are approximate times for common scenarios on cloud GPUs (RTX 3090/4090):
| Video | Upload | Processing | Download | Total |
|---|---|---|---|---|
| 30s @ 30fps | ~15s | ~90s | ~45s | ~2.5 min |
| 2 min @ 30fps | ~60s | ~6 min | ~3 min | ~10 min |
Was this page helpful?