Enhancement Options
GFPGAN, CodeFormer, GPEN, and RestoreFormer++ face enhancement.
Face enhancement is an optional post-processing step that improves the quality of swapped faces. After the face swap model replaces the source identity onto the target, an enhancement model restores facial details, corrects artifacts, and improves overall sharpness. Recaster supports multiple enhancement models with different characteristics and quality levels.
How Enhancement Works
The enhancement step runs after the face swap and operates on the swapped face region only. The pipeline flow is:
The enhancement result is blended with the swap result using configurable blend and weight controls. This lets you dial in exactly how much enhancement to apply — from subtle correction to full restoration.
Enabling Enhancement
Enhancement is disabled by default. To enable it:
- In the Quick Recast settings panel, check the Enable Enhancement checkbox
- Select an enhancement model from the dropdown
- Adjust the blend and weight sliders to taste
When enhancement is enabled, models marked with a lock icon require a Studio license. Free tier users can use GFPGAN variants.
Free Tier Enhancers
GFPGAN (Generative Facial Prior GAN) is a GAN-based face restoration model developed by Tencent ARC Lab. It is available in three versions on the Free tier, each improving on the last.
| Name | Tier | Variants | Quality | Description |
|---|---|---|---|---|
| GFPGAN 1.2 | Free | 1.2 | Good | Original GAN-based face restoration. Balanced quality and speed for general use. |
| GFPGAN 1.3 | Free | 1.3 | Good | Improved version with better detail preservation. Recommended default for Free tier. |
| GFPGAN 1.4 | Free | 1.4 | Very Good | Latest GFPGAN with best quality among Free tier enhancers. Improved eye and mouth details. |
GFPGAN Details
GFPGAN works by using a pre-trained StyleGAN2 generator as a facial prior. It takes the degraded (swapped) face as input and restores it to a high-quality version while preserving the identity from the swap. Key characteristics:
- Speed: Fast inference, minimal overhead on top of face swap
- Eye correction: Particularly good at restoring eye details and gaze direction
- Version differences: v1.4 has the best overall quality with improved handling of skin texture, eye details, and mouth region
Recommended Free Enhancer
Studio Tier Enhancers Studio
Studio tier users have access to three additional enhancement models that represent the state of the art in face restoration. These models offer significantly higher quality than GFPGAN, especially for challenging inputs.
| Name | Tier | Variants | Quality | Description |
|---|---|---|---|---|
| CodeFormer | Studio | — | Excellent | Transformer-based enhancement with exceptional detail recovery. Best for heavily degraded faces. |
| GPEN | Studio | 256, 512, 1024, 2048 | Excellent | Progressive enhancement network. Multiple resolutions for different quality/speed tradeoffs. |
| RestoreFormer++ | Studio | ++ | Excellent | State-of-the-art restoration architecture. Best overall quality for face restoration. |
CodeFormer Studio
CodeFormer uses a transformer-based architecture with a learned codebook of facial features. It excels at recovering fine details from heavily degraded or low-resolution faces. Key strengths:
- Detail recovery: Exceptional at restoring texture, pores, and fine facial details
- Artifact removal: Effectively removes common face swap artifacts like blur, color shifts, and edge seams
- Identity preservation: Better at maintaining the source identity compared to pure GAN-based approaches
GPEN Studio
GPEN (GAN Prior Embedded Network) operates at multiple resolutions for different quality and speed tradeoffs. Available in four resolution variants:
| Variant | Resolution | Speed | Best For |
|---|---|---|---|
| GPEN-256 | 256x256 | Fastest | Quick previews and low-resolution targets |
| GPEN-512 | 512x512 | Fast | Standard definition video, web content |
| GPEN-1024 | 1024x1024 | Moderate | HD video, close-up shots |
| GPEN-2048 | 2048x2048 | Slowest | 4K content, maximum detail restoration |
RestoreFormer++ Studio
RestoreFormer++ is the latest iteration of the RestoreFormer architecture. It combines a vector-quantized codebook with multi-head attention for state-of-the-art face restoration. Key advantages:
- Best overall quality: Consistently produces the most natural-looking restorations in comparative tests
- Robustness: Handles extreme degradation, heavy compression artifacts, and low resolution inputs
- Natural skin: Particularly good at producing realistic skin texture without the "plastic" look that some enhancers produce
Choosing an Enhancer
Blend Controls
Two sliders control how the enhancement result is mixed with the face swap result:
Enhancement Blend (0 – 1)
The blend slider controls the mix between the original face swap result and the enhanced version. A value of 0 means no enhancement (pure swap result), and 1 means full enhancement.
| Value | Effect | Use Case |
|---|---|---|
| 0.0 | No enhancement applied | A/B comparison with raw swap |
| 0.3 – 0.5 | Subtle enhancement, retains swap character | When swap quality is already good |
| 0.7 – 0.8 | Strong enhancement, recommended default | General use, best balance |
| 1.0 | Full enhancement, maximum restoration | Heavily degraded or low-res faces |
Enhancement Weight (0 – 1)
The weight slider controls the internal strength of the enhancement model itself. A value of 1.0 applies the model at full strength, while lower values reduce the model's effect before blending. This provides finer control than the blend slider alone.
Recommended Settings
Detection Quality
Detection quality affects how accurately faces are located before both the swap and enhancement steps. Higher detection quality finds more faces and provides tighter face crops, which improves enhancement quality at the cost of speed.
| Setting | Resolution | Speed | Enhancement Impact |
|---|---|---|---|
| Fast | 320px | 2-3x faster | Adequate for close-up faces, may miss small faces |
| Balanced | 640px | Standard | Better face crop accuracy, improved enhancement alignment |
| Accurate | 1024px | Slower | Best face detection, tightest crop for enhancement |
Detection vs Enhancement Quality
Model Comparison Summary
Here is a quick reference comparing all enhancement models:
| Model | Tier | Architecture | Strength | Speed |
|---|---|---|---|---|
| GFPGAN 1.4 | Free | GAN | Eye/mouth detail | Fast |
| CodeFormer | Studio | Transformer | Detail recovery, identity | Moderate |
| GPEN | Studio | GAN (progressive) | Resolution flexibility | Varies by resolution |
| RestoreFormer++ | Studio | Transformer + Codebook | Natural skin, robustness | Moderate |
Performance Considerations
Adding enhancement increases processing time by approximately 30-50% per frame compared to swap only. Here are tips for managing performance:
- Skip for previews: When testing different swap models, disable enhancement to iterate faster. Enable it only for the final render.
- Match model to target: Use lower GPEN resolutions (256/512) for standard definition content. There is no benefit to running GPEN-2048 on a 480p video.
- GPU memory: Enhancement models require additional GPU memory (~1-2 GB). If you encounter out-of-memory errors, try a lower resolution GPEN variant or use GFPGAN instead.
- Remote processing: Studio tier cloud GPUs have ample memory (24 GB) and can run any combination of swap model and enhancer without memory constraints.
Model Download
All enhancement models are downloaded from Hugging Face on first use. Download sizes are typically 60-100 MB per model. After the initial download, models are cached locally and load instantly.
Model cache locations:
- macOS:
~/Library/Application Support/Recaster/models/ - Windows:
%APPDATA%\Recaster\models\ - Linux:
~/.config/Recaster/models/
Was this page helpful?