SoraWatermarkCleaner

This project provides an elegant way to remove the sora watermark in the sora2 generated videos.

Case1(25s)	keigo_matsumaru_sora_watermark_removed_10mb.mp4
Case2(10s)	man_in_snowmountain_sora_watermark_removed_combined.mp4
Case3(10s)	STH_soar2_sora_watermark_removed_combined.mp4

Commercial Hosted Service & Sponsorship

If you prefer a one-click online service instead of running everything locally, you can use the hosted Sora watermark remover here:

👉 https://www.sorawatermarkremover.ai/

SoraWatermarkRemover runs SoraWatermarkCleaner under the hood and provides GPU-backed processing, credits-based pricing and an easy web UI. This service financially supports the ongoing development and maintenance of SoraWatermarkCleaner.

⭐️:

I'm excited to release DeMark-World – to the best of my knowledge, the first model capable of removing any watermark from AI-generated videos.
We have provided another model which could preserve time consistency without flicker!
We support batch processing now.
For the new watermark with username, the Yolo weights has been updated, try the new version watermark detect model, it should work better.
We have uploaded the labelled datasets into huggingface, check this dataset out. Free free to train your custom detector model or improve our model!
One-click portable build is available — Download here for Windows users! No installation required.

💝 If you find this project helpful, please consider buying me a coffee to support the development!

1. Method

The SoraWatermarkCleaner(we call it SoraWm later) is composed of two parsts:

SoraWaterMarkDetector: We trained a yolov11s version to detect the sora watermark. (Thank you yolo!)
WaterMarkCleaner: We refer iopaint's implementation for watermark removal using the lama model.

(This codebase is from https://github.com/Sanster/IOPaint#, thanks for their amazing work!)

Our SoraWm is purely deeplearning driven and yields good results in many generated videos.

2. Installation

FFmpeg is needed for video processing, please install it first. We highly recommend using the uv to install the environments:

installation:

uv sync

now the envs will be installed at the .venv, you can activate the env using:
source .venv/bin/activate

Downloaded the pretrained models:

The trained yolo weights will be stored in the resources dir as the best.pt. And it will be automatically download from https://github.com/linkedlist771/SoraWatermarkCleaner/releases/download/V0.0.1/best.pt . The Lama model is downloaded from https://github.com/Sanster/models/releases/download/add_big_lama/big-lama.pt, and will be stored in the torch cache dir. Both downloads are automatic, if you fail, please check your internet status.

Batch processing Use the cli.py for batch processing

python cli.py [-h] -i INPUT -o OUTPUT [-p PATTERN] [-m MODEL] [--quiet]

examples:

# Process all .mp4 files in input folder
python cli.py -i /path/to/input -o /path/to/output
# Process all .mov files
python cli.py -i /path/to/input -o /path/to/output --pattern "*.mov"
# Process all video files (mp4, mov, avi)
python cli.py -i /path/to/input -o /path/to/output --pattern "*.{mp4,mov,avi}"
# Use e2fgvi_hq model for time-consistent results (slower, requires CUDA)
python cli.py -i /path/to/input -o /path/to/output --model e2fgvi_hq
# Without displaying the Tqdm bar inside sorawm procrssing.
python cli.py -i /path/to/input -o /path/to/output --quiet

3. One-Click Portable Version

For users who prefer a ready-to-use solution without manual installation, we provide a one-click portable distribution that includes all dependencies pre-configured.

Download Links

Google Drive:

Download from Google Drive

Baidu Pan (百度网盘) - For users in China:

Link: https://pan.baidu.com/s/1onMom81mvw2c6PFkCuYzdg?pwd=jusu
Extract Code (提取码): jusu

Features

✅ No installation required
✅ All dependencies included
✅ Pre-configured environment
✅ Ready to use out of the box

Simply download, extract, and run!

4. Performance Optimization

We provide several options to speed up processing:

Detector	Batch	Cleaner	TorchCompile	Bf16	Time (s)	Speedup
YOLO	×	LAMA	×	×	44.33	-
YOLO	×	E2FGVI	×	×	142.42	1.00×
YOLO	×	E2FGVI	✓	×	117.19	1.22×
YOLO	4	E2FGVI	✓	×	82.63	1.72×
YOLO	4	E2FGVI	✓	✓	58.60	2.43×

Speedup is calculated relative to the E2FGVI baseline. LAMA uses a different cleaning approach and is not directly comparable.

YOLO Batch Detection: Default batch size is 4 (detect_batch_size=4), enables batch inference for watermark detection, provides ~40% speedup
TorchCompile (E2FGVI only): Enabled by default (enable_torch_compile=True), provides ~22% speedup
Bf16 Inference (E2FGVI only): Enable with use_bf16=True(Default False), provides up to 2.43× speedup. Note: quality may slightly decrease, and the first inference will be slow (~90s) due to compilation overhead; subsequent runs will be much faster (~58s) as artifacts are cached.

You can customize these settings when initializing SoraWM:

from sorawm.core import SoraWM
from sorawm.schemas import CleanerType

# LAMA with batch detection (fast)
sora_wm = SoraWM(
    cleaner_type=CleanerType.LAMA,
    detect_batch_size=4  # default: 4
)

# E2FGVI_HQ with all optimizations (time-consistent)
sora_wm = SoraWM(
    cleaner_type=CleanerType.E2FGVI_HQ,
    enable_torch_compile=True,  # default: True
    detect_batch_size=8         # custom batch size
)

# E2FGVI_HQ with bf16 for maximum speed (may have slight quality loss)
sora_wm = SoraWM(
    cleaner_type=CleanerType.E2FGVI_HQ,
    enable_torch_compile=True,
    detect_batch_size=4,
    use_bf16=True  # enables bfloat16 inference
)

5. Demo

To have a basic usage, just try the example.py:

We provide two models to remove watermark. LAMA is fast but may have flicker on the cleaned area, which E2FGVI_HQ compromise this only requires cuda otherwise very slow on CPU or MPS.

from pathlib import Path

from sorawm.core import SoraWM
from sorawm.schemas import CleanerType

if __name__ == "__main__":
    input_video_path = Path("resources/dog_vs_sam.mp4")
    output_video_path = Path("outputs/sora_watermark_removed")
    
    # 1. LAMA is fast and good quality, but not time consistent.
    sora_wm = SoraWM(cleaner_type=CleanerType.LAMA)
    sora_wm.run(input_video_path, Path(f"{output_video_path}_lama.mp4"))
    
    # 2. E2FGVI_HQ ensures time consistency, but will be very slow on no-cuda device.
    sora_wm = SoraWM(cleaner_type=CleanerType.E2FGVI_HQ)
    sora_wm.run(input_video_path, Path(f"{output_video_path}_e2fgvi_hq.mp4"))

We also provide you with a streamlit based interactive web page, try it with:

We also provide the switch here.

streamlit run app.py

Batch processing is also supported, now you can drag a folder or select multiple files to process.

6. WebServer

Here, we provide a FastAPI-based web server that can quickly turn this watermark remover into a service.

We also have a frontUI for the webserver, to try this:

cd frontend && bun install && bun run build

And then start the server, the frontend UI will be just ready in root route:

The task statuses are recoreded and can resume when server is down.

Simply run:

python start_server.py

The web server will start on port 5344.

You can view the FastAPI documentation for more details.

There are three routes available:

submit_remove_task

After uploading a video, a task ID will be returned, and the video will begin processing immediately.

get_results

You can use the task ID obtained above to check the task status.

It will display the percentage of video processing completed.

Once finished, the returned data will include a download URL.

download

You can use the download URL from step 2 to retrieve the cleaned video.

7. Datasets

We have uploaded the labelled datasets into huggingface, check this out https://huggingface.co/datasets/LLinked/sora-watermark-dataset. Free free to train your custom detector model or improve our model!

8. API

Packaged as a Cog and published to Replicate for simple API based usage.

9. License

Apache License

10. Citation

If you use this project, please cite:

@misc{sorawatermarkcleaner2025,
  author = {linkedlist771},
  title = {SoraWatermarkCleaner},
  year = {2025},
  url = {https://github.com/linkedlist771/SoraWatermarkCleaner}
}

11. Acknowledgments

IOPaint for the LAMA implementation
Ultralytics YOLO for object detection

Name		Name	Last commit message	Last commit date
Latest commit History 154 Commits
.streamlit		.streamlit
assests		assests
benchmark		benchmark
datasets		datasets
ffmpeg		ffmpeg
frontend		frontend
mds		mds
notebooks		notebooks
profile		profile
resources		resources
scirpts		scirpts
sorawm		sorawm
tests		tests
train		train
version		version
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
app.py		app.py
cli.py		cli.py
example.py		example.py
model_version.json		model_version.json
one-click-portable.md		one-click-portable.md
pyproject.toml		pyproject.toml
start_server.py		start_server.py
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SoraWatermarkCleaner

1. Method

2. Installation

3. One-Click Portable Version

Download Links

Features

4. Performance Optimization

5. Demo

6. WebServer

7. Datasets

8. API

9. License

10. Citation

11. Acknowledgments

About

Uh oh!

Releases 4

Packages

Contributors 7

Uh oh!

Languages

License

linkedlist771/SoraWatermarkCleaner

Folders and files

Latest commit

History

Repository files navigation

SoraWatermarkCleaner

1. Method

2. Installation

3. One-Click Portable Version

Download Links

Features

4. Performance Optimization

5. Demo

6. WebServer

7. Datasets

8. API

9. License

10. Citation

11. Acknowledgments

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 4

Packages 0

Contributors 7

Uh oh!

Languages

Packages