MaxMark: High-Capacity Diffusion-Native Watermarking via Robust and Invertible Latent Embedding - CVPR

Model Description

This repository contains the model weights from our CVPR paper on latent space watermarking. The models demonstrate robust watermarking capabilities in the latent representation space with different bit configurations for embedding imperceptible watermarks with invertible neural networks.

Model Details

Model Type: INN
Framework: PyTorch
Model Description:
- 16384-bit configuration: latent-4x64x64_margin10.0_wm_16384bits.pth
- 8192-bit configuration: latent-4x64x64_margin10.0_wm_8192bits.pth

Key Features

Latent Space Processing: Operates on 4×64×64 latent representations
Margin Parameter: 10.0 margin for watermark robustness
Multiple Bit Depths: Supports less than 16384 and 8192-bit watermarks, while 8192-bit model has less impact on latent.
Robust Watermarking: Imperceptible watermarks resistant to common attacks

Citation

If you use this model in your research, please cite our paper on OpenReview:

@inproceedings{maxmark2026,
  title={MaxMark: High-Capacity Diffusion-Native Watermarking via Robust and Invertible Latent Embedding},
  author={Xuanhang Chang, Zhonghao Yang, Cheng Zhuo, Yu Li},
  booktitle={CVPR 2026},
  year={2026},
  url={https://openreview.net/pdf?id=8yhxO86boG}
}

License

This model is released under the MIT License.

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support