Simulating analogue film damage to analyse and improve artefact restoration on high-resolution scans

Abstract

Digital scans of analogue photographic film typically contain artefacts such as dust and scratches. Automated removal of these is an important part of preservation and dissemination of photographs of historical and cultural importance.

While state-of-the-art deep learning models have shown impressive results in general image inpainting and denoising, film artefact removal is an understudied problem. It has particularly challenging requirements, due to the complex nature of analogue damage, the high resolution of film scans, and potential ambiguities in the restoration. There are no publicly available high-quality datasets of real-world analogue film damage for training and evaluation, making quantitative studies impossible.

We address the lack of ground-truth data for evaluation by collecting a dataset of 4K damaged analogue film scans paired with manually-restored versions produced by a human expert, allowing quantitative evaluation of restoration performance. We construct a larger synthetic dataset of damaged images with paired clean versions using a statistical model of artefact shape and occurrence learnt from real, heavily-damaged images. We carefully validate the realism of the simulated damage via a human perceptual study, showing that even expert users find our synthetic damage indistinguishable from real. In addition, we demonstrate that training with our synthetically damaged dataset leads to improved artefact segmentation performance when compared to previously proposed synthetic analogue damage.

Finally, we use these datasets to train and analyse the performance of eight state-of-the-art image restoration methods on high-resolution scans. We compare both methods which directly perform the restoration task on scans with artefacts, and methods which require a damage mask to be provided for the inpainting of artefacts.

FILM-R: Film Image Library with Manual Restorations

Damage Segmentation Task

Segmentation from U-Net trained on our synthetically damaged data.

Segmentation by BOPB [WZC*20].

Segmentation from U-Net trained on damage overlays by DeepRemaster [ISS19].

Approximate ground truth: binarised difference of damaged and manually restored scans.

Qualitative comparison of segmentations of artefacts from our authentic damage dataset.

More examples

Segmentation from U-Net trained on our synthetically damaged data.

Segmentation by BOPB [WZC*20].

Segmentation from U-Net trained on damage overlays by DeepRemaster [ISS19].

Approximate ground truth: binarised difference of damaged and manually restored scans.

Segmentation from U-Net trained on our synthetically damaged data.

Segmentation by BOPB [WZC*20].

Segmentation from U-Net trained on damage overlays by DeepRemaster [ISS19].

Approximate ground truth: binarised difference of damaged and manually restored scans.

Segmentation from U-Net trained on our synthetically damaged data.

Segmentation by BOPB [WZC*20].

Segmentation from U-Net trained on damage overlays by DeepRemaster [ISS19].

Approximate ground truth: binarised difference of damaged and manually restored scans.

Damage Restoration Task

Input: 4K film scan with authentic damage (hairs and dirt)

Segmentation from our U-Net trained on synthetically damaged data.

Segmentationby BOPB [WZC*20].

Restoration by U-Net [ISW22]: using originally provided model weights.

Restoration by BOPB [WZC*20]: using our predicted segmentation.

Restoration by BOPB [WZC*20]: using their predicted segmentation.

Restoration by U-Net [ISW22]: retrained on our synthetic damage.

Restoration by LaMa [SLM*22]: best performing model, using our predicted segmentation.

Restoration by Stable Diffusion [RBL*21]: using our predicted segmentation.

Restoration by BVMR [ISW22]:retrained on our synthetic damage.

Restoration by RePaint [LDR*22]: using our predicted segmentation.

Ground Truth: manually restored in Photoshop by a human expert

Input and ground truth from our authentic artefact damage dataset, along with restorations from some of the models we evaluated, presented at full resolution. Mouse over to zoom in.

More examples

Input: 4K film scan with authentic damage (hairs and dirt)

Segmentation from our U-Net trained on synthetically damaged data.

Segmentationby BOPB [WZC*20].

Restoration by U-Net [ISW22]: using originally provided model weights.

Restoration by BOPB [WZC*20]: using our predicted segmentation.

Restoration by BOPB [WZC*20]: using their predicted segmentation.

Restoration by U-Net [ISW22]: retrained on our synthetic damage.

Restoration by LaMa [SLM*22]: best performing model, using our predicted segmentation.

Restoration by Stable Diffusion [RBL*21]: using our predicted segmentation.

Restoration by BVMR [ISW22]:retrained on our synthetic damage.

Restoration by RePaint [LDR*22]: using our predicted segmentation.

Ground Truth: manually restored in Photoshop by a human expert

Input: 4K film scan with authentic damage (hairs and dirt)

Segmentation from our U-Net trained on synthetically damaged data.

Segmentationby BOPB [WZC*20].

Restoration by U-Net [ISW22]: using originally provided model weights.

Restoration by BOPB [WZC*20]: using our predicted segmentation.

Restoration by BOPB [WZC*20]: using their predicted segmentation.

Restoration by U-Net [ISW22]: retrained on our synthetic damage.

Restoration by LaMa [SLM*22]: best performing model, using our predicted segmentation.

Restoration by Stable Diffusion [RBL*21]: using our predicted segmentation.

Restoration by BVMR [ISW22]:retrained on our synthetic damage.

Restoration by RePaint [LDR*22]: using our predicted segmentation.

Ground Truth: manually restored in Photoshop by a human expert

BibTeX

@article{ivanova23analogue, title = {Simulating analogue film damage to analyse and improve artefact restoration on high-resolution scans}, author = {Daniela Ivanova and John Williamson and Paul Henderson}, year = 2023, journal = {Computer Graphics Forum (Proc. Eurographics 2023)}, volume = {42}, number = {2}, doi = {https://doi.org/10.1111/cgf.14749}, copyright = {Creative Commons Attribution 4.0 International} }

Simulating analogue film damage to analyse and improve artefact restoration on high-resolution scans

This work proposes a statistical model of analogue film damage (e.g. scratches, dust, hairs), which can be then used to train and evaluate models at the task of analogue film damage detection and restoration at high resolutions.

Abstract

Contents

FILM-R: Film Image Library with Manual Restorations

FILM-AA: Film Image Library with Manual Artefact Annotations

Damage Segmentation Task

Damage Restoration Task

BibTeX