Images of the Russian Empire

Colorizing the Prokudin-Gorskii Photo Collection - CS 180

Francesco Crivelli

💡

Context

Sergei Mikhailovich Prokudin-Gorskii (1863-1944) captured color photos of the Russian Empire using glass plates with red, green, and blue filters. Although color printing technology wasn't available at the time, his vision was to combine these images into full-color photos. The Library of Congress later digitized these glass plate negatives.

Overview

The goal of this project is to automatically align and merge these digitized images to produce high-quality color photos with minimal artifacts.

My Implementation

Using l2-norm, the green and red channels are aligned to the blue channel using exhaustive search for small images.

For large images, an image pyramid algorithm is employed, reducing image size at each level to speed up the alignment process. Additionally, automated border cropping helps remove unwanted edges for a cleaner final image.

Methodology

This project involved reconstructing color images from Prokudin-Gorskii's digitized glass plate negatives by aligning the red, green, and blue channels. The alignment was achieved through a multi-step approach combining exhaustive search, image pyramids, border cropping, and edge detection.

Exhaustive Search:
I began by implementing an exhaustive search within a 15x15 pixel window to align the green and red channels to the blue channel, using the L2 norm (SSD) to evaluate alignment accuracy.

Image Pyramid for Efficiency:
To improve performance, I introduced an image pyramid technique, which recursively downsizes the images to align them at different scales. I made the pyramid levels adaptive such that the coarsest level's resolution was set to approximately 300 pixels. This allowed me to align images at lower resolutions, progressively refining the alignment at higher resolutions. The optimal shift found at each level was scaled and applied to the next finer level, thus significantly speeding up the alignment process without sacrificing accuracy.

Dynamic Border Cropping:
Recognizing the impact of noisy borders on alignment, I implemented an automated border cropping strategy. The border size was dynamically set as 15% of the image size at each pyramid level to account for variations in resolution. This step helped to remove edge artifacts and focus the alignment on the central part of the images, where the content is most important.

Normalization and Window Refinement:
For each pyramid level, the window size was refined dynamically, starting with a larger window at coarser resolutions and reducing it at finer resolutions. This allowed for more precise shifts at higher resolutions and ensured the process remained computationally efficient while maintaining alignment accuracy.