The goal of this project was to implement techniques for image filters and fiddling around with frequencies for creating hybrid images or blending images. Using gaussian filters, I could filter out the higher frequencies of an image, which helped me sharpen them. I also implemented gaussian and laplacian stacks to help blend images together, using a mask.
Above are the finite difference filters in the x and y directions. Let the image be M. Then, the partial derivative in the x direction is G_x = M_x * D_x. The partial derivative in the y direction is G_y = M_y * D_y. To get the gradient magnitude of the image, we simply do G_m = sqrt(G_x^2 + G_y^2). So, the gradient magnitude is just the Euclidean distance of the two partial derivatives. Finally, to turn this into an edge image, we binarize G_m by thresholding it, where we only accept values above a threshold. The results are displayed below:
First, we try the blur then redo the previous methods. For this, we first apply the gaussian filter to the original image. Afterwards, we apply the same methods as before. Compared to the results from part 1.1, these images are significantly smoother. This can be especially seen in the partial derivative images, where they feel less sharp. Also, the gradient magnitude is less noisy, and the binarized image is much more smooth. Below are the results:
Now, we try the second method by creating the DoG filters. As seen below, the results are the same as the previous method. Both methods in part 1.2 produce the same results at the end, because convolution is communitive. So, whether we are applying the gaussian filter to smooth the original image or to the finite difference operators, we will eventually get the same result. Below are the results:
In total, I have sharpened three images. Each row reflects the progression of each image as it is sharpened. Below are the results:
As seen above, as alpha increased, each image got gradually sharper. I wanted to experiment with the light and dog images, because when I took the light image in Japan, I thought it was too blury. For the dog images, it had a lot of fur, so sharpening the image will provide more contrast to these details. Interestingly, for the light image, because overall my photo quality was too blurry, it mainly captured the edge features in the middle. This location in Tokyo is where the light through the clouds was shining on. So, as alpha increased, this region got highlighted more and more. The dog's fur showed created contrast and sharpness, as expected. My favorite image was the dog's progression, and as I thought its fur was already pretty clear, I chose this image for the evaluation. Below are the results from blurring the dog image, then re-sharpening it:
This section of the project was to replicate the SIGGRAPH paper on hybrid images. The idea is when viewed from close and far distances, the image would look drastically different. This illusion is created by combining the low frequencies of one image with the high frequencies of another. Below are the results:
I could not find a way to make Babak and Bamman's image work, mainly because of Babak's facial features. While tuning the code, I could see Bamman's head structure when viewing the image from far away. However, Baback's eyebrows would always appear instead of Bamman's glasses. Also because Baback's image has darker colors, they tended to dominate the hybrid image. My favorite result was of the puppy looking either right or left. Below is the fourier frequency analysis of the images:
Bells and Whistles: I used color to enhance the effects of the hybrid images. It seems to be working much better for the low-frequency component. This is because the high-frequency components captures details such as edges, meaning that most pixels aren't captured. So, since the low pass filter lets much more pixels through, the majority of the color would be then from the low-frequency image.
This section of the project is setting up the stacks in order to replicate the oraple result. Below are the recreated results for the laplacian stack levels, which are very similar to the original paper's:
In this section, I added up the combined laplacian stack levels to create a blended image. Below are the original two images, the mask applied, and the resulting blend:
It was really fun to implement these filters and manipulate the frequency of the images. What I enjoyed most was making the oraple, because this combined the other techniques I learned about filters and low vs high frequencies to make a seamless image.