as a freshman, my first research-paper implementation (as part of the AI Club of IIT Madras) was Pix2Pix, a conditional GAN for image-to-image translation.
we read the paper Image-to-Image Translation with Conditional Adversarial Networks by Isola et al. and built and trained the full pipeline from scratch (a U-Net generator paired with a PatchGAN discriminator) reproducing the core results from the original work.
we then extended the model into a practical demo that converts real images into cartoon-style outputs through an interactive application.
learned: how to read and implement a research paper, PyTorch, Kaggle, streamlit (for app interface), and that nervous feeling when you watch your model train/learn :)
some screenshots from training (on Kaggle P100 GPU)
