Exp 5 — Autoencoder on image dataset

Record-ready template Fill placeholders with your dataset, code, outputs, plots, and viva.
AIML355 • Fundamentals of Deep Learning Lab

EXP05 — Autoencoder on Image Dataset

Record-ready template Replace placeholders with your final work (code + outputs + screenshots).
Submission checklist
Aim ✓ • Environment ✓ • Dataset ✓ • Procedure ✓ • Code ✓ • Output ✓ • Discussion ✓ • Viva ✓

1) Aim

To apply Autoencoders on an image dataset for reconstruction/denoising.

Learning outcomes
  • Train a convolutional autoencoder for reconstruction.
  • Evaluate reconstruction quality (visual + loss).
  • Optionally perform denoising and compare outputs.

2) Requirements / Environment

Software
  • Python 3.10+ (recommended)
  • TensorFlow/Keras (or PyTorch where applicable)
  • NumPy, Pandas, Matplotlib
  • Jupyter/Colab optional
Hardware
  • CPU is OK for small runs; GPU optional
  • RAM: 4–8 GB+ recommended
Reproducibility
Record library versions and random seed in your final report.

3) Dataset

  • Source: Use a relevant dataset and cite the source in your report.
  • Features/Labels: [Describe X and y; mention classes if classification]
  • Split: [Train/Validation/Test or K-fold]
  • Preprocessing: [Scaling/Normalization, resizing, tokenization, etc.]

4) Procedure / Steps

  1. Load dataset and perform preprocessing.
  2. Define model architecture and justify key choices.
  3. Compile model (loss + optimizer + metrics).
  4. Train with validation and log curves.
  5. Evaluate on test set and compute required metrics.
  6. Summarize observations and limitations.
Model hint
Convolutional autoencoder; optionally add noise for denoising autoencoder.

5) Code (Skeleton)

Paste your complete runnable code below (or attach notebook link in the final submission).

import tensorflow as tf
from tensorflow import keras

# TODO: build convolutional autoencoder for images
inputs = keras.Input(shape=(IMG, IMG, 3))
# encoder
x = keras.layers.Rescaling(1./255)(inputs)
x = keras.layers.Conv2D(32, 3, activation='relu', padding='same')(x)
x = keras.layers.MaxPooling2D(2, padding='same')(x)
# decoder
x = keras.layers.Conv2DTranspose(32, 3, strides=2, activation='relu', padding='same')(x)
outputs = keras.layers.Conv2D(3, 3, activation='sigmoid', padding='same')(x)
ae = keras.Model(inputs, outputs)
ae.compile(optimizer='adam', loss='binary_crossentropy')

6) Results / Output

  • Metrics: [Write your final values: accuracy/F1 or MAE/MSE]
  • Plots: [Attach loss/metric curves; prediction vs actual plots if forecasting]
  • Screenshots: [Paste screenshots of outputs, confusion matrix, sample predictions]

7) Observations / Discussion

  • [Observation 1: what changed when you tuned epochs/batch size?]
  • [Observation 2: evidence of overfitting/underfitting?]
  • [Observation 3: what improved performance (augmentation, regularization, fine-tuning)?]

8) Conclusion

Write 3–6 lines summarizing what you implemented, key result, and what you learned.

9) Viva Questions

  1. What is denoising autoencoder?
  2. Why use Conv2DTranspose in the decoder?
  3. What is the difference between training and validation data?
  4. Explain overfitting and two ways to reduce it.
  5. Why do we normalize/scale inputs?
  6. What does batch size and epoch mean?
  7. How do you choose a loss function for a task?

10) Post-lab Assignment

  • Add noise and implement denoising autoencoder.
  • Compare MSE vs BCE reconstruction loss.
Tip: press Esc to close.