update instructions for exercise 2 and 3

CarlaMue · CarlaMue · commit 1c258b4a9122 · 2026-02-09T16:04:44.000+01:00
diff --git a/README.md b/README.md
@@ -29,9 +29,9 @@ Reuse your MNIST digit recognition code. Implement IG as discussed in the lectur
 \text{IntegratedGrads}_i(x) = (x_i - x_i') \cdot \frac{1}{m} \sum_{k=1}^m \frac{\partial F (x' + \frac{k}{m} \cdot (x - x'))}{\partial x_i}.
 ```
 
-F partial xi denotes the gradients with respect to the input color-channels i.
-x prime denotes a baseline black image. And x symbolizes an input we are interested in.
-Finally, m denotes the number of summation steps from the black baseline image to the interesting input.
+$\frac{\partial F}{\partial x_i}$ denotes the gradients with respect to the input color-channels $i$.
+$x'$ denotes a baseline black image. And $x$ symbolizes an input we are interested in.
+Finally, $m$ denotes the number of summation steps from the black baseline image to the interesting input.
 
 Follow the todos in `./src/mnist_integrated.py` and then run `scripts/integrated_gradients.slurm`.
 
@@ -66,19 +66,16 @@ The desired outcome is to have a folder called `ffhq_style_gan` in the project d
 The `load_folder` function from the `util` module loads both real and fake data.
 Code to load the data is already present in the `deepfake_interpretation.py` file.
 
-Compute log-scaled frequency domain representations of samples from both sources via
+1. Implement the `transform` function to compute log-scaled frequency domain representations of samples from both sources via
 
-``` math
-\mathbf{F}_I =  \log_e (| \mathcal{F}_{2d}(\mathbf(I)) | + \epsilon ), \text{ with } \mathbf{I} \in \mathbb{R}^{h,w,c}, \epsilon \approx 0 .
-```
-
-Above `h`, `w` and `c` denote image height, width and columns. `Log` denotes the natural logarithm, and bars denote the absolute value. A small epsilon is added for numerical stability.
-
-Use the numpy functions `np.log`, `np.abs`, `np.fft.fft2`. By default, `fft2` transforms the last two axes. The last axis contains the color channels in this case. We are looking to transform the rows and columns.
+   ``` math
+   \mathbf{F}_I =  \log_e (| \mathcal{F}_{2d}(\mathbf(I)) | + \epsilon ), \text{ with } \mathbf{I} \in \mathbb{R}^{h,w,c}, \epsilon \approx 0 .
+   ```
 
-Plot mean spectra for real and fake images as well as their difference over the entire validation or test sets. For that complete the TODOs in `src/deepfake_interpretation.py` and run the script `scripts/train.slurm`.
+   Above `h`, `w` and `c` denote image height, width and columns. `Log` denotes the natural logarithm, and bars denote the absolute value. A small epsilon is added for numerical stability.
 
+   Use the numpy functions `np.log`, `np.abs`, `np.fft.fft2`. By default, `fft2` transforms the last two axes. The last axis contains the color channels in this case. We are looking to transform the rows and columns.
 
-## 3.3 Training and interpreting a linear classifier
-Train a linear classifier consisting of a single `nn.Linear`-layer on the log-scaled Fourier coefficients using Torch. Plot the result. What do you see?
+2. Plot mean spectra for real and fake images as well as their difference over the entire validation or test sets. For that run the script `scripts/train.slurm`.
 
+3. `scripts/train.slurm` also trains a linear classifier (consisting of a single `nn.Linear`-layer) to distinguish real from fake images on the log-scaled Fourier coefficients. We want to visualize the weights of the trained classifier. For that go to `src/deepfake_interpretation.py` and implement the TODO at the end of the file. What do you see?
diff --git a/src/deepfake_interpretation.py b/src/deepfake_interpretation.py
@@ -87,7 +87,7 @@ def eval_step(net, loss, img, labels):
 
 def transform(image_data):
     """Transform image data."""
-    # TODO: Implement the function given in the readme
+    # 3.2.1 TODO: Implement the function given in the readme
     return np.zeros_like(image_data)
 
 
@@ -249,7 +249,7 @@ def transform(image_data):
         plt.colorbar()
         plt.savefig("mean_freq_difference.jpg")
 
-        # TODO: Visualize the weight array `net.dense.weight`.
+        # 3.2.3 TODO: Visualize the weight array `net.dense.weight`.
         # By reshaping and plotting the weight matrix.
 
     if type(net) is CNN:
diff --git a/src/mnist_integrated.py b/src/mnist_integrated.py
@@ -115,20 +115,20 @@ def integrate_gradients(net, test_images, output_digit, steps_m=300):
     g_list = []
     for test_image_x in tqdm(test_images, desc="Integrating Gradients"):
 
-        # TODO: create a list for the gradients.
+        # list for the gradients
         step_g_list = []
-        
+
         # TODO: create a black reference image via `zeros_like`` .
-        
+
         # TODO: Loop over the integration steps.
         for current_step_k in range(steps_m):
             pass
             # TODO: compute the input to F from equation 5 in the slides.
-        
+
             # TODO: define a forward pass for torch.func.grad
-        
+
             # TODO: use torch.grad to find the gradient with repsect to the input image.
-            
+
             # TODO: append the gradient to your list
 
         # TODO: Return the sum of the of the list elements.