@@ -74,32 +74,68 @@ format:
7474
7575## Few Shot Learning in General
7676
77- tbd
77+ ### Few-Shot Learning (FSL)
78+ - Learning new ** tasks, labels, or segmentations** from very few labeled examples
79+ * (N-way, K-shot)*
80+ - ** Motivation** :
81+ - Data scarcity
82+ - Expensive and time-consuming annotation
7883
84+ ### Few-Shot Semantic Segmentation (FSSS)
85+ - ** Goal** : Segment novel object classes using only a few annotated examples
86+ - Assigning a class label to ** every pixel**
87+
88+
89+ ---
90+
91+ ### Prototypical Networks (ProtNets)
92+ - Learn a shared ** embedding space**
93+ - Pixels belonging to the same class are ** close in feature space**
94+ - Class representations are formed as ** prototypes**
95+ - Training follows an ** episodic framework**
96+ - Each episode consists of:
97+ - ** Support set** :
98+ - Few images with ** pixel-level masks**
99+ - Defines the target classes
100+ - ** Query image** :
101+ - Image where the model must segment the target classes
79102
80103
81- ## Prototypical Network
82104
83- ![ ] ( figures/illustration_prototypical_network.png ) {width=100%}
84105
85- (modified figure from paper) [ SRPNet] ( https://arxiv.org/abs/2210.16829 )
86106
87- - high-level schematic (support → prototype → similarity → segmentation)
88107
89- - 1-way-1 shot --> explain what it means
90108
91- - Data Preprocessing (e.g. Augementation, Geographic Splits)
109+ ## Prototypical Network Overview
92110
93- - Model Architecture (feature Exctraction, CNN --> Number of Layers, Backbone)
111+ ### Workflow
112+ - Support Image → Prototype → Similarity → Query Segmentation
94113
95- - Training strategy
96114
97- - Loss function
115+ ### Feature Extraction
116+ - ** Backbone:** ResNet-18 CNN, pretrained on ImageNet
117+ - ** Projection:** feature maps → embedding dimension (256 channels)
98118
99- - Evaluation metrics
100119
120+ ### Evaluation Metric
121+ $$
122+ \mathrm{IoU} = \frac{|A \cap B|}{|A \cup B|}
123+ $$
101124
102125
126+ ---
127+
128+ ## Prototypical Network Overview
129+
130+ ::: {#fig-protnet fig-align="center"}
131+ ![ ] ( figures/illustration_prototypical_network.png ) {width=100%}
132+
133+ * (Modified figure from* [ SRPNet] ( https://arxiv.org/abs/2210.16829 ) * )*
134+ :::
135+
136+
137+ ---
138+
103139## (Preliminary) Results
104140
105141- Show performance for 1-shot / 5-shot / full-data comparison
@@ -110,23 +146,27 @@ tbd
110146
111147## Wrap-Up/ Discussion
112148
113- [ GitHub Repo] ( https://github.com/hertie-data-science-lab/tutorial-new-tutorial-group-1/tree/main )
149+ ** What we still want to work on** :
150+
151+ - Testing different kind of pretrained models as our encoder (ResNet-50, ResNet pretrained on satellite images)
114152
153+ - Play around with different distance metrics (Cosine Similarity and Fidelity)
115154
116- ** What we still need to finalize ** :
155+ - Evaluate different K values and see how they perform
117156
118- - insert bullet point here
119157
120- - insert bullet point here
158+ ** Discussion and Key Takeaways **
121159
160+ - Strong 1-shot performance: even with minimal labeled data, results were impressive
122161
123- ** Questions to discuss in class/ lynn **
162+ - Dataset brought its own challenges (designed primarily for PV assessment, not general segmentation)
124163
125- - strengths
164+ - Limited scope: focused only on roofs in Geneva → raises questions of generalizability
126165
127- - weaknesses
166+ - More diverse data or complex models could improve performance
128167
129- - failure cases (shadows, tiny rooftops)
168+
169+ [ GitHub Repo] ( https://github.com/hertie-data-science-lab/tutorial-new-tutorial-group-1/tree/main )
130170
131171
132172
0 commit comments