|
| 1 | +--- |
| 2 | +title: "Few-Shot Learning for Rooftop Detection in Satellite Imagery" |
| 3 | +subtitle: "Deep Learning Tutorial" |
| 4 | +author: "Giorgio Coppala, Nadine Daum, Elena Dreyer, Nico Reichardt" |
| 5 | +bibliography: refs.bib |
| 6 | + |
| 7 | + |
| 8 | +resources: |
| 9 | + - img/** |
| 10 | + |
| 11 | +format: |
| 12 | + revealjs: |
| 13 | + theme: dimmery.scss |
| 14 | + slide-number: true |
| 15 | + default-image-width: 70% |
| 16 | + preview-links: auto |
| 17 | + logo: "" |
| 18 | + footer: "" |
| 19 | + transition: slide |
| 20 | + background-transition: fade |
| 21 | + self-contained: true |
| 22 | + html-math-method: |
| 23 | + method: mathjax |
| 24 | + url: https://cdn.jsdelivr.net/npm/mathjax@4/tex-mml-chtml.js |
| 25 | + include-in-header: include.html |
| 26 | + resources: |
| 27 | + - img/** |
| 28 | +--- |
| 29 | + |
| 30 | + |
| 31 | +## Policy Relevance |
| 32 | + |
| 33 | +- Many public auhorities face the problem of **limited labeled data** |
| 34 | + (annotation is expensive, slow, or requires domain expertise) |
| 35 | + |
| 36 | +- **Applications:** |
| 37 | + - medical sector: **rare disease detection** |
| 38 | + - emergency management: **flood extent mapping** |
| 39 | + - climate & energy: **solar PV rooftop assessment** |
| 40 | + - urban planning: **building footprints & infrastructure mapping** |
| 41 | + |
| 42 | +- **Few-shot learning (FSL)** can help: |
| 43 | + - Learns to **generalize** from *1–5 labeled support examples per class* |
| 44 | + - (in our case) learns a **feature embedding** and constructs **class prototypes** |
| 45 | + - Enables segmentation in a **new city** with *minimal additional annotation* |
| 46 | + |
| 47 | + |
| 48 | + |
| 49 | +## Problem Setting |
| 50 | + |
| 51 | +::: {.columns} |
| 52 | + |
| 53 | +::: {.column width="55%"} |
| 54 | + |
| 55 | +- Goal of the tutorial: apply **Prototypical Networks** to |
| 56 | +rooftop segmentation using only a few labeled tiles |
| 57 | + |
| 58 | +- **Few-shot segmentation** allows the model to learn characteristic |
| 59 | +rooftop shapes and textures from a small Geneva subset |
| 60 | + |
| 61 | +- Demonstrates how rooftop maps can be produced for solar potential estimation in a **new geographic setting** with limited labels |
| 62 | + |
| 63 | +::: |
| 64 | + |
| 65 | +::: {.column width="45%"} |
| 66 | +{width="100%" style="margin-top: 1rem;"} |
| 67 | + |
| 68 | +<div style="font-size: 0.75rem; color:#666; text-align:center; margin-top:0.2rem;"> |
| 69 | +Demonstration use case (self-made visualization) |
| 70 | +</div> |
| 71 | +::: |
| 72 | + |
| 73 | +::: |
| 74 | + |
| 75 | + |
| 76 | +## Dataset: [Roofs of Geneva](https://huggingface.co/datasets/raphaelattias/overfitteam-geneva-satellite-images) |
| 77 | + |
| 78 | +- **Size**: 1,050 labeled image-mask pairs |
| 79 | + |
| 80 | +- **Task**: Binary segmentation masks (rooftop vs background) |
| 81 | + |
| 82 | +- **Geographic splits**: 3 grids/ neighborhoods (North, Center, South) |
| 83 | + |
| 84 | +- **Image size**: 250x250 pixels |
| 85 | + |
| 86 | +- **Categories**: Industrial, Residential |
| 87 | + |
| 88 | + |
| 89 | +## Inside the dataset |
| 90 | + |
| 91 | +<div style="text-align:center;"> |
| 92 | +{width="50%"} |
| 93 | +</div> |
| 94 | + |
| 95 | +<div style="font-size:0.75rem; text-align:center; color:#666; margin-top:0.5rem;"> |
| 96 | +Geneva Animation: raw image → overlay rooftop → binary mask |
| 97 | +</div> |
| 98 | + |
| 99 | + |
| 100 | +## Discussion |
| 101 | + |
| 102 | +**Room for improvement:** |
| 103 | + |
| 104 | +- Fine-tune / tweak model parameters |
| 105 | + - Add regularization |
| 106 | + - Increase number of epochs |
| 107 | + |
| 108 | +- Implement rough approximation of solar potential |
| 109 | + - e.g. based on IoU over roof area |
| 110 | + |
| 111 | + |
| 112 | +**Open for discussion:** |
| 113 | + |
| 114 | +- Try a different encoder ? |
| 115 | + - e.g. ResNet-50 |
| 116 | + |
| 117 | +- Change train / test split strategy ? |
| 118 | + - e.g. random shuffle regardless of geographic regions |
| 119 | + |
| 120 | + |
| 121 | + |
| 122 | +<div style="text-align:center; margin-top:3.5em; font-size:1.1em;"> |
| 123 | + <a href="https://github.com/hertie-data-science-lab/tutorial-new-tutorial-group-1/tree/main" |
| 124 | + target="_blank" |
| 125 | + style="text-decoration:none;"> |
| 126 | + GitHub Repo |
| 127 | + </a> |
| 128 | +</div> |
| 129 | + |
| 130 | + |
| 131 | + |
| 132 | +## References |
| 133 | + |
| 134 | +::: {.refs-super-small} |
| 135 | + |
| 136 | +- **Alsentzer, E., Li, M. M., Kobren, S. N., Noori, A., Undiagnosed Diseases Network, Kohane, I. S., & Zitnik, M.** (2025). Few shot learning for phenotype-driven diagnosis of patients with rare genetic diseases. *npj Digital Medicine, 8*(1), 380. https://doi.org/10.1038/s41746-025-01749-1 |
| 137 | + |
| 138 | +- **Castello, R., Walch, A., Attias, R., Cadei, R., Jiang, S., & Scartezzini, J.-L.** (2021). Quantification of the suitable rooftop area for solar panel installation from overhead imagery using convolutional neural networks. *Journal of Physics: Conference Series, 2042*(1), 012002. https://doi.org/10.1088/1742-6596/2042/1/012002 |
| 139 | + |
| 140 | +- **Chen, Y., Wei, C., Wang, D., Ji, C., & Li, B.** (2022). Semi-supervised contrastive learning for few-shot segmentation of remote sensing images. *Remote Sensing, 14*(17), 4254. https://doi.org/10.3390/rs14174254 |
| 141 | + |
| 142 | +- **Ding, H., Zhang, H., & Jiang, X.** (2022). Self-regularized prototypical network for few-shot semantic segmentation. *Pattern Recognition, 132*, 109018. https://doi.org/10.1016/j.patcog.2022.109018 |
| 143 | + |
| 144 | +- **Finn, C., Abbeel, P., & Levine, S.** (2017). Model-agnostic meta-learning for fast adaptation of deep networks. In *International Conference on Machine Learning* (pp. 1126–1135). https://doi.org/10.48550/arXiv.1703.03400 |
| 145 | + |
| 146 | +- **Ge, Z., Fan, X., Zhang, J., & Jin, S.** (2025). SegPPD-FS: Segmenting plant pests and diseases in the wild using few-shot learning. *Plant Phenomics*, 100121. https://doi.org/10.1016/j.plaphe.2025.100121 |
| 147 | + |
| 148 | +- **Hu, Y., Liu, C., Li, Z., Xu, J., Han, Z., & Guo, J.** (2022). Few-shot building footprint shape classification with relation network. *ISPRS International Journal of Geo-Information, 11*(5), 311. https://doi.org/10.3390/ijgi11050311 |
| 149 | + |
| 150 | +- **Jadon, S.** (2021). COVID-19 detection from scarce chest X-ray image data using few-shot deep learning. In *Medical Imaging 2021* (pp. 161–170). https://doi.org/10.1117/12.2581496 |
| 151 | + |
| 152 | +- **Lee, G. Y., Dam, T., Ferdaus, M. M., Poenar, D. P., & Duong, V.** (2025). Enhancing Few-Shot Classification of Benchmark and Disaster Imagery with ATTBHFA-Net. *arXiv preprint* arXiv:2510.18326. https://doi.org/10.48550/arXiv.2510.18326 |
| 153 | + |
| 154 | +- **Li, X., He, Z., Zhang, L., Guo, S., Hu, B., & Guo, K.** (2025). CDCNet: Cross-domain few-shot learning with adaptive representation enhancement. *Pattern Recognition, 162*, 111382. https://doi.org/10.1016/j.patcog.2025.111382 |
| 155 | + |
| 156 | +- **Puthumanaillam, G., & Verma, U.** (2023). Texture based prototypical network for few-shot semantic segmentation of forest cover: Generalizing for different geographical regions. *Neurocomputing, 538*, 126201. https://doi.org/10.1016/j.neucom.2023.03.062 |
| 157 | + |
| 158 | +- **Snell, J., Swersky, K., & Zemel, R.** (2017). Prototypical networks for few-shot learning. *Advances in Neural Information Processing Systems, 30*. https://doi.org/10.48550/arXiv.1703.05175 |
| 159 | + |
| 160 | +- **Sung, F., Yang, Y., Zhang, L., Xiang, T., Torr, P. H., & Hospedales, T. M.** (2018). Learning to compare: Relation network for few-shot learning. In *CVPR* (pp. 1199–1208). https://doi.org/10.1109/CVPR.2018.00131 |
| 161 | +::: |
| 162 | + |
| 163 | + |
0 commit comments