Version Checks (indicate both or one)
Issue Description
Re-generating the collection with the extend_by_vres option produces a very high number of installed powerplants, particularly for PV and Wind.
The image below uses the default configuration (Europe). Wind is quite extreme.

For PV, I suspect the cause might be due mismatches in the coding of some technologies. Notice PV, Pv, Photovoltaics and Photovoltaics ground.

Reproducible Example
# Took 1h on my computer
import powerplantmatching as ppm
collection = ppm.collection.powerplants(
update=True,
extend_by_vres=True,
fill_geopositions=True
)
stats = ppm.data.Capacity_stats()
ppm.plot.fueltype_totals_bar([collection, stats])
Expected Behavior
Two suggestions:
- A schema should be used to validate that the processed datasets conform to a given shape before they are passed to the matching algorithm. I suggest pandera.
- Consider adding some logic that 'trims' excessive capacities (when compared to annual country totals), as an optional callable function for further post-processing.
Installed Versions
Details
0.7.1
Version Checks (indicate both or one)
I have confirmed this bug exists on the lastest release of powerplantmatching.
I have confirmed this bug exists on the current
masterbranch of powerplantmatching.Issue Description
Re-generating the collection with the
extend_by_vresoption produces a very high number of installed powerplants, particularly for PV and Wind.The image below uses the default configuration (Europe). Wind is quite extreme.
For PV, I suspect the cause might be due mismatches in the coding of some technologies. Notice
PV,Pv,PhotovoltaicsandPhotovoltaics ground.Reproducible Example
Expected Behavior
Two suggestions:
Installed Versions
Details
0.7.1