Skip to content

Commit 744bb77

Browse files
SBM 2026 updates (#759)
* upd index, md and docking pages
1 parent 7da717c commit 744bb77

3 files changed

Lines changed: 28 additions & 44 deletions

File tree

education/molmod_online/docking.md

Lines changed: 3 additions & 28 deletions
Original file line numberDiff line numberDiff line change
@@ -204,39 +204,14 @@ solely on the evolutionary conservation analysis?
204204

205205
### Predicting interface residues
206206

207-
Besides sequence conservation, other features can be used to predict possible interfaces on protein
208-
structures. For example, certain residues tend to be overrepresented at protein-protein interfaces.
209-
This information, combined with evolutionary conservation and with a surface clustering algorithm
210-
that finds groups of surface residues meeting both the previous criteria results in reasonably
211-
accurate predictions. This is the basis of the
212-
[WHISCY](https://wenmr.science.uu.nl/whiscy/){:target="_blank"} server. A more advanced
213-
predictor, the [CPORT](https://alcazar.science.uu.nl/services/CPORT/){:target="_blank"} web server, judiciously
214-
combines (up to) 6 different predictors to provide a consensus prediction that is more robust and
215-
more reliable than any of the individual predictors alone. CPORT was designed to provide
216-
predictions for HADDOCK. The server also returns a PDB file of the
217-
original structure loaded with the predictions in the temperature factor column. This is extremely
218-
helpful to visualize the predictions in PyMOL.
219-
220-
<a class="prompt prompt-info">
221-
Submit the homology model of mouse MDM2 to the CPORT web server and load the resulting PDB file
222-
in Pymol.
223-
</a>
224-
<a class="prompt prompt-pymol">
225-
spectrum b, cyan_red, cport
226-
</a>
227-
<a class="prompt prompt-question">
228-
Do the predictions highlight a particular region of the homology model?
229-
</a>
230-
<a class="prompt prompt-info">
231-
Note down the list of residues predicted by CPORT to be part of an interface.
232-
</a>
207+
Besides sequence conservation, other features can be used to predict possible interfaces on protein structures. For example, certain residues tend to be overrepresented at protein-protein interfaces. This information, combined with evolutionary conservation and with a surface clustering algorithm that finds groups of surface residues meeting both the previous criteria results in reasonably accurate predictions. This is the basis of the [WHISCY](https://wenmr.science.uu.nl/whiscy/){:target="_blank"} server. A more advanced predictor, the [CPORT](https://alcazar.science.uu.nl/services/CPORT/){:target="_blank"} web server, judiciously combines (up to) 6 different predictors to provide a consensus prediction that is more robust and more reliable than any of the individual predictors alone.
233208

234-
Many tools in science are developed by dedicated PhD students and postdocs. Unfortunately, over time, some of these tools may become unavailable as maintaining and supporting them requires significant time and effort. In such cases, it may be necessary to transition to alternative tools.
209+
Many tools in science are developed by dedicated PhD students and postdocs. Unfortunately, over time, some of these tools may become unavailable as maintaining and supporting them requires significant time and effort. In such cases, it may be necessary to use alternative tools.
235210

236211
### Obtain known interfaces of homologous proteins
237212

238213
Another way to obtain information about possible interface residues is by analysing known interfaces found in **homologous** proteins.
239-
This can easily be performed by [ARCTIC-3D](https://wenmr.science.uu.nl/arctic3d/){:target="_blank"}, a [new tool](https://www.nature.com/articles/s42003-023-05718-w){:target="_blank"} dedicated to an automatic retrieval and clustering of interfaces in complexes from 3D structural information.
214+
This can easily be performed by [ARCTIC-3D](https://wenmr.science.uu.nl/arctic3d/){:target="_blank"}, a [tool](https://www.nature.com/articles/s42003-023-05718-w){:target="_blank"} dedicated to an automatic retrieval and clustering of interfaces in complexes from 3D structural information.
240215
As structural information of the human MDM2 interacting with other partners is available, ARCTIC-3D will extract interacting residues and cluster them into binding surfaces. Not all residues of a binding surface are relevant, as some amino acids may be rarely present among the interfaces that define that patch.
241216
Wisely define a probability threshold and note down the residue indices, as you will need them to define *active* residues in HADDOCK.
242217

education/molmod_online/index.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -51,7 +51,7 @@ By the end of this tutorial, you should know the steps involved in:
5151
### Part 3: [Protein-peptide data-driven docking](/education/molmod_online/docking)
5252

5353
The third module introduces protein-peptide docking using the [HADDOCK2.4 web server](https://wenmr.science.uu.nl/haddock2.4/).
54-
It also introduces the CPORT web server for interface prediction, based on evolutionary conservation and other biophysical properties.
54+
It also introduces the ARCTIC3D web server for interface prediction, based on clustering of structural protein interface information.
5555
By the end of this tutorial, you should know how to:
5656

5757
* setup a HADDOCK run

education/molmod_online/simulation.md

Lines changed: 24 additions & 15 deletions
Original file line numberDiff line numberDiff line change
@@ -155,15 +155,19 @@ Take your time to know your system and what particularities its simulation entai
155155
To run the actual simulation, you will need access to a computing cluster. Running on your laptop is likely to take far too long. In our hands, the simulations of this system take ~2 full days on 18 CPU cores in our dedicated cluster.
156156
</a>
157157

158+
<a class="prompt prompt-attention">
159+
You may have noticed that NMRBox is in the process of migrating its virtual machines from Ubuntu 20 to Ubuntu 24. The “Selecting an initial structure” section of this course was developed with Ubuntu 20 in mind and is currently not functional under Ubuntu 24.
160+
However, Ubuntu 24 can be used for the remaining of this part of the course.
161+
</a>
158162

159163
In NMRBox, after you open the terminal prompt you notice `username@machine`, where your username is the same as the NMRbox username.
160-
You will find your own copy of the course material in `~/EVENTS/2025-struct-bioinfo-uu/` directory.
164+
You will find your own copy of the course material in `~/EVENTS/2026-struct-bioinfo-uu/` directory.
161165
You can store your data in your `home` directory but we recommend creating a new directory where you will store your data and work in.
162166

163167

164168
__Note__: The data are automatically copied to your home directory under the `EVENTS` directory provided you have registered for this event on NMRBox. The event can be found at [https://nmrbox.nmrhub.org/events](https://nmrbox.nmrhub.org/events){:target="_blank"}. In order to register for the course you need to have an NMRBox account.
165169

166-
__Note__: In case you are following this tutorial on your own, you will have to manually copy all the required data and edit possibly some files to correct the paths (e.g. the `setup.sh` and the `bashrc` scripts). The data for the course can be found once logged in into a VM in the following directory: `/public/EVENTS/2025-struct-bioinfo-uu/`.This directory will however automatically be copied to your home directory when you register for the course on NMRBox
170+
__Note__: In case you are following this tutorial on your own, you will have to manually copy all the required data and edit possibly some files to correct the paths (e.g. the `setup.sh` and the `bashrc` scripts). The data for the course can be found once logged in into a VM in the following directory: `/public/EVENTS/2026-struct-bioinfo-uu/`.This directory will however automatically be copied to your home directory when you register for the course on NMRBox
167171

168172
Open the terminal and create a directory where you will work in with name of your choice:
169173
<a class="prompt prompt-cmd">
@@ -351,25 +355,29 @@ these atoms when reading the structure and (re)generate their coordinates using
351355
parameters defined in the force field. Also, the program allows the user to define the status of
352356
the termini of the molecule through the `-ter` flag. Termini can be either charged (e.g.
353357
NH<sub>3</sub><sup>+</sup> and COO<sup>-</sup>), uncharged (e.g. NH<sub>2</sub> and COOH), or
354-
capped by an additional chemical group (e.g. N-terminal acetyl and C-terminal amide). This is very
355-
important since leaving the termini charged (default) can lead to artificial charge-charge
356-
interactions, particular in small molecules. If a peptide is part of a larger structure, then it
357-
makes sense to cap the termini in order to neutralize their charge, as it would happen in reality.
358+
capped by an additional chemical group (e.g. N-terminal acetyl and C-terminal amide).
359+
360+
<a class="prompt prompt-attention">
361+
This is very important since leaving the termini charged (default) can lead to artificial charge-charge
362+
interactions, particular in small-sized molecules.
363+
</a>
364+
365+
If a peptide is part of a larger structure, then it makes sense to cap the termini in order to neutralize their charge,
366+
as it would happen in reality.
358367
Terminal capping should be performed prior to topology generation using the `pdb_cap.py` script.
359368
This script replaces the first residue with an ACE cap and the last residue with an NME cap
360369
by modifying atom and residue names in the PDB file, making them compatible with the CHARMM36m force field.
361-
For capping to work correctly, the input structure must include one additional residue at both the N- and C-termini
370+
For capping to work correctly, **the input structure must include one additional residue** at both the N- and C-termini
362371
(i.e. residues *−1* and *N+1* relative to the peptide of interest).
363372
These residues act as placeholders and will be converted into caps. In practice, we add two glycine residues,
364373
one at each end of the peptide sequence, before capping.
365-
Capping is performed with:
366-
374+
Capping is performed with a python script `$MOLMOD_BIN/pdb_cap.py`, read it's help message to learn how to use it:
367375
<a class="prompt prompt-cmd">
368-
python3.10 $MOLMOD_BIN/pdb_cap.py --pdb peptide_helix.pdb --cap
376+
python3.10 $MOLMOD_BIN/pdb_cap.py -h
369377
</a>
370378

371-
The script produces a new file named peptide_helix_capped.pdb, which should then be used as input for pdb2gmx.
372-
Once capped, pdb2gmx will recognize the ACE and NME residues automatically when using the CHARMM36m force field.
379+
The script will produce a new file, which should then be used as input for pdb2gmx.
380+
Once capped, `pdb2gmx` will recognize the ACE and NME residues automatically when using the CHARMM36m force field.
373381
Read through the output of `pdb2gmx` and check the choices the program made for histidine
374382
protonation states and the resulting charge of the peptide.
375383

@@ -420,7 +428,8 @@ Protein 3
420428
</pre>
421429

422430
<a class="prompt prompt-info">
423-
Look at the partial charge that each atom carries (column 7) and note the differences between different types of atom.
431+
Look at the partial charge that each atom carries (column 7) and note the differences between different types of atom.
432+
Note that displayed file was generated using the default settings.
424433
</a>
425434

426435
<a class="prompt prompt-question">
@@ -717,8 +726,8 @@ gmx mdrun -v -deffnm peptide-EM-solvated
717726
Despite dissipating most of the strain in the system, energy minimization does not consider
718727
temperature, and therefore velocities and kinetic energy. When first running molecular dynamics,
719728
the algorithm assigns velocities to the atoms, which again stresses the system and might cause the
720-
simulation to become unstable. To avoid possible instabilities, the preparation setup here
721-
described includes several stages of molecular dynamics that progressively remove constraints on
729+
simulation to become unstable. To avoid possible instabilities, the preparation setup described
730+
here includes several stages of molecular dynamics that progressively remove constraints on
722731
the system and as such, let it slowly adapt to the conditions in which the production simulation
723732
will run.
724733

0 commit comments

Comments
 (0)