Skip to content
Snippets Groups Projects
Commit 101b0023 authored by Juan Cortes's avatar Juan Cortes
Browse files

Update file README.md

parent 312083e5
No related branches found
No related tags found
No related merge requests found
......@@ -42,7 +42,7 @@ The installation procedure works correctly with recent versions of Linux (Ubuntu
## DEMO (data example)
To test WARIO, we provide a [MD simulation](https://gitlab.laas.fr/moma/methods/analysis/WARIO/-/tree/main/wario/P113) of the peptide P113 using the CHARMM36IDPFF force-field. Data have been extracted from [Jephthah _et al_ (2021)](https://doi.org/10.1021/acs.jctc.1c00408), where details on the simulation and the force-field can be found. The P113 ensemble contains $n=100001$ conformations for a sequence of length $L=12$. To run WARIO on this data, the user can uncomment the lines
To test WARIO, we provide a dataset corresponding to the conformational ensemble of a small peptide, P-113, obtained from [MD simulations](https://gitlab.laas.fr/moma/methods/analysis/WARIO/-/tree/main/wario/P113). Data was extracted from [(Jephthah _et al_, 2021)](https://doi.org/10.1021/acs.jctc.1c00408), where details on the simulation and the force-field can be found. The P-113 ensemble contains $n=100001$ conformations for a sequence of length $L=12$. To run WARIO on this data, the user can uncomment the lines
```
#ensemble_folder = "/".join([path_to_notebook,'P113']) # Path to the folder containing trajectory files
......@@ -50,7 +50,7 @@ To test WARIO, we provide a [MD simulation](https://gitlab.laas.fr/moma/methods/
```
in the second code cell of the [contact_clustering](https://gitlab.laas.fr/moma/methods/analysis/WARIO/-/blob/main/wario/contact_clustering.ipynb) notebook, and proceed following the usual guidelines. Then, results will be automatically saved in the [P113 folder](https://gitlab.laas.fr/moma/methods/analysis/WARIO/-/tree/main/wario/P113). The typical running time for this data example is around 30 minutes in a normal desktop computer using 5 threads.
By setting the clustering precision at the default level (i.e. minimum cluster size = 1% of the total number of conformations), WARIO partitions the P113 ensemble into 12 clusters and classifies the 93.04% of conformations. The ensemble is characterized by the following family of weighted cluster-specific contact maps:
By setting the clustering precision at the default level (i.e. minimum cluster size = 1% of the total number of conformations), WARIO partitions the P-113 ensemble into 12 clusters and classifies the 93.04% of conformations. The ensemble is characterized by the following family of weighted cluster-specific contact maps:
<br />
<br />
......@@ -73,8 +73,8 @@ By setting the clustering precision at the default level (i.e. minimum cluster s
<br />
<br />
Each cluster can be complementary described by the average DSSP propensities of its conformations. The corresponding plots for P113 are available [here](https://gitlab.laas.fr/moma/methods/analysis/WARIO/-/tree/main/wario/demo).
Structural properties such as the average DSSP propensities of each cluster can be then computed. The corresponding plots for P-113 are available [here](https://gitlab.laas.fr/moma/methods/analysis/WARIO/-/tree/main/wario/demo).
## Large ensembles
The current implementation of WARIO can be run in a normal desktop computer for ensembles with $nL\lessapprox 3\cdot 10^7$. Limitations are mainly due to memory constraints. We are currently working on a more efficient implementation for large ensembles, that can be run in remote servers for larger ensembles. Until it is released, we can [provide guidance](mailto:javier.gonzalezdelgado@mcgill.ca) on how to run the current code in limiting settings $nL\approx 3-4\cdot 10^7$ more efficiently.
The current implementation of WARIO can be run in a normal desktop computer for ensembles with $nL\lessapprox 3\cdot 10^7$. Limitations are mainly due to memory constraints. We are currently working on a more efficient implementation for large ensembles, which can be run in remote servers. Until it will be released, we can [provide guidance](mailto:javier.gonzalezdelgado@mcgill.ca) on how to run the current code in limiting settings $nL\approx 3-4\cdot 10^7$ more efficiently.
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment