[AMBER] Questions about Principle Component Analysis from Li,Haoxi on 2021-04-15 (Amber Archive Apr 2021)

From: Li,Haoxi <hl2500.chem.ufl.edu>
Date: Thu, 15 Apr 2021 18:56:50 +0000

Dear Amber Users,

I’m recently doing principle component analysis. I’m relatively new to this. Can I ask some questions?

1. If I would like to compare different MD trajectories by using PCA, should I combine the trajectories and then perform the analysis, like what was done in the 2 DNA Amber tutorial? Or can I perform PCA on each trajectory individually and then compare the results? In the later case, would the randomness of the sign of eigenvector cause inconsistency?

2. If I have to combine different trajectories, how can I compare different proteins with different number of atoms.

3. The PCA Amber tutorial says that we will need at least as many input frames to calculate the coordinate covariance matrix as we have rows/columns (i.e. 3 * # selected atoms). Can I ask why the frames number should be as many as the atom coordinates? It is not very obvious to me how this would influence the calculation of the covariance matrix.

4. I’m a little confused about nmwizvecs calculated from diagmatrix command. Does it have an actual meaning? From the .nmd output file, it seems nmwizvecs are a number of vectors which point from the average coordinates to the coordinates + eigenvector, so they are just parts of the eigenvector, 3 in a group for each atom for 3D visualization purpose? Is there a better way to think of this? Why they are starting with the lowest frequency mode?

5. Is the unit of the projection on eigenvectors in Angstroms?

Thank you so much in advance!

Best wishes,
Haoxi

_______________________________________________
AMBER mailing list
AMBER.ambermd.org
http://lists.ambermd.org/mailman/listinfo/amber
Received on Thu Apr 15 2021 - 12:00:04 PDT