1
RNA structures (DSSR) / discarded nucleotides in json output
« on: March 04, 2018, 08:36:01 am »
Hi,
First, many thanks for your great software suite. It is a real pleasure to work with such complete and convenient outputs for my parsing purposes. I am a strong believer in collaboration in methods development trough connectible building blocs, and it is nice to have others with such view in the RNA-modeling field
I am building specific structural fragment libraries for RNA docking, and I try to parse dssr json outputs to automatically keep track of my fragments characteristics (2D structure, interactions ...). For this, I need to now which nucleotides were discarded because of e.g. weird geometry. As I see it, a "&" is inserted in the "bseq" entry for chain breaks. But I can't find any indication of if a nucleotide was discarded or the break was already present in the input pdb, and retrieving it by comparing bseq to the input sequence can be ambiguous.
I am considering using the "Summary of structural features of xx nucleotides" in the text output, by running dssr w. and wo. the "--json" option. Is there a dssp option I could use to get the info directly from the json output and avoid double work (as I am analysing thousands of pdb files)?
Thanks in advance for your help,
Isaure C. de Beauchene
PS: I'm using version v1.7.2-2017nov20
First, many thanks for your great software suite. It is a real pleasure to work with such complete and convenient outputs for my parsing purposes. I am a strong believer in collaboration in methods development trough connectible building blocs, and it is nice to have others with such view in the RNA-modeling field
I am building specific structural fragment libraries for RNA docking, and I try to parse dssr json outputs to automatically keep track of my fragments characteristics (2D structure, interactions ...). For this, I need to now which nucleotides were discarded because of e.g. weird geometry. As I see it, a "&" is inserted in the "bseq" entry for chain breaks. But I can't find any indication of if a nucleotide was discarded or the break was already present in the input pdb, and retrieving it by comparing bseq to the input sequence can be ambiguous.
I am considering using the "Summary of structural features of xx nucleotides" in the text output, by running dssr w. and wo. the "--json" option. Is there a dssp option I could use to get the info directly from the json output and avoid double work (as I am analysing thousands of pdb files)?
Thanks in advance for your help,
Isaure C. de Beauchene
PS: I'm using version v1.7.2-2017nov20