1201
General discussions (Q&As) / Re: different geometries
« on: November 06, 2012, 10:18:50 am »
Could you be more specific by providing an example to put your question in context?
Xiang-Jun
Xiang-Jun
This section allows you to view all posts made by this member. Note that you can only see posts made in areas you currently have access to.
Netiquette · Download · News · Gallery · G-quadruplexes · DSSR-Jmol · DSSR-PyMOL · Video Overview · DSSR v2.6.0 (DSSR Manual) · Homepage
REMARK 3DNA by Dr. Xiang-Jun Lu [2012-10-26] (xiangjun@x3dna.org)
ATOM 1 C1' 5CM A 1 -2.477 5.402 0.000 1.00 0.00 C
ATOM 2 N1 5CM A 1 -1.285 4.542 0.000 1.00 0.00 N
ATOM 3 C2 5CM A 1 -1.472 3.158 0.000 1.00 0.00 C
ATOM 4 O2 5CM A 1 -2.628 2.709 0.001 1.00 0.00 O
ATOM 5 N3 5CM A 1 -0.391 2.344 0.000 1.00 0.00 N
ATOM 6 C4 5CM A 1 0.837 2.868 0.000 1.00 0.00 C
ATOM 7 N4 5CM A 1 1.875 2.027 0.001 1.00 0.00 N
ATOM 8 C5 5CM A 1 1.056 4.275 0.000 1.00 0.00 C
ATOM 9 C5A 5CM A 1 2.466 4.961 0.001 1.00 0.00 C
ATOM 10 C6 5CM A 1 -0.023 5.068 0.000 1.00 0.00 C
END
mutate_bases 'chain=A snum=1 m=5CM; chain=B snum=23 m=5CM' 355d.pdb 355d_AC1BC23_5CM.pdb
mutate_bases 'chain=A snum=2 name=C m=5CM' 6tna.pdb 6tna_C2_5CM.pdb
The original atomic coordinates of a cytosine from PDB entry 6tna:
--------------------------------------------------------------------------------
ATOM 25 P C A 2 31.659 20.469 70.978 1.00 10.00 P
ATOM 26 OP1 C A 2 32.973 21.044 71.364 1.00 10.00 O
ATOM 27 OP2 C A 2 30.973 21.143 69.849 1.00 10.00 O
ATOM 28 O5' C A 2 31.815 18.912 70.652 1.00 10.00 O
ATOM 29 C5' C A 2 30.629 18.184 70.293 1.00 10.00 C
ATOM 30 C4' C A 2 30.507 16.914 71.139 1.00 10.00 C
ATOM 31 O4' C A 2 29.293 17.051 71.947 1.00 10.00 O
ATOM 32 C3' C A 2 30.455 15.607 70.367 1.00 10.00 C
ATOM 33 O3' C A 2 31.724 14.971 70.316 1.00 10.00 O
ATOM 34 C2' C A 2 29.411 14.815 71.146 1.00 10.00 C
ATOM 35 O2' C A 2 29.987 14.227 72.301 1.00 10.00 O
ATOM 36 C1' C A 2 28.473 15.927 71.630 1.00 10.00 C
ATOM 37 N1 C A 2 27.474 16.346 70.621 1.00 10.00 N
ATOM 38 C2 C A 2 26.658 15.368 70.068 1.00 10.00 C
ATOM 39 O2 C A 2 26.802 14.198 70.441 1.00 10.00 O
ATOM 40 N3 C A 2 25.726 15.730 69.143 1.00 10.00 N
ATOM 41 C4 C A 2 25.601 17.008 68.767 1.00 10.00 C
ATOM 42 N4 C A 2 24.682 17.314 67.872 1.00 10.00 N
ATOM 43 C5 C A 2 26.436 18.041 69.324 1.00 10.00 C
ATOM 44 C6 C A 2 27.351 17.658 70.243 1.00 10.00 C
--------------------------------------------------------------------------------
The coordinates of the mutant 5-methylcytosine generated by 'mutate_bases'
REMARK Mutation#1 A:...2@:[..C] to [5CM]
ATOM 25 P 5CM A 2 31.659 20.469 70.978 1.00 10.00 P
ATOM 26 OP1 5CM A 2 32.973 21.044 71.364 1.00 10.00 O
ATOM 27 OP2 5CM A 2 30.973 21.143 69.849 1.00 10.00 O
ATOM 28 O5' 5CM A 2 31.815 18.912 70.652 1.00 10.00 O
ATOM 29 C5' 5CM A 2 30.629 18.184 70.293 1.00 10.00 C
ATOM 30 C4' 5CM A 2 30.507 16.914 71.139 1.00 10.00 C
ATOM 31 O4' 5CM A 2 29.293 17.051 71.947 1.00 10.00 O
ATOM 32 C3' 5CM A 2 30.455 15.607 70.367 1.00 10.00 C
ATOM 33 O3' 5CM A 2 31.724 14.971 70.316 1.00 10.00 O
ATOM 34 C2' 5CM A 2 29.411 14.815 71.146 1.00 10.00 C
ATOM 35 O2' 5CM A 2 29.987 14.227 72.301 1.00 10.00 O
ATOM 36 C1' 5CM A 2 28.473 15.927 71.630 1.00 10.00 C
ATOM 37 N1 5CM A 2 27.475 16.353 70.620 1.00 1.00 N
ATOM 38 C2 5CM A 2 26.651 15.372 70.062 1.00 1.00 C
ATOM 39 O2 5CM A 2 26.789 14.195 70.427 1.00 1.00 O
ATOM 40 N3 5CM A 2 25.726 15.730 69.141 1.00 1.00 N
ATOM 41 C4 5CM A 2 25.610 17.009 68.776 1.00 1.00 C
ATOM 42 N4 5CM A 2 24.685 17.316 67.863 1.00 1.00 N
ATOM 43 C5 5CM A 2 26.437 18.028 69.328 1.00 1.00 C
ATOM 44 C5A 5CM A 2 26.359 19.547 68.947 1.00 1.00 C
ATOM 45 C6 5CM A 2 27.347 17.660 70.238 1.00 1.00 C
open_file <./Atomic.g.pdb> failed: No such file or directory
means 3DNA (find_pair) is trying to locate the file in your current working directory (CWD), instead of the default system setting $X3DNA/config where the file Atomic.g.pdb is available, among other Atomic*.pdb files. x3dna_utils cp_std bdna
cp -f $X3DNA/config/Atomic.*.pdb .
rm -f Atomic*.pdb
find_pair 1ehz.pdb stdout
(-0.6616 -0.1982)/2 =-0.4299, which was different to the given value -0.4490Here Rm, as a proper rotation matrix, is orthonormal; its x-axis and y-axis are normalized, which is precisely where the discrepancy comes from.
# The following is extracted from your attached "struct_1.pdb"
ATOM 30 P G 2 -3.465 14.386 -4.840 0.00 0.00
ATOM 31 OP1 G 2 -4.272 15.372 -4.078 0.00 0.00
ATOM 32 OP2 G 2 -2.267 13.805 -4.173 0.00 0.00
ATOM 17 P DG A 2 23.337 31.278 21.156 1.00 13.26 P
ATOM 18 OP1 DG A 2 24.761 31.571 21.391 1.00 13.17 O
ATOM 19 OP2 DG A 2 22.651 31.834 19.956 1.00 12.34 O
# The above in red is taken from PDB entry 355d
"OP1 " instead of the normal " OP1",
and that explains the message you saw: no matching entry for atom name [OP1 ] (OP..) in 'atomlist.dat'Adding an entry "OP.. O" (note the two dots in place of digit and space) to file 'atomlist.dat' will make the info message go away.
Selection Criteria:
NDB ID: ad OR bd
Classification: DNA
Structure Description: Double Helix
Conformation Type: A OR B
No Drug, No Mismatch
No Modifiers (Base/Sugar/Phosphate)
Resolution better than 2.0 A
=======================
34 A-DNA and 27 B-DNA
For B-DNA, delete bd0012, bd0013 & bdf068 (following HMB)
bd0001 bd0006_A
bd0014: coordinates from PDB 463D
bd0005 bd0016_A (with repeated atoms!)
bd0018 bd0019 bdj017 bdj019 bdj025 bdj031 bdj036 bdj037 bdj051
bdj052 bdj060 bdj061
bdj081 (Uses helix #1 with strands A and B. The other two are
disordered)
bdl001 bdl005 bdl020 bdl084
bd0023_A bd0029
-------------------------- 27-3=24 structures
For A-DNA
ad0002 ==> (ad0002_AB + ad0002_CD)
ad0003 ad0004 adh008 adh010 adh0102 adh0103 adh0104 adh0105
adh014 adh026 adh027 adh029 adh033 adh034 adh038 adh039 adh047
adh070 adh078 adj0102 adj0103 adj0112 adj0113 adj022 adj049
adj050 adj051 adj065 adj066 adj067 adj075
adl025 (suspicious! big Buckle, alternating Propeller)
adl047 (with B-steps, not good either!)
-------------------------- 34+1-2=33 structures
Outliers:
A-DNA: ad0002_CD, steps 3-4, bps 3-4-5
ad0004, steps 3-4-5, bps 3-4-5-6
B-DNA: bdj025, step 3, bps 3-4
bdj031, step 3, bps 3-4
bdj037, step 3, bps 3-4
This folder (3DNA-NAR03-Fig5) contains all the data files and scripts
to reproduce Figure 5 of the 2003 3DNA paper in Nucleic Acids Research
(NAR03). The contents are taken from the original materials I used to
create Figure 5 of NAR03, with slight editing. Specifically, I revised
the Matlab scripts to work in GNU Octave v3.2.4 for verification.
If you have any questions or comments, please do post them on the 3DNA
Forum.
2012-09-06 -- Xiang-Jun Lu (http://x3dna.org)
========================================================================
Data selections:
'note-AB-datasets' -- datasets of selected A- and B-DNA structures
'note-TA-dataset' -- dataset of selected TA-DNA structures
Data files:
'A-heli-pars.dat' -- six helical parameters
'A-step-pars.dat' -- six step parameters
'A-zp-zph.dat' -- Zp and ZpH parameters
Selected parameters of the A-DNA dataset. Note that the order
the parameters is as in .out file from running 'analyze'
'B-heli-pars.dat', 'B-step-pars.dat', 'B-zp-zph.dat' for B-DNA
'TA-heli-pars.dat', 'TA-step-pars.dat', 'TA-zp-zph.dat' for TA-DNA
Scripts:
'incl_xdsp.m' -- script to generate Figure 5(a), Inclination vs Tip
'incl_xdsp.png' -- output file from running the script
'roll_slide.m' -- script to generate Figure 5(b), Roll vs Slide
'roll_slide.png' -- output file from running the script
'zph_zp.m' -- script to generate Figure 5(c), Zp(h) vs Zp
'zph_zp.png' -- output file from running the script
'draw_ellipse.m', 'get_pars.m', 'open_file.m' -- supporting scripts
Selection Criteria:
NDB ID: ad OR bd
Classification: DNA
Structure Description: Double Helix
Conformation Type: A OR B
No Drug, No Mismatch
No Modifiers (Base/Sugar/Phosphate)
Resolution better than 2.0 A
=======================
34 A-DNA and 27 B-DNA
For B-DNA, delete bd0012, bd0013 & bdf068 (following HMB)
bd0001 bd0006_A
bd0014: coordinates from PDB 463D
bd0005 bd0016_A (with repeated atoms!)
bd0018 bd0019 bdj017 bdj019 bdj025 bdj031 bdj036 bdj037 bdj051
bdj052 bdj060 bdj061
bdj081 (Uses helix #1 with strands A and B. The other two are
disordered)
bdl001 bdl005 bdl020 bdl084
bd0023_A bd0029
-------------------------- 27-3=24 structures
For A-DNA
ad0002 ==> (ad0002_AB + ad0002_CD)
ad0003 ad0004 adh008 adh010 adh0102 adh0103 adh0104 adh0105
adh014 adh026 adh027 adh029 adh033 adh034 adh038 adh039 adh047
adh070 adh078 adj0102 adj0103 adj0112 adj0113 adj022 adj049
adj050 adj051 adj065 adj066 adj067 adj075
adl025 (suspicious! big Buckle, alternating Propeller)
adl047 (with B-steps, not good either!)
-------------------------- 34+1-2=33 structures
Outliers:
A-DNA: ad0002_CD, steps 3-4, bps 3-4-5
ad0004, steps 3-4-5, bps 3-4-5-6
B-DNA: bdj025, step 3, bps 3-4
bdj031, step 3, bps 3-4
bdj037, step 3, bps 3-4
pd0070, pd0112, pd0154, pd0155, pd0156 pd0157, pd0158, pd0159, pd0160,
pd0161, pd0162, pd0163, pd0164, pdr031 pdt009, pdt012, pdt024, pdt025,
pdt032, pdt034, pdt036
This directory contains TATA box segments. It is normally 8-bp long, and
has the sequence: T-A-T-A-@-A-@-N. There are two kinks at the terminal
steps.
* means non-WC base-pair which is eliminated from further analysis
NDB ID ## Sequence Res(A) R-fac(%) chainID and residue range
--------------------------------------------------------------------
pd0070 01 T-T-T-A-A-A-T-A 2.4 20.0 C 1410 1417 D 1432 1439
pd0112 02 T-A-T-A-A-A-A-G 2.65 23.1 K 8 15 L 105 112
03 T-A-T-A-A-A-A-G C 8 15 D 105 112
04 T-A-T-A-A-A-A-G G 8 15 H 105 112
05 T-A-T-A-A-A-A-G O 8 15 P 105 112
06 T-A-T-A-A-A-A-G S 8 15 T 105 112
pd0154 07 T-A-T-A-A-A-A-T 1.86 21.0 C 203 210 D 219 226
08 T-A-T-A-A-A-A-T E 203 210 F 219 226
pd0155 09 T-A-T-A-A-G-A-G* 1.93 19.6 C 203 209 D 220 226
10 T-A-T-A-A-G-A-G* E 203 209 F 220 226
pd0156 11 T-A-T-A-A-T-A-G* 2.1 19.3 C 203 209 D 220 226
12 T-A-T-A-A-T-A-G* E 203 209 F 220 226
pd0157 13 T-A-T-A-T-A-A-G* 2.3 19.4 C 203 209 D 220 226
14 T-A-T-A-T-A-A-G* E 203 209 F 220 226
pd0158 15 T-A-T-T-A-A-A-G* 2.1 19.4 C 203 209 D 220 226
16 T-A-T-T-A-A-A-G* E 203 209 F 220 226
pd0159 17 T-A-C-A-A-A-A-G* 1.9 20.9 C 203 209 D 220 226
18 T-A-C-A-A-A-A-G* E 203 209 F 220 226
pd0160 19 T-T-T-A-A-A-A-G* 1.8 19.3 C 203 209 D 220 226
20 T-T-T-A-A-A-A-G* E 203 209 F 220 226
pd0161 21 T-A-T-A-A-A-T-G* 2.23 19.1 C 203 209 D 220 226
22 T-A-T-A-A-A-T-G* E 203 209 F 220 226
pd0162 23 A-A-T-A-A-A-A-G* 2.3 18.2 C 203 209 D 220 226
24 A-A-T-A-A-A-A-G* E 203 209 F 220 226
pd0163 25 T-A-T-A-A-A-A-G 1.9 19.7 C 203 210 D 219 226
26 T-A-T-A-A-A-A-G E 203 210 F 219 226
pd0164 27 T-A-T-A-A-A-C*G* 1.95 19.9 C 203 208 D 221 226
28 T-A-T-A-A-A-C*G* E 203 208 F 221 226
pdr031 29 T-T-T-t-t-A-A-A 2.1 21.2 C 1408 1415 E 1420 1427
pdt009 30 T-A-T-A-A-A-A-G 2.25 20.2 A 203 210 B 305 312
31 T-A-T-A-A-A-A-G C 403 410 D 505 512
pdt012 32 T-A-T-A-T-A-A-A 1.8 20.1 C 2 9 C 21 28
33 T-A-T-A-T-A-A-A D 2 9 D 21 28
pdt024 34 T-A-T-A-T-A-T-A 2.9 21.4 B 103 110 C 115 122
pdt025 35 T-A-T-A-A-A-A-G 1.9 19.4 C 203 210 D 219 226
36 T-A-T-A-A-A-A-G E 303 310 F 319 326
pdt032 37 T-A-T-A-A-A-A-G 2.7 21.5 C 4 11 D 106 113
pdt034 38 T-A-T-A-A-A-A-G 1.9 18.9 B 5 12 C 105 112
pdt036 39 T-A-T-A-A-A-A-C 2.5 23.5 E 9 16 F 1 8
--ensemble, -e <s>: Ensemble delineated with MODEL/ENDMDL pairs
--models, -m <s>: File containing an explicit list of model numbers
--pattern, -p <s>: Pattern of model files to process (e.g., *.pdb)
--list, -l <s>: File containing an explicit list of models
I work in Gromacs program with Amber force field. I converted my trajectory, in separeted pdbs files such as example folder.
I'd like to ask about the DNA set used for the analysis that is presented in Fig 5. in the NAR 2003 paper. Are those structures previously classified as A, B and TA DNA by other means (?) before doing the Zp and Zp(h) calculations to confirm their differences? Where can I look for the structures which were used? (I guess it is somewhere in reference 81, Patikoglou,G.A. et al (1999))
A detailed structural analysis of two early examples of the TATA‐box DNA bound to the TATA‐box binding protein (TBP) (10,79) led Guzikevich‐Guerstein and Shakked (80) to propose that the 8 bp TATA‐box adopts a novel TA‐DNA conformation, different from either A or B DNA. The structures of many more such complexes have since been determined (81) and, as shown in Table 2 and Figure 5, all TATA‐box regions share similar conformational features.
mkdir find_pair-p-examples
cd find_pair-p-examples
ls
wget ftp://ndbserver.rutgers.edu/NDB/coordinates/na-biol/1j6s.pdb1
find_pair -p -all_model 1j6s.pdb1 1j6s.mbp
1: #8 [1]...1>A:...1_:[BRU]u + [2]...1>A:...2_:[..G]G + [47]...2>A:...1_:[BRU]u + [48]...2>A:...2_:[..G]G + [93]...3>A:...1_:[BRU]u + [94]...3>A:...2_:[..G]G + [139]...4>A:...1_:[BRU]u + [140]...4>A:...2_:[..G]G
2: #4 [3]...1>A:...3_:[..A]A + [49]...2>A:...3_:[..A]A + [95]...3>A:...3_:[..A]A + [141]...4>A:...3_:[..A]A
3: #4 [4]...1>A:...4_:[..G]G + [50]...2>A:...4_:[..G]G + [96]...3>A:...4_:[..G]G + [142]...4>A:...4_:[..G]G
4: #4 [5]...1>A:...5_:[..G]G + [51]...2>A:...5_:[..G]G + [97]...3>A:...5_:[..G]G + [143]...4>A:...5_:[..G]G
ex_str -1 multiplets.pdb oct.pdb
r3d_atom -od -r=0.1 -b=0.2 oct.pdb stdout | render -png > oct.png
ex_str -2 multiplets.pdb A-tetrad.pdb
r3d_atom -od -r=0.1 -b=0.2 A-tetrad.pdb stdout | render -png > A-tetrad.png
ex_str -3 multiplets.pdb G-tetrad.pdb
r3d_atom -od -r=0.1 -b=0.2 G-tetrad.pdb stdout | render -png > G-tetrad.png
find_pair -s 1egk.pdb stdout | analyze -chain_markers='+|x*' stdin
1 (0.013) ....>A:...1_:[..A]A +
2 (0.020) ....>A:...2_:[..G]G |
3 (0.019) ....>A:...3_:[..G]G |
4 (0.014) ....>A:...4_:[..A]A |
5 (0.014) ....>A:...5_:[..G]G |
6 (0.016) ....>A:...6_:[..A]A |
7 (0.020) ....>A:...7_:[..G]G |
8 (0.015) ....>A:...8_:[..A]A |
9 (0.028) ....>A:...9_:[..G]G |
10 (0.015) ....>A:..10_:[..A]A |
11 (0.015) ....>A:..11_:[..U]U |
12 (0.022) ....>A:..12_:[..G]G |
13 (0.015) ....>A:..13_:[..G]G |
14 (0.021) ....>A:..14_:[..G]G |
15 (0.025) ....>A:..15_:[..U]U |
16 (0.016) ....>A:..16_:[..G]G |
17 (0.016) ....>A:..17_:[..C]C |
18 (0.016) ....>A:..18_:[..G]G |
19 (0.012) ....>A:..19_:[..A]A |
20 (0.017) ....>A:..20_:[..G]G x
21 (0.010) ....>B:..21_:[..C]C +
22 (0.018) ....>B:..22_:[..T]T |
23 (0.007) ....>B:..23_:[..C]C |
24 (0.016) ....>B:..24_:[..G]G |
25 (0.011) ....>B:..25_:[..C]C |
26 (0.013) ....>B:..26_:[..A]A |
27 (0.011) ....>B:..27_:[..C]C |
28 (0.006) ....>B:..28_:[..C]C |
29 (0.010) ....>B:..29_:[..C]C |
find_pair 1egk.pdb stdout
# default as before
find_pair -chain_markers='o|x+' 1egk.pdb stdout
# with helix beginning character assigned to 'o'
Funded by the NIH R24GM153869 grant on X3DNA-DSSR, an NIGMS National Resource for Structural Bioinformatics of Nucleic Acids
Created and maintained by Dr. Xiang-Jun Lu, Department of Biological Sciences, Columbia University