THIS IS PROGRAM TRACTS(PUR), UNIX VERSION 5.5 2/96.
By G. Yagil. cf. DNA SEQUENCE 1:157-172 (1991).
DATA FILE: AT
ID HIL42023 1830135 bp DNA circular BCT 27-SEP-1996
SQ H influenzae Rd complete genome. 567624a 350720c 347436g 564240t 115 others
BASE 47036 N
BASE 131342 N
BASE 131362 N
BASE 1475991 N
BASE 1545334 N
BASE 1567294 N
BASE 1768171 N
1.SQ H influenzae Rd complete genome. 567624a 350720c 347436g Clone = HI
FIRST BASE is 1; LAST BASE is ******
SEQUENCE LENGTH= ****** IN SECTION(S) OF ****** REGION OF****** BASES
MODE is NOT alternating. LEVEL = 100% ALT= W;S 1:1 = ; 2:3 =W;S
Sequences longer than 14
(partial)
I = Intercoding C = Coding w- watson c = crick
Code begins ends length from to type sequence
HIL4 I - I - 100.0 16 521 536 W;S TAAAAATAATTTAAAA
HIL4 I - I - 100.0 21 4839 4859 W;S TTAAATTATTTTTATATTTAT
HIL4 C c C c 100.0 15 11128 11142 W;S TATTAATTTTAAATT
HIL4 C c C c 100.0 15 13650 13664 W;S TAAAATTTTATTTAA
HIL4 I - I - 100.0 15 17851 17865 W;S AAAATAATAATAAAA
HIL4 I - I - 100.0 20 17900 17919 W;S AAATTATTTTATATTTATTT
HIL4 I - I - 100.0 29 17933 17961 W;S TTATTTTTTTAATTAATTTTTATAAATTT
HIL4 I - I - 100.0 25 17979 18003 W;S ATTAATTTTTAAATAAAATTTATAT
HIL4 C w C w 100.0 17 18343 18359 W;S TATTAAATATATATAAA
HIL4 C c C c 100.0 15 20099 20113 W;S TATTTAATAATAATA
HIL4 I - I - 100.0 16 23516 23531 W;S TATAAAAATAAATTTT
HIL4 C c C c 100.0 17 24239 24255 W;S TAATAATTTTTTAATAA
HIL4 I - C 6 100.0 15 27641 27655 W;S AATTTTAAATTATTT
HIL4 I - I - 100.0 16 29594 29609 W;S TAAAAATAAAATAAAT
HIL4 C w C w 100.0 15 39559 39573 W;S TTATTTAAAAAAATT
HIL4 C c C c 100.0 15 44020 44034 W;S AAATAAAATTATTTA
HIL4 C c C c 100.0 16 44177 44192 W;S TTTTAAAATAAATAAA
HIL4 C x C 100.0 15 44615 44629 W;S TAAATTTTATTTTTA
HIL4 I - I - 100.0 15 46395 46409 W;S TAAAATAAATAAAAA
HIL4 I - C 0 100.0 22 47737 47758 W;S AAATAAATTTATTTAATAATAT
HIL4 C c C c 100.0 17 48761 48777 W;S AAATATTAATAATATTT
HIL4 C c C c 100.0 16 52084 52099 W;S ATATTTTTTATAAAAA
HIL4 C 0 C h 100.0 20 53064 53083 W;S ATTAATTTTTAAATAATTTT
HIL4 C w C w 100.0 17 54587 54603 W;S ATATTTTAAATTTAAAA
HIL4 C c C c 100.0 17 56871 56887 W;S TTAATAAAAATAATATT
HIL4 I - I - 100.0 15 65065 65079 W;S TAAAAATAAATTTTT
HIL4 C w C w 100.0 16 66588 66603 W;S TTTATTAAAAAATATT
HIL4 C c C c 100.0 15 1808249 1808263 W;S TTTAAAAATTTTATT
HIL4 I - I - 100.0 17 1808330 1808346 W;S AATTTTATTTTATATTT
HIL4 I - I - 100.0 25 1808441 1808465 W;S TTAATTTATTTTTTATTTAAAATAA
HIL4 I - I - 100.0 16 1810620 1810635 W;S TTATTATTTTAAAAAA
HIL4 I - C 7 100.0 18 1813642 1813659 W;S AAAATTATATAAAAATAA
HIL4 I - I - 100.0 23 1815711 1815733 W;S TAATTAATAAAATTAATTATTTA
HIL4 I - I - 100.0 15 1815830 1815844 W;S TTAATTTAATTTTAT
HIL4 I - I - 100.0 21 1819428 1819448 W;S AAAATTAAATTTAAAATTAAA
HIL4 I - C 3 100.0 15 1821763 1821777 W;S AATTTATAAATTAAA
HIL4 I - I - 100.0 16 1823021 1823036 W;S AATTTATTTTAAAAAA
---------- SEQUENCES ---------- ---------- BASES ----------- ------- BASES GE. --------
LENGTH PYRS PURS SUM EXPCTD DIFF FOUND EXPECTD DIFF RATIO CUMULTV. FOUND EXPCTD RATIO
1 135112 218306 353418 431827.78 -78409.8 353418 431827.78-78409.78 0.82 353418 18301351830135.00 1.00
2 94687 101630 196317 203783.05 -7466.0 392634 407566.09-14932.09 0.96 746052 14767171398307.50 1.06
3 54467 43527 97994 101891.52 -3897.5 293982 305674.56-11692.56 0.96 1040034 1084083 990741.50 1.09
4 36988 20665 57653 53808.09 3844.9 230612 215232.38 15379.63 1.07 1270646 790101 685066.94 1.15
5 29156 7928 37084 29766.37 7317.6 185420 148831.86 36588.14 1.25 1456066 559489 469834.47 1.19
6 15551 2249 17800 17070.13 729.9 106800 102420.82 4379.18 1.04 1562866 374069 321002.59 1.17
7 11070 801 11871 10046.64 1824.4 83097 70326.48 12770.52 1.18 1645963 267269 218581.77 1.22
8 7636 343 7979 6018.87 1960.1 63832 48150.98 15681.02 1.33 1709795 184172 148255.27 1.24
9 3942 134 4076 3648.33 427.7 36684 32834.94 3849.06 1.12 1746479 120340 100104.28 1.20
10 2601 33 2634 2228.15 405.9 26340 22281.49 4058.51 1.18 1772819 83656 67269.35 1.24
11 1807 21 1828 1367.31 460.7 20108 15040.43 5067.57 1.34 1792927 57316 44987.88 1.27
12 928 8 936 841.57 94.4 11232 10098.85 1133.15 1.11 1804159 37208 29947.45 1.24
13 603 1 604 518.95 85.1 7852 6746.33 1105.67 1.16 1812011 25976 19848.60 1.31
14 414 0 414 320.38 93.6 5796 4485.27 1310.73 1.29 1817807 18124 13102.27 1.38
15 227 0 227 197.93 29.1 3405 2968.92 436.08 1.15 1821212 12328 8617.00 1.43
16 140 0 140 122.33 17.7 2240 1957.35 282.65 1.14 1823452 8923 5648.08 1.58
17 111 0 111 75.63 35.4 1887 1285.75 601.25 1.47 1825339 6683 3690.73 1.81
18 59 0 59 46.77 12.2 1062 841.81 220.19 1.26 1826401 4796 2404.98 1.99
19 59 0 59 28.92 30.1 1121 549.50 571.50 2.04 1827522 3734 1563.18 2.39
20 38 0 38 17.89 20.1 760 357.73 402.27 2.12 1828282 2613 1013.67 2.58
21 20 0 20 11.06 8.9 420 232.31 187.69 1.81 1828702 1853 655.95 2.82
22 11 0 11 6.84 4.2 242 150.52 91.48 1.61 1828944 1433 423.64 3.38
23 9 0 9 4.23 4.8 207 97.33 109.67 2.13 1829151 1191 273.12 4.36
24 9 0 9 2.62 6.4 216 62.82 153.18 3.44 1829367 984 175.79 5.60
25 8 0 8 1.62 6.4 200 40.47 159.53 4.94 1829567 768 112.97 6.80
26 3 0 3 1.00 2.0 78 26.03 51.97 3.00 1829645 568 72.50 7.83
27 1 0 1 0.62 0.4 27 16.72 10.28 1.61 1829672 490 46.46 10.55
28 7 0 7 0.38 6.6 196 10.73 185.27 18.27 1829868 463 29.74 15.57
29 5 0 5 0.24 4.8 145 6.87 138.13 21.10 1830013 267 19.02 14.04
30 1 0 1 0.15 0.9 30 4.40 25.60 6.82 1830043 122 12.15 10.04
SUM 791316 1830043 (:2 =915021.5) DEVIATION: 13.52% %G,C = 0.381
Summary for AT sequences .GE.15 at level 100%
CODING INTERCODING INTRONS TOTAL BASES
TOTAL: 1622885 207250 0 1830135
FOUND: 6011 6225 0 12236
EXPECTED: 7641.19 975.82 0.00 8617.00
RATIO: 0.79 6.38 0.00 1.42
Level 100 Mode AC Sequence:
PIU,QIU = ******* 0.0
Sequences longer than 14
(partial)
I = Intercoding C = Coding w- watson c = crick
Code begins ends length from to type sequence
HIL4 C w C w 100.0 16 9686 9701 K.M AAAAAACACCACCCAA
HIL4 C c C c 100.0 19 16498 16516 K.M CACAAACAACCCAACCCAC
HIL4 C w C w 100.0 17 69492 69508 K.M AACAAAAAACCAAACAC
HIL4 C w C w 100.0 17 86101 86117 K.M GTTTGGTTGTTGTTTTG
HIL4 I - C 8 100.0 16 87884 87899 K.M TGTGGGTGTTGGTTTG
HIL4 C c C c 100.0 17 110040 110056 K.M TTGTTGTTTTTGTGTTT
HIL4 C c C c 100.0 17 111176 111192 K.M AAAAACCCACACCAAAC
HIL4 I - I - 100.0 16 122781 122796 K.M AAAAAAACAAAAACCC
HIL4 C c C c 100.0 15 134588 134602 K.M AAAACACAAACCACC
HIL4 C c C c 100.0 15 135303 135317 K.M AACACCACCACCAAC
HIL4 I - I - 100.0 15 168850 168864 K.M CAACAACAACCCCAA
HIL4 I - I - 100.0 17 169295 169311 K.M CACACCACCAAAACAAA
HIL4 C c C c 100.0 17 171077 171093 K.M AAAAACCAACCACACCC
HIL4 C c C c 100.0 15 174420 174434 K.M TTTTTTGTTTTGTTG
HIL4 C w C w 100.0 15 1670583 1670597 K.M CAACAACACAAACAC
HIL4 C w C w 100.0 16 1673741 1673756 K.M GGTGTTGGTGGTGGTT
HIL4 C c C c 100.0 16 1676198 1676213 K.M TTTTGGTTTTTTGTGG
HIL4 C w C w 100.0 16 1741843 1741858 K.M AAACACAAAAACACAA
HIL4 C w C w 100.0 16 1755805 1755820 K.M AAAAAACACAACAAAA
HIL4 C c C c 100.0 17 1775321 1775337 K.M GGTTTTGTTTGTTTTTG
HIL4 I - I - 100.0 16 1789793 1789808 K.M TTTGTTTTGTTGTTTT
HIL4 I - I - 100.0 15 1805532 1805546 K.M GGTTGTTTTTTTGTT
---------- SEQUENCES ---------- ---------- BASES ----------- ------- BASES GE. --------
LENGTH PYRS PURS SUM EXPCTD DIFF FOUND EXPECTD DIFF RATIO CUMULTV. FOUND EXPCTD RATIO
1 190363 192559 382922 457527.50 -74605.5 382922 457527.50-74605.50 0.84 382922 18301351830135.00 1.00
2 99305 98725 198030 228760.64 -30730.6 396060 457521.28-61461.28 0.87 778982 14472131372607.25 1.05
3 59684 59259 118943 114380.32 4562.7 356829 343141.00 13688.00 1.04 1135811 1051153 915086.31 1.15
4 29164 28808 57972 57190.94 781.1 231888 228763.75 3124.25 1.01 1367699 694324 571945.31 1.21
5 17465 17104 34569 28596.25 5972.8 172845 142981.23 29863.77 1.21 1540544 462436 343181.50 1.35
6 9628 9340 18968 14298.71 4669.3 113808 85792.25 28015.75 1.33 1654352 289591 200200.22 1.45
7 4969 4884 9853 7149.74 2703.3 68971 50048.21 18922.79 1.38 1723323 175783 114407.99 1.54
8 2686 2583 5269 3575.12 1693.9 42152 28600.93 13551.07 1.47 1765475 106812 64359.78 1.66
9 1455 1492 2947 1787.70 1159.3 26523 16089.33 10433.67 1.65 1791998 64660 35758.86 1.81
10 764 765 1529 893.94 635.1 15290 8939.37 6350.63 1.71 1807288 38137 19669.51 1.94
11 438 392 830 447.02 383.0 9130 4917.19 4212.81 1.86 1816418 22847 10730.14 2.13
12 232 233 465 223.54 241.5 5580 2682.43 2897.57 2.08 1821998 13717 5812.95 2.36
13 120 110 230 111.78 118.2 2990 1453.18 1536.82 2.06 1824988 8137 3130.52 2.60
14 58 73 131 55.90 75.1 1834 782.60 1051.40 2.34 1826822 5147 1677.34 3.07
15 37 30 67 27.95 39.0 1005 419.32 585.68 2.40 1827827 3313 894.74 3.70
16 24 23 47 13.98 33.0 752 223.68 528.32 3.36 1828579 2308 475.42 4.85
17 17 11 28 6.99 21.0 476 118.85 357.15 4.01 1829055 1556 251.74 6.18
18 7 6 13 3.50 9.5 234 62.93 171.07 3.72 1829289 1080 132.89 8.13
19 2 2 4 1.75 2.3 76 33.22 42.78 2.29 1829365 846 69.96 12.09
20 2 3 5 0.87 4.1 100 17.49 82.51 5.72 1829465 770 36.74 20.96
21 0 2 2 0.44 1.6 42 9.18 32.82 4.57 1829507 670 19.25 34.81
23 1 2 3 0.11 2.9 69 2.52 66.48 27.42 1829576 628 5.25 119.56
68 1 0 1 0.00 1.0 68 0.00 68.00********* 1829644 559 0.00*********
78 1 0 1 0.00 1.0 78 0.00 78.00********* 1829722 491 0.00*********
85 0 1 1 0.00 1.0 85 0.00 85.00********* 1829807 413 0.00*********
87 0 1 1 0.00 1.0 87 0.00 87.00********* 1829894 328 0.00*********
151 0 1 1 0.00 1.0 151 0.00 151.00 0.00 1830045 241 0.00 0.00
SUM 832832 1830045 (:2 =915022.5) DEVIATION: 8.98% %G,T = 0.498
Summary for AC sequences .GE.15 at level 100%
CODING INTERCODING INTRONS TOTAL BASES
TOTAL: 1622885 207250 0 1830135
FOUND: 2429 794 0 3223
EXPECTED: 793.42 101.32 0.00 894.74
RATIO: 3.06 7.84 0.00 3.60
Level 100 Mode Sequence:
PIU,QIU = **************
Sequences longer than 14
(partial)
I = Intercoding C = Coding w- watson c = crick
Code begins ends length from to type sequence
HIL4 C o C o 100.0 17 4024 4040 R.Y TTTCCTTTTCCTTCCCT
HIL4 C c C c 100.0 17 15560 15576 R.Y TTTTTTCCCTTCTTTCT
HIL4 C c C c 100.0 15 37186 37200 R.Y AAAGAAGAAAAAAAG
HIL4 C w C y 100.0 17 68344 68360 R.Y TTTTTCCTTTTTTTCTC
HIL4 C w C w 100.0 16 68985 69000 R.Y AGAAGAAGAAAAAAAA
HIL4 C w C w 100.0 15 69213 69227 R.Y AAAAAAAGGAGAAAG
HIL4 C w C w 100.0 15 69429 69443 R.Y AAAAAAAGAGAAAAA
HIL4 C l C t 100.0 15 78905 78919 R.Y TTTTTTCTTTTTCCT
HIL4 C w C w 100.0 21 1633907 1633927 R.Y AAGGGAAAGAAAGAGAGAAAG
HIL4 I - I - 100.0 15 1647971 1647985 R.Y TCCCCTCCTCCTTTT
HIL4 I - I - 100.0 21 1654893 1654913 R.Y TTCCTTTCTTTTTTCTTCTTC
HIL4 O 6 O v 100.0 15 1686202 1686216 R.Y TTTTTCTCTCTTTTT
HIL4 I - I - 100.0 21 1728812 1728832 R.Y TTTTTCCTTTTCTTCTTTTTT
HIL4 I - I - 100.0 16 1753038 1753053 R.Y AGGAAAAAGAAAGAAA
HIL4 C o C b 100.0 17 1779826 1779842 R.Y TTCTTCTCCCCTCTTTT
HIL4 C s C c 100.0 19 1789866 1789884 R.Y TCTTTCCCTCTCCTCTCTT
HIL4 I - I - 100.0 15 1810630 1810644 R.Y AAAAAAGAGAAAAGG
---------- SEQUENCES ---------- ---------- BASES ----------- ------- BASES GE. --------
LENGTH PYRS PURS SUM EXPCTD DIFF FOUND EXPECTD DIFF RATIO CUMULTV. FOUND EXPCTD RATIO
1 215522 215706 431228 457533.75 -26305.8 431228 457533.75-26305.75 0.94 431228 18301351830135.00 1.00
2 116680 116699 233379 228766.88 4612.1 466758 457533.75 9224.25 1.02 897986 13989071372600.75 1.02
3 55500 55265 110765 114383.44 -3618.4 332295 343150.28-10855.28 0.97 1230281 932149 915067.50 1.02
4 27578 27591 55169 57191.72 -2022.7 220676 228766.88 -8090.88 0.96 1450957 599854 571917.25 1.05
5 14498 14480 28978 28595.86 382.1 144890 142979.30 1910.70 1.01 1595847 379178 343150.31 1.10
6 7538 7639 15177 14297.93 879.1 91062 85787.57 5274.43 1.06 1686909 234288 200171.02 1.17
7 4131 3965 8096 7148.96 947.0 56672 50042.75 6629.25 1.13 1743581 143226 114383.43 1.25
8 2301 2310 4611 3574.48 1036.5 36888 28595.86 8292.14 1.29 1780469 86554 64340.68 1.35
9 1159 1150 2309 1787.24 521.8 20781 16085.17 4695.83 1.29 1801250 49666 35744.82 1.39
10 566 595 1161 893.62 267.4 11610 8936.21 2673.79 1.30 1812860 28885 19659.65 1.47
11 325 339 664 446.81 217.2 7304 4914.91 2389.09 1.49 1820164 17275 10723.45 1.61
12 140 190 330 223.41 106.6 3960 2680.86 1279.14 1.48 1824124 9971 5808.53 1.72
13 100 95 195 111.70 83.3 2535 1452.13 1082.87 1.75 1826659 6011 3127.67 1.92
14 50 59 109 55.85 53.1 1526 781.92 744.08 1.95 1828185 3476 1675.54 2.07
15 29 26 55 27.93 27.1 825 418.88 406.12 1.97 1829010 1950 893.62 2.18
16 7 18 25 13.96 11.0 400 223.41 176.59 1.79 1829410 1125 474.74 2.37
17 11 9 20 6.98 13.0 340 118.68 221.32 2.86 1829750 725 251.33 2.88
18 3 0 3 3.49 -0.5 54 62.83 -8.83 0.86 1829804 385 132.65 2.90
19 2 4 6 1.75 4.3 114 33.16 80.84 3.44 1829918 331 69.81 4.74
20 1 2 3 0.87 2.1 60 17.45 42.55 3.44 1829978 217 36.65 5.92
21 2 1 3 0.44 2.6 63 9.16 53.84 6.88 1830041 157 19.20 8.18
SUM 892286 1830041 (:2 =915020.5) DEVIATION: 2.48% %A,G = 0.500
Summary for RY sequences .GE.15 at level 100%
CODING INTERCODING INTRONS TOTAL BASES
TOTAL: 1622885 207250 0 1830135
FOUND: 1266 590 0 1856
EXPECTED: 792.42 101.20 0.00 893.62
RATIO: 1.60 5.83 0.00 2.08
PERCENT OCCUPIED BY TRACTS.GE. 10: 8.09%
1. End Sequence: HIL42023 1830135 bp DNA circular BCT 27-SEP-1996
THIS IS PROGRAM TRACTS(PUR), UNIX VERSION 5.5 2/96.
By G. Yagil. cf. DNA SEQUENCE 1:157-172 (1991).