Starcevic et al. 10.1073/pnas.0707388105.
Scheme 1. Presumed biosynthetic pathway for the production of natural UV-suncreening agents (MAAs) in UV-tolerant marine algae via the plant shikimic acid pathway. Enzymes of the pathway are 1) 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase, 2) 3-dehydroquinate synthase, 3) 3-dehydroquinate dehydratase, 4) shikimate 5-dehydrogenase, 5) shikimate kinase, 6) 5-enolpyruvylshikimate-3-phosphate (EPSP) synthase, and 7) chorismate synthase. [Reconstructed with permission from ref. 1 (Copyright 2002, Annual Reviews, www.annualreviews.org).]
1. Shick JM, Dunlap WC (2002) Mycosporine-like amino acids and related Gadusols: Biosynthesis, acumulation, and UV-protective functions in aquatic organisms. Annu Rev Physiol 64:223-262.
Dataset 1
Query sequence: scaffold_33_6
Accession: [none]
Description: [none]
Scores for sequence family classification (score includes all domains):
Model Description Score E-value N
-------- ----------- ----- ------- ---
EPSP_synthase EPSP synthase (3-phosphoshikimate 1-c 282.6 8.4e-85 2
Parsed for domains:
Model Domain seq-f seq-t hmm-f hmm-t score E-value
-------- ------- ----- ----- ----- ----- ----- -------
EPSP_synthase 1/2 38577 38962 .. 1 425 [. 271.8 1.5e-81
EPSP_synthase 2/2 39293 39314 .. 430 451 .] 10.8 1e-05
EPSP_synthase: domain 1 of 2, from 38577 to 38962: score 271.8, E = 1.5e-81
*->vtggsrLnGeVkvPGSKSishRaLllAALAegeepstItNlLdsdDt
v+gg +L+G++++PG +++++++L++A+L+++ p+++tN + D+
scaffold_3 38577 VRGGYPLRGTIRIPG-AKNAALPLMAASLLTT-KPVRLTNIPKVTDV 38621
rlmleaLraLGaevieldeekevviveGlggqfeapyesdlvldlGNSGT
+m+ +L++ G++v++ +++ v ++++ + +++ s ++ S+
scaffold_3 38622 NAMAVILQSHGVAVEWR-PDDSLVLDARNAQGIPSI--SSTYAPIRSSIF 38668
amRpLlgrlalaqsnevvLtGddsi..geRPidrlldaLrqlGAeIesre
+++p gr+++a + +G+++i++g RPid ++ a r lGA ++ ++
scaffold_3 38669 TLGPAMGRFGEAM---IQVPGGCQIsqGGRPIDLHFYAMRKLGAIVDEES 38715
gegyaPlavrggglklggveidgsiSSqfvTslLmlApllAegdvttiie
g l v+++g +l+g+ i ++++S+++T + ++A+ l g ti+e
scaffold_3 38716 G-----L-VKTNGNRLRGARITFDKVSVGATINALMAACLVQG--KTILE 38757
nGklasePyiddTlnmLkkfGakiegsgtetsftvkGgqkYklpgveylV
n +a e +idd++ mL+k+Ga+i++ + +++ ++G+ l+g+++ V
scaffold_3 38758 N--AAMEAEIDDLVCMLRKMGAQIDKNRDTKTWFITGVSS--LHGADHGV 38803
egDaSsAayFlaAAaitgGStVlvenvginslqpGDiravlkvLedmGen
++D++ A+++++AA++tgG + +++ +g+ ++ p+ vl L+ +G
scaffold_3 38804 VPDRIVAGTYAVAAVMTGG-ELTLT-LGPCPV-PALMGCVLTCLRAAG-- 38848
eaevtqeedadivvgppvnsmLkglkgidvdintapDpapttAvlaafAe
aev + ++i+v++ g ++ v i t p+p+++t+++ + ++
scaffold_3 38849 -AEVMELA-EGIRVRG-------GRRPRSVSITTSPYPGFPTDMQPQWMA 38889
GtsrieGiselRvKEtDRlfamatELrklGaeveegpDGliiigsiitav
+ ++G e++++++D++f ++ ELrklGa+ e + G ++ +
scaffold_3 38890 LMCMAQGSCEVKETIFDHRFQHVKELRKLGANLECT--G------KNVVR 38931
vhGve..qLkgaevdtygDHRiAMafaLaGLv<-*
v Gv + +++ v ++ D+R+A+a+ L+GL+
scaffold_3 38932 VQGVDlsLMQPSLVQAT-DLRAAAALLLVGLA 38962
EPSP_synthase: domain 2 of 2, from 39293 to 39314: score 10.8, E = 1e-05
*->geviIddpectdksfPdFfekL<-*
ge++I d++++ ++++d +++L
scaffold_3 39293 GETVIQDIHHLERGYEDVVRVL 39314
#############################################################################
Query sequence: scaffold_85_5
Accession: [none]
Description: [none]
Scores for sequence family classification (score includes all domains):
Model Description Score E-value N
-------- ----------- ----- ------- ---
DHQ_synthase 3-dehydroquinate synthase 91.0 1.7e-28 2
Parsed for domains:
Model Domain seq-f seq-t hmm-f hmm-t score E-value
-------- ------- ----- ----- ----- ----- ----- -------
DHQ_synthase 1/2 134772 134859 .. 22 110 .. 60.4 7.3e-20
DHQ_synthase 2/2 134885 134941 .. 136 193 .. 30.6 2e-11
DHQ_synthase: domain 1 of 2, from 134772 to 134859: score 60.4, E = 7.3e-20
*->vvivtdetvaklygekveeaLkaaGfevevivipdGEtsKtletlek
v+i+ d+ v klyge+++ ++ +++ +v+p+ E K+++ +ek
scaffold_8 134772 VAII-DDKVDKLYGEPLKLYFDTHNIKLWKLVFPGNEVDKDISAVEK 134817
iydaLleagltRsdlliAlGGGvigDlaGFaAAtymRGipfi<-*
+ L +++R+ ++ GGGvi D+aGFaAA y+R p++
scaffold_8 134818 MLVELKKIKVSRDQPILVMGGGVISDIAGFAAALYHRNTPYV 134859
DHQ_synthase: domain 2 of 2, from 134885 to 134941: score 30.6, E = 2e-11
*->KNliGaFyqPkaVliDtdfLkTLPeRElraGmAEvISKygaIaDwel
KNl G++++P + l D f +TL +r+G+AE++ K+++++D+el
scaffold_8 134885 KNLYGSYHPPVLTLTDRSFFRTLHHGWIRHGIAEIV-KMAVVKDEEL 134930
fhwLeeeafal<-*
f++Le+ +++l
scaffold_8 134931 FNLLEQVSSTL 134941
BLASTP results for AroA scaffold_33_6
################################################################################
Score E
Sequences producing significant alignments: (Bits) Value
ref|YP_932319.1| UDP-N-acetylglucosamine 1-carboxyvinyltransf... 311 5e-83 Gene info
ref|ZP_00800949.1| UDP-N-acetylglucosamine 1-carboxyvinyltran... 308 5e-82
ref|ZP_01362358.1| UDP-N-acetylglucosamine 1-carboxyvinyltran... 308 6e-82
ref|YP_286592.1| UDP-N-acetylglucosamine 1-carboxyvinyltransf... 307 9e-82 Gene info
ref|YP_865138.1| UDP-N-acetylglucosamine 1-carboxyvinyltransf... 306 1e-81 Gene info
ref|ZP_01504774.1| UDP-N-acetylglucosamine 1-carboxyvinyltran... 306 1e-81
ref|ZP_01199610.1| UDP-N-acetylglucosamine 1-carboxyvinyltran... 305 4e-81
ref|ZP_01513509.1| UDP-N-acetylglucosamine 1-carboxyvinyltran... 303 9e-81
ref|YP_315649.1| UDP-N-acetylglucosamine 1-carboxyvinyltransf... 303 1e-80 Gene info
ref|ZP_01739735.1| UDP-N-acetylglucosamine 1-carboxyvinyltran... 303 1e-80
ref|NP_900110.1| UDP-N-acetylglucosamine 1-carboxyvinyltransf... 303 1e-80 Gene info
ref|ZP_00985309.1| COG0766: UDP-N-acetylglucosamine enolpyruv... 302 3e-80
ref|YP_560589.1| UDP-N-acetylglucosamine1-carboxyvinyltransfe... 302 3e-80 Gene info
ref|NP_283098.1| UDP-N-acetylglucosamine 1-carboxyvinyltransf... 301 4e-80 Gene info
ref|YP_154801.1| UDP-N-acetylglucosamine 1-carboxyvinyltransf... 301 5e-80 Gene info
ref|YP_367767.1| UDP-N-acetylglucosamine 1-carboxyvinyltransf... 301 5e-80 Gene info
ref|YP_001237059.1| UDP-N-acetylglucosamine 1-carboxyvinyltra... 301 6e-80 Gene info
ref|YP_001118249.1| UDP-N-acetylglucosamine 1-carboxyvinyltra... 300 9e-80 Gene info
>ref|YP_932319.1| Gene info UDP-N-acetylglucosamine 1-carboxyvinyltransferase [Azoarcus sp.
BH72]
emb|CAL93432.1| Gene info UDP-N-acetylglucosamine 1-carboxyvinyltransferase [Azoarcus sp.
BH72]
Length=416
Score = 311 bits (796), Expect = 5e-83, Method: Composition-based stats.
Identities = 187/405 (46%), Positives = 257/405 (63%), Gaps = 14/405 (3%)
Query 1 VRGGYPLRGTIRIPGAKNAALPLMAASLLTTKPVRLTNIPKVTDVNAMAVILQSHGVAVE 60
+ GG L G + I GAKNAALP++ A+LLT +PV TN+P++ D+ + +L GV VE
Sbjct 6 IEGGRRLSGEVAISGAKNAALPILCAALLTREPVTFTNVPRLNDIGTLLKLLGQMGVKVE 65
Query 61 WRPDDSLVLDARNAQGIPSISSTYAPIRSSIFTLGPAMGRFGEAMIQVPGGCQISQGGRP 120
R DD + LDA + +R+SI LGP + R G+A + +PGGC I G RP
Sbjct 66 -REDDRVTLDASALDNPVAPYEMVKTMRASILVLGPLVARCGDARVSLPGGCAI--GARP 122
Query 121 IDLHFYAMRKLGAIVDEESGLVKTNGNRLRGARITFDKVSVGATINALMAACLVQGKTIL 180
+D H ++ +GA V E G V+ RL+GAR+ D V+V T N +MAACL QG+T++
Sbjct 123 VDQHIKGLQAMGAEVRVEHGYVQAQVPRLKGARLFTDMVTVTGTENLMMAACLAQGETVI 182
Query 181 ENAAMEAEIDDLVCMLRKMGAQIDKNRDTKTWFITGVSSLHGADHGVVPDRIVAGTYAVA 240
ENAA E E+ DL L MGAQI T I GV +LHGA H ++PDRI GTY A
Sbjct 183 ENAAREPEVVDLANCLVAMGAQI-SGAGTDVIRIRGVDALHGATHRIMPDRIETGTYLCA 241
Query 241 AVMTGGELTLT-LGPCPVPALMGCVLTCLRAAGAEVMELAEGIRVRGGRRPRSVSITTSP 299
A +TGGE+ LT C + A V+ L AG EV+ + IR+ RRP++V++ T+P
Sbjct 242 AAVTGGEVRLTGTSSCYLDA----VIDKLMDAGCEVVSERDAIRLAAPRRPQAVNLRTAP 297
Query 300 YPGFPTDMQPQWMALMCMAQGSCEVKETIFDHRFQHVKELRKLGANLECTGKNVVRVQGV 359
YP FPTDMQ Q+MAL C+A G+ ++ETIF++RF H EL++LGA++ G V V+GV
Sbjct 298 YPAFPTDMQAQFMALNCVADGAAMIRETIFENRFMHAVELQRLGADIRIDGNTAV-VRGV 356
Query 360 DLSLMQPSLVQATDLRAAAALLLVGLA--GETVIQDIHHLERGYE 402
+ +Q + V ATDLRA+A+L++ GL GETVI+ I+HL+RGYE
Sbjct 357 E--RLQGATVMATDLRASASLVVAGLVAEGETVIERIYHLDRGYE 399
BLASTP results for scaffold_85_5 AroB
###############################################################################
Score E
Sequences producing significant alignments: (Bits) Value
gb|ABF61768.1| chloroplast 3-dehydroquinate synthase [Karlodiniu 245 2e-63
gb|ABF61766.1| chloroplast 3-dehydroquinate synthase/O-methyl... 244 4e-63
gb|ABF61767.1| 3-dehydroquinate synthase/O-methyltransferase ... 218 3e-55
ref|ZP_00514848.1| 3-dehydroquinate synthase [Crocosphaera wa... 158 3e-37
ref|ZP_01727903.1| 3-dehydroquinate synthase [Cyanothece sp. ... 148 3e-34
>gb|ABF61766.1| chloroplast 3-dehydroquinate synthase/O-methyltransferase fusion
[Heterocapsa triquetra]
Length=951
Score = 244 bits (569), Expect = 4e-63
Identities = 96/169 (56%), Positives = 116/169 (68%), Gaps = 35/169 (20%)
Query 1 VAIIDDKVDKLYGEPLKLYFDTHNI--KLWKLVFPGNEVDKDISAVEKMLVELKKIKVS- 57
VA++DDKV+ LYG+ L YF H I K KL+F GNEVDKDI VE++LV LKK S
Sbjct 322 VAVVDDKVEALYGKDLDAYFAHHGIEYK--KLIFSGNEVDKDIRDVERILVALKK---SG 376
Query 58 --RDQPILVMGGGVISDIAGFAAALYHRNTPYV-------------------------KN 90
R +P+LV+GGGVI+DIAGFAAALY RNTPYV KN
Sbjct 377 QGRHEPLLVVGGGVIADIAGFAAALYSRNTPYVMLCTSIVSGIDAGPSPRVCCNGFDYKN 436
Query 91 LYGSYHPPVLTLTDRSFFRTLHHGWIRHGIAEIVKMAVVKDEELFNLLE 139
LYG+YHPPVLT+TDR F++TLH GW+RHG+AEI+KMAV+KD LF L+E
Sbjct 437 LYGAYHPPVLTITDRGFWKTLHPGWLRHGVAEIIKMAVMKDLSLFELME 485
>gb|ABF61767.1| 3-dehydroquinate synthase/O-methyltransferase fusion [Oxyrrhis marina]
Length=864
Score = 218 bits (508), Expect = 3e-55
Identities = 92/165 (55%), Positives = 112/165 (67%), Gaps = 27/165 (16%)
Query 1 VAIIDDKVDKLYGEPL-KLYFDTHNIKLWKLVFPGNEVDKDISAVEKMLVELKKIKVSRD 59
VA++D VD+ +GE L K YF H ++L KLV+ E DKDIS VE++L +LK VSR+
Sbjct 231 VAVVDQFVDEKWGEDLCK-YFAHHGVELTKLVYRAMEADKDISTVEEILKDLKMHSVSRN 289
Query 60 QPILVMGGGVISDIAGFAAALYHRNTPYV-------------------------KNLYGS 94
+P+L++GGGVI+D+ GFA ALYHRNT YV KNLYG+
Sbjct 290 EPVLIVGGGVIADVGGFATALYHRNTAYVMLCTSIVSGIDAGPSPRTCCDGFGYKNLYGA 349
Query 95 YHPPVLTLTDRSFFRTLHHGWIRHGIAEIVKMAVVKDEELFNLLE 139
YHPPVLTLTDR+FF TL GW+RHGIAEIVKMAVVKD LF LLE
Sbjct 350 YHPPVLTLTDRTFFNTLKEGWVRHGIAEIVKMAVVKDLSLFELLE 394
BLASTP results: scaffold_85 translated against fusion protein from Oxyrrhis marina
#######################################################################
Score E
Sequences producing significant alignments: (Bits) Value
gb|ABF61767.1| 3-dehydroquinate synthase/O-methyltransferase ... 973 0.0
>gb|ABF61767.1| 3-dehydroquinate synthase/O-methyltransferase fusion [Oxyrrhis marina]
Length=864
Score = 973 bits (2288), Expect = 0.0
Identities = 434/760 (57%), Positives = 528/760 (69%), Gaps = 85/760 (11%)
Query 143 YQTNADYLEKADPHAVYPTSIYRSCEGYVLAKPDASLIESTMSTTLTTTIKIVNDVLDPE 202
Y +NA YLE ADPHAV+PTSIYR C+G+V A DAS+IE MST++TTTIKI VL+P
Sbjct 157 YSSNAAYLECADPHAVFPTSIYRMCDGHVHANQDASVIEGVMSTSITTTIKIQTGVLNPS 216
Query 203 NKELK--NAYKPFGRCVAIIDDKVDKLYGEPL-KLYFDTHNIKLWKLVFPGNEVDKDISA 259
N L YKP G+CVA++D VD+ +GE L K YF H ++L KLV+ E DKDIS
Sbjct 217 N--LTLCKVYKPIGKCVAVVDQFVDEKWGEDLCK-YFAHHGVELTKLVYRAMEADKDIST 273
Query 260 VEKMLVELKKIKVSRDQPILVMGGGVISDIAGFAAALYHRNTPYVMLCTSIVSGIDAGPS 319
VE++L +LK VSR++P+L++GGGVI+D+ GFA ALYHRNT YVMLCTSIVSGIDAGPS
Sbjct 274 VEEILKDLKMHSVSRNEPVLIVGGGVIADVGGFATALYHRNTAYVMLCTSIVSGIDAGPS 333
Query 320 PRTCCDGFGFKNLYGSYHPPVLTLTDRSFFRTLHHGWIRHGIAEIVKMAVVKDEELFNLL 379
PRTCCDGFG+KNLYG+YHPPVLTLTDR+FF TL GW+RHGIAEIVKMAVVKD LF LL
Sbjct 334 PRTCCDGFGYKNLYGAYHPPVLTLTDRTFFNTLKEGWVRHGIAEIVKMAVVKDLSLFELL 393
Query 380 EQVSSTLVWTKFGT---EIDDLSGDKETFENVCDLIIGKALEGYVRSEYGNLWETHQCRP 436
E+ S L+ TKFGT E D E F +CD IIGKA+EGYV+SEYGNLWETHQCRP
Sbjct 394 EKAGSRLITTKFGTTCPE------DTE-FGEMCDAIIGKAMEGYVKSEYGNLWETHQCRP 446
Query 437 HAYGHTWSPGYELPAGMLHGHAVATGMGFGAYLSFCNDWITKQELYRILNLLSGLELSLW 496
HAYGHTWSPGYE+PAGMLHGHAVATGMGFGA+L+F +IT+ E RI+ L+S LELSLW
Sbjct 447 HAYGHTWSPGYEIPAGMLHGHAVATGMGFGAHLAFREGFITEGESRRIMKLISDLELSLW 506
Query 497 HPVMCDNMK-IYKAQEKMIEKRGGNLAAPIPKG-IGNCGYLNHLPFELLQKRLREYKEIC 554
HP++ D+ ++ +QEKM++KRGGNL AP+PKG IG CGY+N + E L+K + EYK +C
Sbjct 507 HPIL-DDTDVVWASQEKMVQKRGGNLCAPVPKGQIGVCGYINDVSRERLEKTMAEYKTVC 565
Query 555 QEFPRKGLGIEAHCKDVGLEDPATVGGV--NKELPEENVDAVSNGDAVPNGDAVPNGCEN 612
QEFPR G+GI+ HC DVGLE P T GV K EE+ A G +VP+G A
Sbjct 566 QEFPRAGVGIDPHCHDVGLEHPGTT-GVCKKK---EEDQAAAEEG-SVPSGAA------- 613
Query 613 GIENENKKRKINLSYQEWIEKVQKKRN-------GGITRKVSLKQAE-DTPHPPEFEPNQ 664
LSY EWIE+ Q++R GG AE PP F+ N
Sbjct 614 ------------LSYNEWIEQCQQQRASSHTERLGG---------AEGGAAKPPVFDENT 652
Query 665 L--CRP--EDYAGDLSEPP--SADIQKIAII--TEQQQMFVPCMVGHLESQFLKMMAQIA 716
L P E YA LS+ S D+ A++ T+++ +F PCMVG LE QFLKM A+
Sbjct 653 LFY--PVVEAYA--LSQTTLGSKDVN--AVVESTDKEGLFAPCMVGQLEGQFLKMFAKST 706
Query 717 NAKRVLDVGTFTGMSAMAFAEGIPPDGQVVTIEFDQTIASTADKLFRD-SAQAHKLALKV 775
A RVLDVGTFTG SA++FAEGI G+VVT+E D IA A LF D SAQ K+ L V
Sbjct 707 KASRVLDVGTFTGYSALSFAEGIAAGGKVVTLESDTKIAGVAKSLF-DGSAQKEKIELIV 765
Query 776 GDAVDVMTDL---KSAQEKFDIIFLDAAKDQYITYYHLAL-SMLTPTGFILADNSLCALL 831
GDA M L K Q+ FDI+FLDA K+ Y+TYY L + +L P G ILADNSLC+L+
Sbjct 766 GDARAAMRKLLEDK--QQ-FDIVFLDADKENYVTYYDLTMDGLLAPGGVILADNSLCSLV 822
Query 832 YDPDDSRRQALHDFNQLVKNDKRVEQLALPFREGVSIIRP 871
Y D RRQ LHDFN+ V+ D RVEQ+ L REG+++I+P
Sbjct 823 YTEGDERRQKLHDFNEHVRKDARVEQVVLTVREGITLIQP 862
Score = 49.0 bits (108), Expect = 0.002
Identities = 21/40 (52%), Positives = 27/40 (67%), Gaps = 0/40 (0%)
Query 50 LLINVSRLAVFSKELAKSFRGDLDSLTLIKQLKYFYNQPI 89
+LI VSRL VFS E+A + DL L +K LKYFY+ P+
Sbjct 67 MLIYVSRLPVFSAEVAGELKADLGLLAAVKHLKYFYSIPV 106
Malate synthase like protein on the left side of AroB-methylase fusion protein(most similar to Strongylocentrotus purpuratus hypothetical protein)
Score E
Sequences producing significant alignments: (Bits) Value
ref|XP_782946.2| PREDICTED: hypothetical protein [Strongyloce... 564 4e-159 UniGene infoGene info
ref|XP_001512361.1| PREDICTED: hypothetical protein [Ornithorhyn 554 5e-156
ref|XP_001377783.1| PREDICTED: hypothetical protein [Monodelphis 553 9e-156
emb|CAF91513.1| unnamed protein product [Tetraodon nigroviridis] 551 5e-155
ref|XP_685378.1| PREDICTED: hypothetical protein isoform 1 [Dani 529 2e-148
ref|XP_788098.1| PREDICTED: hypothetical protein, partial [St... 480 1e-133
gb|EAT45696.1| malate synthase [Aedes aegypti] 462 3e-128
ref|ZP_01532792.1| Malate synthase A [Roseiflexus castenholzi... 377 1e-102
###############################################################################
RuvB-like protein on the right side of AroB-methylase fusion protein.
Sequences producing significant alignments: (Bits) Value
ref|NP_062659.1| RuvB-like protein 1 [Mus musculus] >ref|NP_6... 761 0.0 UniGene infoGene info
ref|XP_001366708.1| PREDICTED: similar to TIP49 [Monodelphis dom 760 0.0
gb|AAP36457.1| Homo sapiens RuvB-like 1 (E. coli) [synthetic ... 760 0.0
ref|NP_003698.1| RuvB-like 1 [Homo sapiens] >ref|XP_848712.1|... 760 0.0
AroA
Query sequence: scaffold_33_6
Accession: [none]
Description: [none]
Scores for sequence family classification (score includes all domains):
Model Description Score E-value N
-------- ----------- ----- ------- ---
EPSP_synthase EPSP synthase (3-phosphoshikimate 1-c 282.6 8.4e-85 2
Parsed for domains:
Model Domain seq-f seq-t hmm-f hmm-t score E-value
-------- ------- ----- ----- ----- ----- ----- -------
EPSP_synthase 1/2 38577 38962 .. 1 425 [. 271.8 1.5e-81
EPSP_synthase 2/2 39293 39314 .. 430 451 .] 10.8 1e-05
AroB
Query sequence: scaffold_3395_1
Accession: [none]
Description: [none]
Scores for sequence family classification (score includes all domains):
Model Description Score E-value N
-------- ----------- ----- ------- ---
DHQ_synthase 3-dehydroquinate synthase 385.3 1e-115 2
Parsed for domains:
Model Domain seq-f seq-t hmm-f hmm-t score E-value
-------- ------- ----- ----- ----- ----- ----- -------
DHQ_synthase 2/2 1944 2248 .. 1 339 [] 385.2 1.1e-115
AroC
Query sequence: scaffold_3300_6
Accession: [none]
Description: [none]
Scores for sequence family classification (score includes all domains):
Model Description Score E-value N
-------- ----------- ----- ------- ---
Chorismate_synt Chorismate synthase 623.6 4.1e-195 1
Parsed for domains:
Model Domain seq-f seq-t hmm-f hmm-t score E-value
-------- ------- ----- ----- ----- ----- ----- -------
Chorismate_synt 1/1 220 560 .. 1 401 [] 623.6 4.1e-195
AroE
Query sequence: scaffold_5070_2
Accession: [none]
Description: [none]
Scores for sequence family classification (score includes all domains):
Model Description Score E-value N
-------- ----------- ----- ------- ---
Shikimate_dh_N Shikimate dehydrogenase substrate bin 101.4 3e-30 1
Shikimate_DH Shikimate / quinate 5-dehydrogenase 24.7 4.4e-09 1
Parsed for domains:
Model Domain seq-f seq-t hmm-f hmm-t score E-value
-------- ------- ----- ----- ----- ----- ----- -------
Shikimate_dh_N 1/1 2168 2250 .. 1 84 [] 101.4 3e-30
Shikimate_DH 1/1 2279 2369 .] 22 157 .. 24.7 4.4e-09
AroA from scaffold_5181 most similar to Tenacibaculum sp. MED152
Score E
Sequences producing significant alignments: (Bits) Value
ref|ZP_01119134.1| 3-phosphoshikimate 1-carboxyvinyltransfera... 376 1e-102
ref|ZP_01053050.1| 3-phosphoshikimate 1-carboxyvinyltransfera... 369 1e-100
ref|YP_860405.1| 3-phosphoshikimate 1-carboxyvinyltransferase... 320 7e-86
ref|ZP_01059205.1| 3-phosphoshikimate 1-carboxyvinyltransfera... 315 2e-84
ref|ZP_01890962.1| putative 3-phosphoshikimate 1-carboxyvinyl... 311 2e-83
ref|ZP_00951385.1| 3-phosphoshikimate 1-carboxyvinyltransfera... 311 3e-83
ref|ZP_01107281.1| putative 3-phosphoshikimate 1-carboxyvinyl... 307 5e-82
emb|CAC82655.1| 5-enolpyruvylshikimate 3-phosphate synthase [... 306 1e-81
ref|ZP_01122370.1| 3-phosphoshikimate 1-carboxyvinyltransfera... 302 2e-80

########################################################################################
AroB from scaffold_3395 most similar to Tenacibaculum sp. MED152
Score E
Sequences producing significant alignments: (Bits) Value
ref|ZP_01052169.1| 3-dehydroquinate synthase [Tenacibaculum s... 466 1e-129
ref|ZP_01118687.1| 3-dehydroquinate synthase [Polaribacter ir... 445 2e-123
ref|ZP_01061679.1| putative 3-dehydroquinate synthase [Flavob... 378 3e-103
ref|ZP_01051584.1| putative 3-dehydroquinate synthase [Cellul... 378 4e-103
ref|ZP_01106521.1| 3-dehydroquinate synthase [Flavobacteriale... 375 3e-102
ref|YP_861532.1| 3-dehydroquinate synthase [Gramella forsetii... 374 7e-102
ref|YP_001195153.1| 3-dehydroquinate synthase [Flavobacterium... 370 7e-101

########################################################################################
AroC from scaffold 3300 most similar to Flavobacteria, among them Tenacibaculum sp. MED152
Score E
Sequences producing significant alignments: (Bits) Value
ref|ZP_01734660.1| chorismate synthase [Flavobacteria bacteri... 586 6e-166
emb|CAL42868.1| Chorismate synthase [Flavobacterium psychrophilu 577 4e-163
ref|YP_001196894.1| Chorismate synthase [Flavobacterium johns... 567 5e-160
ref|YP_861667.1| chorismate synthase [Gramella forsetii KT080... 542 1e-152
ref|ZP_01060121.1| chorismate synthase [Flavobacterium sp. ME... 538 2e-151
ref|ZP_01050894.1| chorismate synthase [Cellulophaga sp. MED1... 535 2e-150
ref|ZP_01891850.1| chorismate synthase [unidentified eubacter... 533 7e-150
ref|ZP_01106758.1| chorismate synthase [Flavobacteriales bact... 525 1e-147
ref|ZP_00950854.1| chorismate synthase [Croceibacter atlantic... 514 2e-144
ref|ZP_01121103.1| chorismate synthase [Robiginitalea biforma... 508 2e-142
ref|ZP_01052472.1| chorismate synthase [Tenacibaculum sp. MED... 508 3e-142
ref|ZP_01118000.1| chorismate synthase [Polaribacter irgensii... 502 1e-140
ref|ZP_01202724.1| chorismate synthase (5-enolpyruvylshikimat... 501 3e-140

########################################################################################
AroE from scaffold 5070 most similar to Tenacibaculum sp. MED152
Score E
Sequences producing significant alignments: (Bits) Value
ref|ZP_01733030.1| putative shikimate 5-dehydrogenase [Flavob... 296 4e-79
ref|YP_001194935.1| Shikimate dehydrogenase substrate binding... 235 2e-60
ref|ZP_01052545.1| shikimate 5-dehydrogenase [Tenacibaculum s... 228 2e-58
ref|ZP_01117836.1| putative shikimate 5-dehydrogenase [Polari... 228 2e-58

######################################################################################
(a) Sequence of the 985-bp 16S fragment of N. vectensis from StellaBase (http://evodevo.bu.edu/stellabase) in entry c439003225.Contig1 (length = 1,241 bp) aligned to the genus Pseudomonas
tgacgttacctacagaagaagcaccggctaactccgtgccagcagccgcggtaatacggagggtgcaagcgttaatcggaattactgggcgtaaagcgcgcgtaggcggctaggtcagttggatgtgaaatccccgggctcaacctgggaattgcatccaaaactgcctggctagagtacagaagagggtggtggaatttcctgtgtagcggtgaaatgcgtagatataggaaggaacatcagtggcgaaggcggccacctggactgatactgacactgaggtgcgaaagcgtggggagcaaacaggattagataccctggtagtccacgccgtaaacgatgtcaactagccgttgggagtcttgaactcttagtggcgcagctaacgcattaagttgaccgcctggggagtacggccgcaaggttaaaactcaaatgaattgacgggggcccgcacaagcggtggagcatgtggtttaattcgaagcaacgcgaagaaccttacctggccttgacatgctgagaactttctagagatagattggtgccttcgggaactcagacacaggtgctgcatggctgtcgtcagctcgtgtcgtgagatgttgggttaagtcccgtaacgagcgcaacccttgtccttagttaccagcacgtaatggtgggaactctaaggagactgccggtgacaaaccggaggaaggtggggatgacgtcaagtcatcatggcccttacggccagggctacacacgtgctacaatggtcggtacaaagggttgccaagccgcgaggtggagctaatcccataaaaccgatcgtagtccggatcgcagtctgcaactcgactgcgtgaagtcggaatcgctagtaatcgtgaatcagaatgtcacggtgaatacgttcccgggccttgtacacaccgcccgtcacaccatgggagtgggttgcaccagaagtagctagtctaaccttcgggaggacggt
(b) Sequence of the 720-bp 16S fragment of N. vectensis from StellaBase (http://evodevo.bu.edu/stellabase) in entry c429301624.Contig1 (length = 5,682 bp) aligned to the family Flavobacteriaceae used in construction of Fig. 1
GCCGTAAACGATGGATACTAGCTGTTCGGATTTCGGTCTGAGTGGCTAAGCGAAAGTGATAAGTATCCCACCTGGGGAGTACGCACGCAAGTGTGAAACTCAAAGGAATTGACGGGGGCCCGCACAAGCGGTGGAGCATGTGGTTTAATTCGATGATACGCGAGGAACCTTACCAGGGCTTAAATGGGAGACGACGTTATTGGAAACAGTAATTTCTTCGGACGTCTTTCAAGGTGCTGCATGGTTGTCGTCAGCTCGTGCCGTGAGGTGTCAGGTTAAGTCCTATAACGAGCGCAACCCCTGTCGTTAGTTGCCAGCGAGTGATGTCGGGAACTCTAACGAGACTGCCGGTGCAAACCGTGAGGAAGGTGGGGATGACGTCAAATCATCACGGCCCTTACGTCCTGGGCCACACACGTGCTACAATGGCCGGTACAGAGAGCAGCTACATGGTGACATGATGCGAATCTTCAAAACCGGTCTCAGTTCGGATCGGAGTCTGCAACTCGACTCCGTGAAGCTGGAATCGCTAGTAATCGGATATCAGCCATGATCCGGTGAATACGTTCCCGGGCCTTGTACACACCGCCCGTCAAGCCATGGAAGCTGGGGGTACCTGAAGTCGGTGACCGTAAGGAGCTGCCTAGGGTAAAACTGGTAACTGGGGCTAAGTCGTAACAAGGTAGCCGTACCGGAAGGTGCGGCTGGAACACCTCCTTT
Carsonella ruddii
length 159,662 bp
Result: one putative shikimate gene:
>sp|Q5NGG6|AROC_FRATT Chorismate synthase (EC 4.2.3.5) (5-enolpyruvylshikimate-3-phosphate phospholyase)
Francisella tularensis subsp. tularensis.
Length: 352 aa
Query frame: +3
Score: 901, Expect: 1e-68
Identical: 151/347 (43%), Positive: 233/347 (67%)
Indels: 5/347 (1%), Gaps: 3
Q: 74478 NSYGEIIKISTFGESHGLIIGALIDGFFSNLYISEKFIQKNLNLRKPFTSLFSTQRREQD 74657
|++|+| ++| ||||| + |+||| ||+ + | || |+ ||| | |+|||+| |
D: 4 NTFGKIFTVTTCGESHGDSLAAIIDGCPSNIPLCEADIQLELDRRKPGQSKFTTQRKEPD 63
Q: 74658 KVKIFTGIFKNKTTGAPVLMLIKNNDKQSSDYNNISLNFRPGHADYTYFLKYKFRDYRGG 74837
+||| +|+|+ |||| |+ ++||| |++| ||+ | ||||||||||| || ||||||
D: 64 EVKIISGVFEGKTTGTPIGLIIKNQDQKSKDYSEIKDKFRPGHADYTYFKKYGIRDYRGG 123
Q: 74838 GRSSARETACRVASGCVFKNLIYNKGVIVRSYIKKIGFLKINFKYWNYTLNR--FFSNLL 75011
||||||||| |||+| + | ++ + |+ + + +|| |||+| ++ | +|
D: 124 GRSSARETAMRVAAGAIAKKILKHYGIEIYGFCSQIGSLKIDFIDKDFINQNPFFIANKN 183
Q: 75012 FINEIKDIINNCKNSCNSLSSEIVIIINGLEPSLGDPLYKKINSTISNYLLSINATKSIC 75191
+ +|+|++ + +|+ +|+ ++ ||| || |++ +++++|+ ++|||| |++
D: 184 AVPACEDLIHSIRKQGDSIGAEVTVVATGLEAGLGRPVFDRLDASIAYAMMSINAVKAVS 243
Q: 75192 FG--FNFKNKNSFQVKDEI-KNSGFTSNNNGGILAGITNGQPLVIKILFKPTSSTSRKIK 75362
| |+ + | +||| + || ||+ |||| ||+ || ++ |+ |||||| + |
D: 244 IGDGFDCVAQKGSQHRDEITQQQGFLSNHAGGILGGISTGQDIIAKLAFKPTSSILQPGK 303
Q: 75363 TINEKLKNITNKTYGRHDPCVGLRAVPVIESMLYTILINKILKKKIY 75503
+|+ + + | | ||||||||+| ||+ |+|| +|++++| + |
D: 304 SIDVQGNDTTVITKGRHDPCVGIRGVPIAEAMLALVLVDELLITRSY 350
sequence:
>putative shikimate
aaaaaaaactataaaaattataaattattataatgaataattcatacggtgaaattatta
aaatttcaacttttggagaaagtcatggtttaattattggtgctttaattgatggttttt
tttcaaatttatatattagtgaaaaatttattcaaaaaaatttaaacttaagaaaaccat
ttacttcattattttcaacacaaagaagagaacaagacaaagttaaaattttcaccggaa
tttttaaaaataaaacaacaggcgcacctgtattaatgttaataaaaaataatgataaac
aaagttcagattataataatataagtttaaattttagacctggacatgcagactatactt
attttttaaagtataaatttagagattatagaggtggaggtagatctagtgctagagaaa
cagcttgcagagttgcaagtggatgtgtgtttaaaaatttgatttataataaaggagtta
ttgttcgttcatatattaaaaaaattggttttttaaaaataaattttaaatattggaatt
atacattaaatagatttttttcaaatttattatttataaatgagattaaagatataatta
ataattgtaaaaattcatgcaattcgttaagttcagaaattgtaattattatcaacggtc
ttgaaccaagtttgggagatcctctttataaaaaaattaattctactatttctaattatt
tgttaagtattaatgcaactaaaagtatttgctttggttttaactttaaaaataaaaact
catttcaagtaaaagatgaaattaaaaattctggatttacttcaaacaataatggaggaa
tattagctggaataactaatggacaacctttagtaatcaaaatattatttaaacctacat
ctagtacttctagaaaaataaaaacaataaacgaaaaattaaaaaatattacaaataaaa
cttatggaagacatgatccttgtgttggtttaagagctgtaccagtaattgaatctatgt
tatatacaatattaataaataaaattttaaaaaaaaaaatt
Buchnera aphidicola str. Cc (Cinara cedri)
length: 416,380 bp
Results:3 previously identified shikimate proteins:
>gnl|BL_ORD_ID|1759 tr|Q5EU83|Q5EU83_9ENTR Shikimate 5-dehydrogenase
Buchnera aphidicola (Cinara cedri).
Length: 280 aa
Query frame: -1
Score: 1714, Expect: 0
Identical: 280/280 (100%), Positive: 280/280 (100%)
Indels: 0/280 (0%), Gaps: 0
Q: 353875 MQNSKYCEKKIHIALFGNPIEHSLSPLIHKNFSKEIKINYNYNSFLCTKSNFFVIVKNFF 353696
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
D: 1 MQNSKYCEKKIHIALFGNPIEHSLSPLIHKNFSKEIKINYNYNSFLCTKSNFFVIVKNFF 60
Q: 353695 QNGGFGCNITVPFKKKSFQISNKNTKYVKISNSVNVLKKSSNNNIIGYNTDGIGLIYDLN 353516
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
D: 61 QNGGFGCNITVPFKKKSFQISNKNTKYVKISNSVNVLKKSSNNNIIGYNTDGIGLIYDLN 120
Q: 353515 RLKYITENSFILILGSGGAVYSIVYHLLKKKCCIFILNRTISKSCILVNKFKKFGKIFVF 353336
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
D: 121 RLKYITENSFILILGSGGAVYSIVYHLLKKKCCIFILNRTISKSCILVNKFKKFGKIFVF 180
Q: 353335 DKNLYTKKFDIIINATSCGLYNFSPKFPKNLIFPNTKCYDISYSKNKKLTPFLSTCRDLG 353156
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
D: 181 DKNLYTKKFDIIINATSCGLYNFSPKFPKNLIFPNTKCYDISYSKNKKLTPFLSTCRDLG 240
Q: 353155 SRKYSDGLGMLVAQAAYSCYIWFNILPNIKKNINLLKSII 353036
||||||||||||||||||||||||||||||||||||||||
D: 241 SRKYSDGLGMLVAQAAYSCYIWFNILPNIKKNINLLKSII 280
sequence:
>gnl|BL_ORD_ID|1759 tr|Q5EU83|Q5EU83_9ENTR Shikimate 5-dehydrogenase - Buchnera aphidicola (Cinara cedri)
atgcaaaatagtaaatattgtgaaaaaaaaattcatattgctttgttcggaaatcccata
gagcattctttatctccattaatacataaaaatttttctaaagagataaaaattaattat
aattataactcttttttatgtacaaaatcaaatttttttgttatagtaaaaaattttttt
caaaatggtggttttggatgcaacattactgttccttttaaaaaaaaatcatttcaaatt
tcaaataaaaatactaaatatgtaaaaatttctaattcggtaaatgttttaaaaaaaagt
tcaaataataatattattggttacaacacggatggaattggattaatttatgatttaaat
cgtttaaaatatatcacagaaaattcttttattttaatattaggttcaggtggagctgta
tattctattgtatatcatttattaaaaaaaaaatgttgtatttttattttaaatcgaaca
attagtaaatcatgtattttagtaaataaatttaaaaaatttggaaaaatttttgttttt
gataaaaatttatatacaaaaaaatttgatattattattaatgccacttcatgcgggtta
tataatttctctcctaaatttccaaaaaatttgatatttccaaatacaaaatgttatgat
atttcttattcaaaaaataaaaaattaaccccttttttatctacttgtagagatttaggt
agtagaaaatattctgatggcttaggaatgttagtagcacaagcagcatattcatgttat
atatggtttaacatactaccaaacattaaaaaaaatattaatttattaaaatctattata
>sp|Q9ZHE9|AROC_BUCAP Chorismate synthase (EC 4.2.3.5) (5-enolpyruvylshikimate-3-phosphate phospholyase)
Buchnera aphidicola subsp. Schizaphis graminum.
Length: 353 aa
Query frame: -1
Score: 1591, Expect: 0
Identical: 247/353 (69%), Positive: 300/353 (84%)
Indels: 0/353 (0%), Gaps: 0
Q: 69088 MPGNSIGKIFKVTTCGESHGPMLAGIIDGVPPGLSLNNKDIQYELNRRRPGFSKFTSQRR 68909
| ||+|||+|+||| ||||| | +|||+|||| |++ |+||+||||||| |++|+||
D: 1 MAGNTIGKVFRVTTFGESHGTALGCVIDGMPPGLELSSDDLQYDLNRRRPGTSRYTTQRS 60
Q: 68908 EKDKVEIFSGIFKGITTGTSIGIRIKNIDIRSQDYSEIKNLYRPNHADYTYEKKYGIRDY 68729
| |+|+|+||+||| |||||||+ |+| | |||||||||+|+|| |||||||||||||||
D: 61 ELDEVQILSGVFKGTTTGTSIGLVIQNKDQRSQDYSEIKDLFRPGHADYTYEKKYGIRDY 120
Q: 68728 RGGGRSSARETAIRVAAGAIAKKYLKLQHNIKIRGYLSQIGSIYCPFQSWEEVEKNPFFC 68549
||||||||||||+|||||+|||||||+| | || ||| +| | |||+||||||+|||||
D: 121 RGGGRSSARETAMRVAAGSIAKKYLKIQTGIVIRAYLSAMGDIKCPFESWEEVEQNPFFC 180
Q: 68548 SNSEKIKKIIHFIKKLKKSGNSVGAKITIIAKNVPIGLGEPVFDRLNAEIAHSIMSINAA 68369
|| |+ ++ +||||||+|+|+||+|||||+|||+|+||||||||+|++||++||||||
D: 181 SNKNKVFQLEELIKKLKKTGDSIGAEITIIAQNVPVGFGEPVFDRLDADLAHALMSINAA 240
Q: 68368 KSIEIGDGIHVAKQTGVEHRDEILPNGFSSNHSGGILGGISNGEEIIVHAAFKPTSSIKI 68189
| +||||| | | | |+|||+ |||| ||| ||||||||||| | + ||||||||+
D: 241 KGVEIGDGFSVVNQKGSENRDEMTPNGFKSNHCGGILGGISNGENIFLKVAFKPTSSIRQ 300
Q: 68188 PGKTIDTFGKKRFIITKGRHDPCVGIRAVPIAEAMLAITLMDHVLRFKAQCGK 68030
| ||+ +| |+ |||||||||||||||||||+|| ||||+|||+||| |
D: 301 SGNTINKNNEKVKIVIKGRHDPCVGIRAVPIAEAMVAIVLMDHLLRFRAQCAK 353
sequence:
> chorismate synthase [Buchnera aphidicola str. Cc (Cinara cedri)
atgccgggaaattcaattggaaaaatatttaaagttactacatgtggggaatcacatgga
cctatgttagcaggaattattgatggagttccgcctggtttatctttaaacaataaagat
attcaatatgaattaaatagaagaagaccgggtttttctaaatttacatcacaaagaaga
gaaaaagataaagtagaaatattttcaggaatatttaaaggaataactactggaacaagt
ataggcataagaataaaaaatatagatattagatcacaagattattcagaaattaaaaat
ttatatcgacctaatcatgctgattatacatatgaaaaaaaatatggaatcagagattat
agaggtggaggaagatcctcagctagagaaacagctattcgagtagctgctggagctatt
gctaaaaaatatttaaaattacaacataatataaaaattagaggatatttgtcacaaata
ggatcaatttattgtccatttcaatcttgggaagaagtagaaaaaaatccttttttttgt
agtaattcagaaaaaataaaaaaaattatacattttatcaaaaaattaaaaaaaagtgga
aattcagtaggagcaaaaattactattattgctaaaaatgttcctattggattaggagaa
cctgtatttgatagactaaatgctgaaattgcacattctataatgagtattaatgctgct
aaatctattgaaattggagatggaattcatgtagctaaacaaacaggagttgaacataga
gatgaaattctacctaacggattttctagcaatcattccggaggaatattaggtggtatt
agtaacggggaagaaattattgtacatgcagcttttaaacctacatctagtattaaaata
ccgggaaaaacaatagatacatttggaaaaaaaagatttattataacaaaaggaagacat
gatccttgtgtaggaattcgagctgttccaattgcagaagcaatgttagcaattacatta
atggaccatgttttacgtttcaaagctcaatgtggaaaa
>sp|Q59178|AROA_BUCAP 3-phosphoshikimate 1-carboxyvinyltransferase (EC 2.5.1.19) (5- enolpyruvylshikimate-3-phosphate synthase) (EPSP synthase) (EPSPS)
Buchnera aphidicola subsp. Schizaphis graminum.
Length: 428 aa
Query frame: -1
Score: 1573, Expect: 0
Identical: 260/429 (60%), Positive: 324/429 (75%)
Indels: 3/429 (0%), Gaps: 2
Q: 225559 MQDSLTLKPVDYIQGKINIPGSKSISNRVLLLSALSNGKTILKNLLYSDDIKYMLKALLK 225380
|| | |||| || | | +|||||||||||||||++|| | | ||| | | +||| || |
D: 1 MQKFLELKPVSYINGTIYLPGSKSISNRVLLLSAMANGITCLTNLLDSQDTQYMLNALRK 60
Q: 225379 LGIFYKLDKKKSKCTIYGISDAFSVKNKIKLFLGNAGTAMRPLLAILSLKKNKIILTGEK 225200
+|| + | + | ++|| || + + | ||||||||||||||| ||| +| ++|+|+
D: 61 IGIKFFLSNNNTTCHVHGIGKAFHLSHPISLFLGNAGTAMRPLLAALSLYENNVVLSGDD 120
Q: 225199 RMKERPIHHLVDSLRQGGANITYKNKKKFPPLYIKGGFKGGKIFIDGSISSQFLSSLLMA 225020
|| |||| ||||+|+|||| + || +||+ ||||||| | +|||||||||+||||
D: 121 RMHERPIAHLVDALKQGGATLEYKKGIGYPPVLTKGGFKGGSIMLDGSISSQFLTSLLMV 180
Q: 225019 APLAELDTEIIVKNQLVSKPYINLTINLMEKFGISVSILND-YKHFYIKGNQKYISPKKY 224843
|||| +| | +| |||||||++|+|||+ || |+|+|| || ||||||||| || |
D: 181 APLALQNTNIFIKGNLVSKPYIDITLNLMKSFG--VNIVNDCYKSFYIKGNQKYESPGNY 238
Q: 224842 YIESDLSSATYFLAAAAIKGGSIQINGIQKKSIQGDINFIKILKQMGVSIQWKKNSVICK 224663
+| | |||+||||||||||||+++ |+ |||+|||| | +|++|| | | + ++|+
D: 239 LVEGDASSASYFLAAAAIKGGSVKVVGVGKKSVQGDIKFADVLEKMGAIIDWGDSFIVCR 298
Q: 224662 KNKLLGITVDCNHIPDAAMTIAILGVFSKKKVYIKNIYNWRVKETDRIYAMSTELKKIGA 224483
||| | +| ||||||||||||+ +|+| ||||||||||||||+ ||| ||||+||
D: 299 HNKLEKIDLDMNHIPDAAMTIAIVALFAKGTSIIKNIYNWRVKETDRLSAMSKELKKVGA 358
Q: 224482 RVITGKDYIKVYPVKNFIHAKINTYNDHRIAMCFSLISLSGTSVTLLNPKCVNKTFPSFF 224303
+ |+| + + | | |+|+||||||+||||||| ||| || +||| |++|||||+|
D: 359 IIKEGRDCLSITPPNFFKFAEIDTYNDHRMAMCFSLICLSGISVRILNPNCISKTFPSYF 418
Q: 224302 KNFYSICHY 224276
+|| | +
D: 419 ENFLKISRF 427
sequence:
>3-phosphoshikimate 1-carboxyvinyltransferase [Buchnera aphidicola
str. Cc (Cinara cedri)]
atgcaagatagtttaactttaaaaccagtagattatattcaaggaaaaattaatattcca
ggttcaaaaagtatttctaatcgtgttcttttattatcagctttatctaatggaaaaaca
attttaaaaaatttgttatatagtgatgatattaaatatatgttaaaagctttattaaaa
ttaggtattttttataaattagacaaaaaaaaatctaagtgtactatttatggaatatct
gatgcattttctgtaaaaaataaaattaaattatttttaggtaatgctggtaccgctatg
cgtccactattagcaattttatcattaaaaaaaaataaaattatacttactggtgaaaaa
agaatgaaagaaagacctattcatcatttagtagactctttacgtcagggtggagcaaat
ataacttataaaaataaaaaaaaatttcctccattatatattaaaggtggttttaaaggt
ggaaaaatttttatagatggatctatttctagtcaatttttaagttctttattaatggcc
gctcctttagcagaattagatactgaaattatagtaaaaaatcaattagtatctaaacct
tatattaatttaacaataaatttaatggaaaaatttggtatatcagtaagtattttaaat
gattataaacatttctatataaaaggaaaccagaaatatatttctcctaaaaaatattat
attgaaagtgatctttcttctgctacttattttttagctgcggctgcaataaaaggtgga
tcaattcaaataaatggaatacaaaaaaaaagtattcaaggagacataaattttattaaa
attttaaaacaaatgggtgtatcaattcaatggaaaaaaaattcagttatttgtaaaaaa
aataagttattaggtattacagtagattgcaatcatatacctgatgcagctatgactata
gctattcttggagtattttctaaaaaaaaagtatatattaaaaatatatataattggaga
gttaaagaaactgatcgaatatatgctatgagtacagaattaaaaaaaatcggagctcga
gtaattacaggtaaagattatataaaagtctatccagtaaaaaattttatacatgctaaa
ataaatacttataatgatcatagaatagctatgtgtttttctttaatttcactgtctgga
acttctgtaactttactaaatccaaaatgtgttaataaaacatttccatcattttttaaa
aacttttattctatttgtcattat