Enzymes of the shikimic acid pathway encoded in the genome of a basal metazoan, Nematostella vectensis , have microbial origins

Starcevic et al. 10.1073/pnas.0707388105.

Supporting Information

Files in this Data Supplement:

SI Scheme 1
SI Dataset 1
SI Dataset 2
SI Dataset 3
SI Dataset 4
SI Dataset 5
SI Dataset 6
SI Dataset 7
SI Dataset 8
SI Dataset 9




SI Scheme 1

Scheme 1. Presumed biosynthetic pathway for the production of natural UV-suncreening agents (MAAs) in UV-tolerant marine algae via the plant shikimic acid pathway. Enzymes of the pathway are 1) 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase, 2) 3-dehydroquinate synthase, 3) 3-dehydroquinate dehydratase, 4) shikimate 5-dehydrogenase, 5) shikimate kinase, 6) 5-enolpyruvylshikimate-3-phosphate (EPSP) synthase, and 7) chorismate synthase. [Reconstructed with permission from ref. 1 (Copyright 2002, Annual Reviews, www.annualreviews.org).]

1. Shick JM, Dunlap WC (2002) Mycosporine-like amino acids and related Gadusols: Biosynthesis, acumulation, and UV-protective functions in aquatic organisms. Annu Rev Physiol 64:223-262.





Dataset 1

Query sequence: scaffold_33_6

Accession: [none]

Description: [none]

Scores for sequence family classification (score includes all domains):

Model Description Score E-value N

-------- ----------- ----- ------- ---

EPSP_synthase EPSP synthase (3-phosphoshikimate 1-c 282.6 8.4e-85 2

Parsed for domains:

Model Domain seq-f seq-t hmm-f hmm-t score E-value

-------- ------- ----- ----- ----- ----- ----- -------

EPSP_synthase 1/2 38577 38962 .. 1 425 [. 271.8 1.5e-81

EPSP_synthase 2/2 39293 39314 .. 430 451 .] 10.8 1e-05

EPSP_synthase: domain 1 of 2, from 38577 to 38962: score 271.8, E = 1.5e-81

*->vtggsrLnGeVkvPGSKSishRaLllAALAegeepstItNlLdsdDt

v+gg +L+G++++PG +++++++L++A+L+++ p+++tN + D+

scaffold_3 38577 VRGGYPLRGTIRIPG-AKNAALPLMAASLLTT-KPVRLTNIPKVTDV 38621

rlmleaLraLGaevieldeekevviveGlggqfeapyesdlvldlGNSGT

+m+ +L++ G++v++ +++ v ++++ + +++ s ++ S+

scaffold_3 38622 NAMAVILQSHGVAVEWR-PDDSLVLDARNAQGIPSI--SSTYAPIRSSIF 38668

amRpLlgrlalaqsnevvLtGddsi..geRPidrlldaLrqlGAeIesre

+++p gr+++a + +G+++i++g RPid ++ a r lGA ++ ++

scaffold_3 38669 TLGPAMGRFGEAM---IQVPGGCQIsqGGRPIDLHFYAMRKLGAIVDEES 38715

gegyaPlavrggglklggveidgsiSSqfvTslLmlApllAegdvttiie

g l v+++g +l+g+ i ++++S+++T + ++A+ l g ti+e

scaffold_3 38716 G-----L-VKTNGNRLRGARITFDKVSVGATINALMAACLVQG--KTILE 38757

nGklasePyiddTlnmLkkfGakiegsgtetsftvkGgqkYklpgveylV

n +a e +idd++ mL+k+Ga+i++ + +++ ++G+ l+g+++ V

scaffold_3 38758 N--AAMEAEIDDLVCMLRKMGAQIDKNRDTKTWFITGVSS--LHGADHGV 38803

egDaSsAayFlaAAaitgGStVlvenvginslqpGDiravlkvLedmGen

++D++ A+++++AA++tgG + +++ +g+ ++ p+ vl L+ +G

scaffold_3 38804 VPDRIVAGTYAVAAVMTGG-ELTLT-LGPCPV-PALMGCVLTCLRAAG-- 38848

eaevtqeedadivvgppvnsmLkglkgidvdintapDpapttAvlaafAe

aev + ++i+v++ g ++ v i t p+p+++t+++ + ++

scaffold_3 38849 -AEVMELA-EGIRVRG-------GRRPRSVSITTSPYPGFPTDMQPQWMA 38889

GtsrieGiselRvKEtDRlfamatELrklGaeveegpDGliiigsiitav

+ ++G e++++++D++f ++ ELrklGa+ e + G ++ +

scaffold_3 38890 LMCMAQGSCEVKETIFDHRFQHVKELRKLGANLECT--G------KNVVR 38931

vhGve..qLkgaevdtygDHRiAMafaLaGLv<-*

v Gv + +++ v ++ D+R+A+a+ L+GL+

scaffold_3 38932 VQGVDlsLMQPSLVQAT-DLRAAAALLLVGLA 38962

EPSP_synthase: domain 2 of 2, from 39293 to 39314: score 10.8, E = 1e-05

*->geviIddpectdksfPdFfekL<-*

ge++I d++++ ++++d +++L

scaffold_3 39293 GETVIQDIHHLERGYEDVVRVL 39314

#############################################################################

Query sequence: scaffold_85_5

Accession: [none]

Description: [none]

Scores for sequence family classification (score includes all domains):

Model Description Score E-value N

-------- ----------- ----- ------- ---

DHQ_synthase 3-dehydroquinate synthase 91.0 1.7e-28 2

Parsed for domains:

Model Domain seq-f seq-t hmm-f hmm-t score E-value

-------- ------- ----- ----- ----- ----- ----- -------

DHQ_synthase 1/2 134772 134859 .. 22 110 .. 60.4 7.3e-20

DHQ_synthase 2/2 134885 134941 .. 136 193 .. 30.6 2e-11

DHQ_synthase: domain 1 of 2, from 134772 to 134859: score 60.4, E = 7.3e-20

*->vvivtdetvaklygekveeaLkaaGfevevivipdGEtsKtletlek

v+i+ d+ v klyge+++ ++ +++ +v+p+ E K+++ +ek

scaffold_8 134772 VAII-DDKVDKLYGEPLKLYFDTHNIKLWKLVFPGNEVDKDISAVEK 134817

iydaLleagltRsdlliAlGGGvigDlaGFaAAtymRGipfi<-*

+ L +++R+ ++ GGGvi D+aGFaAA y+R p++

scaffold_8 134818 MLVELKKIKVSRDQPILVMGGGVISDIAGFAAALYHRNTPYV 134859

DHQ_synthase: domain 2 of 2, from 134885 to 134941: score 30.6, E = 2e-11

*->KNliGaFyqPkaVliDtdfLkTLPeRElraGmAEvISKygaIaDwel

KNl G++++P + l D f +TL +r+G+AE++ K+++++D+el

scaffold_8 134885 KNLYGSYHPPVLTLTDRSFFRTLHHGWIRHGIAEIV-KMAVVKDEEL 134930

fhwLeeeafal<-*

f++Le+ +++l

scaffold_8 134931 FNLLEQVSSTL 134941




Dataset 2

BLASTP results for AroA scaffold_33_6

################################################################################

Score E

Sequences producing significant alignments: (Bits) Value

ref|YP_932319.1| UDP-N-acetylglucosamine 1-carboxyvinyltransf... 311 5e-83 Gene info

ref|ZP_00800949.1| UDP-N-acetylglucosamine 1-carboxyvinyltran... 308 5e-82

ref|ZP_01362358.1| UDP-N-acetylglucosamine 1-carboxyvinyltran... 308 6e-82

ref|YP_286592.1| UDP-N-acetylglucosamine 1-carboxyvinyltransf... 307 9e-82 Gene info

ref|YP_865138.1| UDP-N-acetylglucosamine 1-carboxyvinyltransf... 306 1e-81 Gene info

ref|ZP_01504774.1| UDP-N-acetylglucosamine 1-carboxyvinyltran... 306 1e-81

ref|ZP_01199610.1| UDP-N-acetylglucosamine 1-carboxyvinyltran... 305 4e-81

ref|ZP_01513509.1| UDP-N-acetylglucosamine 1-carboxyvinyltran... 303 9e-81

ref|YP_315649.1| UDP-N-acetylglucosamine 1-carboxyvinyltransf... 303 1e-80 Gene info

ref|ZP_01739735.1| UDP-N-acetylglucosamine 1-carboxyvinyltran... 303 1e-80

ref|NP_900110.1| UDP-N-acetylglucosamine 1-carboxyvinyltransf... 303 1e-80 Gene info

ref|ZP_00985309.1| COG0766: UDP-N-acetylglucosamine enolpyruv... 302 3e-80

ref|YP_560589.1| UDP-N-acetylglucosamine1-carboxyvinyltransfe... 302 3e-80 Gene info

ref|NP_283098.1| UDP-N-acetylglucosamine 1-carboxyvinyltransf... 301 4e-80 Gene info

ref|YP_154801.1| UDP-N-acetylglucosamine 1-carboxyvinyltransf... 301 5e-80 Gene info

ref|YP_367767.1| UDP-N-acetylglucosamine 1-carboxyvinyltransf... 301 5e-80 Gene info

ref|YP_001237059.1| UDP-N-acetylglucosamine 1-carboxyvinyltra... 301 6e-80 Gene info

ref|YP_001118249.1| UDP-N-acetylglucosamine 1-carboxyvinyltra... 300 9e-80 Gene info

>ref|YP_932319.1| Gene info UDP-N-acetylglucosamine 1-carboxyvinyltransferase [Azoarcus sp.

BH72]

emb|CAL93432.1| Gene info UDP-N-acetylglucosamine 1-carboxyvinyltransferase [Azoarcus sp.

BH72]

Length=416

Score = 311 bits (796), Expect = 5e-83, Method: Composition-based stats.

Identities = 187/405 (46%), Positives = 257/405 (63%), Gaps = 14/405 (3%)

Query 1 VRGGYPLRGTIRIPGAKNAALPLMAASLLTTKPVRLTNIPKVTDVNAMAVILQSHGVAVE 60

+ GG L G + I GAKNAALP++ A+LLT +PV TN+P++ D+ + +L GV VE

Sbjct 6 IEGGRRLSGEVAISGAKNAALPILCAALLTREPVTFTNVPRLNDIGTLLKLLGQMGVKVE 65

Query 61 WRPDDSLVLDARNAQGIPSISSTYAPIRSSIFTLGPAMGRFGEAMIQVPGGCQISQGGRP 120

R DD + LDA + +R+SI LGP + R G+A + +PGGC I G RP

Sbjct 66 -REDDRVTLDASALDNPVAPYEMVKTMRASILVLGPLVARCGDARVSLPGGCAI--GARP 122

Query 121 IDLHFYAMRKLGAIVDEESGLVKTNGNRLRGARITFDKVSVGATINALMAACLVQGKTIL 180

+D H ++ +GA V E G V+ RL+GAR+ D V+V T N +MAACL QG+T++

Sbjct 123 VDQHIKGLQAMGAEVRVEHGYVQAQVPRLKGARLFTDMVTVTGTENLMMAACLAQGETVI 182

Query 181 ENAAMEAEIDDLVCMLRKMGAQIDKNRDTKTWFITGVSSLHGADHGVVPDRIVAGTYAVA 240

ENAA E E+ DL L MGAQI T I GV +LHGA H ++PDRI GTY A

Sbjct 183 ENAAREPEVVDLANCLVAMGAQI-SGAGTDVIRIRGVDALHGATHRIMPDRIETGTYLCA 241

Query 241 AVMTGGELTLT-LGPCPVPALMGCVLTCLRAAGAEVMELAEGIRVRGGRRPRSVSITTSP 299

A +TGGE+ LT C + A V+ L AG EV+ + IR+ RRP++V++ T+P

Sbjct 242 AAVTGGEVRLTGTSSCYLDA----VIDKLMDAGCEVVSERDAIRLAAPRRPQAVNLRTAP 297

Query 300 YPGFPTDMQPQWMALMCMAQGSCEVKETIFDHRFQHVKELRKLGANLECTGKNVVRVQGV 359

YP FPTDMQ Q+MAL C+A G+ ++ETIF++RF H EL++LGA++ G V V+GV

Sbjct 298 YPAFPTDMQAQFMALNCVADGAAMIRETIFENRFMHAVELQRLGADIRIDGNTAV-VRGV 356

Query 360 DLSLMQPSLVQATDLRAAAALLLVGLA--GETVIQDIHHLERGYE 402

+ +Q + V ATDLRA+A+L++ GL GETVI+ I+HL+RGYE

Sbjct 357 E--RLQGATVMATDLRASASLVVAGLVAEGETVIERIYHLDRGYE 399





Dataset 3

BLASTP results for scaffold_85_5 AroB

###############################################################################

Score E

Sequences producing significant alignments: (Bits) Value

gb|ABF61768.1| chloroplast 3-dehydroquinate synthase [Karlodiniu 245 2e-63

gb|ABF61766.1| chloroplast 3-dehydroquinate synthase/O-methyl... 244 4e-63

gb|ABF61767.1| 3-dehydroquinate synthase/O-methyltransferase ... 218 3e-55

ref|ZP_00514848.1| 3-dehydroquinate synthase [Crocosphaera wa... 158 3e-37

ref|ZP_01727903.1| 3-dehydroquinate synthase [Cyanothece sp. ... 148 3e-34

>gb|ABF61766.1| chloroplast 3-dehydroquinate synthase/O-methyltransferase fusion

[Heterocapsa triquetra]

Length=951

Score = 244 bits (569), Expect = 4e-63

Identities = 96/169 (56%), Positives = 116/169 (68%), Gaps = 35/169 (20%)

Query 1 VAIIDDKVDKLYGEPLKLYFDTHNI--KLWKLVFPGNEVDKDISAVEKMLVELKKIKVS- 57

VA++DDKV+ LYG+ L YF H I K KL+F GNEVDKDI VE++LV LKK S

Sbjct 322 VAVVDDKVEALYGKDLDAYFAHHGIEYK--KLIFSGNEVDKDIRDVERILVALKK---SG 376

Query 58 --RDQPILVMGGGVISDIAGFAAALYHRNTPYV-------------------------KN 90

R +P+LV+GGGVI+DIAGFAAALY RNTPYV KN

Sbjct 377 QGRHEPLLVVGGGVIADIAGFAAALYSRNTPYVMLCTSIVSGIDAGPSPRVCCNGFDYKN 436

Query 91 LYGSYHPPVLTLTDRSFFRTLHHGWIRHGIAEIVKMAVVKDEELFNLLE 139

LYG+YHPPVLT+TDR F++TLH GW+RHG+AEI+KMAV+KD LF L+E

Sbjct 437 LYGAYHPPVLTITDRGFWKTLHPGWLRHGVAEIIKMAVMKDLSLFELME 485

>gb|ABF61767.1| 3-dehydroquinate synthase/O-methyltransferase fusion [Oxyrrhis marina]

Length=864

Score = 218 bits (508), Expect = 3e-55

Identities = 92/165 (55%), Positives = 112/165 (67%), Gaps = 27/165 (16%)

Query 1 VAIIDDKVDKLYGEPL-KLYFDTHNIKLWKLVFPGNEVDKDISAVEKMLVELKKIKVSRD 59

VA++D VD+ +GE L K YF H ++L KLV+ E DKDIS VE++L +LK VSR+

Sbjct 231 VAVVDQFVDEKWGEDLCK-YFAHHGVELTKLVYRAMEADKDISTVEEILKDLKMHSVSRN 289

Query 60 QPILVMGGGVISDIAGFAAALYHRNTPYV-------------------------KNLYGS 94

+P+L++GGGVI+D+ GFA ALYHRNT YV KNLYG+

Sbjct 290 EPVLIVGGGVIADVGGFATALYHRNTAYVMLCTSIVSGIDAGPSPRTCCDGFGYKNLYGA 349

Query 95 YHPPVLTLTDRSFFRTLHHGWIRHGIAEIVKMAVVKDEELFNLLE 139

YHPPVLTLTDR+FF TL GW+RHGIAEIVKMAVVKD LF LLE

Sbjct 350 YHPPVLTLTDRTFFNTLKEGWVRHGIAEIVKMAVVKDLSLFELLE 394




Dataset 4

BLASTP results: scaffold_85 translated against fusion protein from Oxyrrhis marina

#######################################################################

Score E

Sequences producing significant alignments: (Bits) Value

gb|ABF61767.1| 3-dehydroquinate synthase/O-methyltransferase ... 973 0.0

>gb|ABF61767.1| 3-dehydroquinate synthase/O-methyltransferase fusion [Oxyrrhis marina]

Length=864

Score = 973 bits (2288), Expect = 0.0

Identities = 434/760 (57%), Positives = 528/760 (69%), Gaps = 85/760 (11%)

Query 143 YQTNADYLEKADPHAVYPTSIYRSCEGYVLAKPDASLIESTMSTTLTTTIKIVNDVLDPE 202

Y +NA YLE ADPHAV+PTSIYR C+G+V A DAS+IE MST++TTTIKI VL+P

Sbjct 157 YSSNAAYLECADPHAVFPTSIYRMCDGHVHANQDASVIEGVMSTSITTTIKIQTGVLNPS 216

Query 203 NKELK--NAYKPFGRCVAIIDDKVDKLYGEPL-KLYFDTHNIKLWKLVFPGNEVDKDISA 259

N L YKP G+CVA++D VD+ +GE L K YF H ++L KLV+ E DKDIS

Sbjct 217 N--LTLCKVYKPIGKCVAVVDQFVDEKWGEDLCK-YFAHHGVELTKLVYRAMEADKDIST 273

Query 260 VEKMLVELKKIKVSRDQPILVMGGGVISDIAGFAAALYHRNTPYVMLCTSIVSGIDAGPS 319

VE++L +LK VSR++P+L++GGGVI+D+ GFA ALYHRNT YVMLCTSIVSGIDAGPS

Sbjct 274 VEEILKDLKMHSVSRNEPVLIVGGGVIADVGGFATALYHRNTAYVMLCTSIVSGIDAGPS 333

Query 320 PRTCCDGFGFKNLYGSYHPPVLTLTDRSFFRTLHHGWIRHGIAEIVKMAVVKDEELFNLL 379

PRTCCDGFG+KNLYG+YHPPVLTLTDR+FF TL GW+RHGIAEIVKMAVVKD LF LL

Sbjct 334 PRTCCDGFGYKNLYGAYHPPVLTLTDRTFFNTLKEGWVRHGIAEIVKMAVVKDLSLFELL 393

Query 380 EQVSSTLVWTKFGT---EIDDLSGDKETFENVCDLIIGKALEGYVRSEYGNLWETHQCRP 436

E+ S L+ TKFGT E D E F +CD IIGKA+EGYV+SEYGNLWETHQCRP

Sbjct 394 EKAGSRLITTKFGTTCPE------DTE-FGEMCDAIIGKAMEGYVKSEYGNLWETHQCRP 446

Query 437 HAYGHTWSPGYELPAGMLHGHAVATGMGFGAYLSFCNDWITKQELYRILNLLSGLELSLW 496

HAYGHTWSPGYE+PAGMLHGHAVATGMGFGA+L+F +IT+ E RI+ L+S LELSLW

Sbjct 447 HAYGHTWSPGYEIPAGMLHGHAVATGMGFGAHLAFREGFITEGESRRIMKLISDLELSLW 506

Query 497 HPVMCDNMK-IYKAQEKMIEKRGGNLAAPIPKG-IGNCGYLNHLPFELLQKRLREYKEIC 554

HP++ D+ ++ +QEKM++KRGGNL AP+PKG IG CGY+N + E L+K + EYK +C

Sbjct 507 HPIL-DDTDVVWASQEKMVQKRGGNLCAPVPKGQIGVCGYINDVSRERLEKTMAEYKTVC 565

Query 555 QEFPRKGLGIEAHCKDVGLEDPATVGGV--NKELPEENVDAVSNGDAVPNGDAVPNGCEN 612

QEFPR G+GI+ HC DVGLE P T GV K EE+ A G +VP+G A

Sbjct 566 QEFPRAGVGIDPHCHDVGLEHPGTT-GVCKKK---EEDQAAAEEG-SVPSGAA------- 613

Query 613 GIENENKKRKINLSYQEWIEKVQKKRN-------GGITRKVSLKQAE-DTPHPPEFEPNQ 664

LSY EWIE+ Q++R GG AE PP F+ N

Sbjct 614 ------------LSYNEWIEQCQQQRASSHTERLGG---------AEGGAAKPPVFDENT 652

Query 665 L--CRP--EDYAGDLSEPP--SADIQKIAII--TEQQQMFVPCMVGHLESQFLKMMAQIA 716

L P E YA LS+ S D+ A++ T+++ +F PCMVG LE QFLKM A+

Sbjct 653 LFY--PVVEAYA--LSQTTLGSKDVN--AVVESTDKEGLFAPCMVGQLEGQFLKMFAKST 706

Query 717 NAKRVLDVGTFTGMSAMAFAEGIPPDGQVVTIEFDQTIASTADKLFRD-SAQAHKLALKV 775

A RVLDVGTFTG SA++FAEGI G+VVT+E D IA A LF D SAQ K+ L V

Sbjct 707 KASRVLDVGTFTGYSALSFAEGIAAGGKVVTLESDTKIAGVAKSLF-DGSAQKEKIELIV 765

Query 776 GDAVDVMTDL---KSAQEKFDIIFLDAAKDQYITYYHLAL-SMLTPTGFILADNSLCALL 831

GDA M L K Q+ FDI+FLDA K+ Y+TYY L + +L P G ILADNSLC+L+

Sbjct 766 GDARAAMRKLLEDK--QQ-FDIVFLDADKENYVTYYDLTMDGLLAPGGVILADNSLCSLV 822

Query 832 YDPDDSRRQALHDFNQLVKNDKRVEQLALPFREGVSIIRP 871

Y D RRQ LHDFN+ V+ D RVEQ+ L REG+++I+P

Sbjct 823 YTEGDERRQKLHDFNEHVRKDARVEQVVLTVREGITLIQP 862

Score = 49.0 bits (108), Expect = 0.002

Identities = 21/40 (52%), Positives = 27/40 (67%), Gaps = 0/40 (0%)

Query 50 LLINVSRLAVFSKELAKSFRGDLDSLTLIKQLKYFYNQPI 89

+LI VSRL VFS E+A + DL L +K LKYFY+ P+

Sbjct 67 MLIYVSRLPVFSAEVAGELKADLGLLAAVKHLKYFYSIPV 106




Dataset 5

Malate synthase like protein on the left side of AroB-methylase fusion protein(most similar to Strongylocentrotus purpuratus hypothetical protein)

Score E

Sequences producing significant alignments: (Bits) Value

ref|XP_782946.2| PREDICTED: hypothetical protein [Strongyloce... 564 4e-159 UniGene infoGene info

ref|XP_001512361.1| PREDICTED: hypothetical protein [Ornithorhyn 554 5e-156

ref|XP_001377783.1| PREDICTED: hypothetical protein [Monodelphis 553 9e-156

emb|CAF91513.1| unnamed protein product [Tetraodon nigroviridis] 551 5e-155

ref|XP_685378.1| PREDICTED: hypothetical protein isoform 1 [Dani 529 2e-148

ref|XP_788098.1| PREDICTED: hypothetical protein, partial [St... 480 1e-133

gb|EAT45696.1| malate synthase [Aedes aegypti] 462 3e-128

ref|ZP_01532792.1| Malate synthase A [Roseiflexus castenholzi... 377 1e-102

###############################################################################

RuvB-like protein on the right side of AroB-methylase fusion protein.

Sequences producing significant alignments: (Bits) Value

ref|NP_062659.1| RuvB-like protein 1 [Mus musculus] >ref|NP_6... 761 0.0 UniGene infoGene info

ref|XP_001366708.1| PREDICTED: similar to TIP49 [Monodelphis dom 760 0.0

gb|AAP36457.1| Homo sapiens RuvB-like 1 (E. coli) [synthetic ... 760 0.0

ref|NP_003698.1| RuvB-like 1 [Homo sapiens] >ref|XP_848712.1|... 760 0.0




Dataset 6

AroA

Query sequence: scaffold_33_6

Accession: [none]

Description: [none]

Scores for sequence family classification (score includes all domains):

Model Description Score E-value N

-------- ----------- ----- ------- ---

EPSP_synthase EPSP synthase (3-phosphoshikimate 1-c 282.6 8.4e-85 2

Parsed for domains:

Model Domain seq-f seq-t hmm-f hmm-t score E-value

-------- ------- ----- ----- ----- ----- ----- -------

EPSP_synthase 1/2 38577 38962 .. 1 425 [. 271.8 1.5e-81

EPSP_synthase 2/2 39293 39314 .. 430 451 .] 10.8 1e-05

AroB

Query sequence: scaffold_3395_1

Accession: [none]

Description: [none]

Scores for sequence family classification (score includes all domains):

Model Description Score E-value N

-------- ----------- ----- ------- ---

DHQ_synthase 3-dehydroquinate synthase 385.3 1e-115 2

Parsed for domains:

Model Domain seq-f seq-t hmm-f hmm-t score E-value

-------- ------- ----- ----- ----- ----- ----- -------

DHQ_synthase 2/2 1944 2248 .. 1 339 [] 385.2 1.1e-115

AroC

Query sequence: scaffold_3300_6

Accession: [none]

Description: [none]

Scores for sequence family classification (score includes all domains):

Model Description Score E-value N

-------- ----------- ----- ------- ---

Chorismate_synt Chorismate synthase 623.6 4.1e-195 1

Parsed for domains:

Model Domain seq-f seq-t hmm-f hmm-t score E-value

-------- ------- ----- ----- ----- ----- ----- -------

Chorismate_synt 1/1 220 560 .. 1 401 [] 623.6 4.1e-195

AroE

Query sequence: scaffold_5070_2

Accession: [none]

Description: [none]

Scores for sequence family classification (score includes all domains):

Model Description Score E-value N

-------- ----------- ----- ------- ---

Shikimate_dh_N Shikimate dehydrogenase substrate bin 101.4 3e-30 1

Shikimate_DH Shikimate / quinate 5-dehydrogenase 24.7 4.4e-09 1

Parsed for domains:

Model Domain seq-f seq-t hmm-f hmm-t score E-value

-------- ------- ----- ----- ----- ----- ----- -------

Shikimate_dh_N 1/1 2168 2250 .. 1 84 [] 101.4 3e-30

Shikimate_DH 1/1 2279 2369 .] 22 157 .. 24.7 4.4e-09




Dataset 7

AroA from scaffold_5181 most similar to Tenacibaculum sp. MED152

Score E

Sequences producing significant alignments: (Bits) Value

ref|ZP_01119134.1| 3-phosphoshikimate 1-carboxyvinyltransfera... 376 1e-102

ref|ZP_01053050.1| 3-phosphoshikimate 1-carboxyvinyltransfera... 369 1e-100

ref|YP_860405.1| 3-phosphoshikimate 1-carboxyvinyltransferase... 320 7e-86

ref|ZP_01059205.1| 3-phosphoshikimate 1-carboxyvinyltransfera... 315 2e-84

ref|ZP_01890962.1| putative 3-phosphoshikimate 1-carboxyvinyl... 311 2e-83

ref|ZP_00951385.1| 3-phosphoshikimate 1-carboxyvinyltransfera... 311 3e-83

ref|ZP_01107281.1| putative 3-phosphoshikimate 1-carboxyvinyl... 307 5e-82

emb|CAC82655.1| 5-enolpyruvylshikimate 3-phosphate synthase [... 306 1e-81

ref|ZP_01122370.1| 3-phosphoshikimate 1-carboxyvinyltransfera... 302 2e-80

########################################################################################

AroB from scaffold_3395 most similar to Tenacibaculum sp. MED152

Score E

Sequences producing significant alignments: (Bits) Value

ref|ZP_01052169.1| 3-dehydroquinate synthase [Tenacibaculum s... 466 1e-129

ref|ZP_01118687.1| 3-dehydroquinate synthase [Polaribacter ir... 445 2e-123

ref|ZP_01061679.1| putative 3-dehydroquinate synthase [Flavob... 378 3e-103

ref|ZP_01051584.1| putative 3-dehydroquinate synthase [Cellul... 378 4e-103

ref|ZP_01106521.1| 3-dehydroquinate synthase [Flavobacteriale... 375 3e-102

ref|YP_861532.1| 3-dehydroquinate synthase [Gramella forsetii... 374 7e-102

ref|YP_001195153.1| 3-dehydroquinate synthase [Flavobacterium... 370 7e-101

########################################################################################

AroC from scaffold 3300 most similar to Flavobacteria, among them Tenacibaculum sp. MED152

Score E

Sequences producing significant alignments: (Bits) Value

ref|ZP_01734660.1| chorismate synthase [Flavobacteria bacteri... 586 6e-166

emb|CAL42868.1| Chorismate synthase [Flavobacterium psychrophilu 577 4e-163

ref|YP_001196894.1| Chorismate synthase [Flavobacterium johns... 567 5e-160

ref|YP_861667.1| chorismate synthase [Gramella forsetii KT080... 542 1e-152

ref|ZP_01060121.1| chorismate synthase [Flavobacterium sp. ME... 538 2e-151

ref|ZP_01050894.1| chorismate synthase [Cellulophaga sp. MED1... 535 2e-150

ref|ZP_01891850.1| chorismate synthase [unidentified eubacter... 533 7e-150

ref|ZP_01106758.1| chorismate synthase [Flavobacteriales bact... 525 1e-147

ref|ZP_00950854.1| chorismate synthase [Croceibacter atlantic... 514 2e-144

ref|ZP_01121103.1| chorismate synthase [Robiginitalea biforma... 508 2e-142

ref|ZP_01052472.1| chorismate synthase [Tenacibaculum sp. MED... 508 3e-142

ref|ZP_01118000.1| chorismate synthase [Polaribacter irgensii... 502 1e-140

ref|ZP_01202724.1| chorismate synthase (5-enolpyruvylshikimat... 501 3e-140

########################################################################################

AroE from scaffold 5070 most similar to Tenacibaculum sp. MED152

Score E

Sequences producing significant alignments: (Bits) Value

ref|ZP_01733030.1| putative shikimate 5-dehydrogenase [Flavob... 296 4e-79

ref|YP_001194935.1| Shikimate dehydrogenase substrate binding... 235 2e-60

ref|ZP_01052545.1| shikimate 5-dehydrogenase [Tenacibaculum s... 228 2e-58

ref|ZP_01117836.1| putative shikimate 5-dehydrogenase [Polari... 228 2e-58

######################################################################################




Dataset 8

(a) Sequence of the 985-bp 16S fragment of N. vectensis from StellaBase (http://evodevo.bu.edu/stellabase) in entry c439003225.Contig1 (length = 1,241 bp) aligned to the genus Pseudomonas

tgacgttacctacagaagaagcaccggctaactccgtgccagcagccgcggtaatacggagggtgcaagcgttaatcggaattactgggcgtaaagcgcgcgtaggcggctaggtcagttggatgtgaaatccccgggctcaacctgggaattgcatccaaaactgcctggctagagtacagaagagggtggtggaatttcctgtgtagcggtgaaatgcgtagatataggaaggaacatcagtggcgaaggcggccacctggactgatactgacactgaggtgcgaaagcgtggggagcaaacaggattagataccctggtagtccacgccgtaaacgatgtcaactagccgttgggagtcttgaactcttagtggcgcagctaacgcattaagttgaccgcctggggagtacggccgcaaggttaaaactcaaatgaattgacgggggcccgcacaagcggtggagcatgtggtttaattcgaagcaacgcgaagaaccttacctggccttgacatgctgagaactttctagagatagattggtgccttcgggaactcagacacaggtgctgcatggctgtcgtcagctcgtgtcgtgagatgttgggttaagtcccgtaacgagcgcaacccttgtccttagttaccagcacgtaatggtgggaactctaaggagactgccggtgacaaaccggaggaaggtggggatgacgtcaagtcatcatggcccttacggccagggctacacacgtgctacaatggtcggtacaaagggttgccaagccgcgaggtggagctaatcccataaaaccgatcgtagtccggatcgcagtctgcaactcgactgcgtgaagtcggaatcgctagtaatcgtgaatcagaatgtcacggtgaatacgttcccgggccttgtacacaccgcccgtcacaccatgggagtgggttgcaccagaagtagctagtctaaccttcgggaggacggt

(b) Sequence of the 720-bp 16S fragment of N. vectensis from StellaBase (http://evodevo.bu.edu/stellabase) in entry c429301624.Contig1 (length = 5,682 bp) aligned to the family Flavobacteriaceae used in construction of Fig. 1

GCCGTAAACGATGGATACTAGCTGTTCGGATTTCGGTCTGAGTGGCTAAGCGAAAGTGATAAGTATCCCACCTGGGGAGTACGCACGCAAGTGTGAAACTCAAAGGAATTGACGGGGGCCCGCACAAGCGGTGGAGCATGTGGTTTAATTCGATGATACGCGAGGAACCTTACCAGGGCTTAAATGGGAGACGACGTTATTGGAAACAGTAATTTCTTCGGACGTCTTTCAAGGTGCTGCATGGTTGTCGTCAGCTCGTGCCGTGAGGTGTCAGGTTAAGTCCTATAACGAGCGCAACCCCTGTCGTTAGTTGCCAGCGAGTGATGTCGGGAACTCTAACGAGACTGCCGGTGCAAACCGTGAGGAAGGTGGGGATGACGTCAAATCATCACGGCCCTTACGTCCTGGGCCACACACGTGCTACAATGGCCGGTACAGAGAGCAGCTACATGGTGACATGATGCGAATCTTCAAAACCGGTCTCAGTTCGGATCGGAGTCTGCAACTCGACTCCGTGAAGCTGGAATCGCTAGTAATCGGATATCAGCCATGATCCGGTGAATACGTTCCCGGGCCTTGTACACACCGCCCGTCAAGCCATGGAAGCTGGGGGTACCTGAAGTCGGTGACCGTAAGGAGCTGCCTAGGGTAAAACTGGTAACTGGGGCTAAGTCGTAACAAGGTAGCCGTACCGGAAGGTGCGGCTGGAACACCTCCTTT




Dataset 9

Carsonella ruddii

length 159,662 bp

Result: one putative shikimate gene:

>sp|Q5NGG6|AROC_FRATT Chorismate synthase (EC 4.2.3.5) (5-enolpyruvylshikimate-3-phosphate phospholyase)

Francisella tularensis subsp. tularensis.

Length: 352 aa

Query frame: +3

Score: 901, Expect: 1e-68

Identical: 151/347 (43%), Positive: 233/347 (67%)

Indels: 5/347 (1%), Gaps: 3

Q: 74478 NSYGEIIKISTFGESHGLIIGALIDGFFSNLYISEKFIQKNLNLRKPFTSLFSTQRREQD 74657

|++|+| ++| ||||| + |+||| ||+ + | || |+ ||| | |+|||+| |

D: 4 NTFGKIFTVTTCGESHGDSLAAIIDGCPSNIPLCEADIQLELDRRKPGQSKFTTQRKEPD 63

Q: 74658 KVKIFTGIFKNKTTGAPVLMLIKNNDKQSSDYNNISLNFRPGHADYTYFLKYKFRDYRGG 74837

+||| +|+|+ |||| |+ ++||| |++| ||+ | ||||||||||| || ||||||

D: 64 EVKIISGVFEGKTTGTPIGLIIKNQDQKSKDYSEIKDKFRPGHADYTYFKKYGIRDYRGG 123

Q: 74838 GRSSARETACRVASGCVFKNLIYNKGVIVRSYIKKIGFLKINFKYWNYTLNR--FFSNLL 75011

||||||||| |||+| + | ++ + |+ + + +|| |||+| ++ | +|

D: 124 GRSSARETAMRVAAGAIAKKILKHYGIEIYGFCSQIGSLKIDFIDKDFINQNPFFIANKN 183

Q: 75012 FINEIKDIINNCKNSCNSLSSEIVIIINGLEPSLGDPLYKKINSTISNYLLSINATKSIC 75191

+ +|+|++ + +|+ +|+ ++ ||| || |++ +++++|+ ++|||| |++

D: 184 AVPACEDLIHSIRKQGDSIGAEVTVVATGLEAGLGRPVFDRLDASIAYAMMSINAVKAVS 243

Q: 75192 FG--FNFKNKNSFQVKDEI-KNSGFTSNNNGGILAGITNGQPLVIKILFKPTSSTSRKIK 75362

| |+ + | +||| + || ||+ |||| ||+ || ++ |+ |||||| + |

D: 244 IGDGFDCVAQKGSQHRDEITQQQGFLSNHAGGILGGISTGQDIIAKLAFKPTSSILQPGK 303

Q: 75363 TINEKLKNITNKTYGRHDPCVGLRAVPVIESMLYTILINKILKKKIY 75503

+|+ + + | | ||||||||+| ||+ |+|| +|++++| + |

D: 304 SIDVQGNDTTVITKGRHDPCVGIRGVPIAEAMLALVLVDELLITRSY 350

sequence:

>putative shikimate

aaaaaaaactataaaaattataaattattataatgaataattcatacggtgaaattatta

aaatttcaacttttggagaaagtcatggtttaattattggtgctttaattgatggttttt

tttcaaatttatatattagtgaaaaatttattcaaaaaaatttaaacttaagaaaaccat

ttacttcattattttcaacacaaagaagagaacaagacaaagttaaaattttcaccggaa

tttttaaaaataaaacaacaggcgcacctgtattaatgttaataaaaaataatgataaac

aaagttcagattataataatataagtttaaattttagacctggacatgcagactatactt

attttttaaagtataaatttagagattatagaggtggaggtagatctagtgctagagaaa

cagcttgcagagttgcaagtggatgtgtgtttaaaaatttgatttataataaaggagtta

ttgttcgttcatatattaaaaaaattggttttttaaaaataaattttaaatattggaatt

atacattaaatagatttttttcaaatttattatttataaatgagattaaagatataatta

ataattgtaaaaattcatgcaattcgttaagttcagaaattgtaattattatcaacggtc

ttgaaccaagtttgggagatcctctttataaaaaaattaattctactatttctaattatt

tgttaagtattaatgcaactaaaagtatttgctttggttttaactttaaaaataaaaact

catttcaagtaaaagatgaaattaaaaattctggatttacttcaaacaataatggaggaa

tattagctggaataactaatggacaacctttagtaatcaaaatattatttaaacctacat

ctagtacttctagaaaaataaaaacaataaacgaaaaattaaaaaatattacaaataaaa

cttatggaagacatgatccttgtgttggtttaagagctgtaccagtaattgaatctatgt

tatatacaatattaataaataaaattttaaaaaaaaaaatt

Buchnera aphidicola str. Cc (Cinara cedri)

length: 416,380 bp

Results:3 previously identified shikimate proteins:

>gnl|BL_ORD_ID|1759 tr|Q5EU83|Q5EU83_9ENTR Shikimate 5-dehydrogenase

Buchnera aphidicola (Cinara cedri).

Length: 280 aa

Query frame: -1

Score: 1714, Expect: 0

Identical: 280/280 (100%), Positive: 280/280 (100%)

Indels: 0/280 (0%), Gaps: 0

Q: 353875 MQNSKYCEKKIHIALFGNPIEHSLSPLIHKNFSKEIKINYNYNSFLCTKSNFFVIVKNFF 353696

||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||

D: 1 MQNSKYCEKKIHIALFGNPIEHSLSPLIHKNFSKEIKINYNYNSFLCTKSNFFVIVKNFF 60

Q: 353695 QNGGFGCNITVPFKKKSFQISNKNTKYVKISNSVNVLKKSSNNNIIGYNTDGIGLIYDLN 353516

||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||

D: 61 QNGGFGCNITVPFKKKSFQISNKNTKYVKISNSVNVLKKSSNNNIIGYNTDGIGLIYDLN 120

Q: 353515 RLKYITENSFILILGSGGAVYSIVYHLLKKKCCIFILNRTISKSCILVNKFKKFGKIFVF 353336

||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||

D: 121 RLKYITENSFILILGSGGAVYSIVYHLLKKKCCIFILNRTISKSCILVNKFKKFGKIFVF 180

Q: 353335 DKNLYTKKFDIIINATSCGLYNFSPKFPKNLIFPNTKCYDISYSKNKKLTPFLSTCRDLG 353156

||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||

D: 181 DKNLYTKKFDIIINATSCGLYNFSPKFPKNLIFPNTKCYDISYSKNKKLTPFLSTCRDLG 240

Q: 353155 SRKYSDGLGMLVAQAAYSCYIWFNILPNIKKNINLLKSII 353036

||||||||||||||||||||||||||||||||||||||||

D: 241 SRKYSDGLGMLVAQAAYSCYIWFNILPNIKKNINLLKSII 280

sequence:

>gnl|BL_ORD_ID|1759 tr|Q5EU83|Q5EU83_9ENTR Shikimate 5-dehydrogenase - Buchnera aphidicola (Cinara cedri)

atgcaaaatagtaaatattgtgaaaaaaaaattcatattgctttgttcggaaatcccata

gagcattctttatctccattaatacataaaaatttttctaaagagataaaaattaattat

aattataactcttttttatgtacaaaatcaaatttttttgttatagtaaaaaattttttt

caaaatggtggttttggatgcaacattactgttccttttaaaaaaaaatcatttcaaatt

tcaaataaaaatactaaatatgtaaaaatttctaattcggtaaatgttttaaaaaaaagt

tcaaataataatattattggttacaacacggatggaattggattaatttatgatttaaat

cgtttaaaatatatcacagaaaattcttttattttaatattaggttcaggtggagctgta

tattctattgtatatcatttattaaaaaaaaaatgttgtatttttattttaaatcgaaca

attagtaaatcatgtattttagtaaataaatttaaaaaatttggaaaaatttttgttttt

gataaaaatttatatacaaaaaaatttgatattattattaatgccacttcatgcgggtta

tataatttctctcctaaatttccaaaaaatttgatatttccaaatacaaaatgttatgat

atttcttattcaaaaaataaaaaattaaccccttttttatctacttgtagagatttaggt

agtagaaaatattctgatggcttaggaatgttagtagcacaagcagcatattcatgttat

atatggtttaacatactaccaaacattaaaaaaaatattaatttattaaaatctattata

>sp|Q9ZHE9|AROC_BUCAP Chorismate synthase (EC 4.2.3.5) (5-enolpyruvylshikimate-3-phosphate phospholyase)

Buchnera aphidicola subsp. Schizaphis graminum.

Length: 353 aa

Query frame: -1

Score: 1591, Expect: 0

Identical: 247/353 (69%), Positive: 300/353 (84%)

Indels: 0/353 (0%), Gaps: 0

Q: 69088 MPGNSIGKIFKVTTCGESHGPMLAGIIDGVPPGLSLNNKDIQYELNRRRPGFSKFTSQRR 68909

| ||+|||+|+||| ||||| | +|||+|||| |++ |+||+||||||| |++|+||

D: 1 MAGNTIGKVFRVTTFGESHGTALGCVIDGMPPGLELSSDDLQYDLNRRRPGTSRYTTQRS 60

Q: 68908 EKDKVEIFSGIFKGITTGTSIGIRIKNIDIRSQDYSEIKNLYRPNHADYTYEKKYGIRDY 68729

| |+|+|+||+||| |||||||+ |+| | |||||||||+|+|| |||||||||||||||

D: 61 ELDEVQILSGVFKGTTTGTSIGLVIQNKDQRSQDYSEIKDLFRPGHADYTYEKKYGIRDY 120

Q: 68728 RGGGRSSARETAIRVAAGAIAKKYLKLQHNIKIRGYLSQIGSIYCPFQSWEEVEKNPFFC 68549

||||||||||||+|||||+|||||||+| | || ||| +| | |||+||||||+|||||

D: 121 RGGGRSSARETAMRVAAGSIAKKYLKIQTGIVIRAYLSAMGDIKCPFESWEEVEQNPFFC 180

Q: 68548 SNSEKIKKIIHFIKKLKKSGNSVGAKITIIAKNVPIGLGEPVFDRLNAEIAHSIMSINAA 68369

|| |+ ++ +||||||+|+|+||+|||||+|||+|+||||||||+|++||++||||||

D: 181 SNKNKVFQLEELIKKLKKTGDSIGAEITIIAQNVPVGFGEPVFDRLDADLAHALMSINAA 240

Q: 68368 KSIEIGDGIHVAKQTGVEHRDEILPNGFSSNHSGGILGGISNGEEIIVHAAFKPTSSIKI 68189

| +||||| | | | |+|||+ |||| ||| ||||||||||| | + ||||||||+

D: 241 KGVEIGDGFSVVNQKGSENRDEMTPNGFKSNHCGGILGGISNGENIFLKVAFKPTSSIRQ 300

Q: 68188 PGKTIDTFGKKRFIITKGRHDPCVGIRAVPIAEAMLAITLMDHVLRFKAQCGK 68030

| ||+ +| |+ |||||||||||||||||||+|| ||||+|||+||| |

D: 301 SGNTINKNNEKVKIVIKGRHDPCVGIRAVPIAEAMVAIVLMDHLLRFRAQCAK 353

sequence:

> chorismate synthase [Buchnera aphidicola str. Cc (Cinara cedri)

atgccgggaaattcaattggaaaaatatttaaagttactacatgtggggaatcacatgga

cctatgttagcaggaattattgatggagttccgcctggtttatctttaaacaataaagat

attcaatatgaattaaatagaagaagaccgggtttttctaaatttacatcacaaagaaga

gaaaaagataaagtagaaatattttcaggaatatttaaaggaataactactggaacaagt

ataggcataagaataaaaaatatagatattagatcacaagattattcagaaattaaaaat

ttatatcgacctaatcatgctgattatacatatgaaaaaaaatatggaatcagagattat

agaggtggaggaagatcctcagctagagaaacagctattcgagtagctgctggagctatt

gctaaaaaatatttaaaattacaacataatataaaaattagaggatatttgtcacaaata

ggatcaatttattgtccatttcaatcttgggaagaagtagaaaaaaatccttttttttgt

agtaattcagaaaaaataaaaaaaattatacattttatcaaaaaattaaaaaaaagtgga

aattcagtaggagcaaaaattactattattgctaaaaatgttcctattggattaggagaa

cctgtatttgatagactaaatgctgaaattgcacattctataatgagtattaatgctgct

aaatctattgaaattggagatggaattcatgtagctaaacaaacaggagttgaacataga

gatgaaattctacctaacggattttctagcaatcattccggaggaatattaggtggtatt

agtaacggggaagaaattattgtacatgcagcttttaaacctacatctagtattaaaata

ccgggaaaaacaatagatacatttggaaaaaaaagatttattataacaaaaggaagacat

gatccttgtgtaggaattcgagctgttccaattgcagaagcaatgttagcaattacatta

atggaccatgttttacgtttcaaagctcaatgtggaaaa

>sp|Q59178|AROA_BUCAP 3-phosphoshikimate 1-carboxyvinyltransferase (EC 2.5.1.19) (5- enolpyruvylshikimate-3-phosphate synthase) (EPSP synthase) (EPSPS)

Buchnera aphidicola subsp. Schizaphis graminum.

Length: 428 aa

Query frame: -1

Score: 1573, Expect: 0

Identical: 260/429 (60%), Positive: 324/429 (75%)

Indels: 3/429 (0%), Gaps: 2

Q: 225559 MQDSLTLKPVDYIQGKINIPGSKSISNRVLLLSALSNGKTILKNLLYSDDIKYMLKALLK 225380

|| | |||| || | | +|||||||||||||||++|| | | ||| | | +||| || |

D: 1 MQKFLELKPVSYINGTIYLPGSKSISNRVLLLSAMANGITCLTNLLDSQDTQYMLNALRK 60

Q: 225379 LGIFYKLDKKKSKCTIYGISDAFSVKNKIKLFLGNAGTAMRPLLAILSLKKNKIILTGEK 225200

+|| + | + | ++|| || + + | ||||||||||||||| ||| +| ++|+|+

D: 61 IGIKFFLSNNNTTCHVHGIGKAFHLSHPISLFLGNAGTAMRPLLAALSLYENNVVLSGDD 120

Q: 225199 RMKERPIHHLVDSLRQGGANITYKNKKKFPPLYIKGGFKGGKIFIDGSISSQFLSSLLMA 225020

|| |||| ||||+|+|||| + || +||+ ||||||| | +|||||||||+||||

D: 121 RMHERPIAHLVDALKQGGATLEYKKGIGYPPVLTKGGFKGGSIMLDGSISSQFLTSLLMV 180

Q: 225019 APLAELDTEIIVKNQLVSKPYINLTINLMEKFGISVSILND-YKHFYIKGNQKYISPKKY 224843

|||| +| | +| |||||||++|+|||+ || |+|+|| || ||||||||| || |

D: 181 APLALQNTNIFIKGNLVSKPYIDITLNLMKSFG--VNIVNDCYKSFYIKGNQKYESPGNY 238

Q: 224842 YIESDLSSATYFLAAAAIKGGSIQINGIQKKSIQGDINFIKILKQMGVSIQWKKNSVICK 224663

+| | |||+||||||||||||+++ |+ |||+|||| | +|++|| | | + ++|+

D: 239 LVEGDASSASYFLAAAAIKGGSVKVVGVGKKSVQGDIKFADVLEKMGAIIDWGDSFIVCR 298

Q: 224662 KNKLLGITVDCNHIPDAAMTIAILGVFSKKKVYIKNIYNWRVKETDRIYAMSTELKKIGA 224483

||| | +| ||||||||||||+ +|+| ||||||||||||||+ ||| ||||+||

D: 299 HNKLEKIDLDMNHIPDAAMTIAIVALFAKGTSIIKNIYNWRVKETDRLSAMSKELKKVGA 358

Q: 224482 RVITGKDYIKVYPVKNFIHAKINTYNDHRIAMCFSLISLSGTSVTLLNPKCVNKTFPSFF 224303

+ |+| + + | | |+|+||||||+||||||| ||| || +||| |++|||||+|

D: 359 IIKEGRDCLSITPPNFFKFAEIDTYNDHRMAMCFSLICLSGISVRILNPNCISKTFPSYF 418

Q: 224302 KNFYSICHY 224276

+|| | +

D: 419 ENFLKISRF 427

sequence:

>3-phosphoshikimate 1-carboxyvinyltransferase [Buchnera aphidicola

str. Cc (Cinara cedri)]

atgcaagatagtttaactttaaaaccagtagattatattcaaggaaaaattaatattcca

ggttcaaaaagtatttctaatcgtgttcttttattatcagctttatctaatggaaaaaca

attttaaaaaatttgttatatagtgatgatattaaatatatgttaaaagctttattaaaa

ttaggtattttttataaattagacaaaaaaaaatctaagtgtactatttatggaatatct

gatgcattttctgtaaaaaataaaattaaattatttttaggtaatgctggtaccgctatg

cgtccactattagcaattttatcattaaaaaaaaataaaattatacttactggtgaaaaa

agaatgaaagaaagacctattcatcatttagtagactctttacgtcagggtggagcaaat

ataacttataaaaataaaaaaaaatttcctccattatatattaaaggtggttttaaaggt

ggaaaaatttttatagatggatctatttctagtcaatttttaagttctttattaatggcc

gctcctttagcagaattagatactgaaattatagtaaaaaatcaattagtatctaaacct

tatattaatttaacaataaatttaatggaaaaatttggtatatcagtaagtattttaaat

gattataaacatttctatataaaaggaaaccagaaatatatttctcctaaaaaatattat

attgaaagtgatctttcttctgctacttattttttagctgcggctgcaataaaaggtgga

tcaattcaaataaatggaatacaaaaaaaaagtattcaaggagacataaattttattaaa

attttaaaacaaatgggtgtatcaattcaatggaaaaaaaattcagttatttgtaaaaaa

aataagttattaggtattacagtagattgcaatcatatacctgatgcagctatgactata

gctattcttggagtattttctaaaaaaaaagtatatattaaaaatatatataattggaga

gttaaagaaactgatcgaatatatgctatgagtacagaattaaaaaaaatcggagctcga

gtaattacaggtaaagattatataaaagtctatccagtaaaaaattttatacatgctaaa

ataaatacttataatgatcatagaatagctatgtgtttttctttaatttcactgtctgga

acttctgtaactttactaaatccaaaatgtgttaataaaacatttccatcattttttaaa

aacttttattctatttgtcattat

This Article

  1. PNAS February 19, 2008 vol. 105 no. 7 2533-2537
  1. AbstractFree
  2. Figures Only
  3. Full Text
  4. Full Text (PDF)
  5. » Supporting Information