Using plastid genome-scale data to resolve enigmatic relationships among basal angiosperms

Moore et al. 10.1073/pnas.0708072104.

Supporting Information

Files in this Data Supplement:

SI Figure 3
SI Table 2
SI Table 3
SI Figure 4
SI Figure 5
SI Figure 6
SI Figure 7
SI Figure 8
SI Figure 9
SI Figure 10
SI Figure 11
SI Figure 12
SI Figure 13
SI Figure 14
SI Figure 15
SI Figure 16
SI Figure 17
SI Table 4
SI Figure 18
SI Table 5
SI Figure 19
SI Dataset 1
SI Text




SI Dataset 1

Dataset 1. Data set (in NEXUS format) used in phylogenetic analyses.





SI Figure 3

Fig. 3. A map of the Ceratophyllum demersum plastid genome. Genes with introns are indicated by asterisks; the introns themselves are indicated on the map by white boxes within the genes. IR = inverted repeat; LSC = large single-copy region; SSC = small single-copy region.





SI Figure 4

Fig. 4. Phylogram of the single MP tree (tree length = 86,825; CI = 0.408; RI = 0.496) for the 61-gene combined data set. Numbers associated with branches are MP bootstrap support values above 50%.





SI Figure 5

Fig. 5. Phylogram of the best ML tree as determined by GARLI (-ln L = 200,247.8) for the combined first and second codon positions of the 61-gene data set. Numbers associated with branches are ML bootstrap support values above 50%/Bayesian posterior probabilities above 0.5. The number in parentheses is the branch length separating the outgroups from angiosperms. The asterisk at the node uniting Amborella and Nymphaeales indicates that the GARLI ML bootstrap analysis weakly favors a topology where Amborella is sister to all remaining angiosperms (BS = 54%).





SI Figure 6

Fig. 6. Phylogram of the single MP tree (tree length = 31,043; CI = 0.488; RI = 0.527) for the combined first and second codon positions of the 61-gene data set. Numbers associated with branches are MP bootstrap support values above 50%. The number in parentheses is the branch length separating the outgroups from angiosperms.





SI Figure 7

Fig. 7. Phylogram of the best ML tree as determined by GARLI (-ln L = 294,406.3) for the fast gene partition. Numbers associated with branches are ML bootstrap support values above 50%/Bayesian posterior probabilities above 0.5. The number in parentheses is the branch length separating the outgroups from angiosperms.





SI Figure 8

Fig. 8. Phylogram of one of two MP trees (tree length = 57,756; CI = 0.421; RI = 0.502) for the fast gene partition. Numbers associated with branches are MP bootstrap support values above 50%. The arrow indicates the branch that collapses in the strict consensus.





SI Figure 9

Fig. 9. Phylogram of the best ML tree as determined by GARLI (-ln L = 163,717.8) for the slow gene partition. Numbers associated with branches are ML bootstrap support values above 50%/Bayesian posterior probabilities above 0.5. The number in parentheses is the branch length separating the outgroups from angiosperms.





SI Figure 10

Fig. 10. Phylogram of the single MP tree (tree length = 29,041; CI = 0.381; RI = 0.483) for the slow gene partition. Numbers associated with branches are MP bootstrap support values above 50%.





SI Figure 11

Fig. 11. Phylogram of the single MP tree (tree length = 83,948; CI = 0.416; RI = 0.504) for the 61-gene combined data set, excluding Piper. Numbers associated with branches are MP bootstrap support values above 50%.





SI Figure 12

Fig. 12. Phylogram of one of two MP trees (tree length = 55,854; CI = 0.430; RI = 0.511) for the fast gene partition, excluding Piper. Numbers associated with branches are MP bootstrap support values above 50%. The arrow indicates the branch that collapses in the strict consensus.





SI Figure 13

Fig. 13. Phylogram of the single MP tree (tree length = 28,060; CI = 0.390; RI = 0.491) for the slow gene partition, excluding Piper. Numbers associated with branches are MP bootstrap support values above 50%.





SI Figure 14

Fig. 14. Phylogram of the single MP tree (tree length = 84,673; CI = 0.415; RI = 0.501) for the 61-gene combined data set, excluding Ceratophyllum. Numbers associated with branches are MP bootstrap support values above 50%.





SI Figure 15

Fig. 15. Phylogram of the best ML tree as determined by GARLI (-ln L = 449,384.34) for the 61-gene combined data set, excluding Piper. Numbers associated with branches are ML bootstrap support values above 50%/Bayesian posterior probabilities above 0.5. The number in parentheses is the branch length separating the outgroups from angiosperms.






SI Figure 16

Fig. 16. Phylogram of the best ML tree as determined by GARLI (-ln L = 286,918.54) for the fast genes partition, excluding Piper. The number in parentheses is the branch length separating the outgroups from angiosperms. Chloranthaceae and magnoliids are sisters in this tree, as are monocots and Ceratophyllum/eudicots.





SI Figure 17

Fig. 17. Phylogram of the best ML tree as determined by GARLI (-ln L = 159,971.02) for the slow genes partition, excluding Piper. The number in parentheses is the branch length separating the outgroups from angiosperms.





SI Figure 18

Fig. 18. Chronogram depicting estimated angiosperm divergence times derived from PL analysis of the 61-gene ML tree. A time scale (in mya) is provided at the bottom. MAGN. = magnoliids.





SI Figure 19

Fig. 19. Transition/transversion saturation plots of first + second codon positions combined and third codon positions for the 61-gene combined data set. GTR + I + G corrected pairwise distance is given on the x axis; uncorrected pairwise (p) distance is given on the y axis; ti = transition; tv =transversion





Table 2. Ceratophyllum plastid genome sequence characteristics and 454 sequencing run characteristics

Characteristics

Ceratophyllum

Genome charasterics:

 

Total genome length

156,252 bp

IR length

25,319 bp

LSC length

86,607 bp

SSC length

19,007 bp

Overall G/C content

38.2%

454 sequencing run characteristics

 

Genome coverage

99.999%

Average read depth

40.1´

Proportion of bases ³ Q40

99.8%





Table 3. Sequence characteristics of data partitions and genes

Partition/gene

Total aligned length, bp

Analyzed aligned length, bp

Average pairwise distance

Number PI characters

PI/analyzedlength

61 genes combined

49,893

42,519

0.114

16,463

0.387

61 genes, first/second codon positions

33,262

28,345

0.063

6,861

0.242

Fast genes combined

29,283

22,682

0.144

10,970

0.484

Slow genes combined

20,610

19,837

0.081

5,493

0.277

Fast genes:

         

matK

1,992

1,506

0.219

992

0.659

rpl32

300

150

0.190

81

0.540

ccsA

1,110

906

0.175

480

0.530

rps15

363

249

0.171

140

0.562

rps3

798

645

0.163

345

0.535

rpl33

231

198

0.162

107

0.540

cemA

858

687

0.161

362

0.527

rpoC2

6,699

4,128

0.160

2,190

0.531

rps19

390

282

0.158

136

0.482

rps8

441

408

0.158

210

0.515

rpoA

1,158

1,002

0.157

513

0.512

clpP

756

585

0.154

308

0.526

rpl20

438

342

0.154

191

0.558

rps11

450

414

0.141

196

0.473

ycf4

678

549

0.134

239

0.435

atpF

597

552

0.130

258

0.467

psaJ

138

126

0.130

48

0.381

atpE

450

398

0.129

192

0.482

psbH

255

219

0.125

80

0.365

rps2

777

708

0.123

303

0.428

psbK

213

183

0.122

78

0.426

rpl16

444

405

0.122

159

0.393

rps4

723

591

0.122

253

0.428

rpoC1

2,367

1971

0.120

865

0.439

psaI

114

108

0.118

47

0.435

petL

255

93

0.115

41

0.441

rpl14

375

369

0.114

149

0.404

rpoB

3,501

3240

0.114

1354

0.418

rps18

951

285

0.114

120

0.421

petA

1017

963

0.113

364

0.378

rpl36

117

111

0.113

44

0.396

rps14

327

309

0.111

125

0.405

Slow genes:

         

atpA

1,542

1,521

0.104

527

0.346

atpI

765

744

0.097

243

0.327

atpB

1,512

1,494

0.095

474

0.317

psbT

117

102

0.095

35

0.343

rbcL

1,476

1,425

0.095

442

0.310

psaC

246

243

0.093

73

0.300

psbJ

123

120

0.092

34

0.283

psbM

126

102

0.092

32

0.314

psbI

168

108

0.091

29

0.269

psbB

1,536

1,524

0.088

443

0.291

petD

531

480

0.085

131

0.273

petB

648

645

0.083

170

0.264

psbZ

189

186

0.083

62

0.333

atpH

246

243

0.080

63

0.259

psbC

1,434

1,369

0.079

367

0.268

ycf3

567

501

0.079

138

0.275

psaA

2,292

2,250

0.078

605

0.269

psaB

2,208

2,202

0.077

568

0.258

petG

117

111

0.075

28

0.252

psbN

135

129

0.072

33

0.256

psbE

252

249

0.069

69

0.277

psbA

1,065

1,059

0.064

239

0.226

psbD

1,086

1,059

0.064

246

0.232

rpl2

918

816

0.061

223

0.273

psbF

120

117

0.060

28

0.239

petN

99

87

0.055

17

0.195

psbL

123

114

0.050

23

0.202

rps12

480

369

0.048

61

0.165

rps7

489

468

0.041

90

0.192

Analyzed aligned length equals the total aligned length minus excluded characters. PI = parsimony informative.





Table 4. Results of topology tests

       

P (values ≥ 0.05 in bold)

Tree

Topology

-ln L

Diff -ln L

AU

SH

WSH

1

((((Cerato,eud),mono),(chlor,magnol)),out)

460654.145

(best)

0.941

1.000

0.999

79

(((Cerato,(mono,eud)),(chlor,magnol)),out)

460663.508

9.4

0.478

0.779

0.763

27

(((((Cerato,eud),mono),magnol),chlor),out)

460662.804

8.7

0.297

0.789