Supplementary Information
Table S1: Sampling details. Tube worm specimens were sampled during the two cruises I: AT15-28 (“Fix08 I”, December 2007/January 2008) and II: AT15-38 (“Fix08 II”, October 2008) at different EPR vent sites. AD: Alvin Dive. The in situ chemistry measurements (“Chemistry”) were performed during the cruise AT15-28 at 9°50’N, Tica vent.
Table S2: Single gene homogeneity. Selected genes with relevant metabolic functions and the ITS and 16S rRNA sequences were compared a) between all three symbiont metagenomes pyrosequenced in this study (R1: Riftia 1 symbiont, R2: Riftia 2 symbiont, T: Tevnia symbiont) and b) between R1, R2, T and the Sanger-sequenced metagenome of 1: Cand. E. persephone as published by Robidart et al. (2008), using the Geneious ProTM tool. Homogeneity is given as percentage of average pairwise identities, i. e. the proportion of homologous base pairs at the same column with regard to the total number of pairs in the alignment. The homogeneity of all key metabolic gene sequences (excluding 16S rRNA and ITS genes) from R1, R2 and T averages 99.9 %, resulting in an average heterogeneity of 0.1 %. The key gene alignment of R1, R2, T and the previously published Cand. E. persephone metagenome resulted in an average of 99.6% homogeneity (0.4% heterogeneity). Five of the genes were not identified in the Cand. E. persephone metagenome. For accession numbers see Figures S2 and S3.
Table S3: Genes of particular interest. Selected representative genes encoding relevant metabolic key enzymes, proteins involved in oxidative stress response and in cell surface-associated processes, and proteins which are putatively related to symbiont-host interactions are listed with their respective GenBank accession numbers for the metagenomes of the Riftia 1 symbiont, the Riftia 2 symbiont and the Tevnia symbiont. Asterisk: Protein is also involved in organic carbon metabolism. 1) Two or more partial coding sequences for one gene; 2) two or more individual copies in separate locations of the metagenomes. n. a.: “not annotated”, nucleotide sequence was manually identified in the metagenome but was not detected during the initial automatic annotation. (Note: This list is not exhaustive but presents exemplary features of the three metagenomes.)
Table S4: Comparison of intracellular soluble proteins. Protein names with corresponding function, Enzyme Commission (E.C.) number, isoelectric point (pI), molecular weight (MW) and GenBank accession number are listed for all identified proteins with at least 1.5-fold change of spot volume (ratio) when comparing the Riftia symbiont and the Tevnia symbiont protein gels (Figure 2). Spot volume values are expressed in percentages (%Vol) of the total proteome on the respective symbiont reference map (pI range 4 – 7). Only proteins exhibiting relative volumes (%Vol) of at least 0.1 were included. Negative ratio values (light gray cells) correspond to comparatively higher spot volumes on the Tevnia symbiont master gel, whereas positive ratios (dark gray cells) indicate larger spot volumes for the Riftia symbiont proteins. Proteins were considered as unambiguously identified, if their MS identification was based on at least two individual peptides, a minimum score of 75 and a sequence coverage of at least 30%. Asterisk: Protein is also involved in organic carbon metabolism.
Figure S1: Tube worm clump at the sampling site. Tevnia jerichonana and Riftia pachyptila specimens at Tica vent, 9°50’N. The picture was taken on January 11th 2008 directly before collection of the tube worms. The long red plumes of the larger Riftia specimens in the center of the clump are more distant from the diffuse flow source at the seafloor than the plumes of the smaller Tevnia specimens.
Figure S2: Single gene comparison. 24 key genes encoding major metabolic enzymes involved in sulfide oxidation, carbon fixation, nitrogen metabolism and oxidative stress response from the Riftia 1 symbiont, the Riftia 2 symbiont, and the Tevnia symbiont metagenome were subjected to a mutual alignment and compared to the respective Riftia symbiont sequences published previously (Robidart et al., 2008) using the Geneious ProTM tool. The green bar indicates the nucleotide consensus sequence (shown on top), i. e. the homogeneity of the four symbiont metagenomes. It is interrupted in differing (non-homologous) DNA regions. Protein coding sequences (CDS) of the respective symbionts are indicated by a yellow bar and specified by their GenBank accession numbers on the left. Translations (above the DNA sequences) indicate whether nucleotide deviations impact the amino acid sequences.
Figure S3: ITS and 16S rRNA alignment. ITS and 16S rRNA sequences of the three metagenomes sequenced in this study (Riftia 1 symbiont, Riftia 2 symbiont, Tevnia symbiont) were compared to each other and to the sequences previously published by Robidart et al. (2008) using Geneious ProTM. Additionally, ITS sequences of two individual Riftia symbiont phylotypes described by Harmer et al. (2008) and of seven Riftia and Tevnia symbiont phylotypes as published by Di Meo et al. (2000) were included in the alignment. The 16S rRNA alignment also includes two 16S rRNA sequences published by Di Meo et al. (2000). GenBank accession numbers of publically accessible sequences are indicated on the left. The pink bar characterizes tRNA sequences located within the ITS sequence, whereas the rRNA sequence is shown as a red bar. The consensus sequence (homogeneity of the compared genes) is highlighted in green.
3