Table S1. Details of datasets used for generation of phylum distribution profiles in Fig. 1a, 1b and 1c.

Dataset Index in present review / Dataset Index in original paper / NCBI or MG-RAST Run ID/Accession no. / Type of dataset / NGS platform / Reference
A1 / AH1d1, AH1d2 / SRR640619-20 / 16S rRNA amplicon / Ion Torrent PGM / 6
A2 / AK1d1, AK1d2, AK1d3 / SRR640636-38 / 16S rRNA amplicon / Ion Torrent PGM / 6
A3 / AL1d1, AL1d2, AL1d3 / SRR640662-64 / 16S rRNA amplicon / Ion Torrent PGM / 6
A4 / AVKd1, AVKd2, AVKd3 / SRR640689-91 / 16S rRNA amplicon / Ion Torrent PGM / 6
A5 / BDEd1, BDEd2, BDEd3 / SRR640698-700 / 16S rRNA amplicon / Ion Torrent PGM / 6
A6 / BY1d1, BY1d2, BY1d3 / SRR640707-09 / 16S rRNA amplicon / Ion Torrent PGM / 6
A7 / EBAd1, EBAd2, EBAd3 / SRR640725-27 / 16S rRNA amplicon / Ion Torrent PGM / 6
A8 / IQAd1, IQAd2, IQAd3 / SRR640734-36 / 16S rRNA amplicon / Ion Torrent PGM / 6
A9 / NORd1, NORd2, NORd3 / SRR640743-45 / 16S rRNA amplicon / Ion Torrent PGM / 6
A10 / RANd1, RANd2, RANd3 / SRR640752-54 / 16S rRNA amplicon / Ion Torrent PGM / 6
A11 / RUSd1, RUSd2, RUSd3 / SRR640761-63 / 16S rRNA amplicon / Ion Torrent PGM / 6
A12 / THUd1, THUd2, THUd3 / SRR640770-72 / 16S rRNA amplicon / Ion Torrent PGM / 6
C1 / CQM1 / SRX731327 / 16S rRNA amplicon / Illumina MiSeq / 26
C2 / CQM2 / SRX731327 / 16S rRNA amplicon / Illumina MiSeq / 26
C3 / CQM3 / SRX731327 / 16S rRNA amplicon / Illumina MiSeq / 26
C4 / DQM3 / SRX731327 / 16S rRNA amplicon / Illumina MiSeq / 26
C5 / DQM4 / SRX731327 / 16S rRNA amplicon / Illumina MiSeq / 26
C6 / DQM5 / SRX731327 / 16S rRNA amplicon / Illumina MiSeq / 26
C7 / DQM12 / SRX731327 / 16S rRNA amplicon / Illumina MiSeq / 26
C8 / DQM50 / SRX731327 / 16S rRNA amplicon / Illumina MiSeq / 26
C9 / DQM200 / SRX731327 / 16S rRNA amplicon / Illumina MiSeq / 26
M1 / T23 2% I, T23 2% II / HM602044–HM622061, HQ457546–HQ462469 / 16S rRNA amplicon / 454 GS FLX / 9
M2 / T66 2% I, T66 2% II / HM602044–HM622061, HQ457546–HQ462469 / 16S rRNA amplicon / 454 GS FLX / 9
M3 / T23 5% I, T23 5% II / HM602044–HM622061, HQ457546–HQ462469 / 16S rRNA amplicon / 454 GS FLX / 9
PolA1 / A1_lib / ERR193622 / 16S rRNA amplicon / 454 GS FLX / 27
PolA2 / A2_lib / ERR193623 / 16S rRNA amplicon / 454 GS FLX / 27
PolB1 / B1_lib / ERR193624 / 16S rRNA amplicon / 454 GS FLX / 27
PolB2 / B2_lib / ERR193625 / 16S rRNA amplicon / 454 GS FLX / 27
PolB3 / B3_lib / ERR193626 / 16S rRNA amplicon / 454 GS FLX / 27
PolB4 / B4_lib / ERR193627 / 16S rRNA amplicon / 454 GS FLX / 27
Tb1 / Walagan bottom of active layer microcosm, 30% crude oil contamination / SRR1180575 / 16S rRNA amplicon / 454 GS FLX / 31
Tb2 / Walagan North bottom of active layer microcosm, 30% crude oil contamination / SRR1180576 / 16S rRNA amplicon / 454 GS FLX / 31
Tb3 / Taiyuan bottom of active layer microcosm, 30% crude oil contamination / SRR1180577 / 16S rRNA amplicon / 454 GS FLX / 31
Tb4 / Jiagedaqi bottom of active layer microcosm, 30% crude oil contamination / SRR1180578 / 16S rRNA amplicon / 454 GS FLX / 31
Tp1 / Walagan permafrost table microcosm, 30% crude oil contamination / SRR1180579 / 16S rRNA amplicon / 454 GS FLX / 31
Tp2 / Walagan North permafrost table microcosm, 30% crude oil contamination / SRR1180580 / 16S rRNA amplicon / 454 GS FLX / 31
Tp3 / Taiyuan permafrost table microcosm, 30% crude oil contamination / SRR1180581 / 16S rRNA amplicon / 454 GS FLX / 31
Tp4 / Jiagedaqi permafrost table microcosm, 30% crude oil contamination / SRR1180582 / 16S rRNA amplicon / 454 GS FLX / 31
Tu1 / Walagan upper active layer microcosm, 30% crude oil contamination / SRR1055249 / 16S rRNA amplicon / 454 GS FLX / 31
Tu2 / Walagan North upper active layer microcosm, 30% crude oil contamination / SRR1055249 / 16S rRNA amplicon / 454 GS FLX / 31
Tu3 / Taiyuan upper active layer microcosm, 30% crude oil contamination / SRR1055249 / 16S rRNA amplicon / 454 GS FLX / 31
Tu4 / Jiagedaqi upper active layer microcosm, 30% crude oil contamination / SRR1055249 / 16S rRNA amplicon / 454 GS FLX / 31
OSC1 / 2010Suncor Oil Sands Core Run 11 Subsample 1 / SRR573815 / 16S rRNA amplicon / 454 GS FLX / 2
OSC2 / 2010Suncor Oil Sands Core Run 11 Subsample 2 / SRR573816 / 16S rRNA amplicon / 454 GS FLX / 2
OSC3 / 2010Suncor Oil Sands Core Run 11 Subsample 3 / SRR573817 / 16S rRNA amplicon / 454 GS FLX / 2
OSC4 / 2010Suncor Oil Sands Core Run 11 Subsample 4 / SRR573818 / 16S rRNA amplicon / 454 GS FLX / 2
OSC5 / 2010Suncor Oil Sands Core Run 11 Subsample 5 / SRR573819 / 16S rRNA amplicon / 454 GS FLX / 2
OSC6 / 2010Suncor Oil Sands Core Run 11 Subsample 6 / SRR573820 / 16S rRNA amplicon / 454 GS FLX / 2
OSC7 / 2010Suncor Oil Sands Core Run 11 Subsample 7 / SRR573821 / 16S rRNA amplicon / 454 GS FLX / 2
OSC8 / 2010Suncor Oil Sands Core Run 11 Subsample 8 / SRR573822 / 16S rRNA amplicon / 454 GS FLX / 2
OSC9 / 2010Suncor Oil Sands Core Run 11 Subsample 9 / SRR573823 / 16S rRNA amplicon / 454 GS FLX / 2
OSC10 / 2010Suncor Oil Sands Core Run 11 Subsample 10 / SRR573824 / 16S rRNA amplicon / 454 GS FLX / 2
OSC11 / 2010Suncor Oil Sands Core Run 11 Subsample 11 / SRR573825 / 16S rRNA amplicon / 454 GS FLX / 2
OSC12 / 2010Suncor Oil Sands Core Run 11 Subsample 13 / SRR573826 / 16S rRNA amplicon / 454 GS FLX / 2
OSTP1 / 2009Suncor Tailings Pond 5 (6.5 ft) / SRR572718 / 16S rRNA amplicon / 454 GS FLX / 2
OSTP2 / 2009Suncor Tailings Pond 5 (15 ft) / SRR573716 / 16S rRNA amplicon / 454 GS FLX / 2
OSTP3 / 2009Suncor Tailings Pond 5 (20 ft) / SRR573717 / 16S rRNA amplicon / 454 GS FLX / 2
OSTP4 / 2009Suncor Tailings Pond 5 (30 ft) / SRR573725 / 16S rRNA amplicon / 454 GS FLX / 2
OSTP5 / 2009Suncor Tailings Pond 5 (40 ft) / SRR573736 / 16S rRNA amplicon / 454 GS FLX / 2
OSTP6 / 2009Suncor Tailings Pond 5 (50 ft) / SRR573738 / 16S rRNA amplicon / 454 GS FLX / 2
OSTP7 / 2009Suncor Tailings Pond 5 (65 ft) / SRR573739 / 16S rRNA amplicon / 454 GS FLX / 2
OSTP8 / 2009Suncor Tailings Pond 5 (95 ft) / SRR573741 / 16S rRNA amplicon / 454 GS FLX / 2
41-0cm / Sample Core_41-0 / SRR090124 / Metagenomic shotgun sequencing / Ion Torrent PGM / 32
41-05cm / Sample Core_41-05 / SRR090125 / Metagenomic shotgun sequencing / Ion Torrent PGM / 32
41-10cm / Sample Core_41-10 / SRR090126 / Metagenomic shotgun sequencing / Ion Torrent PGM / 32
47-0cm / Sample Core_47-0 / SRR090127 / Metagenomic shotgun sequencing / Ion Torrent PGM / 32
47-05cm / Sample Core_47-05 / SRR090128 / Metagenomic shotgun sequencing / Ion Torrent PGM / 32
47-10cm / Sample Core_47-10 / SRR090129 / Metagenomic shotgun sequencing / Ion Torrent PGM / 32
BioP_t0 / Biopile_t0 / SRR069800 / Metagenomic shotgun sequencing / 454 GS FLX / 33
BioP_t1m / Biopile_t1m / SRR069801, SRR06982 / Metagenomic shotgun sequencing / 454 GS FLX / 33
BioP_t1y / Biopile_1y / SRR069803 / Metagenomic shotgun sequencing / 454 GS FLX / 33
BrMgv2 / BRMgv-2Clean / MG-RAST ID 4451034.3 / Metagenomic shotgun sequencing / 454 GS FLX / 3
Kandla / Petroleum Metagenome / SRR921501 / Metagenomic shotgun sequencing / Ion Torrent PGM / 15
BP278 / GoM Deep Sea Sediment 278 / MG-RAST ID 4465490.3 / Metagenomic shotgun sequencing / 454 GS FLX / 17
BP315 / GoM Deep Sea Sediment 315 / MG-RAST ID 4465491.3 / Metagenomic shotgun sequencing / 454 GS FLX / 17
SCA1 / SCADC 454: Short chain n-alkane (C6-C10) degrading methanogenic enrichment culture / SRR634694 / Metagenomic shotgun sequencing / 454 GS FLX + / 28
SCA2 / SCADC 454: Short chain n-alkane (C6-C10) degrading methanogenic enrichment culture / SRR634695 / Metagenomic shotgun sequencing / 454 GS FLX + / 28