1

DING proteins: numerous functions, elusive genes, a potential for health

Supplementary material

Online resource 1 Pairwise aminoacid identities between the representative members of the five DING protein sub-families.

MDR / BBc / SBW / HPBP
Wood / 73 / 70 / 75 / 69
MDR / X / 68 / 73 / 66
BBc / X / X / 74 / 68
SBW / X / X / X / 77

Online resource 2 Alignment of all known DING protein N-termini. Proteins are identified by species (a protein name or ligand is added when more than one DING protein is known for a given species). The first 34 residues are shown since this corresponds to the longest determined N-terminus of a DING protein (human genistein binding protein). Thus, all sequences shorter than 34 aminoacids are from purified DING proteins while the other onesare predicted from DNA sequences(Table 1). Variable residues used to define groups 1-7 (Table 1) are shown in blue. Residues differing from the consensus (Fig. 2) are shown in red.

* Procaryotic sequences

Online resource 3Alignment of pro-peptides deduced from Pseudomonas DING protein genes as well as two eucaryotic genes. Propeptides assemble in two very homogenous groups, one of them made of all P. aeruginosa sequences. It should be noted that eucaryotic ORFs extend further even though the clones are incomplete and do not allow identification of the initiation codon.

Human CW626261 RSFFMFKRNVLAVSMTLAALCSAQAAMADINGGG

Leishmania RSFFMFKRNVLAVSMTLAALCSAQAAMADINGGG

P. fluorescens BBc6R8 MFKRNVLAVSMTLAALCSAQAAMADINGGG

P. fluorescens SBW25 MFKRNVLAVSMTLAALCSAQAAMADINGGG

P. fluorescens Pf5 MFKRNVLAASLTLAALCSAQAAMADINGGG

P. fluorescens Wayne1 MFKRNVLAASLTLAALCSAQAAMADINGGG

P. fluorescensWoodR1/F113 MFKRNVLAVSLTLAGLCAAQAAMADVNGGG

Pseudomonas sp. PAMC25886 MFKRNVLAVSMTLAALCSAQAAMADVNGGG

P. extremaustralis MFKRNVLAVSMTLAALCSAQAAMADINGGG

P. brassicacearum mfkrnvlavsltlaglcaaqaamadvNggg

P. aeruginosa PA7 MFKRSLIAASLSVAALVSAQ-AMATVNGGG

P. aeruginosa 39016 MFKRSLIAASLSVAALVSAQ-AMAEINGGG

P. aeruginosa PA14 MYKRSLIAASLSVAALVSAQ-AMADINGGG

P. aeruginosa MDR1 MYKRSLIAASLSVAALVSAQ-AMAEINGGG

P. aeruginosa NCGM MFKRSLIAASLSVAALVSAQ-AMAEINGGG

*:**.::*.*:::*** *** *** :****

Online resource 4 Alignment (ClustalW, followed by hand edition for some short sequences) of DING protein sequences within each sub-family. When more than one ORF is known, nucleotide alignment is also shown.

HPBP sub-family (groups 1 and 2)– proteins

turkey/1-45 ------

rat_neurones/1-91 XINGGGATLPQKLYLTPNVLTAGFAPYI------IAFLENKYNQFGTDT------

human HPBP/1-376 DINGGGATLPQKLYLTPDVLTAGFAPYIGVGSGKGKIAFLENKYNQFGTDTTKNVHWAGS

Human_amniotic/1-215 ------AFLENKYNQFGTDTT------

tobacco_cDNA/1-54 ------GVGSGKGKIAFLENKYNQFGTDTTKNVHWAGS

turkey/1-45 ----TATELAT-YAADC------LIQVPAVATAVAIPFR------

rat_neurones/1-91 ------

human HPBP/1-376 DSKLTATELAT-YAADKEPGWGKLIQVPSVATSVAIPFRKAGANAVDLSVKELCGVFSGR

Human_amniotic/1-215 ------GKLIQVPSVATSVAIPFRKAGANAVDLSVKELCGVFSGR

tobacco_cDNA/1-54 DSKLTATELAT-YAADKEPGWGK------

turkey/1-45 ------

rat_neurones/1-91 ------

human HPBP/1-376 IADWSGITGAGRSGPIQVVYRAESSGTTELFTRFLNAKCTTEPGTFAVTTTFANSYSLGL

Human_amniotic/1-215 ------TGA-----IQVVYRAESSGTTELFTR------

tobacco_cDNA/1-54 ------

turkey/1-45 ------

rat_neurones/1-91 ------ITYISPDFAAPTLAGLDD------VGKG-WNG

human HPBP/1-376 TPLAGAVAATGSDGVMAALNDTTVAEGRITYISPDFAAPTLAGLDDATKVARVGKGVVNG

Human_amniotic/1-215 ------ITYISPDFAAPTLAGLDDATK------

tobacco_cDNA/1-54 ------

turkey/1-45 ------

rat_neurones/1-91 VAVEGK-PAAANVSAAISVVPLPA------

human HPBP/1-376 VAVEGKSPAAANVSAAISVVPLPAAADRGNPDVWVPVFGATTGGGVVAYPDSGYPILGFT

Human_amniotic/1-215 --VEGKSPAAANVSAAISVVPLPAAADRGNPDVWVPVF------YPDSGYPILGFT

tobacco_cDNA/1-54 ------

turkey/1-45 ------HYGTSSNNDSAIEANAK------

rat_neurones/1-91 ------

human HPBP/1-376 NLIFSQCYANATQTGQVRDFFTKHYGTSANNDAAIEANAFVPLPSNWKAAVRASFLTASN

Human_amniotic/1-215 NLIFSQCYANATQTGQVRDF---HYGTSANNDAAIEAN---PLPSNWKAAVRASFLTASN

tobacco_cDNA/1-54 ------

turkey/1-45 ------

rat_neurones/1-91 ------

human HPBP/1-376 ALSIGNTNVCNGKGRPQ-

Human_amniotic/1-215 ALSIGNTNVCNGKGRP-

tobacco_cDNA/1-54 ------

Online resource 4 (continued)

SBW sub-family (groups 3 and 4) - proteins

Arabidopsis_1a/1-55 ------GVGSGNGKAAFLTNDYTKFVAGVSNKNVHWAG

Arabidopsis_1b/1-55 ------GVGSGNGKAAFLTNDYTNFVAGVSNKNVHWAG

S.solfataricus/1-210 DINGGGATLPQKLYQTSGVLTAGFAPYIGVGSGNGKAAFLTNDYTK------

Potato_prot/1-209 DINGGGATLPQKLYQTSGVLTAGFAPYI------AAFLTNDYT------

SBW25/1-370 DINGGGATLPQALYQTSGVLTAGFAQYIGVGSGNGKAAFLNNDYTKFQAGVTNKNVHWAG

P38/1-362* DINGGGATLPQALYQTSGVLTAGFAPYIGVGSGNGKAAFLNNDYTKFQAGVTNKNVHWAG

X-DING-CD4_prot/1-89 DINGGGATLPQK------VLTAGFAPYL------TNDY-KFVAGVSNKNVHWAG

Arabidopsis_1a/1-55 SDSKLTATELSTYATNKQPTWGK------

Arabidopsis_1b/1-55 SDSKLTATELSTYATNKQPTWGK------

S.solfataricus/1-210 ----LTATELSTYATNLQPTWGKLIQVPSVATSVAIPFR------

Potato_prot/1-209 ---KLTATELSTYATNK------LIQVPSVATSVAIPF----ANAVDLSVSELCGVFSGR

SBW25/1-370 SDSKLSATELSTYASAKQPTWGKLIQVPSVGTSVAIPFNKSGSAAVDLSVQELCGVFSGR

P38/1-362 SDSKLSATELSTYASAKQPTWGKLIQVPSVGTAVAIPFNKSGTAAVDLSVSELCGVFSGR

X-DING-CD4_prot/1-89 SDSKLTATELSTYATNKQ------QVPSVATSVAL------

Arabidopsis_1a/1-55 ------

Arabidopsis_1b/1-55 ------

S.solfataricus/1-210 ITDWSGISGAGRTGPITVVYR------

Potato_prot/1-209 ITDWSGLY--GRTGPITVVYRSESSGTTELFTR------

SBW25/1-370 INTWDGISGSGRTGPIVVVYRSESSGTTELFTRFLNAKCNAETGNFAVTTTFGTSFSGGL

P38/1-362 ITDWSGISGSGRTGAITVVYRSESSGTTELFTRFLNAKC-AETGTFNISTTFGTSYTGGL

X-DING-CD4_prot/1-89 ------

Arabidopsis_1a/1-55 ------

Arabidopsis_1b/1-55 ------

S.solfataricus/1-210 ------ITYM-SPDFAASTLAGLDDATK------GV

Potato_prot/1-209 ------ITYPYSPVYAASTLAGLDDATK------

SBW25/1-370 PAGAVAATGSQGVMTALAAGDGRITYM-SPDFAAPTLAGLDDATKVARVGKNVATNTQGV

P38/1-362 PAGAVSAAGSQGVMTALAGADGGITYM-SPDFAAPTLAGLDDATKVARVGKDVATNTAGV

X-DING-CD4_prot/1-89 ------FAASTLAGLDDATK------

Arabidopsis_1a/1-55 ------

Arabidopsis_1b/1-55 ------

S.solfataricus/1-210 SPAPSNVSDAIAQV-LP-PNDPSAPLDVTNP--DDGVAGVQPYPDSGYPILGFTNLIFS-

Potato_prot/1-209 -----NVSK------RTLQWYPVP------

SBW25/1-370 SPAAANVSAAIGAVPVPAAADRSNPDAWVPVFGPDNTAGVQPYPTSGYPILGFTNLIFSQ

P38/1-362 SPAAANVSAAINAVPVPASTEK--PEFGKA-----NTAGVQPYPTSGYPILGFTNLIFSQ

X-DING-CD4_prot/1-89 ------

Arabidopsis_1a/1-55 ------

Arabidopsis_1b/1-55 ------

S.solfataricus/1-210 ------AFFTKHFGDTNNNDDAITANRFVPLPDNWK------

Potato_prot/1-209 ------VFGKHFGDTNNTQDAITANRFVPLPDNWKATITDNFVTASSALSIGK

SBW25/1-370 CYADATQTTQVRDFFTKHYGASNNNDAAITANAFVPLPTAWKATVRASFLTASNALSIGN

P38/1-362 CYADATQTSQVRDFFAKHYGASNNNDAAITANAFVPLPTAWKATVRASFLTASNALSIGN

X-DING-CD4_prot/1-89 ------

Arabidopsis_1a/1-55 ------

Arabidopsis_1b/1-55 ------

S.solfataricus/1-210 ------

Potato_prot/1-209 TNVCNGIGRGPL

SBW25/1-370 TNVCNGIGR-PL

P38/1-362 TNVCNGIGR-PL

X-DING-CD4_prot/1-89 TNVCNGLGR-PL

* P38 represents both the sequence from human genome (glioblastoma cells) and St.John’s wort (see Table 1)

Online resource 4 (continued)SBW sub-family (groups 3 and 4) – genes

P38/1-1095 ATGGCCGATATAAACGGTGGTGGTGCGACACTACCCCAAGCGCTGTACCAGACTTCCGGC

SBW25/1-1185 ATGGCTGACATCAATGGCGGTGGTGCCACCCTGCCACAAGCGCTGTACCAGACCTCCGGC

Arabidopsis_1a/1-165 ------GGTGTGGGCAGCGGGAATGGCAAGGCAGCC

Arabidopsis_1b/1-165 ------GGTGTGGGCAGCGGGAATGGCAAGGCAGCC

P38/1-1095 GTGCTGACTGCCGGTTTTGCCCCGTACATCGGCGTGGGCAGCGGTAATGGCAAGGCCGCC

SBW25/1-1185 GTGTTGACTGCCGGTTTCGCCCAGTACATTGGTGTCGGCAGCGGTAACGGCAAGGCAGCC

Arabidopsis_1a/1-165 TTCTTGACCAACGATTACACCAAGTTCGTGGCTGGCGTGAGCAACAAGAACGTGCACTGG

Arabidopsis_1b/1-165 TTCTTGACCAACGATTACACCAATTTCGTGGCTGGCGTGAGCAACAAGAACGTGCACTGG

P38/1-1095 TTCCTGAACAACGACTACACCAAGTTCCAGGCCGGCGTGACGAACAAGAATGTGCACTGG

SBW25/1-1185 TTCCTGAACAACGACTACACCAAGTTCCAGGCTGGCGTGACGAACAAGAACGTGCACTGG

Arabidopsis_1a/1-165 GCCGGTAGCGATTCGAAGCTGACTGCGACTGAACTGTCGACCTACGCCACCAACAAACAA

Arabidopsis_1b/1-165 GCCGGTAGCGATTCGAAGCTGACTGCGACTGAACTGTCGACCTACGCCACCAACAAACAA

P38/1-1095 GCCGGCAGCGACTCCAAGCTGAGCGCCACTGAGCTGTCGACCTACGCGTCTGCCAAGCAA

SBW25/1-1185 GCGGGCAGCGACTCCAAGCTGAGCGCCACTGAGTTGTCGACCTACGCGTCTGCCAAGCAA

Arabidopsis_1a/1-165 CCTACCTGGGGCAAA

Arabidopsis_1b/1-165 CCTACCTGGGGCAAA

P38/1-1095 CCGACCTGGGGCAAGTTGATCCAGGTGCCGTCGGTGGGTACTGCGGTCGCTATTCCCTTC

SBW25/1-1185 CCGACCTGGGGCAAGTTGATCCAGGTGCCATCGGTGGGGACTTCGGTTGCCATTCCTTTC

P38/1-1095 AACAAAAGCGGCACCGCAGCGGTTGACCTGAGCGTCAGCGAGCTGTGCGGTGTGTTCTCG

SBW25/1-1185 AATAAAAGCGGTTCCGCCGCTGTAGACCTGAGCGTTCAAGAGTTGTGCGGCGTGTTCTCG

P38/1-1095 GGGCGTATCACTGACTGGAGCGGTATTTCCGGTTCCGGCCGTACCGGCGCGATCACCGTG

SBW25/1-1185 GGCCGTATCAATACCTGGGACGGTATTTCCGGTTCTGGCCGTACCGGTCCGATCGTTGTG

P38/1-1095 GTCTACCGTAGCGAAAGCAGTGGCACCACCGAGTTGTTCACCCGTTTCCTCAACGCCAAG

SBW25/1-1185 GTTTATCGCAGCGAAAGCAGTGGTACCACTGAGCTGTTCACCCGTTTCCTCAATGCCAAG

P38/1-1095 TG---TGCTGAAACCGGCACCTTCAATATCTCCACCACGTTCGGCACCAGCTACACCGGT

SBW25/1-1185 TGCAACGCAGAAACAGGCAACTTCGCCGTCACCACCACCTTCGGCACCAGCTTCTCCGGT

P38/1-1095 GGTTTGCCTGCCGGCGCCGTTTCTGCCGCCGGCAGCCAAGGTGTTATGACCGCGTTGGCC

SBW25/1-1185 GGCTTGCCTGCCGGCGCCGTTGCTGCTACTGGCAGCCAAGGTGTAATGACTGCCCTGGCC

P38/1-1095 GGCGCCGACGGTGGTATCACCTACATGAGCCCTGATTTCGCGGCCCCAACCCTGGCCGGT

SBW25/1-1185 GCCGGCGATGGCCGCATCACCTACATGAGCCCTGACTTCGCCGCCCCTACATTGGCCGGT

P38/1-1095 CTGGACGACGCGACCAAAGTGGCACGTGTCGGCAAGGACGTCGCGACCAACACTGCGGGC

SBW25/1-1185 CTGGACGACGCTACCAAAGTGGCTCGCGTGGGCAAAAACGTCGCCACTAACACCCAGGGC

P38/1-1095 GTTTCGCCTGCCGCCGCTAACGTCTCCGCTGCCATCAACGCTGTGCCAGTTCCAGCATCA

SBW25/1-1185 GTTTCGCCTGCCGCCGCCAACGTGTCTGCCGCTATCGGCGCAGTACCGGTACCGGCTGCC

P38/1-1095 ACCGA------AAAGCCGGAA------TTCGGCAAAGCCAACACCGCCGGT

SBW25/1-1185 GCTGATCGTTCCAACCCGGACGCCTGGGTTCCAGTCTTCGGTCCGGACAACACCGCCGGT

P38/1-1095 GTGCAGCCTTACCCTACCTCGGGCTACCCGATCCTGGGCTTCACCAACCTGATCTTCAGC

SBW25/1-1185 GTACAGCCTTACCCAACCTCGGGCTACCCGATCCTGGGCTTTACCAACCTGATCTTCAGC

P38/1-1095 CAGTGCTACGCCGATGCCACCCAGACCAGCCAAGTGCGTGATTTCTTCGCCAAGCACTAC

SBW25/1-1185 CAGTGCTACGCCGACGCGACCCAGACCACCCAAGTGCGTGATTTCTTCACCAAGCACTAC

P38/1-1095 GGCGCCTCCAACAACAACGATGCAGCCATCACCGCCAACGCTTTCGTGCCGCTGCCAACC

SBW25/1-1185 GGCGCCTCCAACAACAACGATGCAGCCATCACCGCCAACGCTTTCGTTCCACTGCCAACC

P38/1-1095 GCTTGGAAAGCCACCGTTCGCGCCAGCTTCCTGACCGCGAGCAACGCCCTGAGCATCGGC

SBW25/1-1185 GCTTGGAAAGCCACCGTTCGCGCCAGTTTCCTGACCGCGAGCAACGCCCTGAGCATCGGC

P38/1-1095 AACACCAACGTCTGCAACGGCATCGGCCGTCCGCTGTAA

SBW25/1-1185 AACACCAACGTCTGCAACGGTATCGGTCGTCCGCTGTAA

Online resource 4 (continued)MDR sub-family (group 5) - proteins

MDR1/1-369 EINGGGATLPQQLYQEPGVLTAGFAAYIGAGSGNGKAAFLNNDYTKFVAGTTNKNVHWAG

39016/1-369 EINGGGATLPQQLYQEPGVLTAGFAAYIGVGSGNGKAAFLNNDYTKFVAGTTNKNVHWAG

NCGM2/1-369 EINGGGATLPQQLYQEPGVLTAGFAAYIGVGSGNGKAAFLNNDYTKFVAGTTNKNVHWAG

138244/1-253 EINGGGATLPQQLYQEPGVLTAGFAAYIGVGSGNGKAAFLNNDYTKFVAGTTNKNVHWAG

PA14/1-369 DINGGGATLPQQLYQEPGVLTAGFAAYIGVGSGNGKAAFLNNDYTKFVAGTTNKNVHWAG

PA7/1-369 TVNGGGATLPQQLYQEPGVLTAGFAAYIGVGSGNGKAAFLNNDYTKFVAGTTDKNVHWAG

GP56/1-87 ------AAFLNNDYTKFVAGTTNKNVHWAG

P.extremaustralis/1-369 DINGGGATLPQQLYQTPGVLTAGFAQYIGVGSGNGKAAFLTNDYTKFVAGVSNKNVHWAG

MDR1/1-369 SDSKLSKTNETNPYLSAHGSAWGPLIQVPSVATSVALPFNKSGSNAVNFADVNTLCGVFS

39016/1-369 SDSKLSKTNETNPYLSAHGSAWGPLIQVPSVATSVALPFNKSGSNAVNFADVNTLCGVFS

NCGM2/1-369 SDSKLSKTNETNPYLSAHGSAWGPLIQVPSVATSVALPFNKSGSNAVNFADVNTLCGVFS

138244/1-253 SDSKLSKTNETNPYLSAHGSAWGPLIQVPSVATSVALPFNKSGSNAVNFADVNTLCGVFS

PA14/1-369 SDSKLSKTNETNPYLSAHGSAWGPLIQVPSVATSVALPFNKSGSNAVNFADVNTLCGVFS

PA7/1-369 SDSKLNPTSETSPYLAAHGAAWGPLIQVPSVATSVAIPFNKAGSNAVNFADVNTLCGVFS

GP56/1-87 SDSK------LIQVPSVATSVALPFRK------

P.extremaustralis/1-369 SDSKLSAT-ELSTYATNKQPTWGKLIQVPSVATSVAIPFNKAGTAAVNL-SVNQLCGVFS

MDR1/1-369 GRLTDWSQIPGSGRSGAITVAYRSESSGTTELFTRFLNASCSSALEGGTFAITTSFGNSF

39016/1-369 GRLTDWSQIPGSGRSGAITVVYRSESSGTTELFTRFLNASCSSALEGGTFAITTSFGNSF

NCGM2/1-369 GRLTDWSQIPGSGRSGAITVVYRSESSGTTELFTRFLNASCSSALEGGTFAITTSFGNSF

138244/1-253 GRLTDWSQIPGSGRSGAITVVYRSESSGTTELFTRFLNASCSSALEGGTFAITTSFGNSF

PA14/1-369 GRLTDWSQIPGSGRSGAITVVYRSESSGTTELFTRFLNASCSSTLEGGTFAITTSFGSSF

PA7/1-369 GRLTDWSQIPDSGRTGAITVVYRSESSGTTELFTRFLNASCSSATEGGTFAITTSFGSSF

GP56/1-87 --LTDWSQITGAGRSGAITVVYRSESSGTTELFTR------

P.extremaustralis/1-369 GRLTNWNQITGSGRTGAIKVVYRSESSGTTELFTRFLNAKCS---EAKAFAITTTFSSSY

MDR1/1-369 SGGLPAGAVSAQGSQAVMNTLNAAEGRITYMSPDFAAPTLAGLDDATKVAQVRGVSPAPA

39016/1-369 SGGLPAGAVSAQGSQAVMNTLNAAEGRITYMSPDFAAPTLAGLDDATKVAQVRGVSPAPA

NCGM2/1-369 SGGLPAGAVSAQGSQAVMNTLNAAEGRITYMSPDFAAPTLAGLDDATKVAQVRGVSPAPA

138244/1-253 SGGLPAGAVSAQGSQAVMNTLNAAEGRITYMSPDFAAPTLAGLDDATKVAQVRGVSPAPA

PA14/1-369 SGGLPAGAVSAQGSQAVMNALNAAQGRITYMSPDFAAPTLAGLDDATKVAQVRGVSPAPA

PA7/1-369 SGGLPSGAVSAQGSQAVMNTLNAAEGRITYMSPDFAAPTLAGLDDATKVAQIRGVSPAPA

GP56/1-87 ------

P.extremaustralis/1-369 GNGVPAGAVAATGSQGVMTTLNATDGGITYMSPDYAATTLAGLDDATKVAKVAGVSPAPA

MDR1/1-369 NVSAAIGAVTPPTTAQRSDPN------NWVPVFAATASATDPSVRPYPTTGYPILGFT

39016/1-369 NVSAAIGAVTPPTTAQRSDPN------NWVPVFAATASATDPSVRPYPTTGYPILGFT

NCGM2/1-369 NVSAAIGAVTPPTTAQRSDPN------NWVPVFAATASATDPSVRPYPTTGYPILGFT

138244/1-253 NVSAAIGAVTPPTTAQRSDPN------NWVPVFAATASATDPSVRPYPTTGYPILGFT

PA14/1-369 NVSAAIGAVTPPTTAQRSDPN------NWVPVFAATANPNDPSVRPYPTSGYPILGFT

PA7/1-369 NVSAAIGAVTPPTTAQRSDPN------NWVPVFAATASATDPSVRPYPTTGYPILGFT

GP56/1-87 ------

P.extremaustralis/1-369 NVSAAIGQIPAPTEA-RADGSFIDATNQDNWVPVFAATAS-GPATFA-YPTTGYPILGFT

MDR1/1-369 NLIFSQCYADATQTQQVRDFFTRHYGASVNNDTAITNHRFVPLPASWKLAVRQSFLTSTN

39016/1-369 NLIFSQCYADATQTQQVRDFFTRHYGASVNNDTAITNHRFVPLPASWKLAVRQSFLTSTN

NCGM2/1-369 NLIFSQCYADATQTQQVRDFFTRHYGASVNNDTAITNHRFVPLPASWKLAVRQSFLTSTN

138244/1-253 NLIFSQCYADATQTQQVRDFFTRHYGASVNNDTAITNHRFVPLPASWKLAVRQSFLTSTN

PA14/1-369 NLIFSQCYANATQTQQVRDFFTRHYGATANNDTAITNHRFVPLPASWKLAVRQSFLTSTN

PA7/1-369 NLIFSQCYADATQTQQVRDFFTRHYGASVNNDAAITNHRFVPLPASWKLAVRQSFLTSTN

GP56/1-87 ------FVPLPDSWK------

P.extremaustralis/1-369 NLIFSQCYADPTQTTQVQDFFLQHFGAFNNNDAAINDNRFVPLPSNWKQAITDTFATVSS

MDR1/1-369 NLYIGHSNVCNGIGRPL

39016/1-369 NLYIGHSNVCNGIGRPL

NCGM2/1-369 NLYIGHSNVCNGIGRPL

138244/1-253 NLYIGHSNVCNGIGRPL

PA14/1-369 NLYIGHSNVCNGIGRPL

PA7/1-369 NLYIGHTNVCNGIGRPL

GP56/1-87 ------

P.extremaustralis/1-369 GQGIGNTSVCNGIGRPL

Online resource 4 (continued)

MDR sub-family (group 5) – genes

39016/1-1190 ATGGCCGAAATCAACGGCGGTGGCGCCACCCTGCCGCAACAGCTGTACCA

NCGM2/1-1113 ATGGCCGAAATCAACGGCGGTGGCGCCACCCTGCCGCAACAGCTGTACCA

138244/1-1116 ATGGCCGAAATCAACGGCGGTGGCGCCACCCTGCCGCAACAGCTGTACCA

MDR1/1-1179 ATGGCCGAAATCAACGGCGGTGGCGCCACCCTGCCGCAACAGCTGTACCA

PA14/1-1116 ATGGCCGACATCAACGGCGGTGGCGCCACCCTGCCGCAACAGCTGTACCA

PA7/64-1179 ATGGCCACCGTCAATGGCGGCGGCGCTACCCTGCCGCAGCAGCTGTACCA

P.extremaustralis/1-1118 ATGGCTGACATCAACGGCGGTGGTGCCACCCTGCCTCAGCAGCTGTACCA

***** **** ***** ** ** ******** ** ***********

39016/1-1190 GGAGCCCGGCGTCCTGACCGCCGGCTTCGCCGCCTACATCGGCGTAGGCA

NCGM2/1-1113 GGAGCCCGGCGTCCTGACCGCCGGCTTCGCCGCCTACATCGGCGTAGGCA

138244/1-1116 GGAGCCCGGCGTCCTGACCGCCGGCTTCGCCGCCTACATCGGCGTAGGCA

MDR1/1-1179 GGAGCCCGGCGTCCTGACCGCCGGCTTCGCCGCCTACATCGGCGCAGGCA

PA14/1-1116 GGAGCCCGGCGTCCTGACCGCCGGCTTTGCCGCCTACATCGGCGTAGGCA

PA7/64-1179 GGAGCCCGGCGTCCTGACCGCCGGCTTCGCCGCCTACATCGGCGTCGGCA

P.extremaustralis/1-1118 GACTCCGGGCGTGCTGACGGCCGGTTTCGCCCAATACATCGGCGTGGGCA

* ** ***** ***** ***** ** *** ********** ****

39016/1-1190 GCGGCAACGGCAAGGCGGCCTTCCTGAACAACGACTACACCAAGTTCGTC

NCGM2/1-1113 GCGGCAACGGCAAGGCGGCCTTCCTGAACAACGACTACACCAAGTTCGTC

138244/1-1116 GCGGCAACGGCAAGGCGGCCTTCCTGAACAACGACTACACCAAGTTCGTC

MDR1/1-1179 GTGGCAACGGCAAGGCGGCCTTCCTGAACAACGACTACACCAAGTTCGTC

PA14/1-1116 GTGGCAACGGCAAGGCGGCCTTCCTGAACAACGACTACACCAAGTTCGTC

PA7/64-1179 GCGGCAACGGCAAGGCTGCCTTCCTGAACAACGACTACACCAAGTTCGTC

P.extremaustralis/1-1118 GCGGTAATGGCAAGGCCGCCTTCCTGACCAACGACTACACCAAGTTCGTG

* ** ** ******** ********** *********************

39016/1-1190 GCCGGCACTACCAACAAGAACGTGCATTGGGCTGGTAGCGACTCCAAACT

NCGM2/1-1113 GCCGGCACTACCAACAAGAACGTGCATTGGGCTGGTAGCGACTCCAAACT

138244/1-1116 GCCGGCACCACCAACAAGAACGTGCATTGGGCTGGTAGCGACTCCAAACT

MDR1/1-1179 GCCGGCACTACCAACAAGAACGTGCATTGGGCTGGTAGCGACTCCAAACT

PA14/1-1116 GCCGGCACCACCAACAAGAACGTGCATTGGGCTGGTAGCGACTCCAAACT

PA7/64-1179 GCCGGCACCACCGACAAGAACGTGCACTGGGCTGGCAGCGACTCCAAGCT

P.extremaustralis/1-1118 GCCGGTGTGAGCAACAAGAACGTGCACTGGGCCGGTAGCGATTCCAAGCT

***** * * ************* ***** ** ***** ***** **

39016/1-1190 GAGCAAGACCAACGAAACCAACCCCTATCTGAGCGCCCATGGCTCCGCCT

NCGM2/1-1113 GAGCAAGACCAACGAAACCAACCCCTATCTGAGCGCCCATGGCTCCGCCT

138244/1-1116 GAGCAAGACCAACGAAACCAACCCCTATCTGAGCGCCCATGGCTCCGCCT

MDR1/1-1179 GAGCAAGACCAACGAAACCAACCCCTATCTGAGCGCCCATGGCTCCGCCT

PA14/1-1116 GAGCAAGACCAACGAAACCAACCCCTATCTGAGCGCCCATGGCTCCGCCT

PA7/64-1179 GAACCCGACCAGCGAAACCAGCCCCTACCTGGCCGCCCATGGCGCCGCCT

P.extremaustralis/1-1118 GAGCGCGACTGAACTGTCGA---CCTACGCCACCAACAAACAACCGACCT

** * *** * * **** * * * * ***

39016/1-1190 GGGGTCCGCTGATCCAGGTGCCGTCGGTAGCCACTTCGGTCGCTCTGCCG

NCGM2/1-1113 GGGGTCCGCTGATCCAGGTGCCGTCGGTAGCCACTTCGGTCGCTCTGCCG

138244/1-1116 GGGGTCCGCTGATCCAGGTGCCGTCGGTAGCCACTTCGGTCGCTCTGCCG

MDR1/1-1179 GGGGTCCGCTGATCCAGGTGCCGTCGGTAGCCACTTCGGTCGCTCTGCCG

PA14/1-1116 GGGGTCCGCTGATCCAGGTGCCGTCGGTAGCCACTTCTGTCGCTCTGCCG

PA7/64-1179 GGGGTCCGCTGATCCAGGTTCCCTCGGTCGCCACCTCGGTCGCCATTCCG

P.extremaustralis/1-1118 GGGGCAAGCTGATCCAGGTGCCATCGGTGGCGACTTCGGTTGCCATTCCG

**** ************ ** ***** ** ** ** ** ** * ***

39016/1-1190 TTCAACAAGTCAGGTAGCAACGCCGTCAACTTCGCAGACGTGAACACCCT

NCGM2/1-1113 TTCAACAAGTCAGGTAGCAACGCCGTCAACTTCGCAGACGTGAACACCCT

138244/1-1116 TTCAACAAGTCAGGTAGCAACGCCGTCAACTTCGCAGACGTGAACACCCT

MDR1/1-1179 TTCAACAAGTCAGGTAGCAACGCCGTCAACTTCGCAGACGTGAACACCCT

PA14/1-1116 TTCAACAAGTCAGGTAGCAACGCCGTCAACTTCGCAGACGTGAACACCCT

PA7/64-1179 TTCAACAAGGCAGGCAGCAACGCCGTCAACTTCGCCGACGTGAACACCCT

P.extremaustralis/1-1118 TTCAACAAGGCCGGTACTGCAGCGGTTAACCT---GAGCGTTAACCAACT

********* * ** * ** ** *** * *** *** **

39016/1-1190 TTGCGGTGTCTTCTCCGGCCGTCTGACCGATTGGAGTCAGATTCCTGGCT

NCGM2/1-1113 TTGCGGTGTCTTCTCCGGCCGTCTGACCGATTGGAGTCAGATTCCTGGCT

138244/1-1116 TTGCGGTGTCTTCTCCGGCCGTCTGACCGATTGGAGTCAGATTCCTGGCT

MDR1/1-1179 TTGCGGTGTCTTCTCCGGCCGTCTGACCGATTGGAGTCAGATTCCTGGCT

PA14/1-1116 TTGCGGTGTCTTCTCCGGCCGTCTGACCGATTGGAGTCAGATTCCTGGCT

PA7/64-1179 CTGCGGCGTGTTCTCCGGCCGCCTGACCGACTGGAGCCAGATCCCCGACT

P.extremaustralis/1-1118 GTGCGGCGTGTTCTCCGGTCGCCTGACCAACTGGAACCAGATCACGGGTT

***** ** ******** ** ****** * **** ***** * * *

39016/1-1190 CGGGCCGTAGCGGCGCCATCACAGTGGTCTACCGTTCCGAGAGCAGCGGC

NCGM2/1-1113 CGGGCCGTAGCGGCGCCATCACAGTGGTCTACCGTTCCGAGAGCAGCGGC

138244/1-1116 CGGGCCGTAGCGGCGCCATCACAGTGGTCTACCGTTCCGAGAGCAGCGGC

MDR1/1-1179 CGGGCCGTAGCGGCGCCATCACAGTGGCCTACCGTTCCGAGAGCAGCGGC

PA14/1-1116 CGGGCCGTAGCGGCGCCATCACAGTGGTCTACCGTTCCGAGAGCAGCGGC

PA7/64-1179 CGGGCCGCACCGGCGCCATCACCGTGGTCTACCGTTCGGAAAGCAGCGGT

P.extremaustralis/1-1118 CCGGCCGCACGGGCGCGATCAAAGTCGTTTACCGCAGTGAATCCAGCGGT

* ***** * ***** **** ** * ***** ** ******

39016/1-1190 ACTACCGAACTGTTCACCCGCTTCCTCAACGCCTCTTGCTCCAGCGCCCT

NCGM2/1-1113 ACTACCGAACTGTTCACCCGCTTCCTCAACGCCTCTTGCTCCAGCGCCCT

138244/1-1116 ACTACCGAACTGTTCACCCGCTTCCTCAACGCCTCTTGCTCCAGCGCCCT

MDR1/1-1179 ACTACCGAACTGTTCACCCGCTTCCTCAACGCCTCTTGCTCCAGCGCCCT

PA14/1-1116 ACCACCGAACTGTTCACCCGCTTCCTCAACGCCTCCTGCTCCAGCACCCT

PA7/64-1179 ACCACCGAGCTGTTCACCCGCTTCCTCAACGCTTCCTGCTCCAGCGCCAC

P.extremaustralis/1-1118 ACTACCGAACTCTTCACCCGCTTCCTGAACGCCAAGTGC---AGCGAAGC

** ***** ** ************** ***** *** ***

39016/1-1190 CGAAGGTGGCACCTTCGCCATCACCACCAGCTTCGGTAACAGCTTCTCCG

NCGM2/1-1113 CGAAGGTGGCACCTTCGCCATCACCACCAGCTTCGGTAACAGCTTCTCCG

138244/1-1116 CGAAGGTGGCACCTTCGCCATCACCACCAGCTTCGGTAACAGCTTCTCCG

MDR1/1-1179 CGAAGGTGGCACCTTCGCCATCACCACCAGCTTCGGTAACAGCTTCTCCG

PA14/1-1116 CGAAGGTGGCACCTTCGCCATCACCACCAGCTTCGGTAGCAGCTTCTCCG

PA7/64-1179 CGAGGGCGGTACCTTCGCCATCACCACCAGCTTCGGCAGCAGCTTCTCTG

P.extremaustralis/1-1118 CAAAG------CGTTCGCCATCACCACCACCTTCTCGTCGAGCTACGGCA

* * * * **************** **** **** *

39016/1-1190 GCGGCCTGCCGGCTGGCGCCGTATCGGCCCAGGGCAGCCAGGCCGTGATG

NCGM2/1-1113 GCGGCCTGCCGGCTGGCGCCGTATCGGCCCAGGGCAGCCAGGCCGTGATG

138244/1-1116 GCGGCCTGCCGGCTGGCGCCGTATCGGCCCAGGGCAGCCAGGCCGTGATG

MDR1/1-1179 GCGGCCTGCCGGCTGGCGCCGTATCGGCCCAGGGCAGCCAGGCCGTGATG

PA14/1-1116 GCGGCCTGCCGGCTGGCGCCGTATCGGCCCAGGGCAGCCAGGCCGTGATG

PA7/64-1179 GCGGCCTGCCGTCCGGCGCCGTCTCGGCCCAGGGCAGCCAGGCCGTGATG

P.extremaustralis/1-1118 ACGGCGTGCCGGCCGGTGCCGTTGCCGCGACCGGCAGCCAAGGCGTGATG

**** ***** * ** ***** * ** ******** * *******

39016/1-1190 AATACGCTCAACGCCGCCGAAGGCCGCATCACCTATATGAGCCCGGACTT

NCGM2/1-1113 AATACGCTCAACGCCGCCGAAGGCCGCATCACCTATATGAGCCCGGACTT

138244/1-1116 AATACGCTCAACGCCGCCGAAGGCCGCATCACCTATATGAGCCCGGACTT

MDR1/1-1179 AATACGCTCAACGCCGCCGAAGGCCGCATCACCTATATGAGCCCGGACTT

PA14/1-1116 AATGCGCTCAACGCCGCCCAAGGCCGCATCACCTACATGAGCCCGGACTT

PA7/64-1179 AACACGCTCAACGCCGCCGAAGGCCGCATCACCTACATGAGCCCGGACTT

P.extremaustralis/1-1118 ACTACGCTGAACGCTACGGACGGTGGTATCACCTACATGAGCCCGGACTA

* **** ***** * * ** * ******** *************

39016/1-1190 CGCCGCGCCGACGCTGGCCGGCCTCGACGACGCCACCAAGGTCGCCCAGG

NCGM2/1-1113 CGCCGCGCCGACGCTGGCCGGCCTCGACGACGCCACCAAGGTCGCCCAGG

138244/1-1116 CGCCGCGCCGACGCTGGCCGGCCTCGACGACGCCACCAAGGTCGCCCAGG

MDR1/1-1179 CGCCGCGCCGACGCTGGCCGGCCTCGACGACGCCACCAAGGTCGCCCAGG

PA14/1-1116 CGCCGCGCCGACCCTGGCCGGTCTCGACGACGCCACCAAGGTCGCCCAGG

PA7/64-1179 CGCCGCGCCGACCCTGGCCGGTCTCGACGATGCCACCAAGGTCGCCCAGA

P.extremaustralis/1-1118 CGCCGCGACTACTTTGGCCGGTCTGGATGACGCCACCAAAGTCGCCAAGG

******* * ** ******* ** ** ** ******** ****** **

39016/1-1190 TGCGCGGCGTATCGCCGGCTCCGGCCAACGTCTCGGCAGCCATCGGTGCC

NCGM2/1-1113 TGCGCGGCGTATCGCCGGCTCCGGCCAACGTCTCGGCAGCCATCGGTGCC

138244/1-1116 TGCGCGGCGTATCGCCGGCTCCGGCCAACGTCTCGGCAGCCATCGGTGCC

MDR1/1-1179 TGCGCGGCGTATCGCCGGCTCCGGCCAACGTCTCGGCAGCCATCGGTGCC

PA14/1-1116 TTCGCGGCGTATCCCCGGCGCCGGCCAACGTTTCGGCGGCCATCGGCGCC

PA7/64-1179 TTCGCGGCGTATCGCCGGCTCCGGCCAACGTCTCGGCCGCCATCGGCGCC

P.extremaustralis/1-1118 TTGCCGGTGTTTCGCCTGCGCCTGCCAACGTGTCGGCCGCTATCGGTCAG

* *** ** ** ** ** ** ******** ***** ** *****

39016/1-1190 GTTACCCCGCCGACCA----CCGCACAG------CGTTCA------

NCGM2/1-1113 GTTACCCCGCCGACCA----CCGCACAG------CGTTCA------

138244/1-1116 GTTACCCCGCCGACCA----CCGCACAG------CGTTCA------

MDR1/1-1179 GTTACCCCGCCGACCA----CCGCACAG------CGTTCA------

PA14/1-1116 GTTACTCCGCCGACTA----CTGCCCAG------CGTTCC------

PA7/64-1179 GTTACCCCGCCGACCA----CCGCCCAA------CGCTCC------

P.extremaustralis/1-1118 ATCCCTGCGCCAACCGAAGCCCGCGCCGATGGCTCGTTCATCGACGCCAC

* * **** ** * ** * ** **

39016/1-1190 -GACCCGAACAACTGGGTGCCGGTCTTCGCCGCCACTGCCAGTGCGACCG

NCGM2/1-1113 -GACCCGAACAACTGGGTGCCGGTCTTCGCCGCCACTGCCAGTGCGACCG

138244/1-1116 -GACCCGAACAACTGGGTGCCGGTCTTCGCCGCCACTGCCAGTGCGACCG

MDR1/1-1179 -GACCCGAACAACTGGGTGCCGGTCTTCGCCGCCACTGCCAGTGCGACCG

PA14/1-1116 -GATCCGAACAACTGGGTACCGGTCTTCGCTGCCACCGCCAACCCCAACG

PA7/64-1179 -GATCCGAACAACTGGGTGCCGGTCTTCGCCGCCACCGCCAGCGCGACCG

P.extremaustralis/1-1118 TAACCAGGACAACTGGGTGCCGGTATTTGCCGCCACCGCAAGT------G

* * * ********** ***** ** ** ***** ** * *

39016/1-1190 ATCCGAGCGTCCGTCCGTATCCGACCACCGGCTATCCGATCCTCGGCTTC

NCGM2/1-1113 ATCCGAGCGTCCGTCCGTATCCGACCACCGGCTATCCGATCCTCGGCTTC

138244/1-1116 ATCCGAGCGTCCGTCCGTATCCGACCACCGGCTATCCGATCCTCGGCTTC

MDR1/1-1179 ATCCGAGCGTCCGTCCGTATCCGACCACCGGCTATCCGATCCTCGGCTTC

PA14/1-1116 ACCCGAGCGTGCGTCCGTATCCGACCAGCGGCTACCCGATCCTCGGCTTC

PA7/64-1179 ATCCGAGCGTACGTCCGTACCCCACCACCGGCTACCCGATCCTCGGCTTC

P.extremaustralis/1-1118 GTCCAGCTACCTTCGCCTACCCAACGACCGGCTACCCGATCCTGGGCTTC

** * ** ** ** * ****** ******** ******

39016/1-1190 ACCAACCTGATCTTCAGCCAGTGCTACGCCGACGCTACCCAGACCCAGCA

NCGM2/1-1113 ACCAACCTGATCTTCAGCCAGTGCTACGCCGACGCTACCCAGACCCAGCA

138244/1-1116 ACCAACCTGATCTTCAGCCAGTGCTACGCCGACGCTACCCAGACCCAGCA

MDR1/1-1179 ACCAACCTGATCTTCAGCCAGTGCTACGCCGACGCTACCCAGACCCAGCA

PA14/1-1116 ACCAACCTGATCTTCAGCCAGTGCTACGCCAACGCCACCCAGACCCAGCA

PA7/64-1179 ACCAACCTGATCTTCAGCCAGTGCTACGCGGACGCCACCCAGACCCAGCA

P.extremaustralis/1-1118 ACTAACCTGATCTTCAGCCAGTGCTATGCCGACCCAACCCAGACCACCCA

** *********************** ** ** * ********* **

39016/1-1190 GGTGCGCGACTTCTTCACCCGTCACTACGGCGCCAGCGTCAACAACGACA

NCGM2/1-1113 GGTGCGCGACTTCTTCACCCGTCACTACGGCGCCAGCGTCAACAACGACA

138244/1-1116 GGTGCGCGACTTCTTCACCCGCCACTACGGCGCCAGCGTCAACAACGACA

MDR1/1-1179 GGTGCGCGACTTCTTCACCCGTCACTACGGCGCCAGCGTCAACAACGACA

PA14/1-1116 GGTGCGCGACTTCTTCACCCGTCACTACGGCGCCACCGCCAACAACGACA

PA7/64-1179 GGTGCGCGACTTCTTCACCCGTCACTACGGCGCCAGCGTCAACAACGACG

P.extremaustralis/1-1118 AGTGCAGGACTTCTTCCTGCAGCACTTCGGTGCATTCAATAACAACGACG

**** ********* * **** *** ** * *********

39016/1-1190 CTGCGATCACCAACCATCGCTTCGTGCCGCTGCCGGCTTCCTGGAAGCTT

NCGM2/1-1113 CTGCGATCACCAACCATCGCTTCGTGCCGCTGCCGGCTTCCTGGAAGCTT

138244/1-1116 CTGCGATCACCAACCATCGCTTCGTGCCGCTGCCGGCTTCCTGGAAGCTT

MDR1/1-1179 CCGCGATCACCAACCATCGCTTCGTGCCGCTGCCGGCTTCCTGGAAGCTT

PA14/1-1116 CCGCGATCACCAACCATCGCTTCGTGCCGCTGCCGGCTTCCTGGAAGCTT

PA7/64-1179 CCGCGATCACCAACCATCGCTTCGTGCCGCTGCCTGCTTCCTGGAAGCTC

P.extremaustralis/1-1118 CCGCCATCAACGACAACCGCTTCGTACCGCTGCCATCCAACTGGAAACAA

* ** **** * ** * ******** ******** * ****** *

39016/1-1190 GCCGTACGCCAGTCGTTCCTGACCTCCACCAACAACCTGTACATCGGCCA

NCGM2/1-1113 GCCGTACGCCAGTCGTTCCTGACCTCCACCAACAACCTGTACATCGGCCA

138244/1-1116 GCCGTACGCCAGTCGTTCCTGACCTCCACCAACAACCTGTACATCGGCCA

MDR1/1-1179 GCCGTACGCCAGTCGTTCCTGACCTCCACCAACAACCTGTACATCGGCCA

PA14/1-1116 GCCGTACGCCAGTCGTTCCTGACCTCCACCAACAACTTGTACATCGGCCA

PA7/64-1179 GCGGTGCGTCAGTCGTTCCTGACTTCCACCAACAACCTGTACATCGGCCA

P.extremaustralis/1-1118 GCGATCACCGACACCTTCGCGACTGTTTCCAGTGGTCAGGGCATCGGCAA

** * * * *** *** *** * ******* *

39016/1-1190 TTCCAACGTCTGCAACGGCATCGGCCGTCCGCTCTAA

NCGM2/1-1113 TTCCAACGTCTGCAACGGCATCGGCCGTCCGCTCTAA

138244/1-1116 TTCCAACGTCTGCAACGGTATCGGCCGTCCGCTCTAA

MDR1/1-1179 TTCCAACGTCTGCAACGGCATCGGCCGTCCGCTCTAA

PA14/1-1116 TTCCAACGTCTGCAACGGCATCGGCCGTCCGCTCTAA

PA7/64-1179 CACCAACGTCTGCAACGGCATTGGCCGTCCGCTCTGA

P.extremaustralis/1-1118 CACCAGCGTCTGCAACGGCATCGGTCGTCCGCTGTAA

*** ************ ** ** ******** * *

Online resource 4 (continued)

BBc sub-family (group 6) - proteins

BBc/1-363 DINGGGATLPQPLYQTSGVLTAGFAPYIGVGSGKGKLAFLNNDYSQFGTGTKNVHWAGSD

human_CAI/1-281 DINGGGATLPQPLYQTSGVLTAGFAPYI------LAFLNNDYSQFGTGTKNVHWA---

Leishmania/1-318 DINGGGATLPQPLYQTSGVLTAGFAPYIGVGSGKGKLAFLNNDYSQFGTGTKNVHWAGSD

human_CW626261/25-218 DINGGGATLPQPLYQTSGVLTAGFAPYIGVGSGKGKLAFLNNDYSQFGTGTKNVHWAGSD

X-DING-CD4_cDNA/1-124 DINGGGATLPQPLYQTSGVLTAGFAPYIGVGSGKGKLAFLNNDYSQFGTGTKNVHWAGSD

potato_gene/1-207 ------GVGSGKGKLAFLNNDYSQFGTGTKNVHWAGSD

Arabidopsis_Col/1-53 ------GVGSGKGKLAFLNNDYSQFGTGTKNVHWAGSD

tobacco_1b/1-53 ------GVGSGKGKLAFLNNDYSQFGTGTKNVHWAGSD

tobacco_1a/1-53 ------GVGSGKGKLAFLNNDYSQFGTGTENVHWAGSD

BBc/1-363 SKLTSTELSTYASTKQATWGKLIQVPSVATSVAIPFNKAGTNAVDLSVDQLCGVFSGRIT

human_CAI/1-281 --LTSTELSTYASTK------LIQVPSVATSVAIPFNK--TNAVDLSVDQLCGVFSG-IT

Leishmania/1-318 SKLTSTELSTYASTKQAAWGKLIQVPSVATSVAIPFNKAGTNAVDLSVDQLCGVFSGRIT

human_CW626261/25-218 SKLTSTELSTYASTKQATWGKLIQVPSVATSVAIPFNKAGTNAVDLSVDQLCGVFSGRIT

X-DING-CD4_cDNA/1-124 SKLTSTELSTYASTKQATWGKLIQVPSVATSVAIPFNKPGTNAVDLSVDQLCGVFSGRIT

potato_gene/1-207 SKLTSTELSTYASTKQAAWGKLIQVPSVATSVAIPFNKAGTNAVDLSVDQLCGVFSGRIT

Arabidopsis_Col/1-53 SKLTSTELSTYASTKQAAWGK------

tobacco_1b/1-53 SKLTSTELSTYASTKQAAWGK------

tobacco_1a/1-53 SKLTSTELSTYSSTKQAAWGK------

BBc/1-363 TWNQLPATGRTGNIVVVYRNEASGTTELFTRFLAAKCVNESKKFVVTTNFADSFGVPPGA

human_CAI/1-281 TWDQLPATGRTGNIVVVYRNEA------ESKKFVVTTNFADSFGVPAGA

Leishmania/1-318 TWDQLPATGRTGNIVVVYRNEASGTTELFTRFLAAKCVNESKKFVVTTNFADSFGVPAGA

human_CW626261/25-218 TWNQLPATGRTGNIVVVYRNEASGTTELFTRFLAAKCVNESKKFVVTTNFADSFGVPPGA

X-DING-CD4_cDNA/1-124 TWNQ------

potato_gene/1-207 TWNQLPATGRTGNIVVVYRNEASGTTELFTRFLAAKCVNESKKFVVTTNFADSFGVPPGA

human_TC105368/1-65 NESKKFVVTTNFADSFGVPAGA

BBc/1-363 VPAVTSQGVMTALNAGDGRITYMSPDYAAPTLAGLDDATKVAKVAGVSPAPDNVSAAIAA

human_CAI/1-281 VPAVTSQGVMDALN-----ITYMSPDYAAPTLAGL------DNVSAAIAA

Leishmania/1-318 VPAVTSQGVMDALNAGDGRITYMSPDYAAPTLAGLDDATKVAKVAGVSPAPDNVSAAIAA

human_CW626261/25-218 VPAVTSQGVMTKLN------

potato_gene/1-207 VPAVTSQGVMDALNAGDGRITYMSPDYAAPTLAGLDDATKVAKVAGVSPAPDNVS-----

human_TC105368/1-65 VPAVTSEGVMDALSAGDGRITYMSPDYAAPTLAGLDDATIYWL

monkey/1-49 NLIFSQCYADATQTSQ

BBc/1-363 VPVPAAANVANQNAWVPVFAAAADANDPSVVPYPSSGYPILGFTNLIFSQCYADATQTSQ

human_CAI/1-281 VPVPAAANVA-QNAWVPVFA------DPSVVPYPSTGYPILGFTNLIFSQCYADATQTSQ

Leishmania/1-318 VPVPAAANVALQNAWVPVFAAAADANDPSVVPYPSTGYPILGFTNLIFSQCYADATQTSQ

monkey/1-49 VRAFFTRHYGASALNTNDNAIKANRFVPLPTAW

BBc/1-363 VRAFFTRHFGASALNSNDNAIKANRFVPLPAAWKAAITSNFVTATSALSIGKSDVCNAIG

human_CAI/1-281 VRAFFTRHYGASALN--DNAIKANRFVPLPTAWKAAITSNFVTATSALSI------

Leishmania/1-318 VRAFFTRHYGASALYTYD------

BBc/1-363 RPL-
Online resource 4 (continued)

BBc sub-family (group 6) – genes

Leishmania/1-1013 ATGGCAGACATCAACGGCGGCGGCGCCACCCTGCCACAACCGCTGTACCAGACCTCCGGC

BBc6R8/1-1164 ATGGCAGACATCAACGGCGGCGGCGCCACCCTGCCACAACCGCTGTACCAGACCTCCGGC

X-DING-CD4_cDNA/1-382 ATGGCAGACATCAACGGCGGCGGCGCCACCCTGCCACAACCGCTGTACCAGACCTCCGGC

human_CW626261/1-693 ATGGCAGACATCAACGGCGGCGGCGCCACCCTGCCACAACCGCTGTACCAGACCTCCGGC

tobacco_1a/1-159 ------GGTGTGGGCAGCGGCAAAGGCAAACTGGCG

potato_gene/1-622 ------GGTGTGGGCAGCGGCAAAGGCAAACTGGCG

Leishmania/1-1013 GTATTGACTGCCGGTTTCGCCCCGTACATCGGTGTGGGCAGCGGCAAAGGCAAACTGGCG

tobacco_1b/1-159 ------GGTGTGGGCAGCGGCAAAGGCAAACTGGCG

Arabidopsis_Col/1-159 ------GGTGTGGGCAGCGGCAAAGGCAAACTGGCG

BBc6R8/1-1164 GTATTGACCGCCGGTTTCGCCCCGTACATCGGCGTGGGCAGCGGCAAGGGCAAACTGGCG

X-DING-CD4_cDNA/1-382 GTATTGACAGCCGGTTTCGCCCCGTACATCGGTGTGGGCAGCGGCAAAGGCAAACTGGCG

human_CW626261/1-693 GTATTGACCGCTGGTTTCGCCCCGTACATCGGCGTGGGCAGCGGCAAGGGCAAACTGGCG

tobacco_1a/1-159 TTCCTGAACAACGACTACAGCCAGTTCGGGACCGGCACCGAGAACGTGCACTGGGCGGGC

potato_gene/1-622 TTCCTGAACAACGACTACAGCCAGTTCGGGACCGGCACCAAGAACGTGCACTGGGCGGGC

Leishmania/1-1013 TTCCTGAACAACGACTACAGCCAGTTCGGCACCGGCACCAAGAACGTGCACTGGGCGGGC

tobacco_1b/1-159 TTCCTGAACAACGACTACAGCCAGTTCGGGACCGGCACCAAGAACGTGCACTGGGCGGGC

Arabidopsis_Col/1-159 TTCCTGAACAACGACTACAGCCAGTTCGGGACCGGCACCAAGAACGTGCACTGGGCGGGC

BBc6R8/1-1164 TTCCTGAACAACGACTACAGCCAGTTCGGCACCGGCACCAAGAACGTTCACTGGGCGGGC

X-DING-CD4_cDNA/1-382 TTCCTGAACAACGACTACAGCCAGTTCGGCACCGGCACCAAGAACGTGCACTGGGCGGGC

human_CW626261/1-693 TTCCTGAACAACGACTACAGCCAGTTCGGCACCGGCACCAAGAACGTTCACTGGGCGGGC

tobacco_1a/1-159 AGCGACTCCAAGCTGACCTCGACTGAACTGTCCACCTACTCCTCCACCAAACAAGCCGCC

potato_gene/1-622 AGCGACTCCAAGCTGACCTCGACTGAACTGTCCACCTACGCCTCCACCAAACAAGCCGCC

Leishmania/1-1013 AGCGACTCCAAGCTGACCTCGACTGAACTGTCCACCTACGCCTCCACCAAACAAGCCGCC

tobacco_1b/1-159 AGCGACTCCAAGCTGACCTCGACTGAACTGTCCACCTACGCCTCCACCAAACAAGCCGCC

Arabidopsis_Col/1-159 AGCGACTCCAAGCTGACCTCGACTGAACTGTCCACCTACGCCTCCACCAAACAAGCCGCC

BBc6R8/1-1164 AGCGACTCGAAGCTGACCTCGACTGAACTGTCCACCTACGCCTCCACCAAACAAGCGACC

X-DING-CD4_cDNA/1-382 AGCGACTCCAAGCTGACCTCGACTGAACTGTCCACCTACGCCTCCACCAAACAAGCGACC

human_CW626261/1-693 AGCGACTCGAAGCTGACCTCGACTGAACTGTCCACCTACGCCTCCACCAAACAAGCGACC

tobacco_1a/1-159 TGGGGCAAG------

potato_gene/1-622 TGGGGCAAGCTGATTCAAGTGCCTTCGGTCGCCACTTCGGTTGCCATTCCATTCAACAAG

Leishmania/1-1013 TGGGGCAAGCTGATTCAAGTGCCTTCGGTAGCCACTTCGGTTGCCATTCCATTCAACAAG

tobacco_1b/1-159 TGGGGCAAG------

Arabidopsis_Col/1-159 TGGGGCAAG------

BBc6R8/1-1164 TGGGGCAAGCTGATTCAAGTGCCTTCGGTCGCCACTTCGGTTGCCATTCCATTCAACAAG

X-DING-CD4_cDNA/1-382 TGGGGCAAGCTGATTCAAGTGCCTTCGGTCGCCACTTCGGTTGCCATTCCATTCAACAAG

human_CW626261/1-693 TGGGGCAAGCTGATTCAAGTGCCTTCGGTCGCCACTTCGGTAGCCATTCCATTCAACAAG

potato_gene/1-622 GCCGGTACCAACGCAGTTGACCTGAGCGTTGATCAACTGTGCGGCGTGTTCTCGGGCCGC

Leishmania/1-1013 GCCGGTACCAACGCGGTTGACCTGAGCGTTGATCAACTGTGCGGCGTGTTCTCGGGCCGC

BBc6R8/1-1164 GCCGGTACCAACGCAGTTGACCTGAGCGTTGATCAACTGTGCGGCGTGTTCTCGGGCCGC

X-DING-CD4_cDNA/1-382 CCCGGTACCAACGCGGTTGACCTGAGCGTTGATCAACTGTGCGGCGTGTTCTCGGGCCGC

human_CW626261/1-693 GCCGGTACCAACGCGGTTGACCTGAGCGTTGATCAACTGTGCGGCGTGTTCTCGGGCCGC

potato_gene/1-622 ATCACCACCTGGAACCAACTCCCGGCTACCGGTCGCACCGGTAACATCGTGGTGGTTTAC

Leishmania/1-1013 ATCACCACCTGGGACCAACTCCCGGCTACCGGTCGCACCGGTAACATCGTGGTGGTTTAC

BBc6R8/1-1164 ATCACCACGTGGAACCAACTCCCGGCTACCGGTCGCACCGGTAACATCGTGGTGGTTTAC

X-DING-CD4_cDNA/1-382 ATCACCACCTGGAACCAAC------

human_CW626261/1-693 ATCACCACCTGGAACCAACTCCCGGCTACCGGTCGCACCGGTAACATCGTGGTGGTTTAC

potato_gene/1-622 CGTAACGAAGCCAGTGGCACCACCGAGCTGTTCACCCGTTTCCTGGCGGCCAAGTGCGTC

Leishmania/1-1013 CGTAACGAAGCCAGTGGCACCACCGAGCTGTTCACCCGTTTCCTGGCAGCCAAGTGCGTC

BBc6R8/1-1164 CGTAACGAAGCCAGTGGCACCACCGAGCTGTTCACCCGTTTCCTGGCGGCCAAGTGCGTC

human_CW626261/1-693 CGTAACGAAGCCAGTGGCACCACCGAGCTGTTCACCCGTTTCCTGGCGGCCAAGTGCGTC

human_TC105368/1-198 ------TC

potato_gene/1-622 AATGAGTCCAAGAAATTTGTGGTCACTACCAACTTCGCGGACAGCTTCGGCGTGCCACCA

Leishmania/1-1013 AATGAGTCCAAGAAGTTTGTGGTCACCACCAACTTTGCAGACAGCTTCGGCGTGCCGGCA

BBc6R8/1-1164 AATGAGTCCAAGAAATTTGTGGTCACCACCAACTTCGCGGACAGCTTCGGCGTGCCACCA

human_CW626261/1-693 AATGAGTCCAAGAAATTTGTGGTCACCACCAACTTCGCGGACAGCTTCGGCGTGCCACCA

human_TC105368/1-198 AATGAGTCCAAGAAGTTTGTGGTCACCACCAACTTTGCAGACAGCTTCGGCGTGCCGGCT

potato_gene/1-622 GGCGCCGTGCCTGCCGTCACCAGCCAAGGCGTGATGGACGCGCTGAACGCCGGTGATGGC

Leishmania/1-1013 GGTGCCGTGCCTGCCGTCACCAGCCAAGGCGTGATGGACGCGCTGAACGCCGGTGATGGT

BBc6R8/1-1164 GGCGCCGTACCTGCCGTGACCAGCCAAGGCGTGATGACCGCGCTGAACGCCGGTGATGGT

human_CW626261/1-693 GGCGCCGTGCCTGCCGTGACCAGCCAAGGCGTGATGACCAAGCTGAACGA

human_TC105368/1-198 GGTGCCGTGCCTGCCGTTACCAGCGAAGGCGTGATGGACGCGCTGAGCGCCGGTGATGGT

potato_gene/1-622 CGCATCACTTATATGAGCCCGGACTATGCCGCGCCTACCTTGGCTGGCCTGGATGACGCC

Leishmania/1-1013 CGCATCACTTACATGAGCCCGGACTATGCCGCTCCTACCTTGGCTGGCCTGGATGACGCC

BBc6R8/1-1164 CGTATCACCTACATGAGCCCGGACTATGCCGCGCCTACCTTGGCTGGCCTGGATGACGCC

human_TC105368/1-198 CGCATCACTTACATGAGCCCGGACTATGCCGCTCCTACCTTGGCTGGCCTGGATGACGCC

potato_gene/1-622 ACCAAAGTGGCCAAGGTCGCCGGTGTTTCTCCAGCCCCTGATAACGTTTCCG------

Leishmania/1-1013 ACCAAAGTGGCCAAGGTCGCCGGTGTTTCTCCAGCCCCTGACAACGTTTCCGCTGCCATT

BBc6R8/1-1164 ACCAAAGTGGCCAAGGTCGCCGGTGTTTCTCCAGCCCCTGATAACGTTTCCGCCGCCATC

human_TC105368/1-198 ACCATATATTGGCTCA

Leishmania/1-1013 GCTGCTGTGCCAGTACCGGCGGCTGCCAACGTCGCCCTGCAGAACGCCTGGGTACCAGTA

BBc6R8/1-1164 GCTGCTGTGCCCGTACCGGCTGCTGCCAACGTCGCCAACCAGAACGCCTGGGTACCGGTG

Leishmania/1-1013 TTTGCTGCTGCTGCCGACGCCAATGACCCAAGCGTCGTGCCTTACCCAAGCACCGGCTAT

BBc6R8/1-1164 TTTGCTGCCGCCGCCGATGCCAACGACCCAAGCGTCGTGCCTTACCCAAGCAGCGGTTAT

Leishmania/1-1013 CCAATCCTGGGCTTCACCAACCTGATCTTCAGCCAGTGCTACGCCGACGCCACCCAAACC

BBc6R8/1-1164 CCGATCCTGGGCTTCACCAACCTGATCTTCAGCCAGTGCTACGCCGACGCCACCCAGACC

monkey/1-149 AACCTGATCTTCAGCCAGTGCTACGCCGACGCCACCCAAACC

Leishmania/1-1013 TCGCAAGTGCGTGCGTTCTTCACCCGTCACTACGGTGCCAGCGCCCTCTACACCTACGAC

BBc6R8/1-1164 TCGCAAGTGCGTGCATTCTTCACCCGTCACTTCGGTGCCAGCGCCTTGAACAGCAACGAC

monkey/1-149 TCGCAAGTGCGTGCGTTCTTCACCCGTCACTACGGTGCCAGCGCCCTCAACACCAACGAC

Leishmania/1-1013 TAACGTCTTTCAGGCCTACCGTTTTGTGTCGCTGTCATCTGTCTGGATAGACG------

BBc6R8/1-1164 AACGCGATCAAGGCCAACCGCTTCGTGCCACTGCCAGCTGCCTGGAAAGCCGCGATCACC

monkey/1-149 AACGCCATCAAGGCCAACCGCTTTGTGCCGCTGCCAACTGCTTGGAA

BBc6R8/1-1164 AGCAACTTCGTTACCGCAACCAGCGCGCTGAGCATCGGCAAGTCTGATGTCTGCAACGCC

BBc6R8/1-1164 ATCGGTCGTCCGCTGTAA

Online resource 4 (continued)Woodsub-family(group 7) - proteins

B.rapa/1-37 ------AAFLNNDYTK------

R.centenum/1-76 ------AAFLNNDYTK------

Pf5/1-368 DINGGGATLPQPLYQTAGVLTAGFAPYIGVGSGNGKAAFLNNDYTKFVAGTT-GKNVHWA

Wayne1/1-351 ------GVLTAGFAPYIGVGSGNGKAAFLNNDYTKFVAG-TTGKNVHWA

P.brassicacearum/1-372 DVNGGGATLPQPLYQTSGVLTAGFAPYIGVGSGAGKSAFLTNDYTKFVPGDTSGKKVHWA

Wood1R/1-372 DVNGGGATLPQPLYQTSGVLTAGFAPYIGVGSGAGKSAFLTNDYTKFVPGDTSGKKVHWA

F113/1-366 DVNGGGATLPQPLYQTSGVLTAGFAPYIGVGSGAGKSAFLNNDYTKFVSGNTS-KNVHWA

PAMC25886/1-383 DVNGGGATLPQPLYQTAGVLTAGFAPYIGVGSGNGKAAFLNNDYSKLDATVT-GKNVHWA

B.rapa/1-37 -----LSAAELLAYK------LIQVPSVATSVAIPFNK------

R.centenum/1-76 ------LIQVPSVATSVAVPFNK------

Pf5/1-368 GSDSKLSAAELKGYEDNHQAAWGKLIQVPSVATSVAVPFNKAGSTA-VD-LSVNDLCGVF

Wayne1/1-351 GSDSKLSAAELKGYEDNHQAAWGKLIQVPSVATSVAVPFNKAGSTA-VD-LSVNDLCGVF

P.brassicacearum/1-372 GSDSKLSATELSTYVSAHGAAWGPLIQVPSVATSVAIPFNKTGTAN-VD-LSVNQLCGVF

Wood1R/1-372 GSDSKLSATELSTYVSAHGAAWGPLIQVPSVATSVAIPFNKTG-TANVD-LSVNQLCGVF

F113/1-366 GSDSKLSQAELDGYVANHGAAWGPLIQVPSVATSVAIPFNMTG-TANVD-LSVNQLCGVF

PAMC25886/1-383 GSDSKLSSTELSNYVSAHGSAWGPLIQVPSVATSVAIPFNKAGVAGKTVNLSVNDLCGVF

B.rapa/1-37 ------

R.centenum/1-76 ------

Pf5/1-368 SGRVSKWEQLPTSGRTGAITVVYRNESSGTTELFTRFLNAKCAET---GTFAVTTNFASS

Wayne1/1-351 SGRVSKWEQLPTSGRTGAITVVYRNESSGTTELFTRFLNAKCAET---GTFAVTTNFASS

P.brassicacearum/1-372 SGRLTDWSQITGSGRTGAITVVYRAESSGTSELFTRFLNAKCAET---GTFAITTNFASS

Wood1R/1-372 SGRLTDWSQITGSGRTGAITVVYRAESSGTSELFTRFLNAKC---AETGTFAITTNFASS

F113/1-366 SGRLTDWSQITGSGRTGAITVVYRSDLSGTTELFTRFLNAKC---AETGTFAITTNFANS

PAMC25886/1-383 SGRLTTWNLIPDSGRTGPITVIYRKENSGTTELFTRFLNAKCSPALEGGTFAVTQAFGSS

B.rapa/1-37 ------

R.centenum/1-76 ------ITYMSPDYAAPTLAGLDDAT

Pf5/1-368 YSGGLPAGAVAAVTSQGVMDALNAGDGR------ITYMSPDYAAPTLAGLDDAT

Wayne1/1-351 YSGGLPAGAVAAVTSQGVMDALNAGDGR------ITYMSPDYAAPTLAGLDDAT

P.brassicacearum/1-372 YSGGLPASAVSATGSQAVMTALNAAQGR------ITYMSPDYAATTLAGLDDAT

Wood1R/1-372 YSGGLPASAVSATGSQAVMTALNAAQGR------ITYMSPDYAATTLAGLDDAT

F113/1-366 YSGGVPAGAVFASGSANVM------TALNAAQGRITYMSPDYAATTLAGLDDAT

PAMC25886/1-383 FSGGLPAGALNPLQQANPAGGFYADTSAGVMSTLNAADGRITYMSPDYAAATLAGLDDAT

B.rapa/1-37 ------

R.centenum/1-76 KVAV------

Pf5/1-368 KVAKVAGVSPAPANVSSAIAAVAVPDTTVRGDQNLWVPVFTSQANIDAN-PSDKSLRLYP

Wayne1/1-351 KVAKVAGVSPAPANVSSAIAAVAVPDTTVRGDQNLWVPVFTSQANIDAN-PSDKSLRLYP

P.brassicacearum/1-372 KVARVGGLSPAPANVSVAINAVPVPAAADRSNPNAWVPVFTSQKVIDETIPADPSLRLYP

Wood1R/1-372 KVARVGGLSPAPANVSVAINAVPVPAAADRSNPNAWVPVFTSQKVIDETIPADPSLRLYP

F113/1-366 KVARVAGVSPAPANVSAAIAAVAVPAAANRANPNAWVPVFAATTNPNDPSVVAYPATGYP

PAMC25886/1-383 KVATVAGVSPAPGNVSAAIGAVAVPAIANRTLPNNWVPVFAATTSASDPSVVAYPSTGYP

B.rapa/1-37 ------

R.centenum/1-76 ------

Pf5/1-368 TSGYPILGFTNLIFSQCYADANQTSQVRAFFSRHY-GALVNN-DTAINNNRFVPLPAAWK

Wayne1/1-351 TSGYPILGFTNLIFSQCYADANQTSQVRAFFSRHY-GALVNN-DTAINNNRFVPLPAAWK

P.brassicacearum/1-372 TTGYPILGFTNVIFSQCYANAAQTTQVRDFFTRHYNGTAANSNDAAITANRFVPLPGAWK

Wood1R/1-372 TTGYPILGFTNVIFSQCYANAAQTTQVRDFFTRHYNGTAANSNDAAITANRFVPLPGAWK

F113/1-366 ILGF-----TNVIFSQCYANAAQSTQVRDFFTRHYGAVAANNNDAAITANRFVPLPTTWK

PAMC25886/1-383 ILGF-----TNVVFSQCYANADQTSQVRTFFTRHYNTNAFSSNDTAIRNNRFVPLPTTWK

B.rapa/1-37 ------

R.centenum/1-76 ----DSFVTASSGLSIGNASVCNAIGRPL

Pf5/1-368 TAVRDSFVTASSGLSIGNASVCNAIGRPL

Wayne1/1-351 TAVRDSFVTASSGLSIGNASVCNAIGRPL

P.brassicacearum/1-372 SAIRGSFLTATNAQSIGNTNVCNGIGRPL

Wood1R/1-372 SAIRGSFLTATNAQSIGNTNVCNGIGRPL

F113/1-366 NAIRGSFLTTTSAQSIGNTNVCNGIGRPL

PAMC25886/1-383 TAINDTFLSAGSDLSIGKSNICNGIGRPL

Online resource 4 (continued)

Woodsub-family(group 7) - genes

Pf5/1-1179 ATGGCTGATATCAACGGCGGTGGCGCAACCCTGCCACAACCGCTGTACCA

Wayne1/1-1056 ------

PAMC25886/1-1158 ATGGCTGACGTCAACGGCGGCGGCGCCACCCTGCCACAACCGCTGTACCA

P.brassicacearum/1-1191 ATGGCTGATGTCAACGGCGGTGGTGCTACCTTGCCTCAGCCGCTGTACCA

Wood1R/1-1125 ATGGCTGATGTCAACGGCGGTGGTGCTACCTTGCCTCAGCCGCTGTACCA

F113/1-1107 ATGGCTGATGTCAATGGCGGTGGTGCTACCTTGCCTCAGCCGCTGTACCA

Pf5/1-1179 GACCGCTGGCGTACTGACCGCCGGCTTCGCTCCCTACATCGGCGTAGGCA

Wayne1/1-1056 ------GGCGTACTGACCGCCGGCTTCGCTCCCTACATCGGCGTAGGCA

PAMC25886/1-1158 GACTGCCGGCGTATTGACTGCCGGTTTCGCCCCGTACATCGGCGTGGGCA

P.brassicacearum/1-1191 GACCTCCGGTGTACTGACTGCCGGTTTCGCCCCTTACATCGGCGTGGGCA

Wood1R/1-1125 GACCTCCGGTGTACTGACTGCCGGTTTCGCCCCTTACATCGGCGTGGGCA

F113/1-1107 GACCTCCGGCGTACTGACTGCCGGTTTTGCCCCATACATCGGCGTGGGCA

** *** **** ***** ** ** ** *********** ****

Pf5/1-1179 GCGGCAACGGCAAGGCTGCCTTCCTGAACAACGACTACACCAAGTTCGTG

Wayne1/1-1056 GCGGCAACGGCAAGGCTGCCTTCCTGAACAACGACTACACCAAGTTCGTG

PAMC25886/1-1158 GCGGCAACGGCAAGGCTGCTTTCCTGAACAACGACTACAGCAAGCTGGAC

P.brassicacearum/1-1191 GCGGTGCTGGCAAGTCGGCTTTCCTGACCAACGACTACACCAAGTTCGTG

Wood1R/1-1125 GCGGTGCTGGCAAGTCGGCTTTCCTGACCAACGACTACACCAAGTTCGTG

F113/1-1107 GCGGTGCTGGCAAGTCGGCTTTCCTGAACAACGACTACACCAAGTTCGTA

**** ****** * ** ******* *********** **** * *

Pf5/1-1179 GCCGGCACCACCG---GCAAGAACGTGCACTGGGCCGGTAGCGATTCCAA

Wayne1/1-1056 GCCGGCACCACCG---GCAAGAACGTGCACTGGGCCGGTAGCGATTCCAA

PAMC25886/1-1158 GCCACCGTCACCG---GCAAGAACGTGCACTGGGCAGGCAGCGATTCCAA

P.brassicacearum/1-1191 CCTGGCGACACCAGCGGCAAGAAAGTGCACTGGGCTGGTAGCGACTCCAA

Wood1R/1-1125 CCTGGCGACACCAGCGGCAAGAAAGTGCACTGGGCTGGTAGCGACTCCAA

F113/1-1107 TCTGGCAACACCA---GCAAGAACGTGCACTGGGCGGGTAGTGATTCGAA

* * **** ******* *********** ** ** ** ** **

Pf5/1-1179 GCTCAGCGCAGCAGAGCTCAAAGGTTATGAAGACAATCACCAAGCGGCCT

Wayne1/1-1056 GCTCAGCGCAGCAGAGCTCAAAGGTTATGAAGACAATCACCAAGCGGCCT

PAMC25886/1-1158 GCTGTCTTCTACCGAGCTGAGCAACTACGTGTCTGCCCATGGTTCTGCCT

P.brassicacearum/1-1191 GCTCAGCGCCACTGAACTGAGCACCTACGTCAGTGCCCACGGTGCCGCCT

Wood1R/1-1125 GCTCAGCGCCACTGAACTGAGCACCTACGTCAGTGCCCACGGTGCCGCCT

F113/1-1107 GCTCAGCCAGGCTGAACTGGATGGCTACGTTGCCAACCACGGTGCCGCCT

*** * ** ** ** * ** * ****

Pf5/1-1179 GGGGCAAGCTGATCCAGGTGCCTTCGGTGGCCACTTCGGTTGCCGTTCCA

Wayne1/1-1056 GGGGCAAGCTGATCCAGGTGCCTTCGGTGGCCACTTCGGTTGCCGTTCCA

PAMC25886/1-1158 GGGGCCCGCTGATCCAAGTGCCTTCGGTGGCCACTTCGGTTGCCATTCCG

P.brassicacearum/1-1191 GGGGTCCATTGATCCAGGTGCCTTCGGTCGCCACTTCGGTTGCCATTCCG

Wood1R/1-1125 GGGGTCCATTGATCCAGGTGCCTTCGGTCGCCACTTCGGTTGCCATTCCG

F113/1-1107 GGGGTCCATTGATCCAGGTGCCTTCGGTTGCCACTTCGGTTGCGATTCCA

**** ******* *********** ************** ****

Pf5/1-1179 TTCAACAAGGCTGGTTCCACCG------CCGTTGACCTGAGCGTCAATGA

Wayne1/1-1056 TTCAACAAGGCGGGTTCCACCG------CCGTTGACCTGAGCGTCAATGA

PAMC25886/1-1158 TTCAACAAGGCCGGTGTGGCGGGTAAAACCGTCAACCTCAGTGTCAACGA

P.brassicacearum/1-1191 TTCAACAAGACCGGCACTGCCA------ACGTCGACCTGAGCGTCAACCA

Wood1R/1-1125 TTCAACAAGACCGGCACTGCCA------ACGTCGACCTGAGCGTCAACCA

F113/1-1107 TTCAACATGACCGGCACTGCCA------ACGTCGATCTGAGCGTCAACCA

******* * * ** * *** * ** ** ***** *

Pf5/1-1179 CCTGTGCGGCGTGTTCTCGGGGCGCGTTTCCAAGTGGGAGCAGCTTCCGA

Wayne1/1-1056 CCTGTGCGGCGTGTTCTCGGGGCGCGTTTCCAAGTGGGAGCAGCTTCCGA

PAMC25886/1-1158 TCTGTGCGGTGTATTCTCGGGTCGTCTGACTACCTGGAACCTGATCCCGG

P.brassicacearum/1-1191 ACTGTGCGGCGTGTTCTCCGGCCGTCTGACCGACTGGAGCCAGATCACTG

Wood1R/1-1125 ACTGTGCGGCGTGTTCTCCGGCCGTCTGACCGACTGGAGCCAGATCACTG

F113/1-1107 GCTGTGCGGCGTGTTCTCTGGTCGTCTGACTGACTGGAGCCAGATCACCG

******** ** ***** ** ** * * *** * * * *

Pf5/1-1179 CTTCGGGCCGTACCGGCGCCATCACCGTGGTTTACCGCAATGAAAGCAGC

Wayne1/1-1056 CTTCGGGCCGTACCGGCGCCATCACCGTGGTTTACCGCAATGAAAGCAGC

PAMC25886/1-1158 ACTCCGGCCGTACCGGCCCGATCACCGTGATCTATCGCAAAGAAAACAGC

P.brassicacearum/1-1191 GTTCGGGCCGTACCGGCGCGATCACCGTGGTTTACCGTGCCGAGAGCAGC

Wood1R/1-1125 GTTCGGGCCGTACCGGCGCGATCACCGTGGTTTACCGTGCCGAGAGCAGC

F113/1-1107 GTTCTGGCCGTACTGGTGCGATCACTGTGGTTTACCGCAGCGACCTCAGT

** ******** ** * ***** *** * ** ** ** ***

Pf5/1-1179 GGCACCACCGAGCTGTTCACCCGCTTCCTGAACGCCAAGTGCGCC-----

Wayne1/1-1056 GGCACCACCGAGCTGTTCACCCGCTTCCTGAACGCCAAGTGCGCC-----

PAMC25886/1-1158 GGCACCACCGAGCTGTTCACGCGCTTCCTGAACGCCAAGTGCAGCCCGGC

P.brassicacearum/1-1191 GGCACCTCCGAGCTGTTCACCCGCTTCCTGAACGCCAAGTGCGCT-----

Wood1R/1-1125 GGCACCTCCGAGCTGTTCACCCGCTTCCTGAACGCCAAGTGCGCT-----

F113/1-1107 GGCACCACAGAATTGTTCACTCGCTTCTTGAACGCCAAGTGCGCT-----

****** * ** ******* ****** **************

Pf5/1-1179 ----GAGACCGGCACCTTCGCCGTGACCACCAACTTCGCTTCCAGCTACT

Wayne1/1-1056 ----GAGACCGGCACCTTCGCCGTGACCACCAACTTCGCTTCCAGCTACT

PAMC25886/1-1158 CCTGGAAGGCGGCACCTTCGCCGTGACTCAGGCCTTCGGCAGCAGCTTCT

P.brassicacearum/1-1191 ----GAAACCGGCACCTTTGCCATCACCACCAACTTCGCTTCCAGCTACA

Wood1R/1-1125 ----GAAACCGGCACCTTTGCCATCACCACCAACTTCGCTTCCAGCTACA

F113/1-1107 ----GAAACCGGCACCTTTGCCATCACCACCAACTTCGCTAACAGCTACA

** ********* *** * ** ***** ***** *

Pf5/1-1179 CGGGCGGCCTGCCTGCCGGTGCGGTTG--CCGCTGTTACCAGCCAA----

Wayne1/1-1056 CGGGCGGCCTGCCTGCCGGTGCGGTTG--CCGCTGTTACCAGCCAA----

PAMC25886/1-1158 CCGGCGGCTTGCCGGCCGGCGCACTTAACCCATTGCAACAAGCCAACCCG

P.brassicacearum/1-1191 GTGGTGGTCTGCCAGCCAGCGCCGTAT--CCGCTACTGGCAGCCAG----

Wood1R/1-1125 GTGGTGGTCTGCCAGCCAGCGCCGTAT--CCGCTACTGGCAGCCAG----

F113/1-1107 GCGGTGGCGTACCGGCCGGCGCCGTTT--TCGCCTCCGGCAGTGCG----

** ** * ** *** * ** * * **

Pf5/1-1179 ------GGCGTCATGGACGCGCTGAA

Wayne1/1-1056 ------GGCGTCATGGACGCGCTGAA

PAMC25886/1-1158 GCCGGTGGTTTCTACGCCGACACCAGCGCTGGCGTAATGAGCACGCTGAA

P.brassicacearum/1-1191 ------GCTGTAATGACAGCGTTGAA

Wood1R/1-1125 ------GCTGTAATGACAGCGTTGAA

F113/1-1107 ------AACGTCATGACTGCGCTCAA

** *** ** * **

Pf5/1-1179 CGCAGGCGACGGTCGCATCACCTACATGAGCCCGGACTACGCCGCGCCGA

Wayne1/1-1056 CGCTGGCGACGGTCGCATCACCTACATGAGCCCGGACTACGCCGCGCCGA

PAMC25886/1-1158 CGCTGCTGACGGCCGCATCACTTACATGAGCCCGGATTACGCGGCCGCTA

P.brassicacearum/1-1191 CGCTGCCCAAGGTCGTATCACCTACATGAGCCCGGACTATGCCGCTACTA

Wood1R/1-1125 CGCTGCCCAAGGTCGTATCACCTACATGAGCCCGGACTATGCCGCTACTA

F113/1-1107 CGCAGCCCAAGGCCGTATCACCTACATGAGCCCGGATTACGCGGCAACTA

*** * * ** ** ***** ************** ** ** ** * *

Pf5/1-1179 CCCTGGCCGGCCTGGACGACGCTACCAAGGTGGCCAAGGTGGCTGGCGTT

Wayne1/1-1056 CCCTGGCCGGCCTGGACGACGCTACCAAGGTGGCCAAGGTGGCTGGCGTT

PAMC25886/1-1158 CCCTGGCCGGTCTGGATGACGCCACCAAGGTGGCAACCGTCGCCGGTGTT

P.brassicacearum/1-1191 CCCTGGCCGGTCTGGATGACGCCACCAAGGTCGCTCGTGTTGGCGGTCTT

Wood1R/1-1125 CCCTGGCCGGTCTGGATGACGCCACCAAGGTCGCTCGTGTTGGCGGTCTT

F113/1-1107 CTCTGGCAGGTCTGGACGACGCCACCAAGGTTGCCCGCGTCGCCGGTGTT

* ***** ** ***** ***** ******** ** ** * ** **

Pf5/1-1179 TCCCCGGCGCCTGCCAACGTTTCCTCGGCTATCGCCGCGGTTGCCGTGCC

Wayne1/1-1056 TCCCCGGCGCCTGCCAACGTTTCCTCGGCTATCGCCGCGGTTGCCGTGCC

PAMC25886/1-1158 TCGCCTGCTCCAGGCAACGTGTCGGCGGCCATCGGCGCGGTGGCTGTACC

P.brassicacearum/1-1191 TCCCCAGCACCGGCCAACGTGTCGGTAGCGATCAATGCCGTTCCCGTTCC

Wood1R/1-1125 TCCCCAGCACCGGCCAACGTGTCGGTAGCGATCAATGCCGTTCCCGTTCC

F113/1-1107 TCCCCTGCTCCAGCCAACGTGTCGGCCGCGATTGCAGCTGTTGCCGTGCC

** ** ** ** * ****** ** ** ** ** ** * ** **

Pf5/1-1179 TGATACCACTGTTCGTGGCGACCAGAACCTGTGGGTTCCTGTCTTCACTT

Wayne1/1-1056 TGATACCACTGTTCGTGGCGACCAGAACCTGTGGGTTCCTGTCTTCACTT

PAMC25886/1-1158 TGCCATCGCCAACCGTACCCTGCCCAACAACTGGGTCCCAGTGTTTGC--

P.brassicacearum/1-1191 GGCTGCTGCCGACCGGTCAAACCCGAATGCCTGGGTTCCTGTTTTCACTT

Wood1R/1-1125 GGCTGCTGCCGACCGGTCAAACCCGAATGCCTGGGTTCCTGTTTTCACTT

F113/1-1107 TGCTGCTGCCAACCGTGCCAACCCGAACGCCTGGGTGCCAGTGTTCGCC-

* * ** * ** ***** ** ** ** *

Pf5/1-1179 CCCA---GGCCAAC-----ATCGACGCCAACCCAAGCGACAAGAGCCTGC

Wayne1/1-1056 CCCA---GGCCAAC-----ATCGACGCCAACCCAAGCGACAAGAGCCTGC

PAMC25886/1-1158 ------GGCTACC-----ACCAGCGCTAGC----GACCCAAGCGTCGTC

P.brassicacearum/1-1191 CTCAAAAAGTGATTGATGAAACAATTCCGGCT---GATCCTAGC--TTGC

Wood1R/1-1125 CTCAAAAAGTGATTGATGAAACAATTCCGGCT---GATCCTAGC--TTGC

F113/1-1107 ------GCAACC-----ACCAACCCTAAC----GATCCAAGCG-TTGT

* * * * * * * * **

Pf5/1-1179 GCCTGTACCCAACCAGCGGTTACCCAATCCTGGGCTTCACCAACCTGATC

Wayne1/1-1056 GCCTGTACCCAACCAGCGGTTACCCAATCCTGGGCTTCACCAACCTGATC

PAMC25886/1-1158 GCCT--ACCCAAGCACCGGTTACCCAATCCTGGGCTTCACCAACGTGGTG

P.brassicacearum/1-1191 GCCTCTACCCAACCACCGGTTATCCGATCCTGGGCTTCACCAACGTGATC

Wood1R/1-1125 GCCTCTACCCAACCACCGGTTATCCGATCCTGGGCTTCACCAACGTGATC

F113/1-1107 GGCT-TACCCAGCCACCGGCTATCCGATCCTGGGCTTCACCAACGTGATC

* ** ***** ** *** ** ** ****************** ** *

Pf5/1-1179 TTCAGCCAGTGCTACGCCGACGCTAACCAGACTTCGCAAGTACGGGCTTT

Wayne1/1-1056 TTCAGCCAGTGCTACGCCGACGCTAACCAGACTTCGCAAGTACGGGCTTT

PAMC25886/1-1158 TTCAGCCAGTGCTACGCCAACGCTGACCAGACCTCCCAGGTCCGCACGTT

P.brassicacearum/1-1191 TTCAGCCAGTGCTACGCCAATGCTGCACAAACCACTCAGGTGCGTGATTT

Wood1R/1-1125 TTCAGCCAGTGCTACGCCAATGCTGCACAAACCACTCAGGTGCGTGATTT

F113/1-1107 TTCAGCCAGTGCTACGCCAACGCCGCCCAAAGCACCCAGGTGCGTGATTT

****************** * ** ** * * ** ** ** **

Pf5/1-1179 CTTCAGCCGTCACTACGGTGCCCTGGTG------AACAACGACACCGCCA

Wayne1/1-1056 CTTCAGCCGTCACTACGGTGCCCTGGTG------AACAACGACACCGCCA

PAMC25886/1-1158 CTTCACCCGTCATTACAACACCAACGCGTTCAGCAGCAACGATACGGCTA

P.brassicacearum/1-1191 CTTCACCCGTCACTACAACGGCACCGCTGCCAACAGCAACGACGCGGCGA

Wood1R/1-1125 CTTCACCCGTCACTACAACGGCACCGCTGCCAACAGCAACGACGCGGCGA

F113/1-1107 CTTCACCCGCCACTACGGTGCAGTCGCTGCCAACAACAACGATGCCGCCA

***** *** ** *** * * ****** * ** *

Pf5/1-1179 TCAACAACAACCGCTTCGTGCCTCTGCCAGCTGCCTGGAAAACCGCAGTA

Wayne1/1-1056 TCAACAACAACCGCTTCGTGCCTCTGCCAGCTGCCTGGAAAACCGCAGTA

PAMC25886/1-1158 TCCGCAACAACCGCTTCGTGCCACTGCCAACCACCTGGAAAACCGCAATC

P.brassicacearum/1-1191 TCACTGCCAACCGCTTCGTTCCACTGCCTGGCGCTTGGAAATCTGCCATC

Wood1R/1-1125 TCACTGCCAACCGCTTCGTTCCACTGCCTGGCGCTTGGAAATCTGCCATC

F113/1-1107 TCACTGCCAACCGCTTCGTGCCGCTGCCAACGACCTGGAAAAACGCCATC

** ************ ** ***** * ****** ** *

Pf5/1-1179 CGTGACTCGTTCGTCACTGCCTCCAGCGGCCTGAGCATCGGTAACGCCAG

Wayne1/1-1056 CGTGACTCGTTCGTCACTGCCTCCAGCGGCCTGAGCATCGGTAACGCCAG

PAMC25886/1-1158 AACGACACCTTCCTCAGCGCTGGCAGCGACCTGAGCATCGGCAAGTCGAA

P.brassicacearum/1-1191 CGTGGCAGCTTCCTGACCGCTACCAACGCTCAAAGCATCGGCAACACCAA

Wood1R/1-1125 CGTGGCAGCTTCCTGACCGCTACCAACGCTCAAAGCATCGGCAACACCAA

F113/1-1107 CGTGGCAGCTTCCTGACCACTACCAGCGCTCAAAGCATCGGCAACACCAA

* * *** * * * ** ** * ******** ** * *

Pf5/1-1179 CGTCTGCAACGCCATCGGCCGTCCGCTGTAA

Wayne1/1-1056 CGTCTGCAACGCCATCGGCCGTCCGCTGTAA

PAMC25886/1-1158 CATCTGCAACGGTATTGGTCGTCCGCTGTAA

P.brassicacearum/1-1191 CGTGTGCAATGGCATCGGTCGTCCGCTGTAA

Wood1R/1-1125 CGTGTGCAATGGCATCGGTCGTCCGCTGTAA

F113/1-1107 CGTCTGCAACGGCATCGGTCGTCCGCTGTAA

* * ***** * ** ** ************