1
DING proteins: numerous functions, elusive genes, a potential for health
Supplementary material
Online resource 1 Pairwise aminoacid identities between the representative members of the five DING protein sub-families.
MDR / BBc / SBW / HPBPWood / 73 / 70 / 75 / 69
MDR / X / 68 / 73 / 66
BBc / X / X / 74 / 68
SBW / X / X / X / 77
Online resource 2 Alignment of all known DING protein N-termini. Proteins are identified by species (a protein name or ligand is added when more than one DING protein is known for a given species). The first 34 residues are shown since this corresponds to the longest determined N-terminus of a DING protein (human genistein binding protein). Thus, all sequences shorter than 34 aminoacids are from purified DING proteins while the other onesare predicted from DNA sequences(Table 1). Variable residues used to define groups 1-7 (Table 1) are shown in blue. Residues differing from the consensus (Fig. 2) are shown in red.
* Procaryotic sequences
Online resource 3Alignment of pro-peptides deduced from Pseudomonas DING protein genes as well as two eucaryotic genes. Propeptides assemble in two very homogenous groups, one of them made of all P. aeruginosa sequences. It should be noted that eucaryotic ORFs extend further even though the clones are incomplete and do not allow identification of the initiation codon.
Human CW626261 RSFFMFKRNVLAVSMTLAALCSAQAAMADINGGG
Leishmania RSFFMFKRNVLAVSMTLAALCSAQAAMADINGGG
P. fluorescens BBc6R8 MFKRNVLAVSMTLAALCSAQAAMADINGGG
P. fluorescens SBW25 MFKRNVLAVSMTLAALCSAQAAMADINGGG
P. fluorescens Pf5 MFKRNVLAASLTLAALCSAQAAMADINGGG
P. fluorescens Wayne1 MFKRNVLAASLTLAALCSAQAAMADINGGG
P. fluorescensWoodR1/F113 MFKRNVLAVSLTLAGLCAAQAAMADVNGGG
Pseudomonas sp. PAMC25886 MFKRNVLAVSMTLAALCSAQAAMADVNGGG
P. extremaustralis MFKRNVLAVSMTLAALCSAQAAMADINGGG
P. brassicacearum mfkrnvlavsltlaglcaaqaamadvNggg
P. aeruginosa PA7 MFKRSLIAASLSVAALVSAQ-AMATVNGGG
P. aeruginosa 39016 MFKRSLIAASLSVAALVSAQ-AMAEINGGG
P. aeruginosa PA14 MYKRSLIAASLSVAALVSAQ-AMADINGGG
P. aeruginosa MDR1 MYKRSLIAASLSVAALVSAQ-AMAEINGGG
P. aeruginosa NCGM MFKRSLIAASLSVAALVSAQ-AMAEINGGG
*:**.::*.*:::*** *** *** :****
Online resource 4 Alignment (ClustalW, followed by hand edition for some short sequences) of DING protein sequences within each sub-family. When more than one ORF is known, nucleotide alignment is also shown.
HPBP sub-family (groups 1 and 2)– proteins
turkey/1-45 ------
rat_neurones/1-91 XINGGGATLPQKLYLTPNVLTAGFAPYI------IAFLENKYNQFGTDT------
human HPBP/1-376 DINGGGATLPQKLYLTPDVLTAGFAPYIGVGSGKGKIAFLENKYNQFGTDTTKNVHWAGS
Human_amniotic/1-215 ------AFLENKYNQFGTDTT------
tobacco_cDNA/1-54 ------GVGSGKGKIAFLENKYNQFGTDTTKNVHWAGS
turkey/1-45 ----TATELAT-YAADC------LIQVPAVATAVAIPFR------
rat_neurones/1-91 ------
human HPBP/1-376 DSKLTATELAT-YAADKEPGWGKLIQVPSVATSVAIPFRKAGANAVDLSVKELCGVFSGR
Human_amniotic/1-215 ------GKLIQVPSVATSVAIPFRKAGANAVDLSVKELCGVFSGR
tobacco_cDNA/1-54 DSKLTATELAT-YAADKEPGWGK------
turkey/1-45 ------
rat_neurones/1-91 ------
human HPBP/1-376 IADWSGITGAGRSGPIQVVYRAESSGTTELFTRFLNAKCTTEPGTFAVTTTFANSYSLGL
Human_amniotic/1-215 ------TGA-----IQVVYRAESSGTTELFTR------
tobacco_cDNA/1-54 ------
turkey/1-45 ------
rat_neurones/1-91 ------ITYISPDFAAPTLAGLDD------VGKG-WNG
human HPBP/1-376 TPLAGAVAATGSDGVMAALNDTTVAEGRITYISPDFAAPTLAGLDDATKVARVGKGVVNG
Human_amniotic/1-215 ------ITYISPDFAAPTLAGLDDATK------
tobacco_cDNA/1-54 ------
turkey/1-45 ------
rat_neurones/1-91 VAVEGK-PAAANVSAAISVVPLPA------
human HPBP/1-376 VAVEGKSPAAANVSAAISVVPLPAAADRGNPDVWVPVFGATTGGGVVAYPDSGYPILGFT
Human_amniotic/1-215 --VEGKSPAAANVSAAISVVPLPAAADRGNPDVWVPVF------YPDSGYPILGFT
tobacco_cDNA/1-54 ------
turkey/1-45 ------HYGTSSNNDSAIEANAK------
rat_neurones/1-91 ------
human HPBP/1-376 NLIFSQCYANATQTGQVRDFFTKHYGTSANNDAAIEANAFVPLPSNWKAAVRASFLTASN
Human_amniotic/1-215 NLIFSQCYANATQTGQVRDF---HYGTSANNDAAIEAN---PLPSNWKAAVRASFLTASN
tobacco_cDNA/1-54 ------
turkey/1-45 ------
rat_neurones/1-91 ------
human HPBP/1-376 ALSIGNTNVCNGKGRPQ-
Human_amniotic/1-215 ALSIGNTNVCNGKGRP-
tobacco_cDNA/1-54 ------
Online resource 4 (continued)
SBW sub-family (groups 3 and 4) - proteins
Arabidopsis_1a/1-55 ------GVGSGNGKAAFLTNDYTKFVAGVSNKNVHWAG
Arabidopsis_1b/1-55 ------GVGSGNGKAAFLTNDYTNFVAGVSNKNVHWAG
S.solfataricus/1-210 DINGGGATLPQKLYQTSGVLTAGFAPYIGVGSGNGKAAFLTNDYTK------
Potato_prot/1-209 DINGGGATLPQKLYQTSGVLTAGFAPYI------AAFLTNDYT------
SBW25/1-370 DINGGGATLPQALYQTSGVLTAGFAQYIGVGSGNGKAAFLNNDYTKFQAGVTNKNVHWAG
P38/1-362* DINGGGATLPQALYQTSGVLTAGFAPYIGVGSGNGKAAFLNNDYTKFQAGVTNKNVHWAG
X-DING-CD4_prot/1-89 DINGGGATLPQK------VLTAGFAPYL------TNDY-KFVAGVSNKNVHWAG
Arabidopsis_1a/1-55 SDSKLTATELSTYATNKQPTWGK------
Arabidopsis_1b/1-55 SDSKLTATELSTYATNKQPTWGK------
S.solfataricus/1-210 ----LTATELSTYATNLQPTWGKLIQVPSVATSVAIPFR------
Potato_prot/1-209 ---KLTATELSTYATNK------LIQVPSVATSVAIPF----ANAVDLSVSELCGVFSGR
SBW25/1-370 SDSKLSATELSTYASAKQPTWGKLIQVPSVGTSVAIPFNKSGSAAVDLSVQELCGVFSGR
P38/1-362 SDSKLSATELSTYASAKQPTWGKLIQVPSVGTAVAIPFNKSGTAAVDLSVSELCGVFSGR
X-DING-CD4_prot/1-89 SDSKLTATELSTYATNKQ------QVPSVATSVAL------
Arabidopsis_1a/1-55 ------
Arabidopsis_1b/1-55 ------
S.solfataricus/1-210 ITDWSGISGAGRTGPITVVYR------
Potato_prot/1-209 ITDWSGLY--GRTGPITVVYRSESSGTTELFTR------
SBW25/1-370 INTWDGISGSGRTGPIVVVYRSESSGTTELFTRFLNAKCNAETGNFAVTTTFGTSFSGGL
P38/1-362 ITDWSGISGSGRTGAITVVYRSESSGTTELFTRFLNAKC-AETGTFNISTTFGTSYTGGL
X-DING-CD4_prot/1-89 ------
Arabidopsis_1a/1-55 ------
Arabidopsis_1b/1-55 ------
S.solfataricus/1-210 ------ITYM-SPDFAASTLAGLDDATK------GV
Potato_prot/1-209 ------ITYPYSPVYAASTLAGLDDATK------
SBW25/1-370 PAGAVAATGSQGVMTALAAGDGRITYM-SPDFAAPTLAGLDDATKVARVGKNVATNTQGV
P38/1-362 PAGAVSAAGSQGVMTALAGADGGITYM-SPDFAAPTLAGLDDATKVARVGKDVATNTAGV
X-DING-CD4_prot/1-89 ------FAASTLAGLDDATK------
Arabidopsis_1a/1-55 ------
Arabidopsis_1b/1-55 ------
S.solfataricus/1-210 SPAPSNVSDAIAQV-LP-PNDPSAPLDVTNP--DDGVAGVQPYPDSGYPILGFTNLIFS-
Potato_prot/1-209 -----NVSK------RTLQWYPVP------
SBW25/1-370 SPAAANVSAAIGAVPVPAAADRSNPDAWVPVFGPDNTAGVQPYPTSGYPILGFTNLIFSQ
P38/1-362 SPAAANVSAAINAVPVPASTEK--PEFGKA-----NTAGVQPYPTSGYPILGFTNLIFSQ
X-DING-CD4_prot/1-89 ------
Arabidopsis_1a/1-55 ------
Arabidopsis_1b/1-55 ------
S.solfataricus/1-210 ------AFFTKHFGDTNNNDDAITANRFVPLPDNWK------
Potato_prot/1-209 ------VFGKHFGDTNNTQDAITANRFVPLPDNWKATITDNFVTASSALSIGK
SBW25/1-370 CYADATQTTQVRDFFTKHYGASNNNDAAITANAFVPLPTAWKATVRASFLTASNALSIGN
P38/1-362 CYADATQTSQVRDFFAKHYGASNNNDAAITANAFVPLPTAWKATVRASFLTASNALSIGN
X-DING-CD4_prot/1-89 ------
Arabidopsis_1a/1-55 ------
Arabidopsis_1b/1-55 ------
S.solfataricus/1-210 ------
Potato_prot/1-209 TNVCNGIGRGPL
SBW25/1-370 TNVCNGIGR-PL
P38/1-362 TNVCNGIGR-PL
X-DING-CD4_prot/1-89 TNVCNGLGR-PL
* P38 represents both the sequence from human genome (glioblastoma cells) and St.John’s wort (see Table 1)
Online resource 4 (continued)SBW sub-family (groups 3 and 4) – genes
P38/1-1095 ATGGCCGATATAAACGGTGGTGGTGCGACACTACCCCAAGCGCTGTACCAGACTTCCGGC
SBW25/1-1185 ATGGCTGACATCAATGGCGGTGGTGCCACCCTGCCACAAGCGCTGTACCAGACCTCCGGC
Arabidopsis_1a/1-165 ------GGTGTGGGCAGCGGGAATGGCAAGGCAGCC
Arabidopsis_1b/1-165 ------GGTGTGGGCAGCGGGAATGGCAAGGCAGCC
P38/1-1095 GTGCTGACTGCCGGTTTTGCCCCGTACATCGGCGTGGGCAGCGGTAATGGCAAGGCCGCC
SBW25/1-1185 GTGTTGACTGCCGGTTTCGCCCAGTACATTGGTGTCGGCAGCGGTAACGGCAAGGCAGCC
Arabidopsis_1a/1-165 TTCTTGACCAACGATTACACCAAGTTCGTGGCTGGCGTGAGCAACAAGAACGTGCACTGG
Arabidopsis_1b/1-165 TTCTTGACCAACGATTACACCAATTTCGTGGCTGGCGTGAGCAACAAGAACGTGCACTGG
P38/1-1095 TTCCTGAACAACGACTACACCAAGTTCCAGGCCGGCGTGACGAACAAGAATGTGCACTGG
SBW25/1-1185 TTCCTGAACAACGACTACACCAAGTTCCAGGCTGGCGTGACGAACAAGAACGTGCACTGG
Arabidopsis_1a/1-165 GCCGGTAGCGATTCGAAGCTGACTGCGACTGAACTGTCGACCTACGCCACCAACAAACAA
Arabidopsis_1b/1-165 GCCGGTAGCGATTCGAAGCTGACTGCGACTGAACTGTCGACCTACGCCACCAACAAACAA
P38/1-1095 GCCGGCAGCGACTCCAAGCTGAGCGCCACTGAGCTGTCGACCTACGCGTCTGCCAAGCAA
SBW25/1-1185 GCGGGCAGCGACTCCAAGCTGAGCGCCACTGAGTTGTCGACCTACGCGTCTGCCAAGCAA
Arabidopsis_1a/1-165 CCTACCTGGGGCAAA
Arabidopsis_1b/1-165 CCTACCTGGGGCAAA
P38/1-1095 CCGACCTGGGGCAAGTTGATCCAGGTGCCGTCGGTGGGTACTGCGGTCGCTATTCCCTTC
SBW25/1-1185 CCGACCTGGGGCAAGTTGATCCAGGTGCCATCGGTGGGGACTTCGGTTGCCATTCCTTTC
P38/1-1095 AACAAAAGCGGCACCGCAGCGGTTGACCTGAGCGTCAGCGAGCTGTGCGGTGTGTTCTCG
SBW25/1-1185 AATAAAAGCGGTTCCGCCGCTGTAGACCTGAGCGTTCAAGAGTTGTGCGGCGTGTTCTCG
P38/1-1095 GGGCGTATCACTGACTGGAGCGGTATTTCCGGTTCCGGCCGTACCGGCGCGATCACCGTG
SBW25/1-1185 GGCCGTATCAATACCTGGGACGGTATTTCCGGTTCTGGCCGTACCGGTCCGATCGTTGTG
P38/1-1095 GTCTACCGTAGCGAAAGCAGTGGCACCACCGAGTTGTTCACCCGTTTCCTCAACGCCAAG
SBW25/1-1185 GTTTATCGCAGCGAAAGCAGTGGTACCACTGAGCTGTTCACCCGTTTCCTCAATGCCAAG
P38/1-1095 TG---TGCTGAAACCGGCACCTTCAATATCTCCACCACGTTCGGCACCAGCTACACCGGT
SBW25/1-1185 TGCAACGCAGAAACAGGCAACTTCGCCGTCACCACCACCTTCGGCACCAGCTTCTCCGGT
P38/1-1095 GGTTTGCCTGCCGGCGCCGTTTCTGCCGCCGGCAGCCAAGGTGTTATGACCGCGTTGGCC
SBW25/1-1185 GGCTTGCCTGCCGGCGCCGTTGCTGCTACTGGCAGCCAAGGTGTAATGACTGCCCTGGCC
P38/1-1095 GGCGCCGACGGTGGTATCACCTACATGAGCCCTGATTTCGCGGCCCCAACCCTGGCCGGT
SBW25/1-1185 GCCGGCGATGGCCGCATCACCTACATGAGCCCTGACTTCGCCGCCCCTACATTGGCCGGT
P38/1-1095 CTGGACGACGCGACCAAAGTGGCACGTGTCGGCAAGGACGTCGCGACCAACACTGCGGGC
SBW25/1-1185 CTGGACGACGCTACCAAAGTGGCTCGCGTGGGCAAAAACGTCGCCACTAACACCCAGGGC
P38/1-1095 GTTTCGCCTGCCGCCGCTAACGTCTCCGCTGCCATCAACGCTGTGCCAGTTCCAGCATCA
SBW25/1-1185 GTTTCGCCTGCCGCCGCCAACGTGTCTGCCGCTATCGGCGCAGTACCGGTACCGGCTGCC
P38/1-1095 ACCGA------AAAGCCGGAA------TTCGGCAAAGCCAACACCGCCGGT
SBW25/1-1185 GCTGATCGTTCCAACCCGGACGCCTGGGTTCCAGTCTTCGGTCCGGACAACACCGCCGGT
P38/1-1095 GTGCAGCCTTACCCTACCTCGGGCTACCCGATCCTGGGCTTCACCAACCTGATCTTCAGC
SBW25/1-1185 GTACAGCCTTACCCAACCTCGGGCTACCCGATCCTGGGCTTTACCAACCTGATCTTCAGC
P38/1-1095 CAGTGCTACGCCGATGCCACCCAGACCAGCCAAGTGCGTGATTTCTTCGCCAAGCACTAC
SBW25/1-1185 CAGTGCTACGCCGACGCGACCCAGACCACCCAAGTGCGTGATTTCTTCACCAAGCACTAC
P38/1-1095 GGCGCCTCCAACAACAACGATGCAGCCATCACCGCCAACGCTTTCGTGCCGCTGCCAACC
SBW25/1-1185 GGCGCCTCCAACAACAACGATGCAGCCATCACCGCCAACGCTTTCGTTCCACTGCCAACC
P38/1-1095 GCTTGGAAAGCCACCGTTCGCGCCAGCTTCCTGACCGCGAGCAACGCCCTGAGCATCGGC
SBW25/1-1185 GCTTGGAAAGCCACCGTTCGCGCCAGTTTCCTGACCGCGAGCAACGCCCTGAGCATCGGC
P38/1-1095 AACACCAACGTCTGCAACGGCATCGGCCGTCCGCTGTAA
SBW25/1-1185 AACACCAACGTCTGCAACGGTATCGGTCGTCCGCTGTAA
Online resource 4 (continued)MDR sub-family (group 5) - proteins
MDR1/1-369 EINGGGATLPQQLYQEPGVLTAGFAAYIGAGSGNGKAAFLNNDYTKFVAGTTNKNVHWAG
39016/1-369 EINGGGATLPQQLYQEPGVLTAGFAAYIGVGSGNGKAAFLNNDYTKFVAGTTNKNVHWAG
NCGM2/1-369 EINGGGATLPQQLYQEPGVLTAGFAAYIGVGSGNGKAAFLNNDYTKFVAGTTNKNVHWAG
138244/1-253 EINGGGATLPQQLYQEPGVLTAGFAAYIGVGSGNGKAAFLNNDYTKFVAGTTNKNVHWAG
PA14/1-369 DINGGGATLPQQLYQEPGVLTAGFAAYIGVGSGNGKAAFLNNDYTKFVAGTTNKNVHWAG
PA7/1-369 TVNGGGATLPQQLYQEPGVLTAGFAAYIGVGSGNGKAAFLNNDYTKFVAGTTDKNVHWAG
GP56/1-87 ------AAFLNNDYTKFVAGTTNKNVHWAG
P.extremaustralis/1-369 DINGGGATLPQQLYQTPGVLTAGFAQYIGVGSGNGKAAFLTNDYTKFVAGVSNKNVHWAG
MDR1/1-369 SDSKLSKTNETNPYLSAHGSAWGPLIQVPSVATSVALPFNKSGSNAVNFADVNTLCGVFS
39016/1-369 SDSKLSKTNETNPYLSAHGSAWGPLIQVPSVATSVALPFNKSGSNAVNFADVNTLCGVFS
NCGM2/1-369 SDSKLSKTNETNPYLSAHGSAWGPLIQVPSVATSVALPFNKSGSNAVNFADVNTLCGVFS
138244/1-253 SDSKLSKTNETNPYLSAHGSAWGPLIQVPSVATSVALPFNKSGSNAVNFADVNTLCGVFS
PA14/1-369 SDSKLSKTNETNPYLSAHGSAWGPLIQVPSVATSVALPFNKSGSNAVNFADVNTLCGVFS
PA7/1-369 SDSKLNPTSETSPYLAAHGAAWGPLIQVPSVATSVAIPFNKAGSNAVNFADVNTLCGVFS
GP56/1-87 SDSK------LIQVPSVATSVALPFRK------
P.extremaustralis/1-369 SDSKLSAT-ELSTYATNKQPTWGKLIQVPSVATSVAIPFNKAGTAAVNL-SVNQLCGVFS
MDR1/1-369 GRLTDWSQIPGSGRSGAITVAYRSESSGTTELFTRFLNASCSSALEGGTFAITTSFGNSF
39016/1-369 GRLTDWSQIPGSGRSGAITVVYRSESSGTTELFTRFLNASCSSALEGGTFAITTSFGNSF
NCGM2/1-369 GRLTDWSQIPGSGRSGAITVVYRSESSGTTELFTRFLNASCSSALEGGTFAITTSFGNSF
138244/1-253 GRLTDWSQIPGSGRSGAITVVYRSESSGTTELFTRFLNASCSSALEGGTFAITTSFGNSF
PA14/1-369 GRLTDWSQIPGSGRSGAITVVYRSESSGTTELFTRFLNASCSSTLEGGTFAITTSFGSSF
PA7/1-369 GRLTDWSQIPDSGRTGAITVVYRSESSGTTELFTRFLNASCSSATEGGTFAITTSFGSSF
GP56/1-87 --LTDWSQITGAGRSGAITVVYRSESSGTTELFTR------
P.extremaustralis/1-369 GRLTNWNQITGSGRTGAIKVVYRSESSGTTELFTRFLNAKCS---EAKAFAITTTFSSSY
MDR1/1-369 SGGLPAGAVSAQGSQAVMNTLNAAEGRITYMSPDFAAPTLAGLDDATKVAQVRGVSPAPA
39016/1-369 SGGLPAGAVSAQGSQAVMNTLNAAEGRITYMSPDFAAPTLAGLDDATKVAQVRGVSPAPA
NCGM2/1-369 SGGLPAGAVSAQGSQAVMNTLNAAEGRITYMSPDFAAPTLAGLDDATKVAQVRGVSPAPA
138244/1-253 SGGLPAGAVSAQGSQAVMNTLNAAEGRITYMSPDFAAPTLAGLDDATKVAQVRGVSPAPA
PA14/1-369 SGGLPAGAVSAQGSQAVMNALNAAQGRITYMSPDFAAPTLAGLDDATKVAQVRGVSPAPA
PA7/1-369 SGGLPSGAVSAQGSQAVMNTLNAAEGRITYMSPDFAAPTLAGLDDATKVAQIRGVSPAPA
GP56/1-87 ------
P.extremaustralis/1-369 GNGVPAGAVAATGSQGVMTTLNATDGGITYMSPDYAATTLAGLDDATKVAKVAGVSPAPA
MDR1/1-369 NVSAAIGAVTPPTTAQRSDPN------NWVPVFAATASATDPSVRPYPTTGYPILGFT
39016/1-369 NVSAAIGAVTPPTTAQRSDPN------NWVPVFAATASATDPSVRPYPTTGYPILGFT
NCGM2/1-369 NVSAAIGAVTPPTTAQRSDPN------NWVPVFAATASATDPSVRPYPTTGYPILGFT
138244/1-253 NVSAAIGAVTPPTTAQRSDPN------NWVPVFAATASATDPSVRPYPTTGYPILGFT
PA14/1-369 NVSAAIGAVTPPTTAQRSDPN------NWVPVFAATANPNDPSVRPYPTSGYPILGFT
PA7/1-369 NVSAAIGAVTPPTTAQRSDPN------NWVPVFAATASATDPSVRPYPTTGYPILGFT
GP56/1-87 ------
P.extremaustralis/1-369 NVSAAIGQIPAPTEA-RADGSFIDATNQDNWVPVFAATAS-GPATFA-YPTTGYPILGFT
MDR1/1-369 NLIFSQCYADATQTQQVRDFFTRHYGASVNNDTAITNHRFVPLPASWKLAVRQSFLTSTN
39016/1-369 NLIFSQCYADATQTQQVRDFFTRHYGASVNNDTAITNHRFVPLPASWKLAVRQSFLTSTN
NCGM2/1-369 NLIFSQCYADATQTQQVRDFFTRHYGASVNNDTAITNHRFVPLPASWKLAVRQSFLTSTN
138244/1-253 NLIFSQCYADATQTQQVRDFFTRHYGASVNNDTAITNHRFVPLPASWKLAVRQSFLTSTN
PA14/1-369 NLIFSQCYANATQTQQVRDFFTRHYGATANNDTAITNHRFVPLPASWKLAVRQSFLTSTN
PA7/1-369 NLIFSQCYADATQTQQVRDFFTRHYGASVNNDAAITNHRFVPLPASWKLAVRQSFLTSTN
GP56/1-87 ------FVPLPDSWK------
P.extremaustralis/1-369 NLIFSQCYADPTQTTQVQDFFLQHFGAFNNNDAAINDNRFVPLPSNWKQAITDTFATVSS
MDR1/1-369 NLYIGHSNVCNGIGRPL
39016/1-369 NLYIGHSNVCNGIGRPL
NCGM2/1-369 NLYIGHSNVCNGIGRPL
138244/1-253 NLYIGHSNVCNGIGRPL
PA14/1-369 NLYIGHSNVCNGIGRPL
PA7/1-369 NLYIGHTNVCNGIGRPL
GP56/1-87 ------
P.extremaustralis/1-369 GQGIGNTSVCNGIGRPL
Online resource 4 (continued)
MDR sub-family (group 5) – genes
39016/1-1190 ATGGCCGAAATCAACGGCGGTGGCGCCACCCTGCCGCAACAGCTGTACCA
NCGM2/1-1113 ATGGCCGAAATCAACGGCGGTGGCGCCACCCTGCCGCAACAGCTGTACCA
138244/1-1116 ATGGCCGAAATCAACGGCGGTGGCGCCACCCTGCCGCAACAGCTGTACCA
MDR1/1-1179 ATGGCCGAAATCAACGGCGGTGGCGCCACCCTGCCGCAACAGCTGTACCA
PA14/1-1116 ATGGCCGACATCAACGGCGGTGGCGCCACCCTGCCGCAACAGCTGTACCA
PA7/64-1179 ATGGCCACCGTCAATGGCGGCGGCGCTACCCTGCCGCAGCAGCTGTACCA
P.extremaustralis/1-1118 ATGGCTGACATCAACGGCGGTGGTGCCACCCTGCCTCAGCAGCTGTACCA
***** **** ***** ** ** ******** ** ***********
39016/1-1190 GGAGCCCGGCGTCCTGACCGCCGGCTTCGCCGCCTACATCGGCGTAGGCA
NCGM2/1-1113 GGAGCCCGGCGTCCTGACCGCCGGCTTCGCCGCCTACATCGGCGTAGGCA
138244/1-1116 GGAGCCCGGCGTCCTGACCGCCGGCTTCGCCGCCTACATCGGCGTAGGCA
MDR1/1-1179 GGAGCCCGGCGTCCTGACCGCCGGCTTCGCCGCCTACATCGGCGCAGGCA
PA14/1-1116 GGAGCCCGGCGTCCTGACCGCCGGCTTTGCCGCCTACATCGGCGTAGGCA
PA7/64-1179 GGAGCCCGGCGTCCTGACCGCCGGCTTCGCCGCCTACATCGGCGTCGGCA
P.extremaustralis/1-1118 GACTCCGGGCGTGCTGACGGCCGGTTTCGCCCAATACATCGGCGTGGGCA
* ** ***** ***** ***** ** *** ********** ****
39016/1-1190 GCGGCAACGGCAAGGCGGCCTTCCTGAACAACGACTACACCAAGTTCGTC
NCGM2/1-1113 GCGGCAACGGCAAGGCGGCCTTCCTGAACAACGACTACACCAAGTTCGTC
138244/1-1116 GCGGCAACGGCAAGGCGGCCTTCCTGAACAACGACTACACCAAGTTCGTC
MDR1/1-1179 GTGGCAACGGCAAGGCGGCCTTCCTGAACAACGACTACACCAAGTTCGTC
PA14/1-1116 GTGGCAACGGCAAGGCGGCCTTCCTGAACAACGACTACACCAAGTTCGTC
PA7/64-1179 GCGGCAACGGCAAGGCTGCCTTCCTGAACAACGACTACACCAAGTTCGTC
P.extremaustralis/1-1118 GCGGTAATGGCAAGGCCGCCTTCCTGACCAACGACTACACCAAGTTCGTG
* ** ** ******** ********** *********************
39016/1-1190 GCCGGCACTACCAACAAGAACGTGCATTGGGCTGGTAGCGACTCCAAACT
NCGM2/1-1113 GCCGGCACTACCAACAAGAACGTGCATTGGGCTGGTAGCGACTCCAAACT
138244/1-1116 GCCGGCACCACCAACAAGAACGTGCATTGGGCTGGTAGCGACTCCAAACT
MDR1/1-1179 GCCGGCACTACCAACAAGAACGTGCATTGGGCTGGTAGCGACTCCAAACT
PA14/1-1116 GCCGGCACCACCAACAAGAACGTGCATTGGGCTGGTAGCGACTCCAAACT
PA7/64-1179 GCCGGCACCACCGACAAGAACGTGCACTGGGCTGGCAGCGACTCCAAGCT
P.extremaustralis/1-1118 GCCGGTGTGAGCAACAAGAACGTGCACTGGGCCGGTAGCGATTCCAAGCT
***** * * ************* ***** ** ***** ***** **
39016/1-1190 GAGCAAGACCAACGAAACCAACCCCTATCTGAGCGCCCATGGCTCCGCCT
NCGM2/1-1113 GAGCAAGACCAACGAAACCAACCCCTATCTGAGCGCCCATGGCTCCGCCT
138244/1-1116 GAGCAAGACCAACGAAACCAACCCCTATCTGAGCGCCCATGGCTCCGCCT
MDR1/1-1179 GAGCAAGACCAACGAAACCAACCCCTATCTGAGCGCCCATGGCTCCGCCT
PA14/1-1116 GAGCAAGACCAACGAAACCAACCCCTATCTGAGCGCCCATGGCTCCGCCT
PA7/64-1179 GAACCCGACCAGCGAAACCAGCCCCTACCTGGCCGCCCATGGCGCCGCCT
P.extremaustralis/1-1118 GAGCGCGACTGAACTGTCGA---CCTACGCCACCAACAAACAACCGACCT
** * *** * * **** * * * * ***
39016/1-1190 GGGGTCCGCTGATCCAGGTGCCGTCGGTAGCCACTTCGGTCGCTCTGCCG
NCGM2/1-1113 GGGGTCCGCTGATCCAGGTGCCGTCGGTAGCCACTTCGGTCGCTCTGCCG
138244/1-1116 GGGGTCCGCTGATCCAGGTGCCGTCGGTAGCCACTTCGGTCGCTCTGCCG
MDR1/1-1179 GGGGTCCGCTGATCCAGGTGCCGTCGGTAGCCACTTCGGTCGCTCTGCCG
PA14/1-1116 GGGGTCCGCTGATCCAGGTGCCGTCGGTAGCCACTTCTGTCGCTCTGCCG
PA7/64-1179 GGGGTCCGCTGATCCAGGTTCCCTCGGTCGCCACCTCGGTCGCCATTCCG
P.extremaustralis/1-1118 GGGGCAAGCTGATCCAGGTGCCATCGGTGGCGACTTCGGTTGCCATTCCG
**** ************ ** ***** ** ** ** ** ** * ***
39016/1-1190 TTCAACAAGTCAGGTAGCAACGCCGTCAACTTCGCAGACGTGAACACCCT
NCGM2/1-1113 TTCAACAAGTCAGGTAGCAACGCCGTCAACTTCGCAGACGTGAACACCCT
138244/1-1116 TTCAACAAGTCAGGTAGCAACGCCGTCAACTTCGCAGACGTGAACACCCT
MDR1/1-1179 TTCAACAAGTCAGGTAGCAACGCCGTCAACTTCGCAGACGTGAACACCCT
PA14/1-1116 TTCAACAAGTCAGGTAGCAACGCCGTCAACTTCGCAGACGTGAACACCCT
PA7/64-1179 TTCAACAAGGCAGGCAGCAACGCCGTCAACTTCGCCGACGTGAACACCCT
P.extremaustralis/1-1118 TTCAACAAGGCCGGTACTGCAGCGGTTAACCT---GAGCGTTAACCAACT
********* * ** * ** ** *** * *** *** **
39016/1-1190 TTGCGGTGTCTTCTCCGGCCGTCTGACCGATTGGAGTCAGATTCCTGGCT
NCGM2/1-1113 TTGCGGTGTCTTCTCCGGCCGTCTGACCGATTGGAGTCAGATTCCTGGCT
138244/1-1116 TTGCGGTGTCTTCTCCGGCCGTCTGACCGATTGGAGTCAGATTCCTGGCT
MDR1/1-1179 TTGCGGTGTCTTCTCCGGCCGTCTGACCGATTGGAGTCAGATTCCTGGCT
PA14/1-1116 TTGCGGTGTCTTCTCCGGCCGTCTGACCGATTGGAGTCAGATTCCTGGCT
PA7/64-1179 CTGCGGCGTGTTCTCCGGCCGCCTGACCGACTGGAGCCAGATCCCCGACT
P.extremaustralis/1-1118 GTGCGGCGTGTTCTCCGGTCGCCTGACCAACTGGAACCAGATCACGGGTT
***** ** ******** ** ****** * **** ***** * * *
39016/1-1190 CGGGCCGTAGCGGCGCCATCACAGTGGTCTACCGTTCCGAGAGCAGCGGC
NCGM2/1-1113 CGGGCCGTAGCGGCGCCATCACAGTGGTCTACCGTTCCGAGAGCAGCGGC
138244/1-1116 CGGGCCGTAGCGGCGCCATCACAGTGGTCTACCGTTCCGAGAGCAGCGGC
MDR1/1-1179 CGGGCCGTAGCGGCGCCATCACAGTGGCCTACCGTTCCGAGAGCAGCGGC
PA14/1-1116 CGGGCCGTAGCGGCGCCATCACAGTGGTCTACCGTTCCGAGAGCAGCGGC
PA7/64-1179 CGGGCCGCACCGGCGCCATCACCGTGGTCTACCGTTCGGAAAGCAGCGGT
P.extremaustralis/1-1118 CCGGCCGCACGGGCGCGATCAAAGTCGTTTACCGCAGTGAATCCAGCGGT
* ***** * ***** **** ** * ***** ** ******
39016/1-1190 ACTACCGAACTGTTCACCCGCTTCCTCAACGCCTCTTGCTCCAGCGCCCT
NCGM2/1-1113 ACTACCGAACTGTTCACCCGCTTCCTCAACGCCTCTTGCTCCAGCGCCCT
138244/1-1116 ACTACCGAACTGTTCACCCGCTTCCTCAACGCCTCTTGCTCCAGCGCCCT
MDR1/1-1179 ACTACCGAACTGTTCACCCGCTTCCTCAACGCCTCTTGCTCCAGCGCCCT
PA14/1-1116 ACCACCGAACTGTTCACCCGCTTCCTCAACGCCTCCTGCTCCAGCACCCT
PA7/64-1179 ACCACCGAGCTGTTCACCCGCTTCCTCAACGCTTCCTGCTCCAGCGCCAC
P.extremaustralis/1-1118 ACTACCGAACTCTTCACCCGCTTCCTGAACGCCAAGTGC---AGCGAAGC
** ***** ** ************** ***** *** ***
39016/1-1190 CGAAGGTGGCACCTTCGCCATCACCACCAGCTTCGGTAACAGCTTCTCCG
NCGM2/1-1113 CGAAGGTGGCACCTTCGCCATCACCACCAGCTTCGGTAACAGCTTCTCCG
138244/1-1116 CGAAGGTGGCACCTTCGCCATCACCACCAGCTTCGGTAACAGCTTCTCCG
MDR1/1-1179 CGAAGGTGGCACCTTCGCCATCACCACCAGCTTCGGTAACAGCTTCTCCG
PA14/1-1116 CGAAGGTGGCACCTTCGCCATCACCACCAGCTTCGGTAGCAGCTTCTCCG
PA7/64-1179 CGAGGGCGGTACCTTCGCCATCACCACCAGCTTCGGCAGCAGCTTCTCTG
P.extremaustralis/1-1118 CAAAG------CGTTCGCCATCACCACCACCTTCTCGTCGAGCTACGGCA
* * * * **************** **** **** *
39016/1-1190 GCGGCCTGCCGGCTGGCGCCGTATCGGCCCAGGGCAGCCAGGCCGTGATG
NCGM2/1-1113 GCGGCCTGCCGGCTGGCGCCGTATCGGCCCAGGGCAGCCAGGCCGTGATG
138244/1-1116 GCGGCCTGCCGGCTGGCGCCGTATCGGCCCAGGGCAGCCAGGCCGTGATG
MDR1/1-1179 GCGGCCTGCCGGCTGGCGCCGTATCGGCCCAGGGCAGCCAGGCCGTGATG
PA14/1-1116 GCGGCCTGCCGGCTGGCGCCGTATCGGCCCAGGGCAGCCAGGCCGTGATG
PA7/64-1179 GCGGCCTGCCGTCCGGCGCCGTCTCGGCCCAGGGCAGCCAGGCCGTGATG
P.extremaustralis/1-1118 ACGGCGTGCCGGCCGGTGCCGTTGCCGCGACCGGCAGCCAAGGCGTGATG
**** ***** * ** ***** * ** ******** * *******
39016/1-1190 AATACGCTCAACGCCGCCGAAGGCCGCATCACCTATATGAGCCCGGACTT
NCGM2/1-1113 AATACGCTCAACGCCGCCGAAGGCCGCATCACCTATATGAGCCCGGACTT
138244/1-1116 AATACGCTCAACGCCGCCGAAGGCCGCATCACCTATATGAGCCCGGACTT
MDR1/1-1179 AATACGCTCAACGCCGCCGAAGGCCGCATCACCTATATGAGCCCGGACTT
PA14/1-1116 AATGCGCTCAACGCCGCCCAAGGCCGCATCACCTACATGAGCCCGGACTT
PA7/64-1179 AACACGCTCAACGCCGCCGAAGGCCGCATCACCTACATGAGCCCGGACTT
P.extremaustralis/1-1118 ACTACGCTGAACGCTACGGACGGTGGTATCACCTACATGAGCCCGGACTA
* **** ***** * * ** * ******** *************
39016/1-1190 CGCCGCGCCGACGCTGGCCGGCCTCGACGACGCCACCAAGGTCGCCCAGG
NCGM2/1-1113 CGCCGCGCCGACGCTGGCCGGCCTCGACGACGCCACCAAGGTCGCCCAGG
138244/1-1116 CGCCGCGCCGACGCTGGCCGGCCTCGACGACGCCACCAAGGTCGCCCAGG
MDR1/1-1179 CGCCGCGCCGACGCTGGCCGGCCTCGACGACGCCACCAAGGTCGCCCAGG
PA14/1-1116 CGCCGCGCCGACCCTGGCCGGTCTCGACGACGCCACCAAGGTCGCCCAGG
PA7/64-1179 CGCCGCGCCGACCCTGGCCGGTCTCGACGATGCCACCAAGGTCGCCCAGA
P.extremaustralis/1-1118 CGCCGCGACTACTTTGGCCGGTCTGGATGACGCCACCAAAGTCGCCAAGG
******* * ** ******* ** ** ** ******** ****** **
39016/1-1190 TGCGCGGCGTATCGCCGGCTCCGGCCAACGTCTCGGCAGCCATCGGTGCC
NCGM2/1-1113 TGCGCGGCGTATCGCCGGCTCCGGCCAACGTCTCGGCAGCCATCGGTGCC
138244/1-1116 TGCGCGGCGTATCGCCGGCTCCGGCCAACGTCTCGGCAGCCATCGGTGCC
MDR1/1-1179 TGCGCGGCGTATCGCCGGCTCCGGCCAACGTCTCGGCAGCCATCGGTGCC
PA14/1-1116 TTCGCGGCGTATCCCCGGCGCCGGCCAACGTTTCGGCGGCCATCGGCGCC
PA7/64-1179 TTCGCGGCGTATCGCCGGCTCCGGCCAACGTCTCGGCCGCCATCGGCGCC
P.extremaustralis/1-1118 TTGCCGGTGTTTCGCCTGCGCCTGCCAACGTGTCGGCCGCTATCGGTCAG
* *** ** ** ** ** ** ******** ***** ** *****
39016/1-1190 GTTACCCCGCCGACCA----CCGCACAG------CGTTCA------
NCGM2/1-1113 GTTACCCCGCCGACCA----CCGCACAG------CGTTCA------
138244/1-1116 GTTACCCCGCCGACCA----CCGCACAG------CGTTCA------
MDR1/1-1179 GTTACCCCGCCGACCA----CCGCACAG------CGTTCA------
PA14/1-1116 GTTACTCCGCCGACTA----CTGCCCAG------CGTTCC------
PA7/64-1179 GTTACCCCGCCGACCA----CCGCCCAA------CGCTCC------
P.extremaustralis/1-1118 ATCCCTGCGCCAACCGAAGCCCGCGCCGATGGCTCGTTCATCGACGCCAC
* * **** ** * ** * ** **
39016/1-1190 -GACCCGAACAACTGGGTGCCGGTCTTCGCCGCCACTGCCAGTGCGACCG
NCGM2/1-1113 -GACCCGAACAACTGGGTGCCGGTCTTCGCCGCCACTGCCAGTGCGACCG
138244/1-1116 -GACCCGAACAACTGGGTGCCGGTCTTCGCCGCCACTGCCAGTGCGACCG
MDR1/1-1179 -GACCCGAACAACTGGGTGCCGGTCTTCGCCGCCACTGCCAGTGCGACCG
PA14/1-1116 -GATCCGAACAACTGGGTACCGGTCTTCGCTGCCACCGCCAACCCCAACG
PA7/64-1179 -GATCCGAACAACTGGGTGCCGGTCTTCGCCGCCACCGCCAGCGCGACCG
P.extremaustralis/1-1118 TAACCAGGACAACTGGGTGCCGGTATTTGCCGCCACCGCAAGT------G
* * * ********** ***** ** ** ***** ** * *
39016/1-1190 ATCCGAGCGTCCGTCCGTATCCGACCACCGGCTATCCGATCCTCGGCTTC
NCGM2/1-1113 ATCCGAGCGTCCGTCCGTATCCGACCACCGGCTATCCGATCCTCGGCTTC
138244/1-1116 ATCCGAGCGTCCGTCCGTATCCGACCACCGGCTATCCGATCCTCGGCTTC
MDR1/1-1179 ATCCGAGCGTCCGTCCGTATCCGACCACCGGCTATCCGATCCTCGGCTTC
PA14/1-1116 ACCCGAGCGTGCGTCCGTATCCGACCAGCGGCTACCCGATCCTCGGCTTC
PA7/64-1179 ATCCGAGCGTACGTCCGTACCCCACCACCGGCTACCCGATCCTCGGCTTC
P.extremaustralis/1-1118 GTCCAGCTACCTTCGCCTACCCAACGACCGGCTACCCGATCCTGGGCTTC
** * ** ** ** * ****** ******** ******
39016/1-1190 ACCAACCTGATCTTCAGCCAGTGCTACGCCGACGCTACCCAGACCCAGCA
NCGM2/1-1113 ACCAACCTGATCTTCAGCCAGTGCTACGCCGACGCTACCCAGACCCAGCA
138244/1-1116 ACCAACCTGATCTTCAGCCAGTGCTACGCCGACGCTACCCAGACCCAGCA
MDR1/1-1179 ACCAACCTGATCTTCAGCCAGTGCTACGCCGACGCTACCCAGACCCAGCA
PA14/1-1116 ACCAACCTGATCTTCAGCCAGTGCTACGCCAACGCCACCCAGACCCAGCA
PA7/64-1179 ACCAACCTGATCTTCAGCCAGTGCTACGCGGACGCCACCCAGACCCAGCA
P.extremaustralis/1-1118 ACTAACCTGATCTTCAGCCAGTGCTATGCCGACCCAACCCAGACCACCCA
** *********************** ** ** * ********* **
39016/1-1190 GGTGCGCGACTTCTTCACCCGTCACTACGGCGCCAGCGTCAACAACGACA
NCGM2/1-1113 GGTGCGCGACTTCTTCACCCGTCACTACGGCGCCAGCGTCAACAACGACA
138244/1-1116 GGTGCGCGACTTCTTCACCCGCCACTACGGCGCCAGCGTCAACAACGACA
MDR1/1-1179 GGTGCGCGACTTCTTCACCCGTCACTACGGCGCCAGCGTCAACAACGACA
PA14/1-1116 GGTGCGCGACTTCTTCACCCGTCACTACGGCGCCACCGCCAACAACGACA
PA7/64-1179 GGTGCGCGACTTCTTCACCCGTCACTACGGCGCCAGCGTCAACAACGACG
P.extremaustralis/1-1118 AGTGCAGGACTTCTTCCTGCAGCACTTCGGTGCATTCAATAACAACGACG
**** ********* * **** *** ** * *********
39016/1-1190 CTGCGATCACCAACCATCGCTTCGTGCCGCTGCCGGCTTCCTGGAAGCTT
NCGM2/1-1113 CTGCGATCACCAACCATCGCTTCGTGCCGCTGCCGGCTTCCTGGAAGCTT
138244/1-1116 CTGCGATCACCAACCATCGCTTCGTGCCGCTGCCGGCTTCCTGGAAGCTT
MDR1/1-1179 CCGCGATCACCAACCATCGCTTCGTGCCGCTGCCGGCTTCCTGGAAGCTT
PA14/1-1116 CCGCGATCACCAACCATCGCTTCGTGCCGCTGCCGGCTTCCTGGAAGCTT
PA7/64-1179 CCGCGATCACCAACCATCGCTTCGTGCCGCTGCCTGCTTCCTGGAAGCTC
P.extremaustralis/1-1118 CCGCCATCAACGACAACCGCTTCGTACCGCTGCCATCCAACTGGAAACAA
* ** **** * ** * ******** ******** * ****** *
39016/1-1190 GCCGTACGCCAGTCGTTCCTGACCTCCACCAACAACCTGTACATCGGCCA
NCGM2/1-1113 GCCGTACGCCAGTCGTTCCTGACCTCCACCAACAACCTGTACATCGGCCA
138244/1-1116 GCCGTACGCCAGTCGTTCCTGACCTCCACCAACAACCTGTACATCGGCCA
MDR1/1-1179 GCCGTACGCCAGTCGTTCCTGACCTCCACCAACAACCTGTACATCGGCCA
PA14/1-1116 GCCGTACGCCAGTCGTTCCTGACCTCCACCAACAACTTGTACATCGGCCA
PA7/64-1179 GCGGTGCGTCAGTCGTTCCTGACTTCCACCAACAACCTGTACATCGGCCA
P.extremaustralis/1-1118 GCGATCACCGACACCTTCGCGACTGTTTCCAGTGGTCAGGGCATCGGCAA
** * * * *** *** *** * ******* *
39016/1-1190 TTCCAACGTCTGCAACGGCATCGGCCGTCCGCTCTAA
NCGM2/1-1113 TTCCAACGTCTGCAACGGCATCGGCCGTCCGCTCTAA
138244/1-1116 TTCCAACGTCTGCAACGGTATCGGCCGTCCGCTCTAA
MDR1/1-1179 TTCCAACGTCTGCAACGGCATCGGCCGTCCGCTCTAA
PA14/1-1116 TTCCAACGTCTGCAACGGCATCGGCCGTCCGCTCTAA
PA7/64-1179 CACCAACGTCTGCAACGGCATTGGCCGTCCGCTCTGA
P.extremaustralis/1-1118 CACCAGCGTCTGCAACGGCATCGGTCGTCCGCTGTAA
*** ************ ** ** ******** * *
Online resource 4 (continued)
BBc sub-family (group 6) - proteins
BBc/1-363 DINGGGATLPQPLYQTSGVLTAGFAPYIGVGSGKGKLAFLNNDYSQFGTGTKNVHWAGSD
human_CAI/1-281 DINGGGATLPQPLYQTSGVLTAGFAPYI------LAFLNNDYSQFGTGTKNVHWA---
Leishmania/1-318 DINGGGATLPQPLYQTSGVLTAGFAPYIGVGSGKGKLAFLNNDYSQFGTGTKNVHWAGSD
human_CW626261/25-218 DINGGGATLPQPLYQTSGVLTAGFAPYIGVGSGKGKLAFLNNDYSQFGTGTKNVHWAGSD
X-DING-CD4_cDNA/1-124 DINGGGATLPQPLYQTSGVLTAGFAPYIGVGSGKGKLAFLNNDYSQFGTGTKNVHWAGSD
potato_gene/1-207 ------GVGSGKGKLAFLNNDYSQFGTGTKNVHWAGSD
Arabidopsis_Col/1-53 ------GVGSGKGKLAFLNNDYSQFGTGTKNVHWAGSD
tobacco_1b/1-53 ------GVGSGKGKLAFLNNDYSQFGTGTKNVHWAGSD
tobacco_1a/1-53 ------GVGSGKGKLAFLNNDYSQFGTGTENVHWAGSD
BBc/1-363 SKLTSTELSTYASTKQATWGKLIQVPSVATSVAIPFNKAGTNAVDLSVDQLCGVFSGRIT
human_CAI/1-281 --LTSTELSTYASTK------LIQVPSVATSVAIPFNK--TNAVDLSVDQLCGVFSG-IT
Leishmania/1-318 SKLTSTELSTYASTKQAAWGKLIQVPSVATSVAIPFNKAGTNAVDLSVDQLCGVFSGRIT
human_CW626261/25-218 SKLTSTELSTYASTKQATWGKLIQVPSVATSVAIPFNKAGTNAVDLSVDQLCGVFSGRIT
X-DING-CD4_cDNA/1-124 SKLTSTELSTYASTKQATWGKLIQVPSVATSVAIPFNKPGTNAVDLSVDQLCGVFSGRIT
potato_gene/1-207 SKLTSTELSTYASTKQAAWGKLIQVPSVATSVAIPFNKAGTNAVDLSVDQLCGVFSGRIT
Arabidopsis_Col/1-53 SKLTSTELSTYASTKQAAWGK------
tobacco_1b/1-53 SKLTSTELSTYASTKQAAWGK------
tobacco_1a/1-53 SKLTSTELSTYSSTKQAAWGK------
BBc/1-363 TWNQLPATGRTGNIVVVYRNEASGTTELFTRFLAAKCVNESKKFVVTTNFADSFGVPPGA
human_CAI/1-281 TWDQLPATGRTGNIVVVYRNEA------ESKKFVVTTNFADSFGVPAGA
Leishmania/1-318 TWDQLPATGRTGNIVVVYRNEASGTTELFTRFLAAKCVNESKKFVVTTNFADSFGVPAGA
human_CW626261/25-218 TWNQLPATGRTGNIVVVYRNEASGTTELFTRFLAAKCVNESKKFVVTTNFADSFGVPPGA
X-DING-CD4_cDNA/1-124 TWNQ------
potato_gene/1-207 TWNQLPATGRTGNIVVVYRNEASGTTELFTRFLAAKCVNESKKFVVTTNFADSFGVPPGA
human_TC105368/1-65 NESKKFVVTTNFADSFGVPAGA
BBc/1-363 VPAVTSQGVMTALNAGDGRITYMSPDYAAPTLAGLDDATKVAKVAGVSPAPDNVSAAIAA
human_CAI/1-281 VPAVTSQGVMDALN-----ITYMSPDYAAPTLAGL------DNVSAAIAA
Leishmania/1-318 VPAVTSQGVMDALNAGDGRITYMSPDYAAPTLAGLDDATKVAKVAGVSPAPDNVSAAIAA
human_CW626261/25-218 VPAVTSQGVMTKLN------
potato_gene/1-207 VPAVTSQGVMDALNAGDGRITYMSPDYAAPTLAGLDDATKVAKVAGVSPAPDNVS-----
human_TC105368/1-65 VPAVTSEGVMDALSAGDGRITYMSPDYAAPTLAGLDDATIYWL
monkey/1-49 NLIFSQCYADATQTSQ
BBc/1-363 VPVPAAANVANQNAWVPVFAAAADANDPSVVPYPSSGYPILGFTNLIFSQCYADATQTSQ
human_CAI/1-281 VPVPAAANVA-QNAWVPVFA------DPSVVPYPSTGYPILGFTNLIFSQCYADATQTSQ
Leishmania/1-318 VPVPAAANVALQNAWVPVFAAAADANDPSVVPYPSTGYPILGFTNLIFSQCYADATQTSQ
monkey/1-49 VRAFFTRHYGASALNTNDNAIKANRFVPLPTAW
BBc/1-363 VRAFFTRHFGASALNSNDNAIKANRFVPLPAAWKAAITSNFVTATSALSIGKSDVCNAIG
human_CAI/1-281 VRAFFTRHYGASALN--DNAIKANRFVPLPTAWKAAITSNFVTATSALSI------
Leishmania/1-318 VRAFFTRHYGASALYTYD------
BBc/1-363 RPL-
Online resource 4 (continued)
BBc sub-family (group 6) – genes
Leishmania/1-1013 ATGGCAGACATCAACGGCGGCGGCGCCACCCTGCCACAACCGCTGTACCAGACCTCCGGC
BBc6R8/1-1164 ATGGCAGACATCAACGGCGGCGGCGCCACCCTGCCACAACCGCTGTACCAGACCTCCGGC
X-DING-CD4_cDNA/1-382 ATGGCAGACATCAACGGCGGCGGCGCCACCCTGCCACAACCGCTGTACCAGACCTCCGGC
human_CW626261/1-693 ATGGCAGACATCAACGGCGGCGGCGCCACCCTGCCACAACCGCTGTACCAGACCTCCGGC
tobacco_1a/1-159 ------GGTGTGGGCAGCGGCAAAGGCAAACTGGCG
potato_gene/1-622 ------GGTGTGGGCAGCGGCAAAGGCAAACTGGCG
Leishmania/1-1013 GTATTGACTGCCGGTTTCGCCCCGTACATCGGTGTGGGCAGCGGCAAAGGCAAACTGGCG
tobacco_1b/1-159 ------GGTGTGGGCAGCGGCAAAGGCAAACTGGCG
Arabidopsis_Col/1-159 ------GGTGTGGGCAGCGGCAAAGGCAAACTGGCG
BBc6R8/1-1164 GTATTGACCGCCGGTTTCGCCCCGTACATCGGCGTGGGCAGCGGCAAGGGCAAACTGGCG
X-DING-CD4_cDNA/1-382 GTATTGACAGCCGGTTTCGCCCCGTACATCGGTGTGGGCAGCGGCAAAGGCAAACTGGCG
human_CW626261/1-693 GTATTGACCGCTGGTTTCGCCCCGTACATCGGCGTGGGCAGCGGCAAGGGCAAACTGGCG
tobacco_1a/1-159 TTCCTGAACAACGACTACAGCCAGTTCGGGACCGGCACCGAGAACGTGCACTGGGCGGGC
potato_gene/1-622 TTCCTGAACAACGACTACAGCCAGTTCGGGACCGGCACCAAGAACGTGCACTGGGCGGGC
Leishmania/1-1013 TTCCTGAACAACGACTACAGCCAGTTCGGCACCGGCACCAAGAACGTGCACTGGGCGGGC
tobacco_1b/1-159 TTCCTGAACAACGACTACAGCCAGTTCGGGACCGGCACCAAGAACGTGCACTGGGCGGGC
Arabidopsis_Col/1-159 TTCCTGAACAACGACTACAGCCAGTTCGGGACCGGCACCAAGAACGTGCACTGGGCGGGC
BBc6R8/1-1164 TTCCTGAACAACGACTACAGCCAGTTCGGCACCGGCACCAAGAACGTTCACTGGGCGGGC
X-DING-CD4_cDNA/1-382 TTCCTGAACAACGACTACAGCCAGTTCGGCACCGGCACCAAGAACGTGCACTGGGCGGGC
human_CW626261/1-693 TTCCTGAACAACGACTACAGCCAGTTCGGCACCGGCACCAAGAACGTTCACTGGGCGGGC
tobacco_1a/1-159 AGCGACTCCAAGCTGACCTCGACTGAACTGTCCACCTACTCCTCCACCAAACAAGCCGCC
potato_gene/1-622 AGCGACTCCAAGCTGACCTCGACTGAACTGTCCACCTACGCCTCCACCAAACAAGCCGCC
Leishmania/1-1013 AGCGACTCCAAGCTGACCTCGACTGAACTGTCCACCTACGCCTCCACCAAACAAGCCGCC
tobacco_1b/1-159 AGCGACTCCAAGCTGACCTCGACTGAACTGTCCACCTACGCCTCCACCAAACAAGCCGCC
Arabidopsis_Col/1-159 AGCGACTCCAAGCTGACCTCGACTGAACTGTCCACCTACGCCTCCACCAAACAAGCCGCC
BBc6R8/1-1164 AGCGACTCGAAGCTGACCTCGACTGAACTGTCCACCTACGCCTCCACCAAACAAGCGACC
X-DING-CD4_cDNA/1-382 AGCGACTCCAAGCTGACCTCGACTGAACTGTCCACCTACGCCTCCACCAAACAAGCGACC
human_CW626261/1-693 AGCGACTCGAAGCTGACCTCGACTGAACTGTCCACCTACGCCTCCACCAAACAAGCGACC
tobacco_1a/1-159 TGGGGCAAG------
potato_gene/1-622 TGGGGCAAGCTGATTCAAGTGCCTTCGGTCGCCACTTCGGTTGCCATTCCATTCAACAAG
Leishmania/1-1013 TGGGGCAAGCTGATTCAAGTGCCTTCGGTAGCCACTTCGGTTGCCATTCCATTCAACAAG
tobacco_1b/1-159 TGGGGCAAG------
Arabidopsis_Col/1-159 TGGGGCAAG------
BBc6R8/1-1164 TGGGGCAAGCTGATTCAAGTGCCTTCGGTCGCCACTTCGGTTGCCATTCCATTCAACAAG
X-DING-CD4_cDNA/1-382 TGGGGCAAGCTGATTCAAGTGCCTTCGGTCGCCACTTCGGTTGCCATTCCATTCAACAAG
human_CW626261/1-693 TGGGGCAAGCTGATTCAAGTGCCTTCGGTCGCCACTTCGGTAGCCATTCCATTCAACAAG
potato_gene/1-622 GCCGGTACCAACGCAGTTGACCTGAGCGTTGATCAACTGTGCGGCGTGTTCTCGGGCCGC
Leishmania/1-1013 GCCGGTACCAACGCGGTTGACCTGAGCGTTGATCAACTGTGCGGCGTGTTCTCGGGCCGC
BBc6R8/1-1164 GCCGGTACCAACGCAGTTGACCTGAGCGTTGATCAACTGTGCGGCGTGTTCTCGGGCCGC
X-DING-CD4_cDNA/1-382 CCCGGTACCAACGCGGTTGACCTGAGCGTTGATCAACTGTGCGGCGTGTTCTCGGGCCGC
human_CW626261/1-693 GCCGGTACCAACGCGGTTGACCTGAGCGTTGATCAACTGTGCGGCGTGTTCTCGGGCCGC
potato_gene/1-622 ATCACCACCTGGAACCAACTCCCGGCTACCGGTCGCACCGGTAACATCGTGGTGGTTTAC
Leishmania/1-1013 ATCACCACCTGGGACCAACTCCCGGCTACCGGTCGCACCGGTAACATCGTGGTGGTTTAC
BBc6R8/1-1164 ATCACCACGTGGAACCAACTCCCGGCTACCGGTCGCACCGGTAACATCGTGGTGGTTTAC
X-DING-CD4_cDNA/1-382 ATCACCACCTGGAACCAAC------
human_CW626261/1-693 ATCACCACCTGGAACCAACTCCCGGCTACCGGTCGCACCGGTAACATCGTGGTGGTTTAC
potato_gene/1-622 CGTAACGAAGCCAGTGGCACCACCGAGCTGTTCACCCGTTTCCTGGCGGCCAAGTGCGTC
Leishmania/1-1013 CGTAACGAAGCCAGTGGCACCACCGAGCTGTTCACCCGTTTCCTGGCAGCCAAGTGCGTC
BBc6R8/1-1164 CGTAACGAAGCCAGTGGCACCACCGAGCTGTTCACCCGTTTCCTGGCGGCCAAGTGCGTC
human_CW626261/1-693 CGTAACGAAGCCAGTGGCACCACCGAGCTGTTCACCCGTTTCCTGGCGGCCAAGTGCGTC
human_TC105368/1-198 ------TC
potato_gene/1-622 AATGAGTCCAAGAAATTTGTGGTCACTACCAACTTCGCGGACAGCTTCGGCGTGCCACCA
Leishmania/1-1013 AATGAGTCCAAGAAGTTTGTGGTCACCACCAACTTTGCAGACAGCTTCGGCGTGCCGGCA
BBc6R8/1-1164 AATGAGTCCAAGAAATTTGTGGTCACCACCAACTTCGCGGACAGCTTCGGCGTGCCACCA
human_CW626261/1-693 AATGAGTCCAAGAAATTTGTGGTCACCACCAACTTCGCGGACAGCTTCGGCGTGCCACCA
human_TC105368/1-198 AATGAGTCCAAGAAGTTTGTGGTCACCACCAACTTTGCAGACAGCTTCGGCGTGCCGGCT
potato_gene/1-622 GGCGCCGTGCCTGCCGTCACCAGCCAAGGCGTGATGGACGCGCTGAACGCCGGTGATGGC
Leishmania/1-1013 GGTGCCGTGCCTGCCGTCACCAGCCAAGGCGTGATGGACGCGCTGAACGCCGGTGATGGT
BBc6R8/1-1164 GGCGCCGTACCTGCCGTGACCAGCCAAGGCGTGATGACCGCGCTGAACGCCGGTGATGGT
human_CW626261/1-693 GGCGCCGTGCCTGCCGTGACCAGCCAAGGCGTGATGACCAAGCTGAACGA
human_TC105368/1-198 GGTGCCGTGCCTGCCGTTACCAGCGAAGGCGTGATGGACGCGCTGAGCGCCGGTGATGGT
potato_gene/1-622 CGCATCACTTATATGAGCCCGGACTATGCCGCGCCTACCTTGGCTGGCCTGGATGACGCC
Leishmania/1-1013 CGCATCACTTACATGAGCCCGGACTATGCCGCTCCTACCTTGGCTGGCCTGGATGACGCC
BBc6R8/1-1164 CGTATCACCTACATGAGCCCGGACTATGCCGCGCCTACCTTGGCTGGCCTGGATGACGCC
human_TC105368/1-198 CGCATCACTTACATGAGCCCGGACTATGCCGCTCCTACCTTGGCTGGCCTGGATGACGCC
potato_gene/1-622 ACCAAAGTGGCCAAGGTCGCCGGTGTTTCTCCAGCCCCTGATAACGTTTCCG------
Leishmania/1-1013 ACCAAAGTGGCCAAGGTCGCCGGTGTTTCTCCAGCCCCTGACAACGTTTCCGCTGCCATT
BBc6R8/1-1164 ACCAAAGTGGCCAAGGTCGCCGGTGTTTCTCCAGCCCCTGATAACGTTTCCGCCGCCATC
human_TC105368/1-198 ACCATATATTGGCTCA
Leishmania/1-1013 GCTGCTGTGCCAGTACCGGCGGCTGCCAACGTCGCCCTGCAGAACGCCTGGGTACCAGTA
BBc6R8/1-1164 GCTGCTGTGCCCGTACCGGCTGCTGCCAACGTCGCCAACCAGAACGCCTGGGTACCGGTG
Leishmania/1-1013 TTTGCTGCTGCTGCCGACGCCAATGACCCAAGCGTCGTGCCTTACCCAAGCACCGGCTAT
BBc6R8/1-1164 TTTGCTGCCGCCGCCGATGCCAACGACCCAAGCGTCGTGCCTTACCCAAGCAGCGGTTAT
Leishmania/1-1013 CCAATCCTGGGCTTCACCAACCTGATCTTCAGCCAGTGCTACGCCGACGCCACCCAAACC
BBc6R8/1-1164 CCGATCCTGGGCTTCACCAACCTGATCTTCAGCCAGTGCTACGCCGACGCCACCCAGACC
monkey/1-149 AACCTGATCTTCAGCCAGTGCTACGCCGACGCCACCCAAACC
Leishmania/1-1013 TCGCAAGTGCGTGCGTTCTTCACCCGTCACTACGGTGCCAGCGCCCTCTACACCTACGAC
BBc6R8/1-1164 TCGCAAGTGCGTGCATTCTTCACCCGTCACTTCGGTGCCAGCGCCTTGAACAGCAACGAC
monkey/1-149 TCGCAAGTGCGTGCGTTCTTCACCCGTCACTACGGTGCCAGCGCCCTCAACACCAACGAC
Leishmania/1-1013 TAACGTCTTTCAGGCCTACCGTTTTGTGTCGCTGTCATCTGTCTGGATAGACG------
BBc6R8/1-1164 AACGCGATCAAGGCCAACCGCTTCGTGCCACTGCCAGCTGCCTGGAAAGCCGCGATCACC
monkey/1-149 AACGCCATCAAGGCCAACCGCTTTGTGCCGCTGCCAACTGCTTGGAA
BBc6R8/1-1164 AGCAACTTCGTTACCGCAACCAGCGCGCTGAGCATCGGCAAGTCTGATGTCTGCAACGCC
BBc6R8/1-1164 ATCGGTCGTCCGCTGTAA
Online resource 4 (continued)Woodsub-family(group 7) - proteins
B.rapa/1-37 ------AAFLNNDYTK------
R.centenum/1-76 ------AAFLNNDYTK------
Pf5/1-368 DINGGGATLPQPLYQTAGVLTAGFAPYIGVGSGNGKAAFLNNDYTKFVAGTT-GKNVHWA
Wayne1/1-351 ------GVLTAGFAPYIGVGSGNGKAAFLNNDYTKFVAG-TTGKNVHWA
P.brassicacearum/1-372 DVNGGGATLPQPLYQTSGVLTAGFAPYIGVGSGAGKSAFLTNDYTKFVPGDTSGKKVHWA
Wood1R/1-372 DVNGGGATLPQPLYQTSGVLTAGFAPYIGVGSGAGKSAFLTNDYTKFVPGDTSGKKVHWA
F113/1-366 DVNGGGATLPQPLYQTSGVLTAGFAPYIGVGSGAGKSAFLNNDYTKFVSGNTS-KNVHWA
PAMC25886/1-383 DVNGGGATLPQPLYQTAGVLTAGFAPYIGVGSGNGKAAFLNNDYSKLDATVT-GKNVHWA
B.rapa/1-37 -----LSAAELLAYK------LIQVPSVATSVAIPFNK------
R.centenum/1-76 ------LIQVPSVATSVAVPFNK------
Pf5/1-368 GSDSKLSAAELKGYEDNHQAAWGKLIQVPSVATSVAVPFNKAGSTA-VD-LSVNDLCGVF
Wayne1/1-351 GSDSKLSAAELKGYEDNHQAAWGKLIQVPSVATSVAVPFNKAGSTA-VD-LSVNDLCGVF
P.brassicacearum/1-372 GSDSKLSATELSTYVSAHGAAWGPLIQVPSVATSVAIPFNKTGTAN-VD-LSVNQLCGVF
Wood1R/1-372 GSDSKLSATELSTYVSAHGAAWGPLIQVPSVATSVAIPFNKTG-TANVD-LSVNQLCGVF
F113/1-366 GSDSKLSQAELDGYVANHGAAWGPLIQVPSVATSVAIPFNMTG-TANVD-LSVNQLCGVF
PAMC25886/1-383 GSDSKLSSTELSNYVSAHGSAWGPLIQVPSVATSVAIPFNKAGVAGKTVNLSVNDLCGVF
B.rapa/1-37 ------
R.centenum/1-76 ------
Pf5/1-368 SGRVSKWEQLPTSGRTGAITVVYRNESSGTTELFTRFLNAKCAET---GTFAVTTNFASS
Wayne1/1-351 SGRVSKWEQLPTSGRTGAITVVYRNESSGTTELFTRFLNAKCAET---GTFAVTTNFASS
P.brassicacearum/1-372 SGRLTDWSQITGSGRTGAITVVYRAESSGTSELFTRFLNAKCAET---GTFAITTNFASS
Wood1R/1-372 SGRLTDWSQITGSGRTGAITVVYRAESSGTSELFTRFLNAKC---AETGTFAITTNFASS
F113/1-366 SGRLTDWSQITGSGRTGAITVVYRSDLSGTTELFTRFLNAKC---AETGTFAITTNFANS
PAMC25886/1-383 SGRLTTWNLIPDSGRTGPITVIYRKENSGTTELFTRFLNAKCSPALEGGTFAVTQAFGSS
B.rapa/1-37 ------
R.centenum/1-76 ------ITYMSPDYAAPTLAGLDDAT
Pf5/1-368 YSGGLPAGAVAAVTSQGVMDALNAGDGR------ITYMSPDYAAPTLAGLDDAT
Wayne1/1-351 YSGGLPAGAVAAVTSQGVMDALNAGDGR------ITYMSPDYAAPTLAGLDDAT
P.brassicacearum/1-372 YSGGLPASAVSATGSQAVMTALNAAQGR------ITYMSPDYAATTLAGLDDAT
Wood1R/1-372 YSGGLPASAVSATGSQAVMTALNAAQGR------ITYMSPDYAATTLAGLDDAT
F113/1-366 YSGGVPAGAVFASGSANVM------TALNAAQGRITYMSPDYAATTLAGLDDAT
PAMC25886/1-383 FSGGLPAGALNPLQQANPAGGFYADTSAGVMSTLNAADGRITYMSPDYAAATLAGLDDAT
B.rapa/1-37 ------
R.centenum/1-76 KVAV------
Pf5/1-368 KVAKVAGVSPAPANVSSAIAAVAVPDTTVRGDQNLWVPVFTSQANIDAN-PSDKSLRLYP
Wayne1/1-351 KVAKVAGVSPAPANVSSAIAAVAVPDTTVRGDQNLWVPVFTSQANIDAN-PSDKSLRLYP
P.brassicacearum/1-372 KVARVGGLSPAPANVSVAINAVPVPAAADRSNPNAWVPVFTSQKVIDETIPADPSLRLYP
Wood1R/1-372 KVARVGGLSPAPANVSVAINAVPVPAAADRSNPNAWVPVFTSQKVIDETIPADPSLRLYP
F113/1-366 KVARVAGVSPAPANVSAAIAAVAVPAAANRANPNAWVPVFAATTNPNDPSVVAYPATGYP
PAMC25886/1-383 KVATVAGVSPAPGNVSAAIGAVAVPAIANRTLPNNWVPVFAATTSASDPSVVAYPSTGYP
B.rapa/1-37 ------
R.centenum/1-76 ------
Pf5/1-368 TSGYPILGFTNLIFSQCYADANQTSQVRAFFSRHY-GALVNN-DTAINNNRFVPLPAAWK
Wayne1/1-351 TSGYPILGFTNLIFSQCYADANQTSQVRAFFSRHY-GALVNN-DTAINNNRFVPLPAAWK
P.brassicacearum/1-372 TTGYPILGFTNVIFSQCYANAAQTTQVRDFFTRHYNGTAANSNDAAITANRFVPLPGAWK
Wood1R/1-372 TTGYPILGFTNVIFSQCYANAAQTTQVRDFFTRHYNGTAANSNDAAITANRFVPLPGAWK
F113/1-366 ILGF-----TNVIFSQCYANAAQSTQVRDFFTRHYGAVAANNNDAAITANRFVPLPTTWK
PAMC25886/1-383 ILGF-----TNVVFSQCYANADQTSQVRTFFTRHYNTNAFSSNDTAIRNNRFVPLPTTWK
B.rapa/1-37 ------
R.centenum/1-76 ----DSFVTASSGLSIGNASVCNAIGRPL
Pf5/1-368 TAVRDSFVTASSGLSIGNASVCNAIGRPL
Wayne1/1-351 TAVRDSFVTASSGLSIGNASVCNAIGRPL
P.brassicacearum/1-372 SAIRGSFLTATNAQSIGNTNVCNGIGRPL
Wood1R/1-372 SAIRGSFLTATNAQSIGNTNVCNGIGRPL
F113/1-366 NAIRGSFLTTTSAQSIGNTNVCNGIGRPL
PAMC25886/1-383 TAINDTFLSAGSDLSIGKSNICNGIGRPL
Online resource 4 (continued)
Woodsub-family(group 7) - genes
Pf5/1-1179 ATGGCTGATATCAACGGCGGTGGCGCAACCCTGCCACAACCGCTGTACCA
Wayne1/1-1056 ------
PAMC25886/1-1158 ATGGCTGACGTCAACGGCGGCGGCGCCACCCTGCCACAACCGCTGTACCA
P.brassicacearum/1-1191 ATGGCTGATGTCAACGGCGGTGGTGCTACCTTGCCTCAGCCGCTGTACCA
Wood1R/1-1125 ATGGCTGATGTCAACGGCGGTGGTGCTACCTTGCCTCAGCCGCTGTACCA
F113/1-1107 ATGGCTGATGTCAATGGCGGTGGTGCTACCTTGCCTCAGCCGCTGTACCA
Pf5/1-1179 GACCGCTGGCGTACTGACCGCCGGCTTCGCTCCCTACATCGGCGTAGGCA
Wayne1/1-1056 ------GGCGTACTGACCGCCGGCTTCGCTCCCTACATCGGCGTAGGCA
PAMC25886/1-1158 GACTGCCGGCGTATTGACTGCCGGTTTCGCCCCGTACATCGGCGTGGGCA
P.brassicacearum/1-1191 GACCTCCGGTGTACTGACTGCCGGTTTCGCCCCTTACATCGGCGTGGGCA
Wood1R/1-1125 GACCTCCGGTGTACTGACTGCCGGTTTCGCCCCTTACATCGGCGTGGGCA
F113/1-1107 GACCTCCGGCGTACTGACTGCCGGTTTTGCCCCATACATCGGCGTGGGCA
** *** **** ***** ** ** ** *********** ****
Pf5/1-1179 GCGGCAACGGCAAGGCTGCCTTCCTGAACAACGACTACACCAAGTTCGTG
Wayne1/1-1056 GCGGCAACGGCAAGGCTGCCTTCCTGAACAACGACTACACCAAGTTCGTG
PAMC25886/1-1158 GCGGCAACGGCAAGGCTGCTTTCCTGAACAACGACTACAGCAAGCTGGAC
P.brassicacearum/1-1191 GCGGTGCTGGCAAGTCGGCTTTCCTGACCAACGACTACACCAAGTTCGTG
Wood1R/1-1125 GCGGTGCTGGCAAGTCGGCTTTCCTGACCAACGACTACACCAAGTTCGTG
F113/1-1107 GCGGTGCTGGCAAGTCGGCTTTCCTGAACAACGACTACACCAAGTTCGTA
**** ****** * ** ******* *********** **** * *
Pf5/1-1179 GCCGGCACCACCG---GCAAGAACGTGCACTGGGCCGGTAGCGATTCCAA
Wayne1/1-1056 GCCGGCACCACCG---GCAAGAACGTGCACTGGGCCGGTAGCGATTCCAA
PAMC25886/1-1158 GCCACCGTCACCG---GCAAGAACGTGCACTGGGCAGGCAGCGATTCCAA
P.brassicacearum/1-1191 CCTGGCGACACCAGCGGCAAGAAAGTGCACTGGGCTGGTAGCGACTCCAA
Wood1R/1-1125 CCTGGCGACACCAGCGGCAAGAAAGTGCACTGGGCTGGTAGCGACTCCAA
F113/1-1107 TCTGGCAACACCA---GCAAGAACGTGCACTGGGCGGGTAGTGATTCGAA
* * **** ******* *********** ** ** ** ** **
Pf5/1-1179 GCTCAGCGCAGCAGAGCTCAAAGGTTATGAAGACAATCACCAAGCGGCCT
Wayne1/1-1056 GCTCAGCGCAGCAGAGCTCAAAGGTTATGAAGACAATCACCAAGCGGCCT
PAMC25886/1-1158 GCTGTCTTCTACCGAGCTGAGCAACTACGTGTCTGCCCATGGTTCTGCCT
P.brassicacearum/1-1191 GCTCAGCGCCACTGAACTGAGCACCTACGTCAGTGCCCACGGTGCCGCCT
Wood1R/1-1125 GCTCAGCGCCACTGAACTGAGCACCTACGTCAGTGCCCACGGTGCCGCCT
F113/1-1107 GCTCAGCCAGGCTGAACTGGATGGCTACGTTGCCAACCACGGTGCCGCCT
*** * ** ** ** * ** * ****
Pf5/1-1179 GGGGCAAGCTGATCCAGGTGCCTTCGGTGGCCACTTCGGTTGCCGTTCCA
Wayne1/1-1056 GGGGCAAGCTGATCCAGGTGCCTTCGGTGGCCACTTCGGTTGCCGTTCCA
PAMC25886/1-1158 GGGGCCCGCTGATCCAAGTGCCTTCGGTGGCCACTTCGGTTGCCATTCCG
P.brassicacearum/1-1191 GGGGTCCATTGATCCAGGTGCCTTCGGTCGCCACTTCGGTTGCCATTCCG
Wood1R/1-1125 GGGGTCCATTGATCCAGGTGCCTTCGGTCGCCACTTCGGTTGCCATTCCG
F113/1-1107 GGGGTCCATTGATCCAGGTGCCTTCGGTTGCCACTTCGGTTGCGATTCCA
**** ******* *********** ************** ****
Pf5/1-1179 TTCAACAAGGCTGGTTCCACCG------CCGTTGACCTGAGCGTCAATGA
Wayne1/1-1056 TTCAACAAGGCGGGTTCCACCG------CCGTTGACCTGAGCGTCAATGA
PAMC25886/1-1158 TTCAACAAGGCCGGTGTGGCGGGTAAAACCGTCAACCTCAGTGTCAACGA
P.brassicacearum/1-1191 TTCAACAAGACCGGCACTGCCA------ACGTCGACCTGAGCGTCAACCA
Wood1R/1-1125 TTCAACAAGACCGGCACTGCCA------ACGTCGACCTGAGCGTCAACCA
F113/1-1107 TTCAACATGACCGGCACTGCCA------ACGTCGATCTGAGCGTCAACCA
******* * * ** * *** * ** ** ***** *
Pf5/1-1179 CCTGTGCGGCGTGTTCTCGGGGCGCGTTTCCAAGTGGGAGCAGCTTCCGA
Wayne1/1-1056 CCTGTGCGGCGTGTTCTCGGGGCGCGTTTCCAAGTGGGAGCAGCTTCCGA
PAMC25886/1-1158 TCTGTGCGGTGTATTCTCGGGTCGTCTGACTACCTGGAACCTGATCCCGG
P.brassicacearum/1-1191 ACTGTGCGGCGTGTTCTCCGGCCGTCTGACCGACTGGAGCCAGATCACTG
Wood1R/1-1125 ACTGTGCGGCGTGTTCTCCGGCCGTCTGACCGACTGGAGCCAGATCACTG
F113/1-1107 GCTGTGCGGCGTGTTCTCTGGTCGTCTGACTGACTGGAGCCAGATCACCG
******** ** ***** ** ** * * *** * * * *
Pf5/1-1179 CTTCGGGCCGTACCGGCGCCATCACCGTGGTTTACCGCAATGAAAGCAGC
Wayne1/1-1056 CTTCGGGCCGTACCGGCGCCATCACCGTGGTTTACCGCAATGAAAGCAGC
PAMC25886/1-1158 ACTCCGGCCGTACCGGCCCGATCACCGTGATCTATCGCAAAGAAAACAGC
P.brassicacearum/1-1191 GTTCGGGCCGTACCGGCGCGATCACCGTGGTTTACCGTGCCGAGAGCAGC
Wood1R/1-1125 GTTCGGGCCGTACCGGCGCGATCACCGTGGTTTACCGTGCCGAGAGCAGC
F113/1-1107 GTTCTGGCCGTACTGGTGCGATCACTGTGGTTTACCGCAGCGACCTCAGT
** ******** ** * ***** *** * ** ** ** ***
Pf5/1-1179 GGCACCACCGAGCTGTTCACCCGCTTCCTGAACGCCAAGTGCGCC-----
Wayne1/1-1056 GGCACCACCGAGCTGTTCACCCGCTTCCTGAACGCCAAGTGCGCC-----
PAMC25886/1-1158 GGCACCACCGAGCTGTTCACGCGCTTCCTGAACGCCAAGTGCAGCCCGGC
P.brassicacearum/1-1191 GGCACCTCCGAGCTGTTCACCCGCTTCCTGAACGCCAAGTGCGCT-----
Wood1R/1-1125 GGCACCTCCGAGCTGTTCACCCGCTTCCTGAACGCCAAGTGCGCT-----
F113/1-1107 GGCACCACAGAATTGTTCACTCGCTTCTTGAACGCCAAGTGCGCT-----
****** * ** ******* ****** **************
Pf5/1-1179 ----GAGACCGGCACCTTCGCCGTGACCACCAACTTCGCTTCCAGCTACT
Wayne1/1-1056 ----GAGACCGGCACCTTCGCCGTGACCACCAACTTCGCTTCCAGCTACT
PAMC25886/1-1158 CCTGGAAGGCGGCACCTTCGCCGTGACTCAGGCCTTCGGCAGCAGCTTCT
P.brassicacearum/1-1191 ----GAAACCGGCACCTTTGCCATCACCACCAACTTCGCTTCCAGCTACA
Wood1R/1-1125 ----GAAACCGGCACCTTTGCCATCACCACCAACTTCGCTTCCAGCTACA
F113/1-1107 ----GAAACCGGCACCTTTGCCATCACCACCAACTTCGCTAACAGCTACA
** ********* *** * ** ***** ***** *
Pf5/1-1179 CGGGCGGCCTGCCTGCCGGTGCGGTTG--CCGCTGTTACCAGCCAA----
Wayne1/1-1056 CGGGCGGCCTGCCTGCCGGTGCGGTTG--CCGCTGTTACCAGCCAA----
PAMC25886/1-1158 CCGGCGGCTTGCCGGCCGGCGCACTTAACCCATTGCAACAAGCCAACCCG
P.brassicacearum/1-1191 GTGGTGGTCTGCCAGCCAGCGCCGTAT--CCGCTACTGGCAGCCAG----
Wood1R/1-1125 GTGGTGGTCTGCCAGCCAGCGCCGTAT--CCGCTACTGGCAGCCAG----
F113/1-1107 GCGGTGGCGTACCGGCCGGCGCCGTTT--TCGCCTCCGGCAGTGCG----
** ** * ** *** * ** * * **
Pf5/1-1179 ------GGCGTCATGGACGCGCTGAA
Wayne1/1-1056 ------GGCGTCATGGACGCGCTGAA
PAMC25886/1-1158 GCCGGTGGTTTCTACGCCGACACCAGCGCTGGCGTAATGAGCACGCTGAA
P.brassicacearum/1-1191 ------GCTGTAATGACAGCGTTGAA
Wood1R/1-1125 ------GCTGTAATGACAGCGTTGAA
F113/1-1107 ------AACGTCATGACTGCGCTCAA
** *** ** * **
Pf5/1-1179 CGCAGGCGACGGTCGCATCACCTACATGAGCCCGGACTACGCCGCGCCGA
Wayne1/1-1056 CGCTGGCGACGGTCGCATCACCTACATGAGCCCGGACTACGCCGCGCCGA
PAMC25886/1-1158 CGCTGCTGACGGCCGCATCACTTACATGAGCCCGGATTACGCGGCCGCTA
P.brassicacearum/1-1191 CGCTGCCCAAGGTCGTATCACCTACATGAGCCCGGACTATGCCGCTACTA
Wood1R/1-1125 CGCTGCCCAAGGTCGTATCACCTACATGAGCCCGGACTATGCCGCTACTA
F113/1-1107 CGCAGCCCAAGGCCGTATCACCTACATGAGCCCGGATTACGCGGCAACTA
*** * * ** ** ***** ************** ** ** ** * *
Pf5/1-1179 CCCTGGCCGGCCTGGACGACGCTACCAAGGTGGCCAAGGTGGCTGGCGTT
Wayne1/1-1056 CCCTGGCCGGCCTGGACGACGCTACCAAGGTGGCCAAGGTGGCTGGCGTT
PAMC25886/1-1158 CCCTGGCCGGTCTGGATGACGCCACCAAGGTGGCAACCGTCGCCGGTGTT
P.brassicacearum/1-1191 CCCTGGCCGGTCTGGATGACGCCACCAAGGTCGCTCGTGTTGGCGGTCTT
Wood1R/1-1125 CCCTGGCCGGTCTGGATGACGCCACCAAGGTCGCTCGTGTTGGCGGTCTT
F113/1-1107 CTCTGGCAGGTCTGGACGACGCCACCAAGGTTGCCCGCGTCGCCGGTGTT
* ***** ** ***** ***** ******** ** ** * ** **
Pf5/1-1179 TCCCCGGCGCCTGCCAACGTTTCCTCGGCTATCGCCGCGGTTGCCGTGCC
Wayne1/1-1056 TCCCCGGCGCCTGCCAACGTTTCCTCGGCTATCGCCGCGGTTGCCGTGCC
PAMC25886/1-1158 TCGCCTGCTCCAGGCAACGTGTCGGCGGCCATCGGCGCGGTGGCTGTACC
P.brassicacearum/1-1191 TCCCCAGCACCGGCCAACGTGTCGGTAGCGATCAATGCCGTTCCCGTTCC
Wood1R/1-1125 TCCCCAGCACCGGCCAACGTGTCGGTAGCGATCAATGCCGTTCCCGTTCC
F113/1-1107 TCCCCTGCTCCAGCCAACGTGTCGGCCGCGATTGCAGCTGTTGCCGTGCC
** ** ** ** * ****** ** ** ** ** ** * ** **
Pf5/1-1179 TGATACCACTGTTCGTGGCGACCAGAACCTGTGGGTTCCTGTCTTCACTT
Wayne1/1-1056 TGATACCACTGTTCGTGGCGACCAGAACCTGTGGGTTCCTGTCTTCACTT
PAMC25886/1-1158 TGCCATCGCCAACCGTACCCTGCCCAACAACTGGGTCCCAGTGTTTGC--
P.brassicacearum/1-1191 GGCTGCTGCCGACCGGTCAAACCCGAATGCCTGGGTTCCTGTTTTCACTT
Wood1R/1-1125 GGCTGCTGCCGACCGGTCAAACCCGAATGCCTGGGTTCCTGTTTTCACTT
F113/1-1107 TGCTGCTGCCAACCGTGCCAACCCGAACGCCTGGGTGCCAGTGTTCGCC-
* * ** * ** ***** ** ** ** *
Pf5/1-1179 CCCA---GGCCAAC-----ATCGACGCCAACCCAAGCGACAAGAGCCTGC
Wayne1/1-1056 CCCA---GGCCAAC-----ATCGACGCCAACCCAAGCGACAAGAGCCTGC
PAMC25886/1-1158 ------GGCTACC-----ACCAGCGCTAGC----GACCCAAGCGTCGTC
P.brassicacearum/1-1191 CTCAAAAAGTGATTGATGAAACAATTCCGGCT---GATCCTAGC--TTGC
Wood1R/1-1125 CTCAAAAAGTGATTGATGAAACAATTCCGGCT---GATCCTAGC--TTGC
F113/1-1107 ------GCAACC-----ACCAACCCTAAC----GATCCAAGCG-TTGT
* * * * * * * * **
Pf5/1-1179 GCCTGTACCCAACCAGCGGTTACCCAATCCTGGGCTTCACCAACCTGATC
Wayne1/1-1056 GCCTGTACCCAACCAGCGGTTACCCAATCCTGGGCTTCACCAACCTGATC
PAMC25886/1-1158 GCCT--ACCCAAGCACCGGTTACCCAATCCTGGGCTTCACCAACGTGGTG
P.brassicacearum/1-1191 GCCTCTACCCAACCACCGGTTATCCGATCCTGGGCTTCACCAACGTGATC
Wood1R/1-1125 GCCTCTACCCAACCACCGGTTATCCGATCCTGGGCTTCACCAACGTGATC
F113/1-1107 GGCT-TACCCAGCCACCGGCTATCCGATCCTGGGCTTCACCAACGTGATC
* ** ***** ** *** ** ** ****************** ** *
Pf5/1-1179 TTCAGCCAGTGCTACGCCGACGCTAACCAGACTTCGCAAGTACGGGCTTT
Wayne1/1-1056 TTCAGCCAGTGCTACGCCGACGCTAACCAGACTTCGCAAGTACGGGCTTT
PAMC25886/1-1158 TTCAGCCAGTGCTACGCCAACGCTGACCAGACCTCCCAGGTCCGCACGTT
P.brassicacearum/1-1191 TTCAGCCAGTGCTACGCCAATGCTGCACAAACCACTCAGGTGCGTGATTT
Wood1R/1-1125 TTCAGCCAGTGCTACGCCAATGCTGCACAAACCACTCAGGTGCGTGATTT
F113/1-1107 TTCAGCCAGTGCTACGCCAACGCCGCCCAAAGCACCCAGGTGCGTGATTT
****************** * ** ** * * ** ** ** **
Pf5/1-1179 CTTCAGCCGTCACTACGGTGCCCTGGTG------AACAACGACACCGCCA
Wayne1/1-1056 CTTCAGCCGTCACTACGGTGCCCTGGTG------AACAACGACACCGCCA
PAMC25886/1-1158 CTTCACCCGTCATTACAACACCAACGCGTTCAGCAGCAACGATACGGCTA
P.brassicacearum/1-1191 CTTCACCCGTCACTACAACGGCACCGCTGCCAACAGCAACGACGCGGCGA
Wood1R/1-1125 CTTCACCCGTCACTACAACGGCACCGCTGCCAACAGCAACGACGCGGCGA
F113/1-1107 CTTCACCCGCCACTACGGTGCAGTCGCTGCCAACAACAACGATGCCGCCA
***** *** ** *** * * ****** * ** *
Pf5/1-1179 TCAACAACAACCGCTTCGTGCCTCTGCCAGCTGCCTGGAAAACCGCAGTA
Wayne1/1-1056 TCAACAACAACCGCTTCGTGCCTCTGCCAGCTGCCTGGAAAACCGCAGTA
PAMC25886/1-1158 TCCGCAACAACCGCTTCGTGCCACTGCCAACCACCTGGAAAACCGCAATC
P.brassicacearum/1-1191 TCACTGCCAACCGCTTCGTTCCACTGCCTGGCGCTTGGAAATCTGCCATC
Wood1R/1-1125 TCACTGCCAACCGCTTCGTTCCACTGCCTGGCGCTTGGAAATCTGCCATC
F113/1-1107 TCACTGCCAACCGCTTCGTGCCGCTGCCAACGACCTGGAAAAACGCCATC
** ************ ** ***** * ****** ** *
Pf5/1-1179 CGTGACTCGTTCGTCACTGCCTCCAGCGGCCTGAGCATCGGTAACGCCAG
Wayne1/1-1056 CGTGACTCGTTCGTCACTGCCTCCAGCGGCCTGAGCATCGGTAACGCCAG
PAMC25886/1-1158 AACGACACCTTCCTCAGCGCTGGCAGCGACCTGAGCATCGGCAAGTCGAA
P.brassicacearum/1-1191 CGTGGCAGCTTCCTGACCGCTACCAACGCTCAAAGCATCGGCAACACCAA
Wood1R/1-1125 CGTGGCAGCTTCCTGACCGCTACCAACGCTCAAAGCATCGGCAACACCAA
F113/1-1107 CGTGGCAGCTTCCTGACCACTACCAGCGCTCAAAGCATCGGCAACACCAA
* * *** * * * ** ** * ******** ** * *
Pf5/1-1179 CGTCTGCAACGCCATCGGCCGTCCGCTGTAA
Wayne1/1-1056 CGTCTGCAACGCCATCGGCCGTCCGCTGTAA
PAMC25886/1-1158 CATCTGCAACGGTATTGGTCGTCCGCTGTAA
P.brassicacearum/1-1191 CGTGTGCAATGGCATCGGTCGTCCGCTGTAA
Wood1R/1-1125 CGTGTGCAATGGCATCGGTCGTCCGCTGTAA
F113/1-1107 CGTCTGCAACGGCATCGGTCGTCCGCTGTAA
* * ***** * ** ** ************