Supplementary Figure and Tables

Supplementary Figure and Tables

Supplementary figure and tables

Figure S1Marker-trait associations across genes in the vicinity of PHYC.

Table S1 Linkage mapping families

TableS2Primers and PCR conditions used to amplify sequences of the PHYC region for association mapping and signatures of selection studies

Table S3 BLASTN results of pearl millet sequences

Table S4 Comparison of results of the mixed model at PHYC and Pg7830

Table S5Neutrality tests applied to genes in the PHYC region in cultivated and wild pearl millet

Table S6 Effects estimates in the the inbred panel data*

Table S7 Effects estimates in linkage mapping data*

* Tables S6 and S7 are given as independent Excel files.











































Figure S1Marker-trait associations across genes in the vicinity of PHYC.Associationswere fitted on the panel of inbreds using a mixed model. The p-value of the marker effect (Wald test) is given at each position (P-value was decimal log transformed). Trait and gene codes are given at the top of each graph. The horizontal line represents the 0.05 p-value threshold.

Table S1 Linkage mapping families

Family / Female parent / Male parent
A7 / PE8151-AF2-P2 (Wild, Niger) / PE5887-AF2-P2 (Cultivated, Burkina Faso)
C1 / PE8151-AF2-P2 (Wild, Niger) / PE1205-AF2-P3 (Cultivated, Burkina Faso)
D1 / PE8504-AF3-P1 (Wild, Senegal) / PE1205-AF2-P3 (Cultivated, Burkina Faso)

The code of the two parents, the type (wild/cultivated) and the country of origin is given for each family.

Table S2.Primers and PCR conditions used to amplify sequences of the PHYC region for association mapping and signatures of selection studies.

Gene fragment / Forward primer sequence / Reverse primer sequence / Tm
(°C) / Size (bp) / Experiment
PHYC gene
PHYC6b / AGCACCGTCTCCGCCTACC / CTTGCAGGGAGCCGCGTAA / 55 / 1573 / A / S
PHYC6bF1 / CGAGGTTATCTCCGAGTGTAGG / TCCACTCTAAGCATGAACCAA / 55 / 1697 / A / S
PHYC7 / TCGGGACCTTTCAGATAACG / GATTTGGACAGTTTGGCACTTA / 55 / 746 / A / S
PHYC8 / AGGTGCAGAAAATGTCTGAG / ATATACCCCCTTCGTACCAT / 57 / 781 / A
PHYC9 / GGTATCCCCATAGTCACCAG / TCTACGGACCTAGAGGGAGT / 55 / 822 / A / S
PHYC10 / TCCGTGTGACTAAGGGTATC / CGTGGGTTACAGTTACAGG / 55 / 884 / A / S
PHYC4 / AGATTGCTCATYTRGAGTTCA / CGTGCCRCTCATCGTYTTC / 55 / 834 / A / S
Genes surrounding PHYC
Pg7830 / TGCTTGACGCCTTGTACTTC / GCCATGATGACAGTGCTCGTG / 57 / 807 / A / S
Pg7840 / TTCAGTTGTCCCTTGCAGTG / GTAGGCAGCAGTCTGGAAGG / 55 / 807 / A / S
Pg7870 / ATCATCAAGCAGATCGAGA / GGCACGTACTGCGGGTA / 55 / 858 / A / S
Pg7878 / TCGCACTGACCGCATGTGAAA / TCTCAAGCGGCTAAAGCAAGCA / 57 / 923 / A / S
Pg7880 / AAGGTCAGGATCGGCAGC / TGCACTAAAAGCGAAAGCAC / 60 / 913 / A / S
Random genes
PgAY620961 / CCGCCCTCGTCTTCTCAT / GTCACCGACGATGCCAAT / 55 / 207 / S
PgAY741536 / ACCTGCCTTCCTTTCTAGCC / CGCCGTGTGGGTAAACTACT / 55 / 814 / S
PgFLIE_3 / GTTGCCTCATCCACTTCCAT / GCACCCAGCTGAAAGAAAAG / 55 / 606 / S
PgPRSC1_015 / ATTGAGCCCCAGCATTTGAG / TCCATAATAAGGGCCCAAAG / 55 / 391 / S
PgPRSC1_021 / GTACCTCTCACCAAAGCTTG / TGTTCAGGAAGGATTACTGG / 55 / 682 / S
PgPRSC1_045 / AACGACCCCTTCATCACCAC / GAATCTCCTCAGGGTTCCTG / 55 / 371 / S
PgPRSC4_062 / CCGTAACTTGTAAGAGGAGAG / TGGAACTTCTTTGCAGTGAG / 55 / 420 / S
PgSRSC11_006 / AGCTGGCCGTCTTCTACG / AGTATCACGATCATCAGAGC / 55 / 619 / S
PgSRSC2_002 / GGAGGAAATCCAGAAGAAAAC / GACACCGTCTCATACTGATG / 55 / 692 / S
PgSRSC3_010 / CTTCTAGATGCTGGTTGGAC / GCAGATCAAATTGCCTCCATC / 50 / 569 / S
PgSRSC3_027 / AAGAGCTACCTCGTCAATGG / CAACCCTGCTTTATGAGCATC / 55 / 320 / S
PgSRSC4_014 / CAAGATCCGGAGGCAGATC / ACCAGATTGGAAAGATCACG / 50 / 315 / S
PgSRSC5_016 / TGGGAGCTGGGTTGTGTTTG / CCTTTTGAGCCAGCAAAGG / 55 / 385 / S
PgSRSC6_018 / TTCTGCAGCAAGTGCATTTG / CAGTTGGATACTTGTAGCAG / 55 / 549 / S
PgSRSC7_006 / CATAGCTTATCTTGCCATCC / CAAGTTCAAGAAGCATGCAC / 50 / 471 / S
PgSRSC7_014 / GCAACGTCAAGGAGTCCATC / CCAAAAGGCACAGTATCCTG / 48 / 437 / S
PgSRSC7_023 / AAGTGGCGGTGAAGCTCATC / GGAAGCAGACAAATAAGGTG / 55 / 541 / S
PgSRSC8_021 / CATCGGGTTCAACTTCAT / GAACCATCAACATGATCACA / 55 / 519 / S
PgSRSPY_3 / AGCTCGTTTCGTTATCGTAG / TGGGGTCTCCAAGTATCAT / 55 / 433 / S
PgSRSPY_4 / CCGTCTCCACCTCTATGTC / GCCGACATCTCTCCTGAC / 55 / 308 / S

Gene name, forward and reverse primer sequences and melting temperature (Tm) for polymerase chain reaction (PCR) amplification for each of the genes and fragment size are presented. The data from these genes have been used for the detection of selection analysis (S), association mapping analysis (A) or both analyses.

Table S3 BLASTN results for pearl millet and sorghum sequences (or: Comparison of BLASTN results in pearl millet and sorghum sequences)

Pearl millet locus / Size (bp) / Sorghum gene / E-value / Annotation / GO biological_process
Pg7830 / 807 / Sb01g007830 / 0.00044 / - / Cellular potassium homeostasis
Pg7840 / 807 / Sb01g007840 / 4.4e-114 / - / Chiasma assembly
Pg7870 / 858 / Sb01g007870 / 0.023 / - / Metal ion transport
Pg7878 / 923 / Sb01g007878 / 8.3e-94 / - / -
Pg7880 / 913 / Sb01g007880 / 4.0e-13 / - / Cellular metal ion homeostasis
PHYC / 6115 / Sb01g007850 (PHYC) / 0 / Phytochrome C / red, far-red light phototransduction

E-value of top BLASTN hits in pearl millet sequenced fragments and sorghum genes. The size of the sequenced fragments of pearl millet genes is given. We named the novel sequences of pearl millet using the corresponding gene index of the sorghum homolog preceded by the prefix Pg which refers to Pennisetum glaucum. Sorghum genes were named according to GRAMENE nomenclature (species name, chromosome index, gene index). Annotation corresponds to the annotation of the sorghum gene (the sign “-“ isdenotes uncharacherized protein). The gene-ontology (GO) of the biological process category is given.

Table S4 Comparison of results of the mixed model at PHYC and Pg7830

Sampling / Site / N / Effect estimate / P-value / LK
Analysis with different subsets / PHYC5004C>A / 87 / 4.22 ±1.24 / 6.93 x 10-04 / -1403.14
Pg7830767T>G / 77 / 4.32 ±1.13 / 1.32 x 10-04 / -1278.95
Analysis with the same subset / PHYC5004C>A / 75 / 5.54 ±1.3 / 2.07 x 10-05 / -1227.78
Pg7830767T>G / 75 / 4.24 ±1.15 / 2.36 x 10-04 / -1229.80

A mixed model was fitted to link polymorphisms with the flowering phenotype scored in 9 trials. The results of one marker each at PHYC and Pg7830 are compared with respect to the method used to handle missing sequence data. In the available dataset, entries with missing data differed between markers. In the first case, the analysis was performed after filtering entries with missing data for each marker on its own. In the second case, the dataset was filtered to choose only entries with valid sequence data for both markers (using the same subset). The number N of inbred in the final subset, the effect estimate (coefficient ±standard error, in days), the p-value for marker effect significance (Wald test) and the log-likelihood (LK) of the model are given. Based on different subsets, the Pg7830 marker yielded the highest level of significance and a higher effect estimate. Based on the same subset of inbreds, the situation was reversed and the PHYC marker yielded the highest level of significance and a higher effect estimate.

Table S5 Neutrality tests applied to the genes in the PHYC region in cultivated and wild pearl millet

For each gene, nucleotide diversity (π) for wild and cultivated samples, nucleotide polymorphism (θ) for wild (θw) and cultivated samples (θc), Tajima’s D for wild and cultivated samples, Fay and Wu’s H for wild and cultivated samples, and differentiation (FST) between wild and cultivated sample are shown. For each statistic (Tajima's D , Fay and Wu's H, FST) a p-value (P) is given based on a model of pearl millet domestication. P corresponds to the rank of the observed statistic value with respect to the expected distribution obtained by the EGGF model, divided by the number of simulations. The number of simulations (n) used to calculate the rank of FST for a given gene is shown. Simulations that shared similar θw per gene ± 0.1 were used. For Tajima’s D and Fay and Wu’s H, bilateral tests were used: * P<0.025 or P>0.975, *** P<0.0005 or P>0.9995. For FST, a unilateral test was used: * P>0.95. NA: outgroup sequence not available. Data from PHYC6b, PHYC6bF1, PHYC7, PHYC9, PHYC10, PHYC4 are from Clotault et al. (2012).