3. ARTICLE 3 - THE FLORAL DEVELOPMENT OF THE ALLOTETRAPLOID Coffea
3.3. Results
3.3.2 The majority of 21-PHAS loci correspond to disease resistance proteins triggered by miR482
Plant phasiRNAs can be processed from long noncoding RNAs (lncRNAs) or protein coding transcripts that, after a precise cleavage guided by a miRNA, generate secondary siRNAs by the activity of DICER-like enzymes. We used small RNA libraries from vegetative and reproductive organs to identify loci producing phasiRNAs in Coffea arabica.
We identified 173 21-PHAS loci, most of them from disease resistance (R) genes (Figure 1 - second internal cycle, green dots; Supplemental Table 2). We found miRNA triggering at least 51 21-PHAS, including 23 triggered by miR482 family members. In addition, 15 21-PHAS loci were found to be triggered by a putative novel MIR family, the candidate miR245889. Similarity based analysis suggests that this candidate diverged from the miR482 superfamily (Supplemental Figure 2).
Figure 1 Genome-wide distribution of MIRs, PHAS and PHAS-like loci in the allopolyploid Coffea arabica. The outermost cycle represents chromosomes from each parental ancestor;
from C. eugenioides (ceu) or C. canephora (ccp). First inner cycle; Chromosomal coordinates of miRNA precursors, black dots are conserved families while violet dots are putative novel.
Second inner cycle; green dots point out the chromosomal coordinates of 21-PHAS loci. Third inner cycle; point out the chromosomal coordinates of 24-PHAS loci. Innermost cycle; red dots point out the chromosomal coordinates of 24-PHAS-like loci
Our prediction of loci with phasing patterns allowed us to identify two candidate tasiRNA loci. Their miRNA target sites were similar to miR390 and miR828. Further manual analysis of those loci revealed that they are the respective orthologs of TAS3 and TAS4 (Supplemental figure 3). We did not find any orthologs for TAS1 or TAS2 in accordance with the lack of its conserved trigger, miR173. We also identified two DICER-like 2 (DCL2), one in chromosome 6e and another in 9c, as loci generating phasiRNAs. This phasing of DCL2 has been described before in Fabaceae 44, and Solanaceae 45. We were not able to identify a
1ccp
1ceu
2ccp
2ceu
3ccp
3ceu
4ccp
4ceu
5ceu
5ccp 6ccp
6ceu 7ccp
7ceu 8ceu 8ccp 9ccp
9ceu 10ceu
10ccp
11 ccp
11
ceu 0 6 12 18 24 30 36 42 48
0 6
12 18
24 30
36 42
48 0
6 12
18 24
30 36
42 48
54 60
66 0
6 12
18 24
30 36
42 48 54 60 66
0 6 12 18 24
30 36
0 6 12 18 24 30 36
0 6 12 18 24 30 36
0 6 12 18 24 30 36 42 0 6 12 18 24 30 36 0 6 18 12
24
30
3642
0
612
18
24
3036
42
48
54
06
12 18 24 36 30 42 48 0 6 12 18 24 30 36 0 6 12 18 24 30 0 6 12 18 24 30 36 42 0 6 12 18 24 30 36 0 6 12 18 24 30 36 0
6 12 18
24 30
0 6
12 18
24 30
36 0
6 12
18 24
30 36
42 0
6 12
18 24
30
36 0 6 12 18 24 30 36 42 MIR9557MIR585MIR10186MIR845MIR5608MIR
64 40MIR159MIR902MIR482MIR403MIR403MIR403MIR1030MIR1
72 MIR1139MIR20625MIR6476MIR1514MIR8
175MIR
319M
IR319MIR
167M
IR
167
MIR 24 044
M IR2 673 M IR 57
82
MIR
7
494
MIR7494 MIR1
035 MIR 845 M IR 25 4202
MIR 56 40
MIR 171 M IR4 03 MIR4 03 MIR8 001 MIR3 91 MIR8 37 M IR1 72 MIR2 68 555
MIR8 45 MIR1 59 MIR5 183
MIR 390 M Ir390
MIR 159 MIR 167 MIR1 67 MIR 167 MIR 27 1876
MIR 115
31
MIR8 767 MIR94 76
MIR 780
MIR 5524 MIR 319 MIR319 MIR11 71 MIR28354 MIR172 MIR 6281 MIR52 04 MIR399 MIR399 MIR 399 MIR39 9 MIR399 MIR33682 MIR39 9 MIR399 MIR399 MIR6224 MIR12150 MIR2673 MIR11479 MIR285302 MIR166 MIR166 M
IR6283 MIR9674 M IR53
76 MIR5741 MIR
2592 MIR1037 MIR
5020 MIR414 MIR 167 MIR167 M
IR5776 MIR164 M
IR164 MIR1918 MIR408 MIR408 MIR773 MIR396 MIR
5658 MIR166 MIR479 MIR482 MIR5658 MIR319 MIR1171 MIR2
76652 MIR393
MIR172 MIR6 478 MIR172 MIR172 MIR6282 MIR858 MI
R12150 MIR1023b MIR7744 MIR283715 MIR
6283 MIR
11479 MIR36740 MIR166 MIR10411 MIR
2594 MIR
7839 MIR2949 MIR398 MIR396 MIR398 MIR11293 MIR11553 MIR845 MIR11447 MIR477 MIR167MIR305549 MIR164 MIR6478 MIR828 MIR166 MIR166
MIR11108 MIR167 MIR167 MIR11078 MIR166 MIR166 MIR7122 MIR1122 MIR399 MIR6257 MIR482 MIR1858 MIR1858 MIR393 MIR171 MIR8746 MIR6470 MIR6470 MIR398
MIR167 MIR167 MIR5218 MIR6105 MIR482 MIR1858 MIR1171 MIR4246 MIR1319 MIR9555 MIR9722 MIR164 MIR6287 MIR396 MIR1446 MIR171 MIR156 MIR162 MIR162 MIR159 MIR1533 MIR5246 MIR169 MIR169 MIR5293 MIR10186 MIR7502 MIR5042 M
IR11 MIR81443 55 M
IR26 19 M IR53
02 MIR96
73 MIR164 MIR6287 M
IR164 M
IR4237 MIR MIR396 1446 MIR MIR171 M156 IR162 M IR162 M IR333194 M IR159 M IR169 M IR169 M IR169 MIR
169 MIR
11515 M IR2
629 M IR34
6958 MIR
9386 M IR3 5713
6 M IR 171 M IR53
76 M IR 39 7 M IR1 85 8 M IR 209 0 M IR 56 48 M IR 10 59 82 M IR 2662 M IR 9386 M IR 7509 M IR 3629 M IR 1075 M IR 171 M IR8 60 M IR
1
88
5
M IR
160
M
IR
160
MIR857
M
IR 26
69
MIR8658
MIR1858
MIR8
60
3
MIR80
05
MIR870
MIR6439
MIR390
MIR2111 MIR397
MIR5819MIR482
MIR166
MIR166
MIR396
MIR396MIR396MIR171
MIR
156
MIR11069
MIR9756 MIR5201
MIR6459
MIR395 MIR395
MIR3
95
MIR3630 MIR137981 MIR6 269 MIR76 97 MIR169
MIR2
679
M
IR
1
67
MIR7124
MIR168
M
IR1
221
M IR
3
90
M IR 2111 MIR530 MIR
2919 MIR
396 MIR396 M IR396 MIR
156 MIR7
984
MIR3 73 056
MIR1 10 45 MIR8
45
MIR 37 41 44 M IR 7494 MIR
395 MIR
395 MIR
395 MIR
395 MIR
5649 MIR
5998 MIR
893 MIR1
69 M IR 87 22 MIR
77 57 MIR1
69
MIR 389
618 MIR39
1 MIR
408 MIR
856 MIR
2661 MIR
36 27 MIR
615 6 MIR3627 MIR4227 MIR
753 0 MIR
169 MIR
7764 MIR
8033 MIR482 MIR16
9 MIR15 6 MIR169 MIR171 MIR17
1 MIR1103 1 MIR16
1566 MIR2873 MIR8170 MIR156 MIR2592 MIR16
9 MIR6459 MIR5185 MIR1018 MIR11342 6 MIR169 MIR156 MIR171 M
IR171 MIR4
249 MIR11076 MIR4
416 MIR4
12506 MIR228 MIR8
014 MIR168 MI MIR3R16893 MIR
4400 MIR2919 MIR MIR34419198 MIR MIR6135 11079 MIR11079 MIR3951 MIR105MIR39934 MIR MIR513MIR47721189 MIR477 MIR166 MIR11493 MIR173045MIR5716MIR7MIR1MIR87632169 MIR10MIR2275MIR168MIR168988 MIR182879MIR9653MIR1520MIR1510 MIR187006MIR1919MIR1919MIR1919 MIR187924MIR156MIR156 MIR190770MIR399MIR399MIR319MIR319MIR477MIR477 MIR1037 MIR171 MIR160 MIR851 MIR195364MIR2275MIR160MIR395MIR395
MIR395MIR395 MIR5368 MIR171 MIR43
3246 MIR2275 MIR10187 MIR8763 MIR443691 MIR1134 MIR395 MIR395 MIR395 MIR395 MIR393
MIR482 MIR2118 MIR11340
MIR45 2328 MIR452404
MIR452316 MIR12111
MIR10187 MIR394 M
IR11449 M
IR5529 M IR9722
M IR6143
M IR396 MIR 4243 M
IR1 1178 MIR 7701
M IR4
82 M
IR5 532 M IR169
MIR 394 M IR18
85 M IR9568
M IR 29 26 M IR5
30 M IR8 30 M IR 109
88 M IR1 510 M IR6
15 6 M IR 977 3 M IR 111 08 M IR60
24 M IR 10 982 M IR7 82 2 M IR3 19 M IR2 454 91 M IR 482 M IR 48 2 M IR4 82 M IR 482 M IR 48 2 M IR 24 57 47 M IR 482 M IR 245 88 9 M IR3 98 M IR2 47400 M IR 39
5
M
IR2
484 21 M IR 536 M IR 144
2
M IR
60
26 M IR8 35 M IR7776M IR
1
10 91MIR413M
IR8 72
6MIR3
19
MIR482M
IR4
82MIR482963M
IR4
82MIR482MIR483
07
5MIR3
98
MIR483810MIR1515MIR482MIR536 21PHAS21PHAS21PHAS21PHAS21PHAS21PHAS21PHAS21PHAS
2 1PHAS
21PHAS 21PHAS
21PHAS 2 1PHAS
21PHAS 21PHAS
21PHAS 21PHAS
21PHAS 21PHAS
21PHAS 21
PHAS 21PHAS
21 PHAS 21PHAS
21PHAS 21PHAS
21PHAS 21PHAS
21PHAS
21PHAS 21PHAS
21PHAS 21PHAS 21PHAS 21PHAS 21PHAS 21PHAS
21PHAS 21PHAS 21PHAS 21PHAS 21PHAS 21PHAS 21PHAS 21PHAS 21PHAS 21PHAS 21PHAS 21PHAS 21PHAS 21PHAS 21PHAS 21PHAS
21PHAS 21PHAS 21PHAS
21PH 21PHAS 21PHASAS 21PH 21PHAS 21P AS
HAS 21PH
AS 21P
HAS 21PH
AS 21PHA
S 21P
HAS 21PHAS 21 PHAS 21
PHA S 21P
HAS 21P
HAS 21PHAS 21P HAS 21PHAS
21PHAS
21PHAS 21PHAS 2 1PHAS 21PHAS 21PHAS 21PHAS 21PHAS 21P
HAS 21PHAS 21PHAS 21PHAS 21
PHAS 21
PHAS 21
PHAS 21PHAS 21PHAS 21PHAS 21
PHAS 21
PHAS 21PHAS 21PHAS 21
PHAS 21PHAS 21PHAS 21PHAS 21PHAS 21PHAS 21PHAS 21PHAS 21PHAS 21PHAS 21PHAS 21PHAS 21PHAS 21PHAS 21PHAS 21PHAS 21PHAS 21PHAS 21PHAS 21PHAS 21PHAS 21PHAS 21PHAS 21PHAS 21PHAS 21PHAS 21PHAS 21PHAS 21PHAS 21PHAS 21PHAS 21PHAS 21PHAS 21PHAS 21PHAS 21PHAS 21PHAS 21PHAS 21PHAS 21PHAS 21PHAS 21PHAS 21PHAS 21PHAS 21PHAS 21PHAS 21PHAS
21PHAS 21PHAS
21PH AS 21P HAS 21PH AS
21 PH AS 21 PHAS
21P HAS 21PH AS 21 PHAS
21PH AS 21P HAS 21 PHAS 21PHAS
21
PHAS 21
PHAS21PHAS21PHAS21PHAS21PHAS
21
PHAS21PHAS21P
HAS21PHAS
specific trigger of DCL2 phasing. It is possible that another small RNA is performing this task.
3.3.4 24-PHAS loci are not triggered by any expressed miRNA
The presence of 24-nt phasiRNAs was recently reported in many eudicots, and in those reports, these small RNAsare highly enriched in reproductive organs, specifically anthers 46–48. Because several phased siRNA annotation methods can frequently mistake heterochromatic siRNAs with 24-PHAS loci 49 we manually evaluated all putative PHAS loci with the IGV genome browser. Doing so, we were able to identify 189 24-PHAS (Figure 1 - third internal cycle with yellow dots; Supplemental Table 3). Of those loci, 56 overlap with annotated protein coding genes (PCG) that were mostly identified as ―uncharacterized proteins‖ by the similarity based local alignment tool BLASTP 50. That way, these putative protein-coding loci could be lncRNAs that were misannotated. Meanwhile, other loci seem to be bonafide PCGs that, due to their proximity to the 24-PHAS loci, may be transcribed and processed as such. In addition, 58 of those phasiRNAs overlapped with transposable elements (TEs) such as Gypsy (34%) and hAT (22%).
Unexpectedly, we were unable to find any apparent miRNA triggers of the loci with the 24-nt phasing pattern. miR2275, broadly present in eudicots, is the usual trigger of 24-nt phasiRNA biogenesis 47. However, it is known to be absent in some lineages such Brassicales, Caryophyllales, Cucurbitaceas and Lamiales. In Coffea arabica, we found four putative miR2275 precursor loci in the genome, but without evidence of expression in the analyzed organs. Something similar was previously reported for the Solanales tomato and petunia in which 24-nt reproductive phasiRNAs were found to be abundant in meiotic anthers, nevertheless, no apparent miRNA target site was present 47. MEME analysis of the 189 Cofea arabica 24-PHAS loci showed that 181 of those loci contain 24-nt rich-A motifs, with statistical significance 51 (exemplified in the Supplemental Figure 4). However, no expressed miRNA (or any of the evaluated small RNA) has complementarity to those putative target sites once miRNAs tend to be poor in A-U stretches. It is also possible that due to the relatively weak pairing of adenine and thymine, with two hydrogen bonds instead of three from the guanine and cytosine, the double strand of DNA can be easily separated allowing transcription and further processed into double-stranded RNA. Then, further degradation in the precise 24-nt pattern by a yet unknown mechanism can take place.
3.3.5 Hundreds of loci present a 24 nt phasing pattern but could not be included in the