Aethionema arabicum genome annotation using pacbio full‐length transcripts provides a valuable resource for seed dormancy and brassicaceae evolution research

ORCID
0000-0002-6489-5566
Affiliation
University of Marburg
Fernandez‐Pozo, Noe;
ORCID
0000-0003-4213-4907
Affiliation
University of Marburg
Metz, Timo;
ORCID
0000-0003-0955-9241
Affiliation
Royal Holloway University of London
Chandler, Jake O.;
GND
1014065593
ORCID
0000-0003-1919-6077
Affiliation
Matthias Schleiden Institute/Genetics
Gramzow, Lydia;
ORCID
0000-0002-2048-1628
Affiliation
Austrian Academy of Sciences
Mérai, Zsuzsanna;
ORCID
0000-0001-7325-0527
Affiliation
Université Paris‐Saclay
Maumus, Florian;
ORCID
0000-0002-7757-4809
Affiliation
Austrian Academy of Sciences
Mittelsten Scheid, Ortrun;
GND
1027426107
ORCID
0000-0003-4854-8692
Affiliation
Matthias Schleiden Institute/Genetics
Theißen, Günter;
ORCID
0000-0001-6777-6565
Affiliation
Wageningen University
Schranz, M. Eric;
ORCID
0000-0002-6045-8713
Affiliation
Royal Holloway University of London
Leubner‐Metzger, Gerhard;
ORCID
0000-0002-0225-873X
Affiliation
University of Marburg
Rensing, Stefan A.

Aethionema arabicum is an important model plant for Brassicaceae trait evolution, particularly of seed (development, regulation, germination, dormancy) and fruit (development, dehiscence mechanisms) characters. Its genome assembly was recently improved but the gene annotation was not updated. Here, we improved the Ae. arabicum gene annotation using 294 RNA‐seq libraries and 136 307 full‐length PacBio Iso‐seq transcripts, increasing BUSCO completeness by 11.6% and featuring 5606 additional genes. Analysis of orthologs showed a lower number of genes in Ae. arabicum than in other Brassicaceae, which could be partially explained by loss of homeologs derived from the At‐α polyploidization event and by a lower occurrence of tandem duplications after divergence of Aethionema from the other Brassicaceae. Benchmarking of MADS‐box genes identified orthologs of FUL and AGL79 not found in previous versions. Analysis of full‐length transcripts related to ABA‐mediated seed dormancy discovered a conserved isoform of PIF6‐β and antisense transcripts in ABI3 , ABI4 and DOG1 , among other cases found of different alternative splicing between Turkey and Cyprus ecotypes. The presented data allow alternative splicing mining and proposition of numerous hypotheses to research evolution and functional genomics. Annotation data and sequences are available at the Ae . arabicum DB ( https://plantcode.online.uni‐marburg.de/aetar_db ).

Significance Statement Improved gene annotation of Aethionema arabicum using long‐read transcript sequencing provides a plethora of full‐length isoforms and an important resource for Brassicaceae evolution studies.

Cite

Citation style:
Could not load citation form.

Rights

License Holder: Copyright © 2021 John Wiley & Sons Ltd and the Society for Experimental Biology

Use and reproduction:
This publication is with permission of the rights owner freely accessible due to an Alliance licence and a national licence (funded by the DFG, German Research Foundation) respectively.