The genome was annotated at the Universities of Padova and Verona. Gene prediction was supported by RNA sequencing of twelve different samples derived from eight organs. 78,311 genes were predicted and functionally annotated in Coffea arabica.

Repetitive sequences

Repeats were masked based on the Coffea canephora set of repeats. Manual curation was carried out in order to keep only true biological repeats, excluding sequence based repeats, e.g. paralog genes.