* After removing genes for which 50% of their coding region is attributable to any combination of RepeatMasker TEs or 20-mers of copy number over 10. This TE-likely gene set can be found in our ftp site.