Hi Jen,As the title, I have a [fasta] file that obtained from a [gtf] file,
>cuff102.1atcgtaaagggcgat>cuff103.1gtcgttgactNNNNNNNNgtc
and I want to get the output like this to filter the sequences that contain any
not[ATCG] character?
>cuff102.1atcgtaaagggcgat
I have a large of sequences to filter. I thought a way that firstly convert the
file to [interval] file, and secondly SELECT the line not matching the patten
/\t[ATCGatcg]*[^ATCGatcg]/.Am I right? Or there is a one-step way ?

Advertising

___________________________________________________________
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org. Please keep all replies on the list by
using "reply all" in your mail client. For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:
http://lists.bx.psu.edu/listinfo/galaxy-dev
To manage your subscriptions to this and other Galaxy lists,
please use the interface at:
http://lists.bx.psu.edu/
To search Galaxy mailing lists use the unified search at:
http://galaxyproject.org/search/mailinglists/