Hi all,
I have a set of automated gene prediction co-ordinates, - the file consists of the
id of each gene followed by its co-ordinates, line by line in a contiguous
manner.
I need to go through these, identify the first instance of each prediction which is
a gene "within" the previous gene and throw those out. Are there any bioperl
scripts that do that?
e.g:
>1:[300:700]
>2:[1200:900]
>3:[1300:1800]
>4: [1600:1900]
>5: [1700:2000]
#4 is a one such example I'd like to find and eliminate.
TIA,
-Nandita