Scansite:
##########
a)
Regular Expression: FS[^PW][LIVCAT][^PD]
Sequences Matching Search Criteria: 118745
b)
Regular Expression: [DNQGHRK][^GP][LIVMC][DENQSTAGC]
c)
Regular Expression: [DNQGHRK][^GP][LIVMC][DENQSTAGC].*FS[^PW][LIVCAT][^PD]
Sequences Matching Search Criteria: 108718
d)
Regular Expression: ([DNQGHRK][^GP][LIVMC][DENQSTAGC]){2}.*(FS[^PW][LIVCAT][^PD]){2}
Sequences Matching Search Criteria: 69
Script:
########
Searching for regular expressions in database...
Total number of proteins in database: 522018 - so this may take a while...
a) How many proteins do you obtain searching for the ATP-binding domain?
122189
b) How many proteins do you obtain searching for the Calcium-binding domain?
515491
c) How many proteins do you obtain that posses such a Calcium-binding domain followed (anywhere)
by an ATP-binding domain?
111829
d) How many proteins do you obtain that posses a duplicated Calcium-binding domain anywhere in
front of a duplicated ATP-binding domain?
74