Motif-detection by PCPM macro is a recently added feature in MASIA. It identifies subtle motifs in a protein family automatically.
Methodology:
eq 1.
is the observed fraction of the component i in the bin
b and
is the corresponding background frequency. High
relative entropy values indicate a significant difference between the observed
frequencies of distributions in a columb and the a priori background distribution.
MOTIF LIST # Motif a: 61 62 63 65 66 67 68 69 70 71 72 73 # Motif b: 88 89 90 91 92 93 94 95 97 # Motif c: 124 125 126 127 128 129 130 133 135 138 # Motif d: 144 146 149 151 153 154 155 157 160 161 164 166 # Motif e: 170 172 173 174 # Motif f: 180 183 184 187 # Motif g: 203 204 205 206 207 208 209 210 211 212 213 214 # Motif h: 230 231 234 235 236 239 242 # Motif i: 246 248 250 252 253 # Motif j: 263 264 265 266 268 269 # Motif k: 273 276 278 279 280 281 282 283 285 287 290 # Motif l: 305 306 307 308 309 310 311 # Motif 0: 77 83 104 112 113 218 222 225 257 294
MOTIF DETAILSAPE_H._sapiens PKRGKKGAVAEDGDELRTEPEAKKSKTAAKKNDKEAAGEGPALYEDPPDH Motifs APE_H._sapiens KTSPSGKPATLKICSWNVDGLRAWIKKKGLDWVKEEAPDILCLQETKCSE Motifs aaaaaaaaaaaaa bbbbbbbbbb APE_H._sapiens NKLPAELQELPGLSHQYWSAPSDKEGYSGVGLLSRQCPLKVSYGIGDEEH Motifs ccccccccccccccc ddddddd APE_H._sapiens DQEGRVIVAEFDSFVLVTAYVPNAGRGLVRLEYRQRWDEAFRKFLKGLAS Motifs dddddddddddddddd eeeee ffffffff APE_H._sapiens RKPLVLCGDLNVAHEEIDLRNPKGNKKNAGFTPQERQGFGELLQAVPLAD Motifs gggggggggggg hhhhhhhhhhhhh iiiii APE_H._sapiens SFRHLYPNTPYAYTFWTYMMNARSKNVGWRLDYFLLSHSLLPALCDSKIR Motifs iii jjjjjjj kkkkkkkkkkkkkkkkkk APE_H._sapiens SKALGSDHCPITLYLAL Motifs lllllll
INPUT PARAM Relative Entropy-Cutoff (R):1.25 Relative Length-Cutoff (L):4 Relative Gap-Cutoff (G):2
Created and maintained by Venkat Mathura, 2003