******************************************************************************** MEME - Motif discovery tool ******************************************************************************** MEME version 2.2 (Release date: 1998/05/05 20:35:42) For further information on how to interpret these results or to get a copy of the MEME software please access http://www.sdsc.edu/MEME. This file may be used as input to the MAST algorithm for searching sequence databases for matches to groups of motifs. MAST is available for interactive use and downloading at http://www.sdsc.edu/MEME. ******************************************************************************** ******************************************************************************** REFERENCE ******************************************************************************** If you use this program in your research, please cite: Timothy L. Bailey and Charles Elkan, "Fitting a mixture model by expectation maximization to discover motifs in biopolymers", Proceedings of the Second International Conference on Intelligent Systems for Molecular Biology, pp. 28-36, AAAI Press, Menlo Park, California, 1994. ******************************************************************************** ******************************************************************************** TRAINING SET ******************************************************************************** DATAFILE= adh.s (deleted by web version of MEME) ALPHABET= ACDEFGHIKLMNPQRSTVWY Sequence name Weight Length Sequence name Weight Length ------------- ------ ------ ------------- ------ ------ 2BHD_STREX 1.0000 255 3BHD_COMTE 1.0000 253 ADH_DROME 1.0000 255 AP27_MOUSE 1.0000 244 BA72_EUBSP 1.0000 249 BDH_HUMAN 1.0000 343 BPHB_PSEPS 1.0000 275 BUDC_KLETE 1.0000 241 DHES_HUMAN 1.0000 327 DHGB_BACME 1.0000 262 DHII_HUMAN 1.0000 292 DHMA_FLAS1 1.0000 270 ENTA_ECOLI 1.0000 248 FIXR_BRAJA 1.0000 278 GUTD_ECOLI 1.0000 259 HDE_CANTR 1.0000 906 HDHA_ECOLI 1.0000 255 LIGD_PSEPA 1.0000 305 NODG_RHIME 1.0000 245 RIDH_KLEAE 1.0000 249 YINL_LISMO 1.0000 248 YRTP_BACSU 1.0000 238 CSGA_MYXXA 1.0000 166 DHB2_HUMAN 1.0000 387 DHB3_HUMAN 1.0000 310 DHCA_HUMAN 1.0000 276 FABI_ECOLI 1.0000 262 FVT1_HUMAN 1.0000 332 HMTR_LEIMA 1.0000 287 MAS1_AGRRA 1.0000 476 PCR_PEA 1.0000 399 RFBB_NEIGO 1.0000 346 YURA_MYXXA 1.0000 258 ******************************************************************************** ******************************************************************************** EXPLANATION OF RESULTS ******************************************************************************** For each motif that it discovers in the training set, MEME prints the following information: Summary Line This line gives the width (`width') and expected number of occurrences in the training set (`sites') of the motif. MEME numbers the motifs consecutively from one as it finds them. MEME usually finds the most statistically significant motifs first. Each motif describes a pattern of a fixed width--no gaps are allowed in MEME motifs. MEME estimates the number of places the motif occurs in the training set. This need not be an integer value. Simplified Motif Letter-probability Matrix MEME motifs are represented by letter-probability matrices that specify the probability of each possible letter appearing at each possible position in an occurrence of the motif. In order to make it easier to see which letters are most likely in each of the columns of the motif, the simplified motif shows the letter probabilities multiplied by 10 rounded to the nearest integer. Zeros are replaced by ":" (the colon) for readability. Information Content Diagram The information content diagram provides an idea of which positions in the motif are most highly conserved. Each column (position) in a motif can be characterized by the amount of information it contains (measured in bits). Highly conserved positions in the motif have high information; positions where all letters are equally likely have low information. The diagram is printed so that each column lines up with the same column in the simplified motif letter-probability matrix above it. Summing the information content for each position in the motif gives the total information content of the motif (shown in parentheses to the left of the diagram). This gives a measure of the usefulness of the motif for database searches. For a motif to be useful for database searches, it must as a rule contain at least log_2(N) bits of information where N is the number of sequences in the database being searched. For example, to effectively search a database containing 100,000 sequences for occurrences of a single motif, the motif should have an IC of at least 16.6 bits. Motifs with lower information content are still useful when a family of sequences shares more than one motif since they can be combined in multiple motif searches (using MAST). Multilevel Consensus Sequence The multilevel consensus sequence corresponding to the motif is an aid in remembering and understanding the motif. It is calculated from the motif letter-probability matrix as follows. Separately for each column of the motif, the letters in the alphabet are sorted in decreasing order by the probability with which they are expected to occur in that position of motif occurrences. The sorted letters are then printed vertically with the most probable letter on top. Only letters with probabilities of 0.2 or higher at that position in the motif are printed. As an example, the multilevel consensus sequence of motif 2 in the sample output is: Multilevel LITGAASGIG consensus V GS sequence G This multilevel consensus sequence says several things about the motif. First, the most likely form of the motif can be read from the top line as LITGAASGIG. Second, that only letter L has probability more than 0.2 in position 1 of the motif, both I and V have probability greater than 0.2 in position 2, etc. Third, a rough approximation of the motif can be made by converting the multilevel consensus sequence into the Prosite signature L-[IV]-T-G-[AG]-[ASG]-S-G-I-G. The multilevel consensus sequence is printed so that each column lines up with the same column in the simplified motif and information content diagrams above it. Motif in BLOCKS or FASTA format For use with the BLOCKS (http://www.blocks.fhcrc.org/blocks) tools, MEME prints the sites in the sequences which were used to construct the motif in BLOCKS format. The sites reported are, for the different model types: OOPS position with highest z_i in each sequence, ZOOPS position with highest z_i > 0.5 in each sequence, TCM all positions with z_i > 0.5, where z_i is the probability that an occurrence of the motif starts at position i in the sequence given the sequence and the motif model. If you inlcude the -print_fasta switch on the command line, MEME prints the motif sites in FASTA format instead of BLOCKS format. Possible Examples of the Motif As a further aid in understanding the motif, MEME displays a list of possible occurrences of the motif in the training set. This list is made by converting the motif letter-probability matrix into a position-dependent scoring matrix (log-odds matrix) and using that to compute a match score between each position in the training set and the motif. All positions which score above a threshold score are listed. (The threshold score is chosen by MEME such that the expected number of non-motif positions listed in error will equal the number of actual motif positions not listed.) The format of the list is sequence name, starting position of the (putative) occurrence, match score of the position, and the actual sequence including the ten positions before and after the motif occurrence (`site'). Position-dependent Scoring Matrix The position-dependent scoring matrix corresponding to the motif is printed for use by database search programs such as MAST. This matrix is a log-odds matrix calculated by taking the log (base 2) of the ratio p/f at each position in the motif where p is the probability of a particular letter at that position in the motif, and f is the average frequency of that letter in the non-redundant database as of 9/22/96. The scoring matrix is printed "sideways"--columns correspond to the letters in the alphabet (in the same order as shown in the simplified motif) and rows corresponding to the positions of the motif, position one first. The scoring matrix is preceded by a line starting with "log-odds matrix:" and containing the length of the alphabet, width of the motif, number of characters in the training set and the scoring threshold used in the list of possible motif examples. Motif Letter-probability Matrix The motif itself is a position-dependent letter-probability matrix giving, for each position in the pattern, the probabilities of each possible letter occurring there. The letter-probability matrix is printed "sideways"--columns correspond to the letters in the alphabet (in the same order as shown in the simplified motif) and rows corresponding to the positions of the motif, position one first. The motif is preceded by a line starting with "letter-probability matrix:" and containing the length of the alphabet, width of the motif and number of characters in the training set. ******************************************************************************** ******************************************************************************** MOTIF 1 width = 9 sites = 29.5 ******************************************************************************** Simplified A ::1::::8: motif letter- C ::::::::: probability D :8::::::: matrix E ::::::::: F ::::::::: G ::1:::::9 H ::::::::: I 2:212:::: K ::::::::: L 3:18::::: M ::::::::: N :::::89:: P ::::::::: Q ::::::::: R ::::::::: S ::::::::: T ::::::::: V 3:3:7:::: W ::::::::: Y ::::::::: bits 6.2 5.6 5.0 4.4 Information 3.7 content 3.1 * ** (22.0 bits) 2.5 * ** * 1.9 * ****** 1.2 ** ****** 0.6 ********* 0.0 --------- Multilevel VDVLVNNAG consensus L sequence -------------------------------------------------------------------------------- Motif 1 in BLOCKS format -------------------------------------------------------------------------------- BL MOTIF 1 width=9 seqs=33 2BHD_STREX ( 81) VDGLVNNAG 0.999998 3BHD_COMTE ( 81) LNVLVNNAG 0.999998 ADH_DROME ( 86) VDVLINGAG 0.999967 AP27_MOUSE ( 77) VDLLVNNAA 0.999995 BA72_EUBSP ( 86) LDVMINNAG 0.999998 BDH_HUMAN ( 138) MWGLVNNAG 0.998039 BPHB_PSEPS ( 79) IDTLIPNAG 0.999538 BUDC_KLETE ( 80) FNVIVNNAG 0.999914 DHES_HUMAN ( 84) VDVLVCNAG 0.999995 DHGB_BACME ( 87) LDVMINNAG 0.999993 DHMA_FLAS1 ( 198) VDVTGNNTG 0.995902 ENTA_ECOLI ( 73) LDALVNAAG 0.99954 FIXR_BRAJA ( 112) LHALVNNAG 0.9999 GUTD_ECOLI ( 82) VDLLVYSAG 0.998643 HDE_CANTR ( 396) IDILVNNAG 0.999733 HDHA_ECOLI ( 89) VDILVNNAG 1 NODG_RHIME ( 81) VDILVNNAG 1 RIDH_KLEAE ( 89) LDIFHANAG 0.974636 YINL_LISMO ( 83) VDAIFLNAG 0.957081 YRTP_BACSU ( 84) IDILINNAG 1 CSGA_MYXXA ( 13) VDVLINNAG 1 DHB2_HUMAN ( 161) LWAVINNAG 0.976358 DHB3_HUMAN ( 125) IGILVNNVG 0.999908 DHCA_HUMAN ( 83) LDVLVNNAG 1 FVT1_HUMAN ( 115) VDMLVNCAG 0.999978 HMTR_LEIMA ( 103) CDVLVNNAS 0.999993 MAS1_AGRRA ( 320) IDGLVNNAG 0.999997 PCR_PEA ( 165) LDVLINNAA 0.999974 YURA_MYXXA ( 90) LDLVVANAG 0.996835 // ---------------------------------------------------------------------- Possible examples of motif 1 in the training set ---------------------------------------------------------------------- Sequence name Start Score Site ------------- ----- ----- --------- 2BHD_STREX 81 28.80 VAYAREEFGS VDGLVNNAG ISTGMFLETE 3BHD_COMTE 81 25.99 MAAVQRRLGT LNVLVNNAG ILLPGDMETG ADH_DROME 86 22.33 LKTIFAQLKT VDVLINGAG ILDDHQIERT AP27_MOUSE 77 24.36 TEKALGGIGP VDLLVNNAA LVIMQPFLEV BA72_EUBSP 86 26.39 VGQVAQKYGR LDVMINNAG ITSNNVFSRV BDH_HUMAN 138 23.46 PFEPEGPEKG MWGLVNNAG ISTFGEVEFT BPHB_PSEPS 79 18.60 ASRCVARFGK IDTLIPNAG IWDYSTALVD BUDC_KLETE 80 20.97 VEQARKALGG FNVIVNNAG IAPSTPIESI DHES_HUMAN 84 25.67 AARERVTEGR VDVLVCNAG LGLLGPLEAL DHGB_BACME 87 26.39 VQSAIKEFGK LDVMINNAG MENPVSSHEM DHMA_FLAS1 198 16.36 ILVNMIAPGP VDVTGNNTG YSEPRLAEQV ENTA_ECOLI 73 21.90 CQRLLAETER LDALVNAAG ILRMGATDQL FIXR_BRAJA 112 23.67 EVKKRLAGAP LHALVNNAG VSPKTPTGDR GUTD_ECOLI 82 17.17 SRGVDEIFGR VDLLVYSAG IAKAAFISDF HDE_CANTR 92 20.90 VETAVKNFGT VHVIINNAG ILRDASMKKM HDE_CANTR 396 29.32 IKNVIDKYGT IDILVNNAG ILRDRSFAKM HDHA_ECOLI 89 30.18 ADFAISKLGK VDILVNNAG GGGPKPFDMP NODG_RHIME 81 30.18 GQRAEADLEG VDILVNNAG ITKDGLFLHM RIDH_KLEAE 89 16.02 LQGILQLTGR LDIFHANAG AYIGGPVAEG YINL_LISMO 83 14.64 VELAIERYGK VDAIFLNAG IMPNSPLSAL YRTP_BACSU 84 27.41 VAQVKEQLGD IDILINNAG ISKFGGFLDL CSGA_MYXXA 13 28.94 AFATNVCTGP VDVLINNAG VSGLWCALGD DHB2_HUMAN 161 19.62 KVAAMLQDRG LWAVINNAG VLGFPTDGEL DHB3_HUMAN 125 18.63 HIKEKLAGLE IGILVNNVG MLPNLLPSHF DHCA_HUMAN 83 30.23 RDFLRKEYGG LDVLVNNAG IAFKVADPTP FVT1_HUMAN 115 24.21 IKQAQEKLGP VDMLVNCAG MAVSGKFEDL HMTR_LEIMA 103 24.02 VAACYTHWGR CDVLVNNAS SFYPTPLLRN MAS1_AGRRA 320 27.94 VTAAVEKFGR IDGLVNNAG YGEPVNLDKH PCR_PEA 165 23.97 VDNFRRSEMP LDVLINNAA VYFPTAKEPS YURA_MYXXA 90 18.59 IRALDAEAGG LDLVVANAG VGGTTNAKRL ---------------------------------------------------------------------- log-odds matrix: alength= 20 w= 9 n= 9732 bayes= 8.36143 -2.723 0.818 -5.202 -4.536 -0.081 -4.430 -3.513 1.560 -4.216 1.814 0.701 -4.123 -3.142 -3.846 -4.065 -3.572 -2.723 2.425 -3.272 -3.032 -3.441 -3.841 4.023 -1.204 -4.313 -2.395 -0.889 -4.226 -4.009 -4.570 -3.882 -0.220 -4.683 -3.546 -3.936 -3.347 -3.589 -4.008 0.407 -3.867 0.768 -2.342 -4.756 -4.189 -2.319 0.376 -3.154 1.758 -3.870 0.289 0.915 -3.146 -4.229 -3.491 -3.204 -3.190 -0.583 2.420 -3.040 -2.688 -3.383 -2.597 -5.066 -4.331 -0.585 -5.089 -3.668 -0.081 -4.098 3.045 1.107 -4.392 -4.287 -3.383 -3.691 -4.232 -1.453 -0.397 -3.403 -3.428 -1.371 -1.894 -3.821 -3.572 -1.090 -1.955 -0.469 1.476 -3.564 -2.234 -1.835 -3.701 -3.611 -3.536 -3.266 -2.883 -1.899 3.383 -3.659 -3.686 -1.879 -0.980 -2.231 -4.187 -3.807 -3.564 -0.891 -3.306 -3.238 -2.757 -3.337 4.193 -2.276 -2.750 -3.524 -1.976 -2.703 -3.785 -3.339 -1.694 -2.462 -0.914 -2.251 -4.175 -3.832 -2.393 -0.967 -3.338 -3.261 -4.255 -3.363 4.217 -4.025 -2.767 -3.555 -1.435 -2.573 -3.805 -3.363 -3.579 3.476 -1.137 -3.874 -3.535 -3.304 -2.080 -2.096 -2.825 -3.544 -3.127 -2.263 -3.592 -4.599 -3.533 -3.494 -1.160 -1.173 -0.795 -3.504 -3.301 -0.693 -3.833 -3.137 -3.879 -4.963 3.663 -3.647 -3.381 -3.716 -5.287 -4.212 -2.849 -4.518 -4.155 -3.653 -1.554 -4.108 -4.474 -4.196 -4.466 letter-probability matrix: alength= 20 w= 9 n= 9732 0.011079 0.032027 0.001405 0.002686 0.038075 0.003217 0.001965 0.165996 0.003148 0.322393 0.037510 0.002644 0.005741 0.002833 0.003100 0.006206 0.008996 0.345603 0.001382 0.003994 0.006738 0.001268 0.841019 0.027063 0.002026 0.013178 0.012107 0.003008 0.003632 0.003860 0.001565 0.039558 0.001972 0.003488 0.003392 0.007254 0.004938 0.004001 0.017693 0.002240 0.124613 0.003583 0.001915 0.003418 0.008072 0.089979 0.002520 0.190298 0.004000 0.112023 0.043486 0.005206 0.002703 0.003625 0.005633 0.008085 0.039660 0.344486 0.001622 0.005070 0.007012 0.003002 0.001544 0.003098 0.026859 0.002037 0.001765 0.053207 0.003416 0.756862 0.049680 0.002194 0.002597 0.003906 0.004018 0.003928 0.021704 0.048873 0.001261 0.003037 0.028279 0.004887 0.003660 0.005242 0.018923 0.017879 0.016206 0.156603 0.004947 0.019489 0.006463 0.003543 0.004148 0.003514 0.005397 0.010004 0.015927 0.671293 0.001056 0.002539 0.019897 0.009211 0.011025 0.003422 0.002878 0.005862 0.012093 0.005691 0.006199 0.013569 0.002282 0.842856 0.010465 0.006059 0.004511 0.018762 0.009126 0.004671 0.001318 0.010104 0.013281 0.009641 0.010872 0.003450 0.002829 0.013203 0.011474 0.005567 0.006102 0.004802 0.002241 0.857099 0.003112 0.005987 0.004414 0.027304 0.009986 0.004603 0.001296 0.002735 0.813839 0.008259 0.003529 0.005378 0.004079 0.016397 0.005247 0.007944 0.005014 0.010498 0.004806 0.003821 0.002090 0.003521 0.004605 0.033034 0.026342 0.037103 0.001176 0.003317 0.045249 0.001274 0.005880 0.004238 0.001291 0.878126 0.001790 0.005402 0.004451 0.002349 0.001245 0.006394 0.002211 0.002287 0.004126 0.025137 0.003445 0.002896 0.000728 0.001479 Time 265.17 secs. ******************************************************************************** MOTIF 2 width = 10 sites = 30.5 ******************************************************************************** Simplified A ::::531::: motif letter- C ::::1::::: probability D :::::1:::: matrix E :::::::::: F :::::::::: G :::93219:9 H :::::::::: I 25::::::6: K ::::::2::: L 51::::::2: M :::::::::: N :::::1:::: P :::::::::: Q ::::::1::: R ::::::1::: S ::1:133::: T ::8::::::: V 14::::::1: W :::::::::: Y :::::::::: bits 6.2 5.6 5.0 4.4 Information 3.7 content 3.1 * * (22.0 bits) 2.5 ** * * 1.9 **** *** 1.2 ****** *** 0.6 ********** 0.0 ---------- Multilevel LITGAASGIG consensus V GS sequence G -------------------------------------------------------------------------------- Motif 2 in BLOCKS format -------------------------------------------------------------------------------- BL MOTIF 2 width=10 seqs=33 2BHD_STREX ( 10) IITGGARGLG 0.99996 3BHD_COMTE ( 10) LVTGGASGVG 0.999917 AP27_MOUSE ( 11) LVTGAGKGIG 0.999999 BA72_EUBSP ( 10) IITGGTRGIG 0.999975 BDH_HUMAN ( 59) LVTGCDSGFG 0.99945 BPHB_PSEPS ( 9) LITGGASGLG 0.999985 BUDC_KLETE ( 6) LVTGAGQGIG 0.999998 DHES_HUMAN ( 6) LITGCSSGIG 0.999966 DHGB_BACME ( 11) VITGSSTGLG 0.999966 DHII_HUMAN ( 38) IVTGASKGIG 0.999824 DHMA_FLAS1 ( 18) IVTGAAGGIG 0.999851 ENTA_ECOLI ( 9) WVTGAGKGIG 0.999983 FIXR_BRAJA ( 40) LLTGASRGIG 0.999995 GUTD_ECOLI ( 6) VVIGGGQTLG 0.736715 HDE_CANTR ( 326) LITGAGAGLG 0.754927 HDHA_ECOLI ( 15) IITGAGAGIG 0.999995 LIGD_PSEPA ( 10) FITGGASGAG 0.999613 NODG_RHIME ( 10) LVTGASGAIG 0.999815 RIDH_KLEAE ( 18) AITGAASGIG 0.999989 YINL_LISMO ( 9) IITGASSGIG 0.999992 YRTP_BACSU ( 10) LITGGGRGIG 0.999998 DHB2_HUMAN ( 86) LVTGGDCGLG 0.998512 DHB3_HUMAN ( 52) VITGAGDGIG 0.999985 DHCA_HUMAN ( 8) LVTGGNKGIG 0.999996 FABI_ECOLI ( 10) LVTGVASKLS 0.759929 FVT1_HUMAN ( 36) VVTGGSSGIG 0.999992 HMTR_LEIMA ( 10) LVTGAAKRLG 0.998781 MAS1_AGRRA ( 249) LVSGSNRGVG 0.999622 PCR_PEA ( 90) VITGASSGLG 0.999949 RFBB_NEIGO ( 10) LVTGGAGFIG 0.997348 // ----------------------------------------------------------------------- Possible examples of motif 2 in the training set ----------------------------------------------------------------------- Sequence name Start Score Site ------------- ----- ----- ---------- 2BHD_STREX 10 26.31 MNDLSGKTV IITGGARGLG AEAARQAVAA 3BHD_COMTE 10 26.80 TNRLQGKVA LVTGGASGVG LEVVKLLLGE AP27_MOUSE 11 29.42 MKLNFSGLRA LVTGAGKGIG RDTVKALHAS BA72_EUBSP 10 26.05 MNLVQDKVT IITGGTRGIG FAAAKIFIDN BDH_HUMAN 59 23.21 AAEPVGSKAV LVTGCDSGFG FSLAKHLHSK BPHB_PSEPS 9 27.43 MKLKGEAV LITGGASGLG RALVDRFVAE BUDC_KLETE 6 28.63 MQKVA LVTGAGQGIG KAIALRLVKD DHES_HUMAN 6 29.17 ARTVV LITGCSSGIG LHLAVRLASD DHES_HUMAN 138 9.18 DMKRRGSGRV LVTGSVGGLM GLPFNDVYCA DHGB_BACME 11 21.20 MYKDLEGKVV VITGSSTGLG KSMAIRFATE DHII_HUMAN 38 28.72 RPEMLQGKKV IVTGASKGIG REMAYHLAKM DHMA_FLAS1 18 28.07 RPGRLAGKAA IVTGAAGGIG RATVEAYLRE ENTA_ECOLI 9 27.95 MDFSGKNV WVTGAGKGIG YATALAFVEA ENTA_ECOLI 238 8.83 DLASHITLQD IVVDGGSTLG A FIXR_BRAJA 40 26.01 RVDRGEPKVM LLTGASRGIG HATAKLFSEA GUTD_ECOLI 6 13.88 MNQVA VVIGGGQTLG AFLCHGLAAE HDE_CANTR 12 25.82 SPVDFKDKVV IITGAGGGLG KYYSLEFAKL HDE_CANTR 326 25.94 PTVSLKDKVV LITGAGAGLG KEYAKWFAKY HDHA_ECOLI 15 27.64 DNLRLDGKCA IITGAGAGIG KEIAITFATA LIGD_PSEPA 10 22.04 MKDFQDQVA FITGGASGAG FGQAKVFGQA NODG_RHIME 10 23.61 MFELTGRKA LVTGASGAIG GAIARVLHAQ RIDH_KLEAE 18 26.93 MNTSLSGKVA AITGAASGIG LECARTLLGA YINL_LISMO 9 29.53 MTIKNKVI IITGASSGIG KATALLLAEK YRTP_BACSU 10 29.26 MQSLQHKTA LITGGGRGIG RATALALAKE DHB2_HUMAN 86 24.30 ELLPVDQKAV LVTGGDCGLG HALCKYLDEL DHB3_HUMAN 52 26.58 SFLRSMGQWA VITGAGDGIG KAYSFELAKR DHCA_HUMAN 8 27.72 SSGIHVA LVTGGNKGIG LAIVRDLCRL FABI_ECOLI 10 13.09 MGFLSGKRI LVTGVASKLS IAYGIAQAMH FVT1_HUMAN 36 28.02 KPLALPGAHV VVTGGSSGIG KCIAIECYKQ HMTR_LEIMA 10 22.19 MTAPTVPVA LVTGAAKRLG RSIAEGLHAE MAS1_AGRRA 249 18.64 TVEIHQSPVI LVSGSNRGVG KAIAEDLIAH PCR_PEA 90 26.50 GKKTLRKGNV VITGASSGLG LATAKALAES RFBB_NEIGO 10 23.35 MQTEGKKNI LVTGGAGFIG SAVVRHIIQN YURA_MYXXA 120 10.57 WERVRGIIDT NVTGAAATLS AVLPQMVERK ----------------------------------------------------------------------- log-odds matrix: alength= 20 w= 10 n= 9699 bayes= 8.30776 -1.008 -2.463 -4.948 -4.279 -0.280 -4.299 -3.260 1.751 -3.948 2.460 -1.078 -2.307 -4.250 -3.462 -3.724 -3.436 -2.642 1.123 0.982 -2.638 -2.911 -1.681 -4.766 -4.539 -1.308 -4.618 -3.763 3.003 -4.201 -0.819 -1.429 -4.036 -4.654 -4.055 -3.506 -3.777 -2.858 2.687 -3.678 -3.293 -3.213 -2.941 -4.192 -4.674 -4.414 -4.892 -3.848 -1.339 -3.826 -4.495 -2.867 -2.655 -4.512 -3.502 -3.844 -0.326 3.837 -1.800 -4.391 -4.716 -1.898 -3.885 -2.774 -3.888 -4.951 3.732 -3.617 -4.986 -3.717 -4.780 -4.221 -2.834 -4.535 -4.147 -3.656 -3.103 -4.125 -4.483 -4.201 -4.519 2.760 1.684 -5.396 -5.187 -4.828 2.191 -4.891 -4.527 -5.197 -3.839 -3.796 -4.746 -5.010 -4.115 -4.970 -0.091 -3.246 -1.187 -5.083 -5.218 1.936 -4.179 0.423 -3.099 -5.266 1.772 -2.953 -5.302 -3.167 -2.449 -4.458 0.636 -4.500 -3.090 -3.694 1.777 -0.729 -4.470 -5.414 -4.517 0.001 0.586 -0.424 -1.460 -3.647 0.587 -1.680 -3.275 1.393 -3.382 -2.476 -1.801 -3.191 0.598 1.480 1.889 -0.564 -3.079 -3.715 -2.981 -1.382 -3.889 -3.146 -3.887 -1.237 3.625 -3.647 -4.963 -1.820 -5.395 -4.202 -2.726 -4.533 -4.159 -1.369 -3.105 -1.636 -4.443 -4.195 -4.475 -1.611 -3.245 -4.585 -4.577 -0.846 -4.873 -4.256 3.440 -4.171 1.037 -1.127 -4.146 -4.921 -4.205 -4.399 -4.000 -3.097 0.720 -3.893 -3.429 -2.576 -3.891 -3.117 -3.893 -4.957 3.717 -3.651 -4.954 -3.388 -5.359 -4.222 -2.820 -4.533 -4.121 -3.647 -1.482 -3.919 -4.456 -4.201 -4.519 letter-probability matrix: alength= 20 w= 10 n= 9699 0.036389 0.003295 0.001676 0.003212 0.033171 0.003523 0.002341 0.189407 0.003791 0.504598 0.010928 0.009309 0.002663 0.003699 0.003926 0.006819 0.009516 0.140139 0.026347 0.005249 0.009729 0.005666 0.001901 0.002682 0.016273 0.002824 0.001652 0.451240 0.003181 0.052001 0.008567 0.002809 0.002013 0.002452 0.004569 0.005382 0.008192 0.414491 0.001043 0.003334 0.007892 0.002366 0.002831 0.002442 0.001890 0.002335 0.001557 0.022245 0.004126 0.004066 0.003161 0.007316 0.002222 0.003597 0.003613 0.058868 0.849116 0.018478 0.000636 0.001244 0.019624 0.001229 0.007565 0.004210 0.001302 0.921355 0.001827 0.001776 0.004447 0.003338 0.001237 0.006464 0.002186 0.002300 0.004117 0.008589 0.003406 0.002878 0.000725 0.001425 0.495450 0.058361 0.001229 0.001711 0.001419 0.316587 0.000756 0.002440 0.001595 0.006411 0.001660 0.001717 0.001573 0.002352 0.001656 0.069282 0.006261 0.028268 0.000394 0.000878 0.279939 0.001003 0.069376 0.007277 0.001047 0.236695 0.002896 0.001427 0.006514 0.016793 0.001049 0.071599 0.002240 0.004785 0.004011 0.252872 0.035832 0.002904 0.000313 0.001427 0.073198 0.027268 0.038552 0.022664 0.003216 0.104109 0.006997 0.005814 0.153611 0.008795 0.004146 0.013225 0.005550 0.061686 0.144772 0.273439 0.040182 0.007618 0.001016 0.004140 0.028072 0.001226 0.005845 0.004213 0.017086 0.855575 0.001790 0.001805 0.016568 0.002179 0.001253 0.006965 0.002190 0.002281 0.020093 0.008578 0.019121 0.002960 0.000729 0.001470 0.023945 0.001916 0.002156 0.002612 0.022403 0.002366 0.001174 0.610941 0.003247 0.188206 0.010562 0.002604 0.001673 0.002210 0.002459 0.004612 0.006945 0.106035 0.000898 0.003035 0.012267 0.001224 0.005964 0.004196 0.001297 0.911756 0.001786 0.001816 0.005588 0.002235 0.001236 0.006525 0.002189 0.002343 0.004144 0.026423 0.003928 0.002933 0.000725 0.001425 Time 506.83 secs. Stopped because nmotifs = 2 reached. CPU: ghidorah ******************************************************************************** DEBUG INFORMATION ******************************************************************************** This information can also be useful in the event you wish to report a problem with the MEME software. model: mod= zoops nmotifs= 2 chi= 1 width: minw= 8 maxw= 57 shorten= yes lambda: minsites= 0 maxsites= 33 theta: prob= 1 spmap= pam spfuzz= 120 em: prior= megap b= 99960 maxiter= 50 distance= 0.001 data: n= 9996 N= 33 strands: w53 sample: seed= 0 seqfrac= 1 LRT: adj= root Dirichlet mixture priors file: prior30.plib Letter frequencies: A 0.111 C 0.012 D 0.050 E 0.055 F 0.036 G 0.090 H 0.018 I 0.057 K 0.052 L 0.092 M 0.027 N 0.041 P 0.041 Q 0.029 R 0.049 S 0.064 T 0.057 V 0.083 W 0.010 Y 0.027 Non-redundant database letter frequencies: A 0.073 C 0.018 D 0.052 E 0.062 F 0.040 G 0.069 H 0.022 I 0.056 K 0.058 L 0.092 M 0.023 N 0.046 P 0.051 Q 0.041 R 0.052 S 0.074 T 0.059 V 0.064 W 0.013 Y 0.033 Effective length of alphabet = 20 Entropy of dataset (bits) = -4.11 meme adh.s -mod zoops -protein -nostatus -nmotifs 2 -gcg ********************************************************************************