ID short_identifier; BLOCK AC block_number; distance from previous block = (min,max) DE description BL xxx motif; width=w; seqs=s; 99.5%=n1; strength=n2 sequence_id (offset) sequence_segment sequence_weight . . . //
AC line contains the block number, a seven-character group number for sequences from which the block was made, followed by a letter (A-Z) indicating the order of the block in the sequences. If the group has only one block, the letter is omitted. If the block was made from InterPro group IPRnnnnnn, the block number is IPBnnnnnna. If the block was converted from Terri Attwood's Prints Database the block number is PRnnnnna. min,max = minimum,maximum number of amino acids from previous block for sequences in this block. For the first block in the group, the distance from the beginning of the sequences.
DE line contains a description of the group of sequences from which the block was made. If the block was taken from InterPro, it will be a slightly edited version of the InterPro description.
BL line contains information about the block:
xxx = the amino acids in the spaced triplet found by MOTIF upon
which the block is based.
w = width of the sequence segments (columns) in the block.
s = number of sequence segments (rows) in the block.
n1 = raw calibration score; 99.5th percentile score of true
negative sequences. Raw search scores are normalized by
dividing by this score and multiplying by 1000.
n2 = median normalized score of known true positive sequences as
documented in InterPro.
Following the BL line are lines for each sequence with a segment in the block. The segments may be clustered with clusters separated by blank lines. Each segment line contains a sequence identifier, the offset from the beginning of the sequence to the block in parentheses, the sequence segment, and a weight for the segment. The weights are normalized so that the most distant segment has a weight of 100.
// line terminates a block entry.
Current Blocks Database Release
>JC2395 NVSDVNLNK---YIWRTAEKMK---ICDAKKFARQHKIPESKIDEIEHNSPQDAAE---- -------------------------QKIQLLQCWYQSHGKT--GACQALIQGLRKANRCD IAEEIQAM >KPEL_DROME MAIRLLPLPVRAQLCAHLDAL-----DVWQQLATAVKLYPDQVEQISSQKQRGRS----- -------------------------ASNEFLNIWGGQYN----HTVQTLFALFKKLKLHN AMRLIKDY >FASA_MOUSE NASNLSLSK---YIPRIAEDMT---IQEAKKFARENNIKEGKIDEIMHDSIQDTAE---- -------------------------QKVQLLLCWYQSHGKS--DAYQDLIKGLKKAECRR TLDKFQDM
CLUSTAL W(1.60) multiple sequence alignment JC2395 NVSDVNLNK---YIWRTAEKMK---ICDAKKFARQHKIPESKIDEIEHNSPQDAAE---- KPEL_DROME MAIRLLPLPVRAQLCAHLDAL-----DVWQQLATAVKLYPDQVEQISSQKQRGRS----- FASA_MOUSE NASNLSLSK---YIPRIAEDMT---IQEAKKFARENNIKEGKIDEIMHDSIQDTAE---- JC2395 -------------------------QKIQLLQCWYQSHGKT--GACQALIQGLRKANRCD KPEL_DROME -------------------------ASNEFLNIWGGQYN----HTVQTLFALFKKLKLHN FASA_MOUSE -------------------------QKVQLLLCWYQSHGKS--DAYQDLIKGLKKAECRR JC2395 IAEEIQAM KPEL_DROME AMRLIKDY FASA_MOUSE TLDKFQDM
// 1 50 JC2395 NVSDVNLNK. ..YIWRTAEK MK...ICDAK KFARQHKIPE SKIDEIEHNS KPEL_DROME MAIRLLPLPV RAQLCAHLDA L.....DVWQ QLATAVKLYP DQVEQISSQK FASA_MOUSE NASNLSLSK. ..YIPRIAED MT...IQEAK KFARENNIKE GKIDEIMHDS 51 100 JC2395 PQDAAE.... .......... .......... .....QKIQL LQCWYQSHGK KPEL_DROME QRGRS..... .......... .......... .....ASNEF LNIWGGQYN. FASA_MOUSE IQDTAE.... .......... .......... .....QKVQL LLCWYQSHGK 101 JC2395 T..GACQALI QGLRKANRCD IAEEIQAM KPEL_DROME ...HTVQTLF ALFKKLKLHN AMRLIKDY FASA_MOUSE S..DAYQDLI KGLKKAECRR TLDKFQDM
[Blocks home]