SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSMUSP00000099923 from Mus musculus 63_37 (longest transcript per gene)

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSMUSP00000099923
Domain Number 1 Region: 84-270
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 1.4e-28
Family Clostridium neurotoxins, the second last domain 0.031
Further Details:      
 
Domain Number 2 Region: 357-482,575-643
Classification Level Classification E-value
Superfamily Metalloproteases ("zincins"), catalytic domain 0.0000000000000308
Family Reprolysin-like 0.067
Further Details:      
 
Domain Number 3 Region: 517-684
Classification Level Classification E-value
Superfamily Metalloproteases ("zincins"), catalytic domain 0.000000000000567
Family Reprolysin-like 0.076
Further Details:      
 
Domain Number 4 Region: 1435-1499
Classification Level Classification E-value
Superfamily Complement control module/SCR domain 0.000000000292
Family Complement control module/SCR domain 0.0052
Further Details:      
 
Domain Number 5 Region: 1372-1444
Classification Level Classification E-value
Superfamily Complement control module/SCR domain 0.00000000264
Family Complement control module/SCR domain 0.003
Further Details:      
 
Domain Number 6 Region: 1311-1376
Classification Level Classification E-value
Superfamily Complement control module/SCR domain 0.0000000459
Family Complement control module/SCR domain 0.0051
Further Details:      
 
Domain Number 7 Region: 1241-1314
Classification Level Classification E-value
Superfamily Complement control module/SCR domain 0.00000334
Family Complement control module/SCR domain 0.0044
Further Details:      
 
Weak hits

Sequence:  ENSMUSP00000099923
Domain Number - Region: 902-973
Classification Level Classification E-value
Superfamily Fibronectin type III 0.0108
Family Fibronectin type III 0.0087
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSMUSP00000099923   Gene: ENSMUSG00000028370   Transcript: ENSMUST00000102859
Sequence length 1653
Comment pep:known chromosome:NCBIM37:4:64785208:65018543:1 gene:ENSMUSG00000028370 transcript:ENSMUST00000102859
Sequence
MRLWSWVLRLGLLSAALGCGLAERPRRVRRDPRAVRPPRPAAGPATCATRAARGRRASPP
PPPGGAWEAVRVPRRRQQRAARGAEEPSPPSRALYFSGRGEQLRLRADLELPRDAFTLQV
WLRAEGGQKSPAVITGLYDKCSYTSRDRGWVMGIHTTSDQGNRDPRYFFSLKTDRARKVT
TIDAHRSYLPGQWVHLAATYDGRLMKLYMNGAQVATSAEQVGGIFSPLTQKCKVLMLGGS
ALNHNFRGHIEHFSLWKVARTQREIVSDMETRGLHTPLPQLLLQENWDNVKRTWSPMKDG
NSPQVEFSNAHGFLLDTNLEPPLCGQTLCDNTEVISSYNQLPSFRQPKVVRYRVVNIYDD
HHENPTVSWQQIDFQHQQLAEAFQHYNISWELEVLNINSSSLRHRLILANCDISKIGDEK
CDPECNHTLTGHDGGDCRQLRYPAFMKKQQNGVCDMDCNYERFNFDGGECCDPDITDVTK
TCFDPDSPHRQSIRKRAHVVEESWLPHGKQKAKKRKRTRAYLDVNELKNILRLDGSTHLN
IFFANSSEEELAGVATWPWDKEALMHLGGIVLNPSFYGIPGHTHTMIHEIGHSLGLYHIF
RGISEIQSCSDPCMETEPSFETGDLCNDTNPAPKHKFCGDPGPGNDTCGFHGFFNTPYNN
FMSYADDDCTDSFTPNQVSRMHCYLDLVYQSWQPSRKPAPVALAPQVVGHTMDSVMLEWF
PPIDGHFFERELGSACDLCLEGRILVQYAFNASSPMPCGPSGHWSPREAEGHPDVEQPCK
SSVRTWSPNSAVNPHTVPPACPEPQGCYLELEFRYPLVPESLTIWVTFVSSDWDSSGAVN
DIKLLTISGKNISLGPQNVFCDIPLTIRLRDVGEEVYGIQIYTLDEHLEIDAAMLTSTVD
SPLCLQCKPLQYKVLRDPPLLEDVASLLHLNRRFMDTDLKLGSVYQYRIITISGNEESEP
SPAAIYTHGSGYCGDGVIQKDQGEECDDMNKVNGDGCSLFCKQEVSFNCIDEPSRCYFHD
GDGMCEEFEQKTSIKDCGVYTPQGFLDQWASNASVSHQDQQCPGWVVIGQPAASQVCRTK
VIDLSEGISQHAWYPCTITYPYYHLPQTTFWLQTYFSQPMVAAAVIIHLVTDGTYYGDQK
QETISVQLLDTKDQSHDLGLHVLSCRNNPLIIPVVHDLSQPFYHSQAVHVSFSSPLVAIS
GVALRSFDNFDPVTLSSCQRGETYSPAEQSCVHFACQAADCPELAVGNASLNCSSNHHYH
GAQCTVSCQTGYVLQIQRDDELIKSQVGPSITVTCTEGKWNKQVACEPVDCGIPDHHHVY
AASFSCPEGTTFGRRCSFQCRHPAQLKGNNSFLTCMEDGLWSFPEALCELMCLAPPPVPN
ADLQTARCRENKHKVGSFCKYKCKPGYHVPGSSRKSKKRAFKTQCTQDGSWQEGTCVPVT
CDPPPPKFHGLYQCTNGFQFNSECRIKCEDSDASQGRGSNIIHCRKDGTWSGSFHVCREM
QGQCSAPNQLNSNLKLQCPDGYAIGSECAISCLDHNSESIILPVNLTVRDIPHWMNPTRV
QRIVCTAGLQWYPHPALIHCVKGCEPFMGDNYCDAINNRAFCNYDGGDCCTSTVKTKKVT
PFPMSCDLQNDCACRDPEAQEHNRKDLRGYSHG
Download sequence
Identical sequences ENSMUSP00000099923 10090.ENSMUSP00000099923

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]