SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSGACP00000021932 from Gasterosteus aculeatus 76_1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGACP00000021932
Domain Number 1 Region: 84-258
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 1.3e-18
Family Clostridium neurotoxins, the second last domain 0.054
Further Details:      
 
Domain Number 2 Region: 413-642
Classification Level Classification E-value
Superfamily Metalloproteases ("zincins"), catalytic domain 0.0000000000000506
Family Reprolysin-like 0.063
Further Details:      
 
Domain Number 3 Region: 1396-1457
Classification Level Classification E-value
Superfamily Complement control module/SCR domain 0.000000000542
Family Complement control module/SCR domain 0.0044
Further Details:      
 
Domain Number 4 Region: 1274-1340
Classification Level Classification E-value
Superfamily Complement control module/SCR domain 0.00000000834
Family Complement control module/SCR domain 0.0033
Further Details:      
 
Domain Number 5 Region: 1335-1404
Classification Level Classification E-value
Superfamily Complement control module/SCR domain 0.0000000114
Family Complement control module/SCR domain 0.0029
Further Details:      
 
Domain Number 6 Region: 651-691,869-924
Classification Level Classification E-value
Superfamily Fibronectin type III 0.000000717
Family Fibronectin type III 0.0054
Further Details:      
 
Weak hits

Sequence:  ENSGACP00000021932
Domain Number - Region: 1218-1277
Classification Level Classification E-value
Superfamily Complement control module/SCR domain 0.011
Family Complement control module/SCR domain 0.0092
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGACP00000021932   Gene: ENSGACG00000016615   Transcript: ENSGACT00000021973
Sequence length 1613
Comment pep:known_by_projection group:BROADS1:groupXIV:5090581:5156041:-1 gene:ENSGACG00000016615 transcript:ENSGACT00000021973 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MKLWTSPWVSCLVILLLCFGSECGTVRRKGRSKRELVRIREAKATIPGACATRLPRGKRS
LPGLERRVLRQRRRSSLAEENSPDRGKAVYFTGRGDQLRLKPGVEIPRGNFTLEMWIKPE
GGQRSPTVIAGLYDKCFYASSDRGWAAWHQALVNMGTVTLDFFFSLKTDRAAQTTPTLAI
YPTKWAHVAVTYDGVYMKLFSTAPKWCQSGAIGHLTKKCKVLMIGGNALNHNYRGEVERV
CLWRQARGQRQIVRHMQGHEDTQDLPQLVIRETFEYPGRKWLTVKDGSFPKPDLGGGGRL
LDTTLDPPTCGQTVCDNVEVIKNYNHLWTFRRPKKVRYRVINVWDDARMKPTVSDHQISL
QHQQLNDAFSPYNITWELSIYNVTNSSLRNRLILANCDISKVGDDVCDPECNHPLTGFDA
GDCMSGFRSRCPEHKQGNGVCDPECNWENFFYDLGDCCNPNVTDVTKTCFNRSSPHKAYL
DVKELKEILRLNGSTHLNVFFANSSDEDLAGVATWPWDKEALTHLGGIVLNPSFYGTFGH
TDTMVHEIGHSLGLYHVFRGISEIESCNDACLETDPSMETGDLCADTNPTPKYKGCHDPE
PGNETCGCRHFTHTPFNNYMSYADDACTDSFTLNQVARMHCYLDLIYQTWQPASKPPPVP
MSPQVVEQHHNSITLEWFPPISGHFYEREVGSVCDKCTEGGVLLQYASNSSSPRPCAPSG
HWSPREAEGPPDVEQPCEPSVRTWSPNAGIEQGVVGLSECPLQGCMLQLEFTYPLVPDSL
TVWVTFFSPEETALPAIHNILLLTVGGGTISLGPSNVFCDTPLTLKLDTKEEVYGVQFFT
MEQHLEIDATLLASKPGNKLCKDCQPLRYRLLRQPHFTHAQYGLMLNEPTRRYTDRDVAP
HVVYTYQVQTISARSESEPSPPLIHELGAPYCGDGRIQSSKGEECDDMNSMNGDGCSSQC
KKEAFFNCVEEPSLCYYYDGDGVCEDFEQETGVRDCGLYTPNGFLDQWASAVEVSHEEKL
YCSGDVTAGYPAVTKTCQSKVFDLSDGVSQYAWFPCREAHKSTWGYPNYWLKAHFSHPMV
AAAVIIHLAADGTGYLDQTQCNITVQLVDTKEGIHSLGEWRLSCRTNPLVIPVSHDLSVA
FYHTKAILVMFASRLVAISGVGLRSFQSFDPITISGCQSNEIYNPTGQSCVYYSREGIHS
HKLLSRMALLSALALARHFYKGDRCSLPCKKKQGLARQASREDDIMNIQSATAVTITCAN
GKWNKQVSCEPVDCGLPDKYHVHPALFDFPEGTTYGKKSTFQCREPAQLVGTNNSLTCLE
DGLWSFPEALCELRCPAPPVVPNAVLQTKRCNDTGLKVGTLCKYNCRPGYHVSNKPKRRA
FKRQCTEDGSWLEGACEPVTCDSPPPIFHGSYRCTDGFRYDSVCKLNCSDAAESQGAGSN
AIRCKKDGTWTGSFRLCPHSKGQCSLPQNLHHSLQYSCKQGHGIEECELTCREGNNDVVI
LPSNMTLENVLIEHWRNPHRVKSIVCTMGLKWYPHPELLHCIKGCEPFMGDNYCDSVNNR
AFCNYDGGDCCQSTVKTTKVIPFPMSCDIRDECSCRDLNAFENRKDEHLHSLG
Download sequence
Identical sequences G3PWE3
ENSGACP00000021932 ENSGACP00000021932 69293.ENSGACP00000021932

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]