SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSGACP00000018900 from Gasterosteus aculeatus 76_1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGACP00000018900
Domain Number 1 Region: 6-183
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 3.26e-25
Family Clostridium neurotoxins, the second last domain 0.046
Further Details:      
 
Domain Number 2 Region: 370-507,540-559
Classification Level Classification E-value
Superfamily Metalloproteases ("zincins"), catalytic domain 0.000000000000227
Family Astacin 0.072
Further Details:      
 
Domain Number 3 Region: 1256-1329
Classification Level Classification E-value
Superfamily Complement control module/SCR domain 0.0000000195
Family Complement control module/SCR domain 0.0039
Further Details:      
 
Domain Number 4 Region: 1194-1259
Classification Level Classification E-value
Superfamily Complement control module/SCR domain 0.0000000514
Family Complement control module/SCR domain 0.0037
Further Details:      
 
Domain Number 5 Region: 1320-1378
Classification Level Classification E-value
Superfamily Complement control module/SCR domain 0.00000032
Family Complement control module/SCR domain 0.0038
Further Details:      
 
Weak hits

Sequence:  ENSGACP00000018900
Domain Number - Region: 1129-1198
Classification Level Classification E-value
Superfamily Complement control module/SCR domain 0.0221
Family Complement control module/SCR domain 0.0079
Further Details:      
 
Domain Number - Region: 789-859
Classification Level Classification E-value
Superfamily Fibronectin type III 0.0609
Family Fibronectin type III 0.012
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGACP00000018900   Gene: ENSGACG00000014317   Transcript: ENSGACT00000018938
Sequence length 1532
Comment pep:known_by_projection group:BROADS1:groupIII:3816709:3866221:-1 gene:ENSGACG00000014317 transcript:ENSGACT00000018938 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
PPSWMTALYFGGRREQLKVTPAAGLELPRDKFSLELWVKPEGGQSNPAVIAGVFDNCSHS
LSEKGWSVGIRTVEPASAKDARFYFTLRTDRAVKATTVYSHQRYRANAWTHLMATYNGHN
MRLYVDSAQVGESSFQAGNLYSPFMKACRSLFLGSNQSDEGHSFRGLIGGVVLWGYARSH
GELLNKPLLNNKNKPLLEMWADFTKVEELWTPYKVGLHPTIITAPLPEEEVVSPFLPPPC
GLTPCDNNEIIFGYNNNFQLRATKRVRYRVVNLSDDDGGNPTVSEDQIELQHRALIEAFQ
PYNVSLDLSVHTARNSSFRQRFILSNCRIGKIGNRQCDPECDHPRTGHDGGDCLRLGPCY
NWKRQDGVCDMECNSIHYDYDDGDCCDPEVTDVFKTCFDPESPDRAYMSVKELKEELHLS
GSDMLNVFFASNSVREELAGAATWPWAKEALTHQGGMVLNPSYFGTKGHHNTMIHEMGHI
LGLYHVFKGVSERDSCDDPCQETTPSMETGDLCADTAPTTKSKACQDPGAVRDTCGLTTY
NDTPYTNYMSYTDDNCTNHFTPNQVARMHCYLDLVYQNWLSEEQLAPIPLAPIVTDQSPD
SVSIYWLPPMRGAFYQRCNKELGLVCGDCETDGVFHQYASEATSPRICDTSGYWTPEEAV
GPPDVEQPCDPSLQAWSPELTLYDTNVTSPCPDTEGCTLTLKFHHPIVPHTLTLWVTYVS
SNNPALADVELITATGKSINLGPQHIFCDMPFTLRLDTRGLAVAAVKLCTFDEKMEVDAA
MLSSGPSSPLCSRCRPLLYQIQRQPPFAGQTPPPQTRQTFTDSSVRQGVQYQYTIQVEAD
GLLSDPSSPLLYTHGQSYCGDGFIQGAEECDDSNLLDRDGCSKSCQKETDFNCNGEPSRC
YVFDGDGVCEEFERGSSVRDCGYFTPLGYTDQWASAAAASHQDPNRCPAHAVTGEPSLTK
LCRYQHLEVSGKLPTDAWFPCTAQFDTNELEQSFWLKVGFVHPGVAASVIVYLASDGSWS
GEQCRRTATILLSDTTGKNHSLGTHDLSCHQNPLVVNVTHDLSRPFFITASVILLFSSPS
VAVGGVALRTSCHFSTFAQTGCASQGGLSHNYLLNSCLRQPCQLDSCAPLEIAHASVRCT
GGGESSQCLVQCHRGFSVNVLNSKGTRPHQVHGELQLECFHGAWDRLVSCQPVDCGMPDQ
SHVYHAIFSCPGGTTFGKQCSFTCGPPAILQGDSDRLVCLEDGLWDFPEAYCKIECPEVP
NLPDAKLLTADCLASGHDIGSACRYKCNPGFYVVGSLKTKTPRKYFKLECLQEGQWEETT
CEPISCPAPPDVFQGMYTCTNSLYYDTVCTLQCPDATENRVIRCTKEGDWSAEFTMCSTL
QGSCSPPADLNSVEYSCDPRTDVGAFCYPTCIGAADMDLRDPAVLPNATTVDSLKHWMLP
TEVQSIVCTGMMKWYPDPQNIHCIQSCEPFGGDGWCDTINNRAYCQYDGGDCCPSTLSTR
KVIQFGADCNQDECTCRDPDAEENKSRTKDSG
Download sequence
Identical sequences G3PMR7
ENSGACP00000018900 69293.ENSGACP00000018900 ENSGACP00000018900

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]