SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSGGOP00000000602 from Gorilla gorilla 69_3.1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGGOP00000000602
Domain Number 1 Region: 812-1067
Classification Level Classification E-value
Superfamily Trypsin-like serine proteases 3.21e-89
Family Eukaryotic proteases 0.000000036
Further Details:      
 
Domain Number 2 Region: 390-548
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 8.39e-38
Family MAM domain 0.0089
Further Details:      
 
Domain Number 3 Region: 277-381
Classification Level Classification E-value
Superfamily Spermadhesin, CUB domain 3.93e-18
Family Spermadhesin, CUB domain 0.0022
Further Details:      
 
Domain Number 4 Region: 59-164
Classification Level Classification E-value
Superfamily SEA domain 1.83e-16
Family SEA domain 0.0071
Further Details:      
 
Domain Number 5 Region: 688-726
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.0000000000314
Family LDL receptor-like module 0.0014
Further Details:      
 
Domain Number 6 Region: 569-681
Classification Level Classification E-value
Superfamily Spermadhesin, CUB domain 0.0000000000393
Family Spermadhesin, CUB domain 0.0084
Further Details:      
 
Domain Number 7 Region: 732-822
Classification Level Classification E-value
Superfamily SRCR-like 0.000000000379
Family Scavenger receptor cysteine-rich (SRCR) domain 0.015
Further Details:      
 
Domain Number 8 Region: 228-268
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.000000654
Family LDL receptor-like module 0.0018
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGGOP00000000602   Gene: ENSGGOG00000000610   Transcript: ENSGGOT00000000616
Sequence length 1068
Comment pep:novel chromosome:gorGor3.1:21:6583577:6714272:-1 gene:ENSGGOG00000000610 transcript:ENSGGOT00000000616 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MGSKRSVSSRHHSLSSYEIMFAALFAILVVLCAGLIAVSCLTIKESQRGAALGQSHEARG
TFKITSGVTYNPNLQDKLSVDFKVLAFDLQQMIDEIFLSSNLKNEYKNSRVLQFENGSII
VIFDLFFAQWVSDENVKEELIQGLEANKSSQLVTFHIDLNSVDILASLENFSVVSPATTS
ADKLTTTSHLATPGTCGQSRTKVALGSGAPMRSIRTAKFMKVGGNVSVECLPGSSPCTDA
LTCIKADLFCDGEVNCPDGSDEDNKMCATVCDGRFLLTGSSGSFQATHYPKPSETSVVCQ
WIIRVNQGLSIKLSFDDFNTYYTDILDIYEGVGSSKILRASIWETNPGTIRIFSNQVTAT
FLIESDESDYVGFNATYTAFNSSELNNYEKINCNFEDGFCFWVQDLNDDNEWERIQGSTF
SPFTGPNFDHTFGNASGFYISTPTGPGGRQERVGLLSLPLDPTLEPACLSFWYHMYGENV
RKLSINISNDQNMEKTVFQKEGNYGDNWNYGQVTLNETVKFKVAFNAFKNKILSDIALDD
ISLTYGICNGSLYPEPTLVPTPPPELPTDCGGPFELWEPNATFSSMNFPNSYPNLAFLCL
TSLQFHRTKTMRFHFRSLDWERLEGWQELRNLKFGSFILITAVYTGPGPVKDVFSTTNRM
TVLLITNDVLARGGFKANFTTGYHLGIPDPCKEDHFQCKNGECVPLVNLCDGHLHCEDGS
DEADCALVIIGKKDNNGLVQFRIQSIWHTACAENWTTQISNDVCQLLGLGSGNSSMPIFS
TDGGPFVKLNTAPDGHLILTPSQQCLQDSLIRLQCNLKSCGKKLAAQDITPKIVGGSNAK
EGAWPWVVGLYYGGRLLCGASLVSSDWLVSAAHCVYGRNLEPSKWTAILGLHMKSNLTSP
QTVPRLIDEIVINPHYNRRRKDNDIAMMHLEFKVNYTDYIQPICLPEENQVFPPGRNCSI
AGWGTVVYQAGTTANILQEADVPLLSNEKCQQQMPEYNITENMICAGYEEGGIDSCQGDS
GGPLMCQENNRWFLAGVTSFGYKCALPNRPGVYARVSRFTEWIQSFLH
Download sequence
Identical sequences ENSGGOP00000000602 ENSGGOP00000000602

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]