SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSGGOP00000017896 from Gorilla gorilla 76_3.1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGGOP00000017896
Domain Number 1 Region: 6-143
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 1.12e-18
Family Clostridium neurotoxins, the second last domain 0.045
Further Details:      
 
Domain Number 2 Region: 235-355,420-487
Classification Level Classification E-value
Superfamily Metalloproteases ("zincins"), catalytic domain 0.0000000000000763
Family Reprolysin-like 0.067
Further Details:      
 
Domain Number 3 Region: 1280-1343
Classification Level Classification E-value
Superfamily Complement control module/SCR domain 0.000000000375
Family Complement control module/SCR domain 0.0055
Further Details:      
 
Domain Number 4 Region: 1216-1288
Classification Level Classification E-value
Superfamily Complement control module/SCR domain 0.00000000389
Family Complement control module/SCR domain 0.0032
Further Details:      
 
Domain Number 5 Region: 1154-1224
Classification Level Classification E-value
Superfamily Complement control module/SCR domain 0.0000000195
Family Complement control module/SCR domain 0.004
Further Details:      
 
Domain Number 6 Region: 1085-1158
Classification Level Classification E-value
Superfamily Complement control module/SCR domain 0.00000389
Family Complement control module/SCR domain 0.0039
Further Details:      
 
Weak hits

Sequence:  ENSGGOP00000017896
Domain Number - Region: 541-577,756-810
Classification Level Classification E-value
Superfamily Fibronectin type III 0.00073
Family Fibronectin type III 0.0084
Further Details:      
 
Domain Number - Region: 1424-1453
Classification Level Classification E-value
Superfamily Notch domain 0.0432
Family Notch domain 0.0048
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGGOP00000017896   Gene: ENSGGOG00000007107   Transcript: ENSGGOT00000033784
Sequence length 1497
Comment pep:known_by_projection chromosome:gorGor3.1:9:98758213:99007723:1 gene:ENSGGOG00000007107 transcript:ENSGGOT00000033784 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
QRSPAVITGLYDKCSYTSRDRGWVVGIHTISDQDNKDPRYFFSLKTDRARQVTTINAHRS
YLPGQWVYLAATYDGQFMKLYVNGAQVATSGEQVGGIFSPLTQKCKVLMLGGSALNHNYR
GYIEHFSLWKVARTQREILSDMETHGAHTALPQLLLQENWDNVKHAWSPMKDGSSPKVEF
SNAHGFLLDTSLEPPLCGQTLCDNTEVIASYNQLPSFRQPKVVRYRVVNLYEDDHKNPTV
TREQVDFQHLQLAEAFKHYNISWELDVLEVSNSSLRRRLILANCDISKIGDENCDPECNH
TLTGHDGGDCRHLRHPAFVKKQQNGVCDMDCNYERFNFDGGECCDPEITNVTQTCFDPDS
PHRAYLDVNELKNILKLDGSTHLNIFFAKSSEEELAGVATWPWDKEALMHLGGIVLNPSF
YGMPGHTHTMIHEIGHSLGLYHVFRGISEIQSCSDPCMETEPSFETGDLCNDTNPAPKHK
SCGDPGPGNDTCGFHSFFNTPYNNFMSYADDDCTDSFTPNQVARMHCYLDLVYQGWQPSR
KPAPVALAPQVVGHTTDSVTLEWFPPIDGDFFERELGSACHLCLEGRILVQYASNASSPM
PCSPSGHWSPREAEGHPDVEQPCKSSVRTWSPNSAVNPHTVPPACPEPQGCYLELEFLYP
LVPESLTIWVTFVSTDWDSSGAVNDIKLLTVSGKNISLGPQNVFCDVPLTIRLWDVGEEV
YGIQIYTLDEHLEIDAAMLTSTADTPLCLQCKPLKYKVVRDPPLQVDVASILHLNRKFVD
MDLNLGSVYQYWVITISGTEESEPSPAVTYIHGSGYCGDGIIQKDQGEQCDDMNKINGDG
CSLFCRQEVSFNCIDEPSRCYFHDGDGVCEEFEQKTSIKDCGVYTPQGFLDQWASNASVS
HQDQQCPGWVIIGQPAASQVCRTKVIDLSEGISQHAWYPCTISYPYSQLAQTTFWLRAYF
SQPMVAAAVIVHLVTDGTYYGDQKQETISVQLLDTKDQSHDLGLHVLSCRNNPLIIPVVH
DLSQPFYHSQAVRVSFSSPLVAISGVALRSFDNFDPVTLSSCQRGETYSPAEQSCVHFAC
EKTDCPELAVENASLNCSSSDRYHGAQCTVSCRTGYVLQIRRDDELIKSQTGPSVTVTCT
EGKWNKQVACEPVDCSIPDHHQVYAASFSCPEGTTFGSQCSFQCRHPAQLKGNNSLLTCM
EDGLWSFPEALCELLCLAPPPVPNADLQTARCRENKHKVGSFCKYKCKPGYHVPGSSRKS
KKRAFKTQCTQDGSWQEGACVPVTCDPPPPKFHGLYQCTNGFQFNSECRIKCEDSDASQG
LGSNVIHCRKDGTWNGSFHVCQEMQGQCSVPNQLNSNLKLQCPDGYAIGSECATSCLDHN
SESIILPMNVTVRDIPHWLNPTRVERVVCTAGLKWYPHPALIHCVKGCEPFMGDNYCDAI
NNRAFCNYDGGDCCTSTVKTKKVTPFPMSCDLQGDCACRDPQAQEHSRKDLRGYSHG
Download sequence
Identical sequences ENSGGOP00000017896

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]