SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSGGOP00000006962 from Gorilla gorilla 69_3.1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGGOP00000006962
Domain Number 1 Region: 19-159
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 5.5e-19
Family Clostridium neurotoxins, the second last domain 0.045
Further Details:      
 
Domain Number 2 Region: 251-371,436-503
Classification Level Classification E-value
Superfamily Metalloproteases ("zincins"), catalytic domain 0.0000000000000763
Family Reprolysin-like 0.067
Further Details:      
 
Domain Number 3 Region: 1296-1359
Classification Level Classification E-value
Superfamily Complement control module/SCR domain 0.000000000389
Family Complement control module/SCR domain 0.0055
Further Details:      
 
Domain Number 4 Region: 1232-1304
Classification Level Classification E-value
Superfamily Complement control module/SCR domain 0.00000000403
Family Complement control module/SCR domain 0.0032
Further Details:      
 
Domain Number 5 Region: 1170-1240
Classification Level Classification E-value
Superfamily Complement control module/SCR domain 0.0000000195
Family Complement control module/SCR domain 0.004
Further Details:      
 
Domain Number 6 Region: 1101-1174
Classification Level Classification E-value
Superfamily Complement control module/SCR domain 0.00000389
Family Complement control module/SCR domain 0.0039
Further Details:      
 
Weak hits

Sequence:  ENSGGOP00000006962
Domain Number - Region: 557-593,772-826
Classification Level Classification E-value
Superfamily Fibronectin type III 0.000738
Family Fibronectin type III 0.0084
Further Details:      
 
Domain Number - Region: 1440-1469
Classification Level Classification E-value
Superfamily Notch domain 0.0432
Family Notch domain 0.0048
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGGOP00000006962   Gene: ENSGGOG00000007107   Transcript: ENSGGOT00000007148
Sequence length 1513
Comment pep:novel chromosome:gorGor3.1:9:98757824:99005863:1 gene:ENSGGOG00000007107 transcript:ENSGGOT00000007148 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MRLWSWVLHLGLLSAAQRSPAVITGLYDKCSYTSRDRGWVVGIHTISDQDNKDPRYFFSL
KTDRARQVTTINAHRSYLPGQWVYLAATYDGQFMKLYVNGAQVATSGEQVGGIFSPLTQK
CKVLMLGGSALNHNYRGYIEHFSLWKVARTQREILSDMETHGAHTALPQLLLQENWDNVK
HAWSPMKDGSSPKVEFSNAHGFLLDTSLEPPLCGQTLCDNTEVIASYNQLPSFRQPKVVR
YRVVNLYEDDHKNPTVTREQVDFQHLQLAEAFKHYNISWELDVLEVSNSSLRRRLILANC
DISKIGDENCDPECNHTLTGHDGGDCRHLRHPAFVKKQQNGVCDMDCNYERFNFDGGECC
DPEITNVTQTCFDPDSPHRAYLDVNELKNILKLDGSTHLNIFFAKSSEEELAGVATWPWD
KEALMHLGGIVLNPSFYGMPGHTHTMIHEIGHSLGLYHVFRGISEIQSCSDPCMETEPSF
ETGDLCNDTNPAPKHKSCGDPGPGNDTCGFHSFFNTPYNNFMSYADDDCTDSFTPNQVAR
MHCYLDLVYQGWQPSRKPAPVALAPQVVGHTTDSVTLEWFPPIDGDFFERELGSACHLCL
EGRILVQYASNASSPMPCSPSGHWSPREAEGHPDVEQPCKSSVRTWSPNSAVNPHTVPPA
CPEPQGCYLELEFLYPLVPESLTIWVTFVSTDWDSSGAVNDIKLLTVSGKNISLGPQNVF
CDVPLTIRLWDVGEEVYGIQIYTLDEHLEIDAAMLTSTADTPLCLQCKPLKYKVVRDPPL
QVDVASILHLNRKFVDMDLNLGSVYQYWVITISGTEESEPSPAVTYIHGSGYCGDGIIQK
DQGEQCDDMNKINGDGCSLFCRQEVSFNCIDEPSRCYFHDGDGVCEEFEQKTSIKDCGVY
TPQGFLDQWASNASVSHQDQQCPGWVIIGQPAASQVCRTKVIDLSEGISQHAWYPCTISY
PYSQLAQTTFWLRAYFSQPMVAAAVIVHLVTDGTYYGDQKQETISVQLLDTKDQSHDLGL
HVLSCRNNPLIIPVVHDLSQPFYHSQAVRVSFSSPLVAISGVALRSFDNFDPVTLSSCQR
GETYSPAEQSCVHFACEKTDCPELAVENASLNCSSSDRYHGAQCTVSCRTGYVLQIRRDD
ELIKSQTGPSVTVTCTEGKWNKQVACEPVDCSIPDHHQVYAASFSCPEGTTFGSQCSFQC
RHPAQLKGNNSLLTCMEDGLWSFPEALCELLCLAPPPVPNADLQTARCRENKHKVGSFCK
YKCKPGYHVPGSSRKSKKRAFKTQCTQDGSWQEGACVPVTCDPPPPKFHGLYQCTNGFQF
NSECRIKCEDSDASQGLGSNVIHCRKDGTWNGSFHVCQEMQGQCSVPNQLNSNLKLQCPD
GYAIGSECATSCLDHNSESIILPMNVTVRDIPHWLNPTRVERVVCTAGLKWYPHPALIHC
VKGCEPFMGDNYCDAINNRAFCNYDGGDCCTSTVKTKKVTPFPMSCDLQGDCACRDPQAQ
EHSRKDLRGYSHG
Download sequence
Identical sequences G3QW20
ENSGGOP00000006962 ENSGGOP00000006962

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]