SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSGGOP00000023748 from Gorilla gorilla 76_3.1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGGOP00000023748
Domain Number 1 Region: 84-288
Classification Level Classification E-value
Superfamily Metalloproteases ("zincins"), catalytic domain 5.63e-40
Family Reprolysin-like 0.0055
Further Details:      
 
Domain Number 2 Region: 386-442
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.00000000000000196
Family TSP-1 type 1 repeat 0.00036
Further Details:      
 
Domain Number 3 Region: 1070-1134
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.0000000262
Family TSP-1 type 1 repeat 0.0037
Further Details:      
 
Domain Number 4 Region: 741-808
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.0000000798
Family TSP-1 type 1 repeat 0.0059
Further Details:      
 
Domain Number 5 Region: 1022-1076
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.000000876
Family TSP-1 type 1 repeat 0.0085
Further Details:      
 
Domain Number 6 Region: 1299-1409
Classification Level Classification E-value
Superfamily Spermadhesin, CUB domain 0.00000523
Family Spermadhesin, CUB domain 0.015
Further Details:      
 
Domain Number 7 Region: 952-1016
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.0000115
Family TSP-1 type 1 repeat 0.0043
Further Details:      
 
Domain Number 8 Region: 1193-1278
Classification Level Classification E-value
Superfamily Spermadhesin, CUB domain 0.0000327
Family Spermadhesin, CUB domain 0.0096
Further Details:      
 
Weak hits

Sequence:  ENSGGOP00000023748
Domain Number - Region: 687-746
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.0017
Family TSP-1 type 1 repeat 0.0097
Further Details:      
 
Domain Number - Region: 903-955
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.0366
Family TSP-1 type 1 repeat 0.008
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGGOP00000023748   Gene: ENSGGOG00000012541   Transcript: ENSGGOT00000031480
Sequence length 1431
Comment pep:known_by_projection chromosome:gorGor3.1:9:117040474:117080585:1 gene:ENSGGOG00000012541 transcript:ENSGGOT00000031480 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MHQRHPRARCPPLCVAGFLACGFLLGCWGPSHFQQSCLQALEPQAVSSYLSPGAPLKGRP
PSPGFQRQRQRQRQRQRRAAGGILHLELLVAVGPDVFQAHQEDTERYVLTNLNIGAELLR
DPSLGAQFRVHLVKMVILTEPEGAPNITANLTSSLLSVCGWSRTINPEDDTDPGHADLVL
YITRFDLELPDGNRQVRGVTQLGGACSPTWSCLITEDTGFDLGVTIAHEIGHSFGLEHDG
APGSGCGPSGHVMASDGAAPRAGLAWSPCSRRXXXPCRSAGRARCVWDPPRPQPGSAGHP
PDAQPGLYYSANEQCRVAFGPKAVACTFAREHLDMCQALSCHTDPLDQSSCSRLLVPLLD
GTECGVEKWCSKGRCRSLVELTPIAAVHGRWSSWGPHSPCSRSCGGGVVTRRRQCNNPRP
AFGGRACVGADLQAEMCNTQACEKTQLEFMSEQCARTDGQPLRSSPGGASFYHWGAAVPH
SQGDALCRHMCRAIGESFIMKRGDSFLDGTRCMPSGPREDGTLSLCVSGSCRTFGCDGRM
DSQQVWDRCQVCGGDNSTCSPRKGSFTAGRAREYVTFLTVTPNLTSVYIANHRPLFTHLA
VRIGGRYVVAGKMSISPNTTYPSLLEDGRVEYRVALTEDRLPRLEEIRIWGPLQEDADIQ
VYRRYGEEYGNLTRPDITFTYFQPKPRQAWVWAAVRGPCSVSCGAGLRWVNYSCLDQARK
ELVETVQCQGSQQPPAWPEACVLEPCPPYWAVGDFGPCSASCGGGLRERPVRCVEAQGSL
LKTLPPARCRAGAQQPAVVLETCNPQPCPARWEVSEPSSCTSAGGAGLALENETCVPGAD
GLEAPVTEGPGSVDEKLPAPEPCVGMSCPPGWGHLDATSAGEKAPSPWGSIRTGAQAAHV
WTPAAGSCSVSCGRGLMELRFLCMDSALRVPVQEELCGLASKPGSRQEVCQAVPCPARWR
YKLAACSVSCGGGVVRRILYCARAHGEDDGEEILLDTQCQGLPRPEPQEACSLEPCPPRW
KVMSLGPCSASCGLGTARRSVACVQLDQGQDVEVDEAACVALVRPQASVPCLIADCTYRW
HVGTWMECSVSCGDGIQRRRDTCLGPQAQAPVPADFCQHLPKPVTVRGCWAGPCVGQGMP
SLVPHEEAAAPGRTTATPAGASLEWSQAQALLFSPAPQPQRLLPGPQENSAQSSACGRQH
LEPTGTIDMRGPGQADCAVAIGRPLGEVVTLRVLESSLNCSAGDMLLLWGRLTWRKMCRK
LLDMTFSSKTNTLVVRQRCGRPGGGVLLRYGSQLAPETFYRECDMQLFGPWGEIVSPSLS
PATSNAGGCRLFINVAPHARIAIHALATNMGAGTEGANASYISIRDTHSLRTTAFHGQQV
LYWESESSQAEMEFSEGFLKAQASLRGQYWTLQSWVPEVQDPQSWKGKEGT
Download sequence
Identical sequences ENSGGOP00000023748 ENSGGOP00000012247

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]