SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSGGOP00000012247 from Gorilla gorilla 76_3.1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGGOP00000012247
Domain Number 1 Region: 71-252
Classification Level Classification E-value
Superfamily Metalloproteases ("zincins"), catalytic domain 7.09e-38
Family Reprolysin-like 0.0055
Further Details:      
 
Domain Number 2 Region: 372-428
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.00000000000000196
Family TSP-1 type 1 repeat 0.00036
Further Details:      
 
Domain Number 3 Region: 1059-1123
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.0000000262
Family TSP-1 type 1 repeat 0.0037
Further Details:      
 
Domain Number 4 Region: 727-797
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.000000249
Family TSP-1 type 1 repeat 0.0063
Further Details:      
 
Domain Number 5 Region: 1011-1065
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.000000863
Family TSP-1 type 1 repeat 0.0085
Further Details:      
 
Domain Number 6 Region: 1288-1398
Classification Level Classification E-value
Superfamily Spermadhesin, CUB domain 0.00000523
Family Spermadhesin, CUB domain 0.015
Further Details:      
 
Domain Number 7 Region: 941-1005
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.0000115
Family TSP-1 type 1 repeat 0.0043
Further Details:      
 
Domain Number 8 Region: 1182-1267
Classification Level Classification E-value
Superfamily Spermadhesin, CUB domain 0.0000327
Family Spermadhesin, CUB domain 0.0096
Further Details:      
 
Weak hits

Sequence:  ENSGGOP00000012247
Domain Number - Region: 673-732
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.0017
Family TSP-1 type 1 repeat 0.0097
Further Details:      
 
Domain Number - Region: 892-944
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.0366
Family TSP-1 type 1 repeat 0.008
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGGOP00000012247   Gene: ENSGGOG00000012541   Transcript: ENSGGOT00000012601
Sequence length 1420
Comment pep:known_by_projection chromosome:gorGor3.1:9:117040474:117080585:1 gene:ENSGGOG00000012541 transcript:ENSGGOT00000012601 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MHQRHPRARCPPLCVAGFLACGFLLGCWGPSHFQQQSCLQALEPQAVSSYLSPGAPLKGR
PPSPGFAGGILHLELLVAVGPDVFQAHQEDTERYVLTNLNIGAELLRDPSLGAQFRVHLV
KMVILTEPEGAPNITANLTSSLLSVCGWSRTINPEDDTDPGHADLVLYITRFDLELPDGN
RQVRGVTQLGGACSPTWSCLITEDTGFDLGVTIAHEIGHSFGLEHDGAPGSGCGPSGHVM
ASVAWSPCSRRRSLPLAYTHPCILLAGRARCVWDPPRPQPGSAGHPPDAQPGLYYSANEQ
CRVAFGPKAVACTFAREHLDMCQALSCHTDPLDQSSCSRLLVPLLDGTECGVEKWCSKGR
CRSLVELTPIAAVHGRWSSWGPHSPCSRSCGGGVVTRRRQCNNPRPAFGGRACVGADLQA
EMCNTQACEKTQLEFMSEQCARTDGQPLRSSPGGASFYHWGAAVPHSQGDALCRHMCRAI
GESFIMKRGDSFLDGTRCMPSGPREDGTLSLCVSGSCRTFGCDGRMDSQQVWDRCQVCGG
DNSTCSPRKGSFTAGRAREYVTFLTVTPNLTSVYIANHRPLFTHLAVRIGGRYVVAGKMS
ISPNTTYPSLLEDGRVEYRVALTEDRLPRLEEIRIWGPLQEDADIQVYRRYGEEYGNLTR
PDITFTYFQPKPRQAWVWAAVRGPCSVSCGAGLRWVNYSCLDQARKELVETVQCQGSQQP
PAWPEACVLEPCPPYWAVGDFGPCSASCGGGLRERPVRCVEAQGSLRSLLKTLPPARCRA
GAQQPAVVLETCNPQPCPARWEVSEPSSCTSAGGAGLALENETCVPGADGLEAPVTEGPG
SVDEKLPAPEPCVGMSCPPGWGHLDATSAGEKAPSPWGSIRTGAQAAHVWTPAAGSCSVS
CGRGLMELRFLCMDSALRVPVQEELCGLASKPGSRQEVCQAVPCPARWRYKLAACSVSCG
GGVVRRILYCARAHGEDDGEEILLDTQCQGLPRPEPQEACSLEPCPPRWKVMSLGPCSAS
CGLGTARRSVACVQLDQGQDVEVDEAACVALVRPQASVPCLIADCTYRWHVGTWMECSVS
CGDGIQRRRDTCLGPQAQAPVPADFCQHLPKPVTVRGCWAGPCVGQGMPSLVPHEEAAAP
GRTTATPAGASLEWSQAQALLFSPAPQPQRLLPGPQENSAQSSACGRQHLEPTGTIDMRG
PGQADCAVAIGRPLGEVVTLRVLESSLNCSAGDMLLLWGRLTWRKMCRKLLDMTFSSKTN
TLVVRQRCGRPGGGVLLRYGSQLAPETFYRECDMQLFGPWGEIVSPSLSPATSNAGGCRL
FINVAPHARIAIHALATNMGAGTEGANASYISIRDTHSLRTTAFHGQQVLYWESESSQAE
MEFSEGFLKAQASLRGQYWTLQSWVPEVQDPQSWKGKEGT
Download sequence
Identical sequences ENSGGOP00000012247

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]