SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for A0A068WC79 from Uniprot 2018_03 genome

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  A0A068WC79
Domain Number 1 Region: 147-306,331-373
Classification Level Classification E-value
Superfamily Metalloproteases ("zincins"), catalytic domain 3.11e-30
Family Reprolysin-like 0.0036
Further Details:      
 
Domain Number 2 Region: 471-527
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.0000000000000131
Family TSP-1 type 1 repeat 0.00037
Further Details:      
 
Domain Number 3 Region: 1251-1325
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.00000000968
Family TSP-1 type 1 repeat 0.0033
Further Details:      
 
Domain Number 4 Region: 1407-1438
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.0000000301
Family TSP-1 type 1 repeat 0.0049
Further Details:      
 
Domain Number 5 Region: 969-1010
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.00000432
Family TSP-1 type 1 repeat 0.0039
Further Details:      
 
Domain Number 6 Region: 1324-1357
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.0000131
Family TSP-1 type 1 repeat 0.0087
Further Details:      
 
Weak hits

Sequence:  A0A068WC79
Domain Number - Region: 1142-1179
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.017
Family TSP-1 type 1 repeat 0.011
Further Details:      
 
Domain Number - Region: 1558-1608
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.0235
Family TSP-1 type 1 repeat 0.012
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) A0A068WC79
Sequence length 1830
Comment (tr|A0A068WC79|A0A068WC79_ECHGR) A disintegrin and metalloproteinase with {ECO:0000313|EMBL:CDS17702.1} OX=6210 OS=Echinococcus granulosus (Hydatid tapeworm). GN=EgrG_001046600 OC=Cyclophyllidea; Taeniidae; Echinococcus.
Sequence
MYIRHNAVLVLFAILCLLIPLCDGFKQGFLPLSSLLHPTDGLTHGYGDIDNHQTMLVRLE
AAGALADMKLPERVLKRPRELHPVMMTLSKRNTVTMQRHWKVEQASYTNGESNSDGGDAD
RHWDERMQINMTAGESVPTVSFKAAEENTIELLVVVTRKMHLILKGRLTEYLVATFGAVA
QNFKHCSLKASVKISIVDIILLDSNFARREGLEDWSNKNHEEVMGKFCHWVNRIRRPTMN
WDSAILLNVGNFKTTALGVAHYQAMCSRESSCLVVVDRGFGTADIIAHEIGHQLGAKHDF
EVGSECGLEEQPSRKRAVQKLSRRTPDVDDLTIQRDTIMSGILYFELYPFRWSACSRENI
QFFLSRSDSACLRTQNSQRTAFTSKQLTQHMGQHKPGLRFTLNEQCALAMRQRGARFCGH
SSPVCKQLHCYDVQRGMCLPVEAPWAEGSRCGYKRWCVMGQCLLQGETISPIDGGWSAWE
EWGTCSRSCGGGVQFSRRECTSPEPQNGGEHCLGTNVRVRSCNIQNCPDSVDLRQQLCDK
VGQKLSKHLQAFKPSIGGANACKLVCLDGTKEVAHNESLPDGTPCYAPGTDICIKGRCWQ
AGCDKMLGSRMERDNCNVCGGDNSTCYAVSGEFHHSDALAPGAKPVGLTIAVHIPQGVTN
AYIKKVSRRSTPYSADAYDDFMILIFEELKTRIRRGETREPFAGAELYYSGSRGKEEIVH
IKGQINKNVNILIRVENKNTKLPLPDVEYTYYVSKDASDHLRFDPSGLAVHHYNDPLLRF
LRSPRKGAPAEELNEENVEPNPPLEPSKPEHSSQTIKEPIGFEWKMDEVPKECTSCSGTV
QSHASCYPVFSSQEASKHYGIHPLHPLPPRYCGESLRPSPVTRNCADYCGVRWSWREVNA
SDYTVARSTCSVRCGEGLTEVHFTAICEEKIPEKDTLHTIWRESSLGAYACIKAELGEPP
EDRVVTNRCTGDCRPLHWVFSDWDKCSERCGAGTKQRSVTCVDDVSNHWPLTECLQHIST
VVVGKLNGVDGYVGTELAECFEVDACGGQFMWITTPWSECRSTASYSDMGSLCHSALRRH
LDGDSQGLPSTSAFTGFQTRTSRCVLRSASGEDVKHEAPQHYCERAHALKPPEQQACTAE
LTCYRWTTVHFSQCSTNCGAGQRVGQLSCEQISLSRGARPNVTPVGETDCLYRLGSNISL
QIDSRRENPTVYIVESGSSEEREALQRVPYLRPLPFTPGKPLLLACSNPPCHSRSLEWAV
SEWSLCSATCGMGYQRRSLRCILVERHTGAPLKSEAVEVEVPGSECTARGLARPADMQLC
EAPPCIKWSPGEWSECKGTCESGTQERYIRCIRQKSLYTTSPKSVTWSSFEAPLNVSSTM
IEQEVDPAECKYQSKPAEKRSCLLASDCPFWFQGPWSSCSSTCGAGRRFRHVDCRFPNGT
VLYVFGRLARDRSILQMSSRRRRNLITESRKHLNTIRCLEPRPVDKTECQLKPCQEDRSF
WWPVVSSTCNAKGCIRGYQQRELQCLTPSFQPVDPKECVYSRKPLKMIPCTLQECKTYRW
RTDTWTRCPYACRKHSRYRNVRCVDDLGEEYADHLCQAHLRPDSWSICPDACPDLPISCW
DQKQRHPASSDGLYELAVHRQVVRIYCSDMESSYPREYLPLHHLNYASIWHFKEASEQNC
SSTNVYYKGKVTDTNVTSGTPSENQSLELSEFPADSVTVFKKLRINLNSLHVDIFDGRFS
ETLGTRFVPYASAKSCSRGTSQLGKFLINLKGTGFTVSKETQWRTSRLQSFGHVARTGDS
MIVMGSCGGDCGGCWPDPLLKLEAADAAER
Download sequence
Identical sequences A0A068WC79
EgrG_001046600.1

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]