SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSSHAP00000018016 from Sarcophilus harrisii 76_7.0

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSSHAP00000018016
Domain Number 1 Region: 2709-2874
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 7.42e-36
Family Laminin G-like module 0.000000169
Further Details:      
 
Domain Number 2 Region: 2878-3059
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 5.15e-34
Family Laminin G-like module 0.0000000536
Further Details:      
 
Domain Number 3 Region: 2270-2453
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 4.03e-31
Family Laminin G-like module 0.0014
Further Details:      
 
Domain Number 4 Region: 2451-2640
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 3.28e-27
Family Laminin G-like module 0.022
Further Details:      
 
Domain Number 5 Region: 2084-2261
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 1.35e-23
Family Laminin G-like module 0.001
Further Details:      
 
Domain Number 6 Region: 750-802
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000000000335
Family Laminin-type module 0.01
Further Details:      
 
Domain Number 7 Region: 800-854
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000000117
Family Laminin-type module 0.002
Further Details:      
 
Domain Number 8 Region: 1355-1406
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000000865
Family Laminin-type module 0.013
Further Details:      
 
Domain Number 9 Region: 853-899
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000000215
Family Laminin-type module 0.0043
Further Details:      
 
Domain Number 10 Region: 995-1043
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000000297
Family Laminin-type module 0.01
Further Details:      
 
Domain Number 11 Region: 223-282
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000000117
Family Laminin-type module 0.033
Further Details:      
 
Domain Number 12 Region: 692-735
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000000167
Family Laminin-type module 0.0073
Further Details:      
 
Domain Number 13 Region: 1412-1464
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000000234
Family Laminin-type module 0.017
Further Details:      
 
Domain Number 14 Region: 350-407
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000195
Family EGF-type module 0.081
Further Details:      
 
Domain Number 15 Region: 902-951
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000363
Family Laminin-type module 0.013
Further Details:      
 
Domain Number 16 Region: 1462-1503
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000642
Family Laminin-type module 0.0058
Further Details:      
 
Domain Number 17 Region: 280-336
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000014
Family Laminin-type module 0.022
Further Details:      
 
Domain Number 18 Region: 25-108
Classification Level Classification E-value
Superfamily Galactose-binding domain-like 0.0000255
Family APC10-like 0.053
Further Details:      
 
Domain Number 19 Region: 949-992
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000586
Family Laminin-type module 0.012
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSSHAP00000018016   Gene: ENSSHAG00000015297   Transcript: ENSSHAT00000018165
Sequence length 3062
Comment pep:known_by_projection scaffold:DEVIL7.0:GL856948.1:1749691:2283893:1 gene:ENSSHAG00000015297 transcript:ENSSHAT00000018165 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MYCKLVEHVPGQPVRNPQCRTCNQNSSFAYQRHPITNAIDGKNTWWQSPSIKNGIEYHYV
TITLDLQQVFQIAYVIVKAANSPRPGNWILERSLDGFEYKPWQYHAITDTECLTRYGIYP
RTGPPSYAKDDEVICTSFYSKIHPLENGEIHISLINGRPSADDPSPELLEFTSARYIRLR
FQRIRTLNADLMMFAHKDPREIDPIVTRRYYYSVKDISVGGMCICYGHARACPLDPLTNK
SRCECEHNTCGDSCDQCCPGFHQKPWKAGTFLTKTECEACNCHGKAEECYYDENVASRNL
SLNIHGKYIGGGVCINCTQNTTGINCETCIDGFFRPKGILPNYPRPCQPCYCDPIGSLNE
ICVKDENHARRGLFPGSCHCKHGFGGVRCDRCARGYTGYPDCVPCNCSGIGSTNEDPCIG
PCYCKENVEGRDCSHCKVGFFNLQENNRKGCEECFCSGVSDRCQSSRWTYSSINDMDGWY
LTDVAGLLRITPHQDSSGQSQQLSINSLDAKRELPRIYYWSAPEPYLGNRVTAAGGQLKF
TISYDLIGEDDLENVLQLMIILEGNGLQISTTQEEIYLPPSKEYTHVVLLSSELFNVHDT
GSRISRKEFMTILANVKRLLIQATYILGMDAIFRLNSVSLESAVPYPTDGELADAVELCQ
CPPGYSGSSCESCWPGHRRVNGTIFGGICEPCQCFGHAEACDDITGECVGCKDHTGGPYC
NRCLPGFYGDPTKGTTEDCHPCACPLNIPSNNFSPTCHLDWSRELICDECPVGYTGPRCE
RCAEGYFGQPSVPGGSCQPCQCNDNLDFSIPGSCDSLSGSCLICKPGTSGRYCERCADGY
FGDAVDAKNCQPCRCNLNGSFSEICHTQTGQCECKPNVQGLRCDECQPQTFGLQSSQGCV
PCNCNSFGSKSFDCEESGQCWCQPGVAGKKCDRCAHGFFNFQEGGCTSCECAHLGNNCDP
NTGKCICPPNTIGDKCDKCAPNSWGHSIIHGCKACNCSLVGSFDFQCNINTGQCSCHPEF
SGRKCSECNLGHWNYPHCDVCDCFLAGTDASTCSSETGKCACLNQSGQCTCKVNVEGVHC
DRCRPGKFGLDAKNPLGCSSCYCFGTTTQCTEAKGLIRMWVTLTPEQTILPLVDEALQHT
TTKGIAFQNPEIVAEMEQVREDLHLEPFYWKLPEQFEGKKLMAYGGKLRYTIYFEAREET
GFFTYNPQVIIRGGTPAHARIITRHMAAPLIGQLTRHEIEMTENEWKYYGDDPRTSRMVT
REDFLDVLYDIHYVLIKATYGNVIRQSRISEISLEVAEQGVISALSPRARLIEKCDCPLG
YSGFSCEACMPGFYRLPSESGGPRPGPSLGACIPCQCNGHSNTCDPETSICQNCQHHTAG
DFCERCAIGYYGTVKGLPNDCRQCACPLITSSNNFSPSCISEGLNDYRCTACPRGYEGQY
CERCAFGYTGSPSSPGGSCQECECDPYGSLPVPCDPVTGYCTCRPGATGQKCDGCKPGHA
RDGMECVFCGDECTGLLLSDLARLEQMAVSINLTGPLPAPYKMLYNFENVTQELKHLLSP
QRAPERLLQLAESNLNTLVTEMDELLTRATKVTADGEQTGQDAERTNMRAKILEGIVKEI
VQDAEDVNEKAIKLNETLGAQDKALEKNLQELQQEIDQMMAELRQKNLDMQNEVAQDELV
AAEALLRKVKKLFGESRGKNEELEKELRDKLTDYKSKVDDAQDLLREATEKIKEADRLTE
TNQKNMTTLEKKRQAVESGKQEAENTLKEANDILDEAHHLADEINSVIDLVKDIQKKLPE
TSEELKDKTDDLSQKIKNRRLPEKVVQAEDHAAQLNESSAVLDGILEEAKNISFNATAAF
KAYSNIKDYIDEAERVAKEAKGHANEATQLASGPQSSLKDDAKGSLQKSFRVLNEAKRLA
NGVKENDDNLKGMKNRLENANEKNGDLLKGLNDTLGKLSAIPIDTAAKLQAVKDKARQAN
ATAKDVLAQIKDLNQNLAGLRNNYNKLADDVSKTNAVVKDPAKNIADADATVKNLEQEAD
RLIDKLKPIKELQDNLKKNISEIKELINQARKQANSIKVSVSSGGDCIRTYRPEIKKGSY
NNIIVNVKTAVADNLLFYLGSAKFTDFLAIEMRKGKVSFLWDVGSGVGRVEYPDLTIDDS
YWYRIEASRTGRNGTISVRALDGPKASIIPSAYNSISPPGYTILDVDANAMLFVGGLTGK
LKKADAVRVITFTGCMGETYFDSKPIGLWNFRDIEGDCKGCTVSPQVEDSEGTIQFDGEG
YALVSRPIRWYPNISTVIFKFRTFSSSALLMYLATRDLKDFMSVELDDGHIKVSYDLGSG
IASVISNQNHNDGKWKSFTLSRIQKQANVSIVDIETNQEENIATTSSGNNFGLDLKADDK
IYFGGLPTLRNLRPEVNLKKYAGCLKDIEISRTPYNILSSPDYVGVTKGCSLENVYIVSF
PKPGFVELPPVSLDVGTEINLSFSTKNESGIILLGSSGTNNSPRRKRRQTGQAYYAIFLN
KGRLEVHLSTGVRTMRKIVVKPELIPFHDGREHSIHVERTRGMFTVQVDEDRKHMQNLTV
EQTITIKKLFVGGSPLTYQPPPLRTIPPFEGCIWNLVINSIPMDFAQPVSFKNADIGRCI
DHKPGEGEVEVTPDQIVFPTDSTLEEDKETTPAFPKFPSPPPTPILVHGPCAADTEPAFL
IGSKQFGLSRNSHIAIAFDDTKVKNSLTIEFEIRTEAESGLMFYMARINHADFATVQLKN
GMAYFSYDLGSGNTSTMIATKINDGQWHKIKISRTKQEGILLVDGASNRTTSPKKADILD
VVGMLYVGGLPINYTTRRIGPVVYSIDGCMKNLKMTEAPADLENPTSSFNVGSCYVNAQK
GTYFDGTGFAKAVGAYKVGLDLLVEFEFRTTRTTGVLLGISSQKMDGMGIELVDEKLMFH
VDNGAGRFTAIYDAGVPGALCDGEWHKVTANKIKHRIELTVDGHQVGAQSPNTASTSADT
NDPVFVGGYPDGLNQFGLTINIPFRGCIRSLKLTKGTSKALEINFAKALELKGVQPISCP
AN
Download sequence
Identical sequences G3WRG1
ENSSHAP00000018016 ENSSHAP00000018016

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]