SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSGACP00000017689 from Gasterosteus aculeatus 69_1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGACP00000017689
Domain Number 1 Region: 1784-2010
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 6.33e-35
Family Laminin G-like module 0.0057
Further Details:      
 
Domain Number 2 Region: 2037-2230
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 1.74e-32
Family Laminin G-like module 0.0021
Further Details:      
 
Domain Number 3 Region: 1286-1422
Classification Level Classification E-value
Superfamily Cadherin-like 1.83e-32
Family Cadherin 0.0027
Further Details:      
 
Domain Number 4 Region: 862-970
Classification Level Classification E-value
Superfamily Cadherin-like 1.86e-25
Family Cadherin 0.00065
Further Details:      
 
Domain Number 5 Region: 746-860
Classification Level Classification E-value
Superfamily Cadherin-like 3.85e-25
Family Cadherin 0.00046
Further Details:      
 
Domain Number 6 Region: 211-320
Classification Level Classification E-value
Superfamily Cadherin-like 5.42e-25
Family Cadherin 0.0017
Further Details:      
 
Domain Number 7 Region: 964-1077
Classification Level Classification E-value
Superfamily Cadherin-like 1.96e-24
Family Cadherin 0.00095
Further Details:      
 
Domain Number 8 Region: 425-533
Classification Level Classification E-value
Superfamily Cadherin-like 5.85e-24
Family Cadherin 0.0008
Further Details:      
 
Domain Number 9 Region: 1070-1182
Classification Level Classification E-value
Superfamily Cadherin-like 8.28e-24
Family Cadherin 0.00047
Further Details:      
 
Domain Number 10 Region: 314-434
Classification Level Classification E-value
Superfamily Cadherin-like 9.57e-20
Family Cadherin 0.0034
Further Details:      
 
Domain Number 11 Region: 529-648
Classification Level Classification E-value
Superfamily Cadherin-like 2.36e-19
Family Cadherin 0.0044
Further Details:      
 
Domain Number 12 Region: 1178-1305
Classification Level Classification E-value
Superfamily Cadherin-like 1.34e-16
Family Cadherin 0.0029
Further Details:      
 
Domain Number 13 Region: 119-223
Classification Level Classification E-value
Superfamily Cadherin-like 0.0000000000000017
Family Cadherin 0.0033
Further Details:      
 
Domain Number 14 Region: 10-104
Classification Level Classification E-value
Superfamily Cadherin-like 0.000000000000002
Family Cadherin 0.0044
Further Details:      
 
Domain Number 15 Region: 638-746
Classification Level Classification E-value
Superfamily Cadherin-like 0.0000000000000471
Family Cadherin 0.0078
Further Details:      
 
Domain Number 16 Region: 1398-1509
Classification Level Classification E-value
Superfamily Cadherin-like 0.000000000000657
Family Cadherin 0.0082
Further Details:      
 
Domain Number 17 Region: 1505-1606
Classification Level Classification E-value
Superfamily Cadherin-like 0.000000000458
Family Cadherin 0.021
Further Details:      
 
Domain Number 18 Region: 2275-2316
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000905
Family EGF-type module 0.016
Further Details:      
 
Weak hits

Sequence:  ENSGACP00000017689
Domain Number - Region: 2234-2265
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0798
Family EGF-type module 0.076
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGACP00000017689   Gene: ENSGACG00000013379   Transcript: ENSGACT00000017724
Sequence length 2418
Comment pep:novel group:BROADS1:groupXIX:17709350:17755283:-1 gene:ENSGACG00000013379 transcript:ENSGACT00000017724 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
DWYLNKVKLKITDVNDNVPEWNMEPYPYLAVVSPEAPAGAFVYQLWARDGDEGKSGEVEY
FLSDGGDGCFAVDKKTGQVVTTGLVLQRDREYLLSVVALDGPGSRSAPAMLSVVAGARAP
QFTNTTYAIAIPENTPEGQPFVVVFALSFQQQPISYSLLINPSSLFSIRQETGEISLTRT
LDYESDQRRYLLMVRASEEPGSLSTATEVQVLITDENDCVPEFLQSIYSVDGVPETVTTA
TSLLQVLATDCDSGSNAELTYYTLSPDFSISPHGTVFPAGRLDYERPNHVYEFVVMAMDR
GDEPHSGTATVRVRMANTNDEAPEFSQPVYRTFVSEDAGPNTLVATVLAKDPDGDGISYS
ITAGNQEGNFVIDSQKGLIRLRSAPLPELQGLEYVLNVTATDDNASGGLHPLASTARVIV
GVDDVNNNKPVFEECQKYREQASVLENQSTGTFVLQVHAKDADEGANGKVKYGLMHRDST
MPAFRIHPDTGVIATARRFDRERQREYSITVTATDWAEEPLIGICQLTVQILDQNDNSPK
FENLRYEYFLREDTSVGTSFLRVAAHDDDFGTNAAVTYSMSLEQPEYLQVNPVTGWIYVN
QPISQRTYITRDIIATDGGERNTSVELAVTITNVKNQPPQWEKESYSVVIPENAARDTPI
VTIKATSQLGDPRVTYNLEDGMVPETNMPVRFYLSPNREDSSASILVSEPLDYETTPVFS
LRVRAQNVAAVPLAAFTTVYVNVTDVNDNVPFFTSSIYEASVTEGAQIGTLVLQVSANDK
DLGLNGEITYSLLSDSSGDHHLFRVDPKLGLIYTEAVFDREARSSYLLEVQSVDGQESAR
PGKNKQPNSDTAYVRVFISDVNDNKPVFAQRLYEVGVDENADVGLAVVTVSANDEDEGAN
AKLRYQITSGNKGGVFDIEPEVGTIFIAQPLDYEEQKRYKLLVLASDGKWEDYAAVVVTV
VNKNDEAPVFSMNEYYGSITEELDGSPVFVLQVTATDPDKDADQGAIRYSIHGQGAESHF
VINDITGEMYAQRTMDREERAVWRFVVMATDEEGEGLTGFTDVIINVWDINDNAPTFTCA
PDNCHSSVAENSPPGTFVVEMTAADRDDAAVGQNAILTYRISENVRVANDADLFVIDSST
GTLSLAAEGLDREFADSHRLVVEARDGGGMIGKATVTVAVTDINDHVPKFKQDRCGARVP
ESADEDAAVLELSAVDPDAGTYGQLAFSVVAGDAEQRFYVVGHRTEKTATLRLKKKLDFE
KPGEQRFNLTLKVEDSDFSSLIHCQILVEDVNDNAPVFTPSSRLLPPLPEDVTVGTSVVQ
VVASDLDSGLNGDILYSISPRSDPHGHFTVSRAGLVTVAKPLDRETVAGYEVVVMATDRG
NPPLTGTVTVRVPLLDVNDNGPELEAPYSPVLWENSPAPQVVWLNRSSTLLRVVDRDSSE
HGPPFSLSLPSLYSIHFHLQDHGNGSATLTALRRFDRERQREFHLPVILIDGGEPPMTAT
ATLTITIGDQNDNAHQAGEKDVYVHTRKGRLANAALGKVYAPDPDDWDNKTYTLETSAAK
YFGLNQSSGVLTIKPNTPAGSYWLRVGVSDGVWPDVFSGVRVHVRELEEKSILLSASLRL
TGITARGFIDPHVEGKSRLETFWDFLSGALSVRPGSVNIFSIADREERSVDVHFYVLTDN
GYMRPEKLHSVLAAHKTKLQALLHANVSQVQVDECVRAACQTAGGCSTRLSVADTPTLVD
SGALSLASVKVTSSAVCGCAAREMNRQPCSSYRINPCLNGGTCVDTQGGYRCHCPPQLEG
PECQQTRLSFLGNGYAWFPPIRPCFDSHLSLEFMTAEDDGLLLYAGPLATLLPGDGEDYM
AIELIGGTPSLKINHGSGTLVLQLTNNIGVTDRRWHRLDVRSNSKEVRFTLDRCSSAIIM
ETEGVDSWAVTEDRSSCEIRGVTPNRDKYLNGSQVLQIGGVNNNISYEYPQLQHTHYTGC
IRNLVVDSKLYDLGSPAESSNTVAGCSLIDDHCSNMERSSPCGKRGRCHGQWGSFSCLCE
PGFTGPQCDQGAPEFSFDGRSHVQFHLLWSLPARQTRVQLGIRTRAAFGVVLSLLSREQN
EYLRLEVIQGLLAVFYNLGDGDYNLTLPHYPLGDGEWHEAELDRYGREFTLRLDGGGGRR
EVTASLGRSQEIIIDPSVVMLGNSFPSGLNRSFLGCLRDVRFNGRSVPLGRDQPTEGLQV
ITSQGVSVGCYSEACRKHHCSPPLVCVDLWRHPECRCPAGHMTKESSLGKVCVYTLCASR
PCRHGTCVAHSPSRYSCRCSEGYRGRHCEVTLAMYHDADSTSLSLSSMFAISICVMAFLV
LMLGLFLYSCWRRHKGLKEGVYHVSAHHGDWEDIRENVLNYDEEGGGEQDQNAFNMVELQ
RSLQPSPAQSLRYSYPQS
Download sequence
Identical sequences G3PJB0
69293.ENSGACP00000017689 ENSGACP00000017689 ENSGACP00000017689

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]