SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSDARP00000102839 from Danio rerio 76_9

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSDARP00000102839
Domain Number 1 Region: 1794-2021
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 2.37e-37
Family Laminin G-like module 0.0047
Further Details:      
 
Domain Number 2 Region: 2046-2238
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 1.72e-33
Family Laminin G-like module 0.005
Further Details:      
 
Domain Number 3 Region: 1294-1429
Classification Level Classification E-value
Superfamily Cadherin-like 2.43e-29
Family Cadherin 0.0008
Further Details:      
 
Domain Number 4 Region: 860-981
Classification Level Classification E-value
Superfamily Cadherin-like 1.23e-27
Family Cadherin 0.00092
Further Details:      
 
Domain Number 5 Region: 750-864
Classification Level Classification E-value
Superfamily Cadherin-like 8.14e-25
Family Cadherin 0.00065
Further Details:      
 
Domain Number 6 Region: 968-1081
Classification Level Classification E-value
Superfamily Cadherin-like 3.93e-24
Family Cadherin 0.0014
Further Details:      
 
Domain Number 7 Region: 429-537
Classification Level Classification E-value
Superfamily Cadherin-like 1.01e-22
Family Cadherin 0.0016
Further Details:      
 
Domain Number 8 Region: 1074-1189
Classification Level Classification E-value
Superfamily Cadherin-like 1.27e-22
Family Cadherin 0.00024
Further Details:      
 
Domain Number 9 Region: 215-324
Classification Level Classification E-value
Superfamily Cadherin-like 3.28e-22
Family Cadherin 0.0025
Further Details:      
 
Domain Number 10 Region: 533-652
Classification Level Classification E-value
Superfamily Cadherin-like 3.4e-20
Family Cadherin 0.0053
Further Details:      
 
Domain Number 11 Region: 320-438
Classification Level Classification E-value
Superfamily Cadherin-like 1.57e-19
Family Cadherin 0.002
Further Details:      
 
Domain Number 12 Region: 1191-1305
Classification Level Classification E-value
Superfamily Cadherin-like 8.85e-17
Family Cadherin 0.0024
Further Details:      
 
Domain Number 13 Region: 123-233
Classification Level Classification E-value
Superfamily Cadherin-like 1.96e-16
Family Cadherin 0.0053
Further Details:      
 
Domain Number 14 Region: 10-108
Classification Level Classification E-value
Superfamily Cadherin-like 0.0000000000000414
Family Cadherin 0.0026
Further Details:      
 
Domain Number 15 Region: 641-757
Classification Level Classification E-value
Superfamily Cadherin-like 0.000000000000127
Family Cadherin 0.0022
Further Details:      
 
Domain Number 16 Region: 1405-1518
Classification Level Classification E-value
Superfamily Cadherin-like 0.000000000000144
Family Cadherin 0.0074
Further Details:      
 
Domain Number 17 Region: 1514-1614
Classification Level Classification E-value
Superfamily Cadherin-like 0.00000000157
Family Cadherin 0.023
Further Details:      
 
Domain Number 18 Region: 2282-2319
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000152
Family EGF-type module 0.019
Further Details:      
 
Weak hits

Sequence:  ENSDARP00000102839
Domain Number - Region: 2241-2274
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0178
Family EGF-type module 0.031
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSDARP00000102839   Gene: ENSDARG00000078088   Transcript: ENSDART00000109511
Sequence length 2532
Comment pep:novel chromosome:Zv9:7:48878970:49154377:-1 gene:ENSDARG00000078088 transcript:ENSDART00000109511 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
DWYTVRVDLTVMDVNDNAPEWMMVPFPYLSVVSAISPPNTLVYKLQARDGDEGINGEVEY
FLSDGHTTGGDGRFEVDRKNGHVRTTGLPLQRDREYLLTVVAADRQGSRSPPAVLSIIAG
PRAPQFTNVSYTIPIPENTPEGQPFLVTPAVSFQKQPVTYSLLINPSSLFSIQPETGEIS
LTRSIDYESDQHRYLLLVRASENQDSLSSAAEVRVIITDENDCVPEFLQSIYSKDGVPET
VTTATSLLQELASDCDSEQNAEITYYTLSPDFIISAHGTIFPAGPLDYERQNHLYEFVVM
AVDKGEVPRTGTATVRLRMANVNDEPPVFSQPVYRTFVSEDAGPNTLVATVLAKDPDGDG
ITYKITAGNDEGNFVIDSQKGLIRLRSSPSPRLQGLKYMLNVTATDDNASGGPQSLSAVA
QVIVGVDDVNNNKPVFEKCAEYKEKASVLENKPAGSFVLQVHADDADEGANGKVTYGFMH
KDSTVPAFSIDPETGVIVTAIKFDRESQREYAVTVTATDQAADPLIGICQLNILILDEND
NNPKFENLRYEYFLREDTMIGTSFLRVAAHDDDYSTNAAITYSMSEEQPEYLQVNPLTGW
VYVNQPISQRTYISRQIIATDGGNRSSSVELAVTITNVKNQPPQWEKDKYEVVIPENTVR
DTPVVTIKATSPLGDPRVTYNLEDGLVPESNMPVRFYLTPNREDGSASILVAEPLDYETT
RNFMLRVRAQNVAPVPLAAFTTVHINITDVNDNVPFFTSSIYEASVTEGVEIGTLVLQVS
ANDLDLGLNGKISYSLLNDRSGDFQYFRIDPELGTIFTEAVFDRETKGSYLLEVKSVDGW
ESARPGKHGQPNSDTAYVRIFISDVNDNKPVFAQPVYEVDVDEDADVGSTVLTVSANDED
EGANAKLRYQITSGNVGGVFDMEPEVGTIFIAQPLDYEQNKLYKLHVLASDGKWEHYATV
IVTIVNKNDEAPVFSVNEYYGSVTEELDGSPVFVLQVTASDPDKDADQEALRYSLHGQGA
ESEFIIDEVTGKIYAQRTLDREVRAVWRFVVLATDEGGEGLTGFTDVIINVWDINDNAPI
FACAPDSCHGDVAENSPPGTSVMEMTATDLDDAAVGQNAMLAYRIVGNAALNGANNGADM
FNINPATGTVSVSMSGLDREQIDSYVLVVEARDGGGMIGSATATIHVTDVNDHIPRFLDR
SCFVRIPESSEPNTAVIELAAEDADAGENGQLTFSVVAGDPEQKFYMVSHRQEQRGTLRL
KKRLDYERPGEQRFNLTIKVEDMQYSTLLHCTLEIEDCNDHVPVFIPHFLQLPAIREDVA
PGTSLASVAASDLDLGLNREITYTIAAESDPYHLFSVDQSGLVTVASELDREKVAQHHLV
ILAADHGTPPLTGTATIQMALLDVNDNGPEFEVAYAPVVWENVLGPQVVRLNQTSMLLRA
VDRDSVENGSPFSFFVPLEYRYSNDFHLQDNGNDTATITALRAFDRERQKEFLLPVIMTD
SGSPPKTVTNTLTITIGDENDHAHTAGQNHLFIYLLTGRMPTNVLGKVYSPDPDDWDNKT
YAFEGHVPNYFILNKRTGFLIIKENAPPGSYEFQVRVSDGVWPDAVSSVKVNVRELRDDA
IYNSASLRLAGTDISATEFMERRGNLKSRYELLGDFLSDMLSVGVDDINIFSLMEVRDRT
LDVRFSVHSTPFLRAERVHGYLAAHKQKLQSFLQVNVTQVHVDECVDADCRGGGGCTSHL
SVTDKPAVVDSGSMALVSVTVEATAVCSCSAREHLHQSCSIYPRNPCHNGGVCVDTQSGY
RCQCPAQFEGPECQQTKHSFHGNGYAWFPPIRPCFESHLSLEFITEVADGLLLYSGPLSQ
LQPWEPEDFMAIELIDGTPTLKINHGSGTLVLQLPGNVNVADRRWHRLDVRSNSKEVRFT
LDRCAGATIMEMEGVGSWLTTEDHTSCEVTGITPNMDKHLNVTQVLQLGGVNENLPYIYP
QLQHKHFTGCIRNLIVDSKYYDLGSPADSSGSAPGCVMTDSSCVNMGFPSCGTRGRCHGE
WGSFSCQCIPGYSGHQCEKEVPEYSFDGRSHIHFQLAFTLPARHTQVQVLVRTRKRSSSI
LSLLSKEQNEYLRLEIYQGLLSVFYNLGDGDFNLTMPSYRLDNGEWHDIHLDRHDNELTL
RLDGGGGRREVTGSPGRSREIVIDPAVVMLGNSFPSGHNRSFQGCMRDLRLNGRFIALDS
QARDGVSVVSSQGVSLGCFSDSCRKNLCSPPFTCVDLWRVHECRCPTGHMVKVNSTGKFC
VYTMCASRPCHKGTCVAQSPSKFTCHCPEGYRGRHCEVTLAIYRDDVGLSFSSLFAICIC
FMALLVLLLGIFLYTRWRSYKGLKEGVYHVSAHHDGWEDIRENVLNYDEEGGGEEDQNAY
DMAELQKSLQPSPGSATIRRDVPPCRSPAQGQTANPSRAAQLARKSLSFSSQDLARYLCE
IIRDADQHPETAPFDSLQVFSTEGGGSLAGSLSSFSSAGLEEGMAAGHECLKEWGPRFEK
LKALYERAEGSD
Download sequence
Identical sequences ENSDARP00000102839 ENSDARP00000102839

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]