SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSDNOP00000016708 from Dasypus novemcinctus 76_2

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSDNOP00000016708
Domain Number 1 Region: 1241-1469
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 5.81e-40
Family Laminin G-like module 0.0031
Further Details:      
 
Domain Number 2 Region: 285-397
Classification Level Classification E-value
Superfamily Cadherin-like 4.28e-29
Family Cadherin 0.00091
Further Details:      
 
Domain Number 3 Region: 390-498
Classification Level Classification E-value
Superfamily Cadherin-like 2.43e-28
Family Cadherin 0.00039
Further Details:      
 
Domain Number 4 Region: 595-702
Classification Level Classification E-value
Superfamily Cadherin-like 3.71e-28
Family Cadherin 0.0011
Further Details:      
 
Domain Number 5 Region: 704-809
Classification Level Classification E-value
Superfamily Cadherin-like 4e-28
Family Cadherin 0.00063
Further Details:      
 
Domain Number 6 Region: 1493-1689
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 5.28e-27
Family Laminin G-like module 0.015
Further Details:      
 
Domain Number 7 Region: 179-283
Classification Level Classification E-value
Superfamily Cadherin-like 5.71e-26
Family Cadherin 0.00079
Further Details:      
 
Domain Number 8 Region: 74-177
Classification Level Classification E-value
Superfamily Cadherin-like 1.71e-24
Family Cadherin 0.0014
Further Details:      
 
Domain Number 9 Region: 810-911
Classification Level Classification E-value
Superfamily Cadherin-like 1.71e-24
Family Cadherin 0.0014
Further Details:      
 
Domain Number 10 Region: 498-599
Classification Level Classification E-value
Superfamily Cadherin-like 8.85e-22
Family Cadherin 0.0011
Further Details:      
 
Domain Number 11 Region: 913-1015
Classification Level Classification E-value
Superfamily Cadherin-like 0.0000000000214
Family Cadherin 0.0074
Further Details:      
 
Domain Number 12 Region: 1168-1258,1458-1506
Classification Level Classification E-value
Superfamily Growth factor receptor domain 0.00000000706
Family Growth factor receptor domain 0.02
Further Details:      
 
Domain Number 13 Region: 1693-1726
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000000911
Family EGF-type module 0.012
Further Details:      
 
Domain Number 14 Region: 1822-1861
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000619
Family Laminin-type module 0.0094
Further Details:      
 
Weak hits

Sequence:  ENSDNOP00000016708
Domain Number - Region: 1728-1774
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000208
Family EGF-type module 0.011
Further Details:      
 
Domain Number - Region: 1871-1923
Classification Level Classification E-value
Superfamily Hormone receptor domain 0.000837
Family Hormone receptor domain 0.0069
Further Details:      
 
Domain Number - Region: 2334-2540
Classification Level Classification E-value
Superfamily Family A G protein-coupled receptor-like 0.0513
Family Rhodopsin-like 0.05
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSDNOP00000016708   Gene: ENSDNOG00000023675   Transcript: ENSDNOT00000025090
Sequence length 2813
Comment pep:known_by_projection scaffold:Dasnov3.0:JH576698.1:248119:395392:-1 gene:ENSDNOG00000023675 transcript:ENSDNOT00000025090 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
AARFPALAALPTSPACCCAPGPGCPRLLCALRRVAVRGQRALQGAAATPGPSPSRPQAGA
AARRARRGASDSTSPQFPLPSYQVSVPENEPAGTAVIELRAHDPDAGEAGRLSYQMEALF
DERSNGYFLIDAATGAVSTTCALDRETKDTHVLRVSAVDHGSPRRSATTYLTVTVSDTND
HSPVFEQSEYRERVRENLEVGYEVLTIRATDGDAPSNANMRYRLLQGAGGVFEIDSRSGV
VRTRAAVDREAAAEYQLLVEANDQGRQPGPRSATATVHIAVEDENDNYPQFSEKRYVVQV
PEDVAVNTPVLRVQATDRDQGANAAVHYSIVSGNLKGQFYLHALSGSLDVINPLDFEAIR
EYTLRVKAQDGGRPPLINASGLVSVRVLDVNDNAPIFVSSPFQAAVLENVPLGHSVLHIQ
AVDADAGENARLRYRLVDTAAAAPDFPFRIHNSTGWITVCAELDREAVEQYSFGVEALDH
GSPPMSSSASVSVTVLDVNDNDPAFTQPMYELRLNEDAAVGSSVLTLRARDRDANSVITY
QLTGGNTRNRFALSSQSGGGLITLALPLDYKQERQYVLAVTASDGTRSHTAQVFINVTDA
NTHRPVFQSSHYTVSVSEDRPVGTSVATIGATDEDTGENARITYVLEDPVPQFRIHPDTG
TVYTTAELDYEDQAAYTLAITARDNGIPRKSDTTSLEILVLDANDNAPRFLREFYQGSVF
EDAPPSTSVLQLSATDRDSGPNGRLLYTFQGGDDGDGDFYIEPTSGVIRTQRRLDRENVA
VYNLRALAVDRGSPAPLSASVEVQVAVLDINDNPPVFERDELELLVEENSPVGSVVARIR
AHDPDEGPNAQIMYQIVEGNVPEVFQLDLLSGDLRALAELDFEVRREYVLVVQATSAPLV
SRATVHLRLLDQNDNPPVLPDFQILFNNYVTNKSNSFPGGVIGRVPAHDPDFSDSLRYSF
LRGNELRLLLLDPATGELQLSRDLDNNRPLEALMEVSVTDGLHSVTALCTLRVTIITDDM
LTNSITVRLENMSQEQFLSPLLALFVEGVATVLSTPRDHIFVFNVQNDTDVRANILNVSF
SALLRGGARGQFLPSEDLQERIYLNRTLLTTISAQRVLPFDDNICLREPCENYMKCVSVL
KFDSSAPFISSTTVLFRPIHPINGLRCRCPPGFTGDYCETEIDLCYSSPCGANGRCRSRE
GGYTCECLEDYTGEHCEVDARSGRCAAGVCRNGGTCVNLLVGGFHCVCPPGGYETPYCEV
TARSFPPQSFVTFRGLRQRFHFTLSLTFATQDRNALLLYNGRFNEKHDFVALEIVDEQVQ
LTFSAGPGETTTMVAPQVPGGVSDGQWHSVQVQYYNKPNIGRLGLPHGPSGDKVAVVTVD
DCDTAVAVRFGSYVGNYSCAAQGTQSGSKKSLDLTGPLLLGGVPNLPEDFPVHNRQFVGC
MRNLSIDGQRLDMAGFIANNGTRAGCAAQRNFCAGISCQNGGTCVNKWNTYLCKCPLRFG
GKNCEQAMPHPQRFSGESIVSWSDLAITISVPWYLGLMFRTRQEGGVLVEAAAGAGCKLH
LEILNSYVQFEVSHGRDVASMTLSRSRVTDGEWHHLLIELKSAKEGKDIKYLAVMTLDYG
MDQNTVQVWSQLPGLKMRSLVIGGVSEDKISVQRGFRGCLQGVRVGETSTNVASLDMDDA
LKVRVRDGCDVDDPCASSPCPPNSLCRDTWDGFSCVCAPGFFGRKCMDVCHLNPCEHVAA
CVRAPGSPRGYACECAPGSYGLYCENKIDLPCPKGWWGTPVCGPCHCAVGRGFDPDCNKT
SGQCRCKENHYKPPEQDACLPCACFPQGSLSRACDTDTGQCACRPGVIGRQCNRCDSPFA
EVTALGCEVIYSGCPKAFEAGIWWPQTKFGQPAAVPCPKGSVGNAVRHCSAEKGWLPPEL
FNCTHFSFLDLKAMNEKLSRNETRMDGDGAVRLAKALRNATRRSAVFGSDVRAAYQLLCR
VLQHESEQQGFDLAATRDAGFHEDVVGTGSALLAPENRAAWEQIQRGEGGAAQLLRRFEA
YFSNVARNVRKTYLQPFVIVTANMILAVDVFDKANFTGARVPRFELVHEEYPEELEASVI
FPADFFRPPGRRDGPVGGPTGRRPAPQTARPGPGTTTEAPFRRRRRHPSDPGQSAVALVL
IYRTLGQLLPEHYDPDRRSLRLPSRPVINTAVVSAAVYSEGAVLPSPLARPVLVESALLE
TEERSKPVCVFWNHSLTPGGTGGWSARGCELLWRSRSRVACQCGHVASFAVLMDVSRREH
GDVLPLKTVTYTAMALSLAALLVAFVLLALARALRSNLLSIRRSLIMALFSSQLVFVVGI
DRTENPVLCTVIAIVLHYVCMSAFAWAFVEGLHVYRMLTEARNIDAGPMRFYYVVGWGLP
AVITGLAVGLDPQGYGNPDFCWLSLRDTLIWSFAGPVGTVIVVHTVIFILSAKVSCQRKH
HHYERARVVSALRTAFGLLLLLGAAWLLALLTVHSDALAFHYLFAVLSCLQGLFVLLFHC
VLNGVVRKHLKGVLAGRKLHPDDSAATRATLLTRSLNCNNTYSEEPNVYRTALGESTASL
DSTARDEGAQKLTVASGPARGGHGEPDPSLARRSSKKPHGHDSDSDSELSLDEQSSSYAS
SHSSDSEDDGVAAEDKWDPAQAHARGPVHSTPKVDLGANHLGAGWPDGSLAGSXAAGPRS
RGEVTAGGGRGGRGPRGRAGGAGAGAGRRPPVPAGILKNRAAYPPPLPEQGALGRREKLA
DGERSPGSSRASSLGSRDRVTVKNPPRELGRGPLNGVAMSVRTGSAHDGSDSE
Download sequence
Identical sequences ENSDNOP00000016708

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]