SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSNLEP00000019142 from Nomascus leucogenys 76_1.0

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSNLEP00000019142
Domain Number 1 Region: 1319-1545
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 7.39e-41
Family Laminin G-like module 0.0022
Further Details:      
 
Domain Number 2 Region: 349-461
Classification Level Classification E-value
Superfamily Cadherin-like 3.85e-29
Family Cadherin 0.0009
Further Details:      
 
Domain Number 3 Region: 782-887
Classification Level Classification E-value
Superfamily Cadherin-like 2.57e-28
Family Cadherin 0.00054
Further Details:      
 
Domain Number 4 Region: 673-780
Classification Level Classification E-value
Superfamily Cadherin-like 4.71e-28
Family Cadherin 0.0012
Further Details:      
 
Domain Number 5 Region: 449-579
Classification Level Classification E-value
Superfamily Cadherin-like 2.43e-27
Family Cadherin 0.00081
Further Details:      
 
Domain Number 6 Region: 243-347
Classification Level Classification E-value
Superfamily Cadherin-like 1.16e-26
Family Cadherin 0.00045
Further Details:      
 
Domain Number 7 Region: 883-997
Classification Level Classification E-value
Superfamily Cadherin-like 5.42e-24
Family Cadherin 0.0015
Further Details:      
 
Domain Number 8 Region: 576-677
Classification Level Classification E-value
Superfamily Cadherin-like 3.57e-22
Family Cadherin 0.0012
Further Details:      
 
Domain Number 9 Region: 141-241
Classification Level Classification E-value
Superfamily Cadherin-like 3.85e-18
Family Cadherin 0.0042
Further Details:      
 
Domain Number 10 Region: 1569-1762
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 0.00000000000000218
Family Laminin G-like module 0.034
Further Details:      
 
Domain Number 11 Region: 991-1093
Classification Level Classification E-value
Superfamily Cadherin-like 0.000000000857
Family Cadherin 0.0097
Further Details:      
 
Domain Number 12 Region: 1259-1298
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000000993
Family EGF-type module 0.011
Further Details:      
 
Domain Number 13 Region: 1895-1935
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000067
Family Laminin-type module 0.013
Further Details:      
 
Domain Number 14 Region: 1766-1799
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000141
Family EGF-type module 0.013
Further Details:      
 
Weak hits

Sequence:  ENSNLEP00000019142
Domain Number - Region: 1801-1847
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000149
Family EGF-type module 0.009
Further Details:      
 
Domain Number - Region: 1238-1266
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00145
Family EGF-type module 0.065
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSNLEP00000019142   Gene: ENSNLEG00000015792   Transcript: ENSNLET00000020111
Sequence length 2902
Comment pep:known_by_projection supercontig:Nleu1.0:GL397440.1:412152:591773:-1 gene:ENSNLEG00000015792 transcript:ENSNLET00000020111 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
XRARAHHPGCGARAWLCGTGAWLCGALCFPVPGGGAAAQHSALAAPTTLPACSCPPRPGP
CCPGRPICLPPGSSVRLRLLCALRRAAGAVRVGLALEAATAGTPSASPSPSPPLPPNLPE
ARAGPARRARRGTSGRGSLKFPMPNYQVALFENEPAGTLILQLHAHYTIEGEEERVSYYM
EGLFDERSRGYFRIDSATGAVSTDSVLDRETKETHVLRVKAVDYSTPPRSATTYITVLVK
DTNDHSPVFEQSEYRERVRENLEVGYEVLTIRASDRDSPINANLRYRVLGGAWDVFQLNE
SSGVVSTRAVLDREEAAEYQLLVEANDQGRNPGPLSATATVYIEVEDENDNYPQFSEQNY
VVQVPEDVGLNTAVLRVQATDRDQGQNAAIHYSILSGNVAGQFYLHSLSGILDVINPLDF
EDVQKYSLSIKAQDGGRPPLINSSGVVSVQVLDVNDNEPIFVSSPFQATVLENVPLGYPV
VHIQAVDADSGENARLHYRLVDTASAFLGGGSAGPKNPAPTLDFPFQIHNSSGWITVCAE
LDREEVEHYSFGVEAVDHGSPPMSSSTSVSITVLDVNDNDPVFTQPTYELRLNEDAAVGS
SVLTLQARDRDANSVITYQLTGGNTRNRFALSSQRGGGLITLALPLDYKQEQQYVLAVTA
SDGTRSHTAHVLINVTDANTHRPVFQSSHYTVSVSEDRPVGTSIATLSANDEDTGENARI
TYVIQDPVPQFRIDPDSGTMYTMMELDYENQVAYTLTIMAQDNGIPQKSDTTTLEILILD
ANDNAPQFLWDFYQGSIFEDAPPSTSILQVSATDRDSGPNGRLLYTFQGGDDGDGDFYIE
PTSGVIRTQRRLDRENVAVYNLWALAVDRGSPTPLSASVEIQVTILDINDNAPMFEKDEL
ELFVEENNPVGSVVAKIHANDPDEGPNAQIMYQIVEGDMRHFFQLDLLNGDLRAMVELDF
EVRREYVLVVQATSAPLVSRATVHLLLVDQNDNPPVLPDFQILFNNYVTNKSNSFPTGVI
GRIPAHDPDVSDSLNYTFVQGNELRLLLLDPATGELQLSRDLDNNRPLEALMEVSVSDGI
HSVTAFCTLRVTIITDDMLTNSITVRLENMSQEKFLSPLLTLFVEGVAAVLSTTKDDVFV
FNVQNDTDVSSNILNVTFSALLPGGVRGQFFPSEDLQEQIYLNRTLLTTISTQRVLPFDD
NICLREPCENYMKCVSVLRFDSSAPFLSSTTVLFRPIHPINGLRCRCPPGFTGDYCETEI
DLCYSNPCGANGRCRSREGGYTCECFEDFTGEHCEVDARSGRCANGVCKNGGTCVNLLIG
GFHCVCPPGEYERPYCEVTTRSFPPRSFVTFRGLRQRFHFTISLTFATQERNGLLLYNGR
FNEKHDFIALEIVDEQVQLTFSAGETTTTVAPKVPSGVSDGRWHSVQVQYYNKPNIGHLG
LPHGPSGEKMAVVTVDDCDTTMAVRFGKDIGNYSCAAQGTQTGSKKSLDLTGPLLLGGVP
NLPEDFPVHNRQFVGCMRNLSVDGKNVDMAGFIANNGTREGCAARRNFCDGRRCQNGGTC
VNRWNMYLCECPLRFGGKNCEQGEWLASSIPLVTEAWEALLHDVPGTTGWMRHRGEAQTG
LQATSGGPTSFRLQILNNYLQFEVSHSPSDVESVMLSGLRVTDGEWHHLLIELKNVKEDS
EMKHLVTMTLDYGMDQNRADIGGMLPGLTIRSMVVGGASEDKVSVRRGFRGCMQGVRMGG
TPTNVATLNMNNALKVRVKDGCDVEDPCTSSPCPRNSHCHDAWEHYSCVCDKGYLGINCV
DACHLNPCENMGACVRSPGSPQGYVCECGPSHYGPYCENKLDLPCPRGWWGNPVCGPCHC
AVSKGFDPDCNKTNGQCQCKENYYKPPAQDTCLPCDCFPHGSHSRTCDMATGQCACKPGV
IGRQCNRCDNPFAEVTTLGCEVIYNGCPKAFEAGIWWPQTKFGQPAAVPCPKGSVGNAVR
HCSGEKGWLPPELFNCTTISFVDLRAMNEKLSRNETQVDGAGALRLVRALRNATQHTGTL
FGNDVRTAYQLLGRVLQHESWQQGFDLAATQDADFHEDVIHSGSALLAPATRAVWEQIQR
SEGGTAQLLRHLEGYFSNVARNVRRTYLRPFVIVTANMILAVDIFDKFNFTGARVPRFDA
IHEEFPRELESSVSFPADFFKPPEEKEGPLLRPAGRRTTPQTTRPGPGTEREAPISRWRR
HPDDAGQFAIALVIIYRTLGQLLPERYDPDRRSLRLPHRPIINTPMVSTLVYSEGAPLPR
PLERPVLVEFALLEVEERTKPVCVFWNHSLAVGGTGGWSARGCELLSRNRTHVTCQCSHT
ASFAVLMDISRRENGEVLPLKIVTYAAVSLSLAALLVAFVLLSLVRTLRSNLHSIHKHLD
AALFLSQLVFVIGINQTENPFLCTVVAILLHYIYMSTFAWTLVESLHVYRMLTEARNIDT
GPMRFYYVVGWGIPAIVTGLAVGLDPQGYGNPDFCWLSLQDTLIWSFAGPIGAVIIVSMG
PSWESGRPPTRPHHQRGRKGASSLLRTAFLLLLLISATWLLGLLAVNRDALSFHYLFSIF
SGLQGPFVLLFHCVLNQEVRKHLKGVLGGRKLHLEDSTTTRATLLTRSLNCNTTFGDGPD
MLRTDLGESTASLDSTVRDEGIQKLSVSSGLARGSHGEPDTSLMPRSSKDPPGHDSDSDS
ELSLDEQSSSYASSHSSDSEDDGVGAEEKWDPARGAVHSTPKGDAVANHVPAGWPDQSLA
ESDSEDPSGKPRLKVETKVSVELHRQEQGSHRGEYPPDQESGGAARLASSQPPEQRKGIL
KNKVTYPPPLTLTEQTLKGRLREKLADCEQSPTSSRTSSLGSGGPDCAITVKSPGREPGR
EHLNGVAMNVCTGSAQADGSDS
Download sequence
Identical sequences ENSNLEP00000019142

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]