SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSNLEP00000008113 from Nomascus leucogenys 76_1.0

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSNLEP00000008113
Domain Number 1 Region: 1437-1662
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 7.63e-41
Family Laminin G-like module 0.00095
Further Details:      
 
Domain Number 2 Region: 539-651
Classification Level Classification E-value
Superfamily Cadherin-like 1.18e-28
Family Cadherin 0.00045
Further Details:      
 
Domain Number 3 Region: 896-1008
Classification Level Classification E-value
Superfamily Cadherin-like 6.57e-28
Family Cadherin 0.00091
Further Details:      
 
Domain Number 4 Region: 322-425
Classification Level Classification E-value
Superfamily Cadherin-like 3e-25
Family Cadherin 0.00094
Further Details:      
 
Domain Number 5 Region: 1002-1102
Classification Level Classification E-value
Superfamily Cadherin-like 1.86e-24
Family Cadherin 0.00081
Further Details:      
 
Domain Number 6 Region: 427-537
Classification Level Classification E-value
Superfamily Cadherin-like 2.57e-24
Family Cadherin 0.00096
Further Details:      
 
Domain Number 7 Region: 793-895
Classification Level Classification E-value
Superfamily Cadherin-like 3.57e-24
Family Cadherin 0.0015
Further Details:      
 
Domain Number 8 Region: 1679-1889
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 4.75e-24
Family Laminin G-like module 0.0044
Further Details:      
 
Domain Number 9 Region: 639-711
Classification Level Classification E-value
Superfamily Cadherin-like 0.0000000000000131
Family Cadherin 0.0022
Further Details:      
 
Domain Number 10 Region: 1376-1416
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000000000205
Family EGF-type module 0.0082
Further Details:      
 
Domain Number 11 Region: 1104-1206
Classification Level Classification E-value
Superfamily Cadherin-like 0.000000000585
Family Cadherin 0.01
Further Details:      
 
Domain Number 12 Region: 725-799
Classification Level Classification E-value
Superfamily Cadherin-like 0.00000000314
Family Cadherin 0.0048
Further Details:      
 
Domain Number 13 Region: 1923-1969
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000000303
Family EGF-type module 0.011
Further Details:      
 
Domain Number 14 Region: 1887-1921
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000173
Family EGF-type module 0.023
Further Details:      
 
Domain Number 15 Region: 2017-2056
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000812
Family Laminin-type module 0.0086
Further Details:      
 
Weak hits

Sequence:  ENSNLEP00000008113
Domain Number - Region: 2045-2119
Classification Level Classification E-value
Superfamily Hormone receptor domain 0.00366
Family Hormone receptor domain 0.0057
Further Details:      
 
Domain Number - Region: 1357-1380
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00749
Family EGF-type module 0.071
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSNLEP00000008113   Gene: ENSNLEG00000006656   Transcript: ENSNLET00000008500
Sequence length 3202
Comment pep:known_by_projection supercontig:Nleu1.0:GL397269.1:36874252:36900680:-1 gene:ENSNLEG00000006656 transcript:ENSNLET00000008500 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MMARRPPWRGLRGRSIPILLLLLLSLFPLSQEELGGGGHQGWDPGLATTTGPRTHIGGGA
LALCPESPGVREDGGPGLGVREPIFVGLRGRRQSARNSRGPPEQPNEELGIEHGIQTLGS
RERETGQGPGSVLYWRPEVSSCGRTGPLQRGSLSPGALSSGVPGSGNNSPLPSDFLVRHH
GPKLVSSQRNAGTGARKRVGTARCCGELWATGSKGQGERATTSGAERTAPRRNCLPRASG
SGPELDSAPRTARTAPASGSAPRESRTAPEPAPKRMRSRGLFRRRFLPQRPGPRPPGLPA
RPEARKITSANRARFRRAANRHPQFPQYNYQTLVPENEAAGTAVLRVVAQDPDAGEAGRL
VYSLAALMNSRSLELFSIDPQSGLIRTAAALDRESMERHYLRVTAQDHGSPRLSATTMVA
VTVADRNDHSPVFEQAQYRETLRENVEEGYPILQLRATDGDAPPNANLRYRFVGPPAARA
AAAAAFEIDPRSGLISTSGRVDREHMESYELVVEASDQGQEPGPRSATVRVHITVLDEND
NAPQFSEKRYVAQVREDVRPHTVVLRVTATDRDKDANGLVHYNIISGNSRGHFAIDSLTG
EIQVVAPLDFEAEREYALRIRAQDAGRPPLSNNTGLASIQVVDINDHIPIFVSTPFQVSV
LENAPLGHSVIHIQAVDADHGENARLEYSLTVAPDTPFVINSTGWVSVSGPDRESCGQRD
RSNRDANSAISYQITGGNTRNRFAISTQGGVGLVTLALPLDYKQERYFKLVLTASDRALH
DHCYVHINITDANTHRPVFQSAHYSVSVNEDRRVGSTIVVISASDDDVGENARITYLLED
NLPQFRIDADSGAITLQAPLDYEDQVTYTLAITARDNGIPQKADTTYVEVMVNDVNDNAP
QFVASHYTGLVSEDAPPFTSVLQISATDRDAHANGRVQYTFQNGEDGDGDFTIEPTSGIV
RTVRRLDREAVSVYELTAYAVDRGVPPLRTPVNIQVMVQDVNDNAPVFPAEEFEVRVKEN
SIVGSVVAQITAVDPDEGPNAHIMYQIVEGNIPELFQMDIFSGELTALIDLDYEARQEYV
IVVQATSAPLVSRATVHVRLVDQNDNSPVLNNFQILFNNYVSNRSDTFPSGIIGRIPAYD
PDVSDHLFYSFERGNELQLLVVNQTSGELRLSRKLDNNRPLVASMLVTVTDGLHSVTAQC
VLRVVIITEELLANSLTVRLENMWQERFLSPLLGRFLEGVAAVLATPAEDVFIFNIQNDT
DVGGTVLNVSFSALAPRRAGAGAAGPWFSSEELQEQLYVRRAALAARSLLDVLPFDDNVC
LREPCENYMKCVSVLRFDSSAPFLASASTLFRPIQPIAGLRCRCPPGFTGDFCETELDLC
YSNPCRNGGACARREGGYTCVCRPRFTGEDCELDTEAGHCVPGVCRNGGTCTDAPNGGFR
CQCPAGGAFEGPRCEVAARSFPPSSFVMFRGLRQRFHLTLSLSFATVQQSGLLFYNGRLN
EKHDFLALELVAGQVRLTYSTGESNTVVSPTVPGGLSDGQWHTVHLRYYNKPRTDALGGA
QGPSKDKVAVLSVDDCDVAVALQFGAEIGNYSCAAAGVQTSSKKSLDLTGPLLLGGVPNL
PENFPVSHDFIGCMRDLHIDGRRVDMAAFVANNGTVAGCQAKLHFCDSGPCKNSGYCSER
WGGFSCDCPVGFGGKDCRLTMAHPHHFRGNGTLSWNFGSDMAVSVPWYLGLAFRTRATQG
VLMQVQAGPHSTLLCQLDRGLLSVTVTRGSGRASHLLLDQVTVSDGRWHDLRLELQEEPG
GRRGYHVLMVSLDFSLFQDTVPVGSELQGLKVKQLHVGGLPPGSAEEAPQGLVGCIQGVW
LGSTPSGSPALLPPSHRVNAEPGCVVTNACASGPCPPHADCRDLWQTFSCTCWPGYYGPG
CVDACLLNPCQNQGSCRHLPGAPHGYTCDCVGGYFGHHCEHRMDQQCPRGWWGSPTCGPC
NCDVHKGFDPNCNKTNGQCHCKEFHYRPRGSDSCLPCDCYPVGSTSRSCAPHSGQCPCRP
GALGRQCNSCDSPFAEVTASGCRVLYDACPKSLRSGVWWPQTKFGVLATVPCPRGALGAA
VRLCDEAQGWLEPDLFNCTSPAFRELSLLLDGLELNKTALDTMEAKKLAQRLREVTGHTD
HYFSQDVRVTARLLAHLLAFESHQQGFGLTATQDAHFNENLLWAGSALLAPETGDLWAAL
GQRAPGGSPGSAGLVRHLEEYAATLARNMELTYLNPMGLVTPNIMLSIDRMEHPSSPRGA
RRYPRYHSNLFRGQDAWDPHTHVLLPSQSPRPSPSEVLPTSSSMENSTTSSVVPPPAPPE
PEPGISIIILLVYRTLGGLLPAQFQAERRGARLPQNPVMNSPVVSVAVFHGRNFLRGILE
SPISLEFRLLQTVNRSKAICVQWDPPGLAEQHGVWTARDCELVHRNGSHAQCRCSRTGTF
GVLMDASPRERLEGDLELLAVFTHVVVAVSVAALLVCTAVAILHYFFLSFAWLFVQGLHL
YMQVEPRNVDRGAMRFYHALGWGVPAVLLGLAVGLDPEGYGNPDFCWISVHEPLIWSFAG
PVVLVIVMNGTMFLLAARTSCSTGQREAKKTSALTLRSSFLLLLLVSASWLFGLLAVNHS
ILAFHYLHAGLCGLQGLAVLLLFCVLNADARAAWTPACLGRKAAPEEARPAPGTGPGAYN
NTALFEESGLIRITLGASTVSSVSSARSGRTQDQDSQRGRSYLRDNVLVRHGSAADHTDH
SLQAHAGPTDLDVAMFHRDAGADSDSDSDLSLEEERSLSIPSSESEDNGRTRGRFQRPLR
RAAQSERLLTHPKDVDGNDLLSYWPALGECEAAPCALQTWGSERRLGLDTSKDAANNNQP
DPALTSGDEASLGRAQCQRKGILKKKEPYPLVPQTRGEEMSWCRAATLGHRAVPAASYGR
IYLGGTGSLSQPASRYSSREQLDLLLRRQLSRERLEEAPAPVLRPLSRPGSQECMDAAPG
RLEPRDRGSTLPRRQPPRDYPGAMAGRFGSRDALDLGAPREWLSRLPPPRRTRDLDPQPP
PLPLSPQRQLSRDPLLPSRPLDSLSRSSNSREQLDQMPSRHPSREALGPPPQLLRAREDP
VSGPSHGPSTEQLDILSSILASFNSSALSSVQSSSTPSGPHTTATPSATASVLGPSTPRS
ATSHSISELSPDSEVPRSEGHS
Download sequence
Identical sequences ENSNLEP00000008113

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]