SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSGGOP00000005153 from Gorilla gorilla 69_3.1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGGOP00000005153
Domain Number 1 Region: 1319-1546
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 2.02e-39
Family Laminin G-like module 0.0028
Further Details:      
 
Domain Number 2 Region: 256-385
Classification Level Classification E-value
Superfamily Cadherin-like 1.7e-31
Family Cadherin 0.0012
Further Details:      
 
Domain Number 3 Region: 372-484
Classification Level Classification E-value
Superfamily Cadherin-like 5.14e-31
Family Cadherin 0.00084
Further Details:      
 
Domain Number 4 Region: 788-900
Classification Level Classification E-value
Superfamily Cadherin-like 1.34e-28
Family Cadherin 0.00069
Further Details:      
 
Domain Number 5 Region: 477-582
Classification Level Classification E-value
Superfamily Cadherin-like 6.85e-28
Family Cadherin 0.001
Further Details:      
 
Domain Number 6 Region: 1569-1762
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 1.44e-27
Family Laminin G-like module 0.028
Further Details:      
 
Domain Number 7 Region: 888-1003
Classification Level Classification E-value
Superfamily Cadherin-like 6.71e-26
Family Cadherin 0.0013
Further Details:      
 
Domain Number 8 Region: 157-267
Classification Level Classification E-value
Superfamily Cadherin-like 6.81e-26
Family Cadherin 0.0017
Further Details:      
 
Domain Number 9 Region: 679-783
Classification Level Classification E-value
Superfamily Cadherin-like 1.83e-25
Family Cadherin 0.0019
Further Details:      
 
Domain Number 10 Region: 582-683
Classification Level Classification E-value
Superfamily Cadherin-like 1.44e-21
Family Cadherin 0.0015
Further Details:      
 
Domain Number 11 Region: 996-1098
Classification Level Classification E-value
Superfamily Cadherin-like 0.000000000414
Family Cadherin 0.0077
Further Details:      
 
Domain Number 12 Region: 1246-1336,1533-1582
Classification Level Classification E-value
Superfamily Growth factor receptor domain 0.000000033
Family Growth factor receptor domain 0.016
Further Details:      
 
Domain Number 13 Region: 1895-1935
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000000385
Family Laminin-type module 0.0094
Further Details:      
 
Domain Number 14 Region: 1801-1847
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000173
Family EGF-type module 0.013
Further Details:      
 
Domain Number 15 Region: 1766-1800
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000536
Family EGF-type module 0.011
Further Details:      
 
Weak hits

Sequence:  ENSGGOP00000005153
Domain Number - Region: 2393-2610
Classification Level Classification E-value
Superfamily Family A G protein-coupled receptor-like 0.0687
Family Rhodopsin-like 0.015
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGGOP00000005153   Gene: ENSGGOG00000005252   Transcript: ENSGGOT00000005287
Sequence length 2871
Comment pep:novel chromosome:gorGor3.1:1:112086470:112113188:1 gene:ENSGGOG00000005252 transcript:ENSGGOT00000005287 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
LLLLPPPLLGDQVGPCRSLGSRGRGSSGACAPMGWLCPSSASNLWLYTSRCRDAGTELTG
HLVPHHDGLRVWCPESGAHIPLPPAPEGCPWSCRLLGIGGHLSPQGKLTLPQEHPCLKAP
RLRCQSCKLAQAPGLRAGEGSPEESLGGRRKRNVNTAPQFQPPSYQATVPENQPAGTPVA
SLRAIDPDEGEAGRLEYTMDALFDSRSNQFFSLDPITGAVTTAEELDRETKSTHVFRVTA
QDHGMPRRSALATLTILVTDTNDHDPVFEQQEYKESLRENLEVGYEVLTVRATDGDAPPN
ANILYRLLEGSGGSPSEVFEIDPRSGVIRTRGPVDREEVESYQLTVEASDQGRDPGPRST
TAAVFLSVEDDNDNAPQFSEKRYVVQVREDVTPGSPVLRVTASDRDKGSNALVHYSIMSG
NARGQFYLDAQTGALDVVSPLDYETTKEYTLRVRAQDGGRPPLSNVSGLVTVQVLDINDN
APIFVSTPFQATVLESVPLGYLVLHVQAIDADAGDNARLEYRLAGVGHDFPFTINNGTGW
ISVAAELDREEVDFYSFGVEARDHGTPALTASASVSVTVLDVNDNNPTFTQPEYTVRLNE
DAAVGTSVVTVSAVDRDAHSVITYQITSGNTRNRFSITSQSGGGLVSLALPLDYKLERQY
VLAVTASDGTRQDTAQIVVNVTDANTHRPVFQSSHYTVNVNEDRPAGTTVVLISATDEDT
GENARITYFMEDSIPQFRIDADTGAVTTQAELDYEDQVSYTLAITARDNGIPQKSDTTYL
EILVNDVNDNAPQFLRDSYQGSVYEDVPPFTSVLQISATDRDSGLNGRVFYTFQGGDDGD
GDFIVESTSGIVRTLRRLDRENVAQYVLRAYAVDKGMPPARTPMEVTVTVLDVNDNPPVF
EQDEFDVFVEENSPIGLAVARVTATDPDEGTNAQIMYQIVEGNIPEVFQLDIFSGELTAL
VDLDYEDRPEYVLVIQATSAPLVSRATVHVRLLDRNDNPPVLGNFEILFNNYVTNRSSSF
PGGAIGRVPAHDPDISDSLTYSFERGNELSLVLLNASTGELKLSRALDNNRPLEAIMSVL
VSDGVHSVTAQCALRVTIITDEMLTHSITLRLEDMSPERFLSPLLGLFIQAVAATLATPP
DHVVVFNVQRDTDAPGGHILNVSLSVGQPFLPSEDLQERLYLNRSLLTAISAQRVLPFDD
NICLREPCENYMRCVSVLRFDSSAPFIASSSVLFRPIHPVGGLRCRCPPGFTGDYCETEV
DLCYSRPCGPHGRCRSREGGYTCLCRDGYTGEHCEVSARSGRCTPGVCKNGGTCVNLLVG
GFKCDCPSGDFEKPYCQVTTRSFPAHSFITFRGLRQRFHFTLALSFATKERDGLLLYNGR
FNEKHDFVALEVIQEQVQLTFSAGESTTTVSPFVPGGVSDGQWHTVQLKYYNKPLLGQTG
LPQGPSEQKVAVVTVDGCDTGVALRFGSVLGNYSCAAQGTQGGSKKSLDLTGPLLLGGVP
DLPESFPVRMRQFVGCMRNLQVDSRHIDMADFIANNGTVPGCPAKKNVCDSNTCHNGGTC
VNQWDAFSCECPLGFGGKSCAQEMANPQHFLGSSLVAWHGLSLPISQPWHLSLMFRTRQA
DGVLLQAITRGRSTITLQLREGHVMLSVEGTGLQASSLRLEPGRANDGDWHHAQLALGAS
GGPGHAILSFDYGQQRAEGNLGPRLHGLHLSNITVGGIPGPAGGVARGFRGCLQGVRVSD
TLEGVNSLDPSHGESINVEQGCSLPDPCDSNPCPANSYCSNDWDSYSCSCDPGYYGDNCT
NVCDLNPCEHQSVCTRKPSAPHGYTCECPPNYLGPYCETRIDQPCPRGWWGHPTCGPCNC
DVSKGFDPDCNKTSGECHCKENHYRPPGSPTCLLCDCYPTGSLSRVCDPEDGQCPCKPGV
IGRQCDRCDNPFAEVTTNGCEVNYDSCPRAIEAGIWWPRTRFGLPAAAPCPKGSFGTAVR
HCDEHRGWLPPNLFNCTSITFSELKGFAERLQRNESGLDSGRSQQLALLLRNATQHTAGY
FGSDVKVAYQLATRLLAHESAQRGFGLSATQDVHFTENLLRVGSALLDAANKRHWELIQQ
TEGGTAWLLHHYEAYASALAQNMRHTYLSPFTIVTPNIVISVVRLDKGNFAGAKLPRYEA
LRGEQPPDLETTVILPESVFRETPPVVRPAGPGEAQEPEELARRQRRHPELSQGEAVASV
IIYRTLAGLLPHNYDPDKRSLRVPKRPIINTPVVSISVHDDEELLPRALDKPVTVQFRLL
ETEERTKPICVFWNHSILVSGTGGWSARGCEVVFRNESHVSCQCNHMTSFAVLMDVSRRE
NGEILPLKTLTYVALGVTLAALLLTFFFLTLLRILRSNQHGIRRNLTAALGLAQLVFLLG
INQADLPFACTVIAILLHFLYLCTFSWALLEALHLYRALTEVRDVNTGPMRFYYMLGWGV
PAFITGLAVGLDPEGYGNPDFCWLSIYDTLIWSFAGPVAFAVSMSVFLYILAARASCAAQ
RQGFEKKGPVSGLQPSFAVLLLLSATWLLALLSVNSDTLLFHYLFATCNCIQGPFIFLSY
VVLSKEVRKALKLACSRKPSPDPALTTKSTLTSSYNCPSPYADGRLYQPYGDSAGSLHST
SRSGKSQPSYIPFLLREESTLNPGQGPPGLGDPGSLFLEGQDQQHDPDTDSDSDLSLEDD
QSGSYASTHSSDAAFPGEQGWDSLLGPGAERLPLHSTPKDGGPGPGKAPWPGDFGTTAKE
SSGNGAPEERLRENGDALSREGSLGPLPGSSAQPHKGILKKKCLPTISEKSSLLRLPLEQ
CTGSSRGSSASEGQSLQEQLNGVMPIAMSIKAGTVDEDSSGSEFLFFNFLH
Download sequence
Identical sequences ENSGGOP00000005153 ENSGGOP00000005153

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]