SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSGGOP00000024019 from Gorilla gorilla 76_3.1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGGOP00000024019
Domain Number 1 Region: 2700-2864
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 1.98e-36
Family Laminin G-like module 0.0000000639
Further Details:      
 
Domain Number 2 Region: 2869-3050
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 5.44e-35
Family Laminin G-like module 0.0000000307
Further Details:      
 
Domain Number 3 Region: 2268-2455
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 1.21e-30
Family Laminin G-like module 0.0014
Further Details:      
 
Domain Number 4 Region: 2451-2643
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 5.24e-27
Family Laminin G-like module 0.021
Further Details:      
 
Domain Number 5 Region: 2083-2259
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 3.45e-24
Family Laminin G-like module 0.0014
Further Details:      
 
Domain Number 6 Region: 815-867
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000000053
Family Laminin-type module 0.013
Further Details:      
 
Domain Number 7 Region: 865-919
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000000000795
Family Laminin-type module 0.0025
Further Details:      
 
Domain Number 8 Region: 410-550,723-738
Classification Level Classification E-value
Superfamily Growth factor receptor domain 0.00000000361
Family Growth factor receptor domain 0.0075
Further Details:      
 
Domain Number 9 Region: 1420-1471
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000000363
Family Laminin-type module 0.014
Further Details:      
 
Domain Number 10 Region: 1477-1529
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000000024
Family Laminin-type module 0.017
Further Details:      
 
Domain Number 11 Region: 757-809
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000000335
Family Laminin-type module 0.0044
Further Details:      
 
Domain Number 12 Region: 287-346
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000000128
Family Laminin-type module 0.034
Further Details:      
 
Domain Number 13 Region: 918-964
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000019
Family Laminin-type module 0.0049
Further Details:      
 
Domain Number 14 Region: 1060-1108
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000000642
Family Laminin-type module 0.012
Further Details:      
 
Domain Number 15 Region: 967-1016
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000184
Family Laminin-type module 0.011
Further Details:      
 
Domain Number 16 Region: 344-416
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000307
Family Laminin-type module 0.021
Further Details:      
 
Domain Number 17 Region: 1527-1567
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000335
Family Laminin-type module 0.0088
Further Details:      
 
Domain Number 18 Region: 1014-1057
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000446
Family Laminin-type module 0.012
Further Details:      
 
Domain Number 19 Region: 89-168
Classification Level Classification E-value
Superfamily Galactose-binding domain-like 0.0000964
Family APC10-like 0.038
Further Details:      
 
Weak hits

Sequence:  ENSGGOP00000024019
Domain Number - Region: 1800-1905
Classification Level Classification E-value
Superfamily ADP-ribosylation 0.00017
Family ADP-ribosylating toxins 0.029
Further Details:      
 
Domain Number - Region: 1120-1163
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000232
Family Laminin-type module 0.018
Further Details:      
 
Domain Number - Region: 1377-1406
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00145
Family Laminin-type module 0.076
Further Details:      
 
Domain Number - Region: 1615-1845
Classification Level Classification E-value
Superfamily Methyl-accepting chemotaxis protein (MCP) signaling domain 0.00392
Family Methyl-accepting chemotaxis protein (MCP) signaling domain 0.0035
Further Details:      
 
Domain Number - Region: 1963-2071
Classification Level Classification E-value
Superfamily BAR/IMD domain-like 0.0241
Family BAR domain 0.041
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGGOP00000024019   Gene: ENSGGOG00000004906   Transcript: ENSGGOT00000024034
Sequence length 3053
Comment pep:known_by_projection chromosome:gorGor3.1:6:129416644:130090403:1 gene:ENSGGOG00000004906 transcript:ENSGGOT00000024034 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MPGAAGVLLLLLLSGGLGGVQAQRPPQQRQSQAHQQRGLFPAVLNLASNALITTNATCGE
KGPEMYCKLVEHVPGQPVRNPQCRICNQNSSNPNQRHPITNAIDGKNTWWQSPSIKNGIE
YHYVTITLDLQQVFQIAYVIVKAANSPRPGNWILERSLDDVEYKPWQYHAVTDTECLTLY
NIYPRTGPPSYAKDDEVICTSFYSKIHPLENGEIHISLINGRPSADDPSPELLEFTSARY
IRLRFQRIRTLNADLMMFAHKDPREIDPIVTRRYYYSVKDISVGGMCICYGHARACPLDP
ATNKSRCECEHNTCGDSCDQCCPGFHQKPWRAGTFLTKTECEACNCHGKAEECYYDENVA
RRNLSLNIHGKYIGGGVCINCTQNTAGVNCETCIDGFFRPKGVSPNYPRPCQPCHCDPIG
SLNEVCVKDEKHARRGLAPGSCHCKTGFGGVSCDRCARGYTGYPDCKACNCSGLGSKNED
PCFGPCICKENVEGGDCSRCKSGFFNLQEDNWKGCDECFCSGVSNRCQSSYWTYGKIQDM
SGWYLTDLSGRIRVAPQQDDLDSPQQISISKAEARQALPHSYYWSAPAPYLGNKLPAVGG
QLTFTISYDLEEEEEDTEHVLQLMIILEGNDLSISTAQDEVYLHPSEEHTNVLLLKEESF
TLHGTHFPVSRKEFMTVLANLKRVLLQITYSFGMDAIFRLSSVNLESAVSYPTDGSIAAA
VEVCQCPPGYTGSSCESCWPRHRRVNGTIFGGICEPCQCFGHAESCDDVTGECLNCKDHT
GGPYCDKCLPGFYGEPTKGTSEDCQPCACPLNIPSNNFSPTCHLDRSLGLICDGCPVGYT
GPRCERCAEGYFGQPSVPGGSCQPCQCNDNLDFSIPGSCDSLSGSCLICKPGTTGRYCEL
CADGYFGDAVDAKNCQPCRCNAGGSFSEVCHSQTGQCECRANVQGQRCDKCKAGTFGLQS
ARGCVPCNCNSFGSKSFDCEESGQCWCQPGVTGKKCDRCAHGYFNFQEGGCTACECSHLG
NNCDPKTGRCICPPNTVGEKCSKCAPNTWGHSITTGCKACNCSTVGSLDFQCNVNTGQCN
CHPKFSGAKCTECSQGHWNYPRCNLCDCFLPGTDAATCDSETKKCSCSDQTGQCTCKVNV
EGIHCDRCRPGKFGLDAKNPLGCSSCYCFGTTTQCSEAKGLIRTWVTLKAEQTILPLVDE
ALQHTTTKGIVFQHPEIVAHMDLMREDLHLEPFYWKLPEQFEGKKLMAYGGKLKYAIYFE
AREETGFSTYNPQVIIRGGTPTHARIIIRHMAAPLIGQLTRHEIEMTEKEWKYYGDDPRV
HRTVTREDFLDILYDIHYILIKATYGNFMRQSRISEISMEVAEQGRRTAMTPPADLIEKC
DCPLGYSGLSCEACLPGFYRLRSQPGGRTPGPTLGTCVPCQCNGHSSLCDPETSICQNCQ
HHTAGDFCERCALGYYGIVKGLPNDCQQCACPLISSSNNFSPSCVAEGLDDYRCTACPRG
YEGQYCERCAPGYTGSPGSPGGSCQECECDPYGSLPVPCDPVTGFCTCRPGATGRKCDGC
KHWHAREGWECVFCGDECTGLLLGDLARLEQMVMSINLTGPLPAPYKMLYGLENMTQELK
ATKVTADGEQTGQDAERTNARAKSLGEFIKELARDAEAVNEKAIKLNETLGTRDEAFERN
LEGLQKEIDQMIKELRRKNLETQKEIAEDELVAAEALLKKVKKLFGESRGENEEMEKDLR
EKLADYKNKVDDAWDLLREATDKIREANRLFAVNQKNMTALEKKKEAVESGKRQIENTLK
EGNDILDEANRLADEINSMIDYVEDIQTKLPPMSEELNDKIDDLSQEIKDRKLAEKVSQA
ESHAAQLNDSSAVLDGILDEAKNISFNATAAFKAYSNIKDYIDEAEKVAKEAKDLAHEAT
KLATGPRGLLKEDAKGSLQKSFRILNEAKKLANDVKENDTAAKLQAVKVKARQANDTAKD
VLAQIKELHQNLDGLKKNYNKLADSVAKTNAVVKDPSKNSKIFADADATVKNLEQEADRL
IDKLKPIKELEDNLKKNISEIKELINQARKQANSIKVSVSSGGDCIRTYKPEIKKGSYNN
IVVNVKTAVADNLLFYLGSAKFIDFLAIEMRKGKVSFLWDVGSGVGRVEYPDLTIDDSYW
YRIVASRTGRNGTISVRALDGPKASIVPSTYHSTSPPGYTILDVDANAMLFVGGLTGKLK
KADAVRVITFTGCMGETYFDNKPIGLWNFREKEGDCKGCTVSPQVEDSEGTIQFDGEGYA
LVSRPIRWYPNISTVMFKFRTFSSSALLMYLATRDLRDFMSVELTDGHIKVSYDLGSGMA
SVVSNQNHNDGKWKSFTLSRIQKQANISIVDIDTNQEENIATSSSGNNFGLDLKADDKIY
FGGLPTLRNLRLFNRPEVNLKKYSGCLKDIEISRTPYNILSSPDYVGVTKGCSLENVYTV
SFPKPGFVELSPVPIDVGTEINLSFSTKNESGIILLGSGGTPAPPRRKRRQTGQAYYAIL
LNRGRLEVHLSTGARTMRKIVIRPEPNLFHDGREHSVHVERTRGIFTVQVDENRRYMQNL
TVEQPIEVKKLFVGGAPPEFQPSPLRNIPPFEGCIWNLVINSVPMDFARPVSFKNADIGR
CAHQKLREDEDGAAPAEIVIQPEPVPTPAFPTPTPVLTHGPCAAESEPALLIGSKQFGLS
RNSHIAIAFDDTKVKNRLTIELEVRTEAESGLLFYMARINHADFATVQLRNGLPYFSYDL
GSGDTHTMIPTKINDGQWHKIKIMRSKQEGILYVDGASNRTISPKKADILDVVGMLYVGG
LPINYTTRRIGPVTYSIDGCVRNLHMAEAPADLEQPTSSFHVGTCFANAQRGTYFDGTGF
AKAVGGFKVGLDLLVEFEFRTTRTTGVLLGISSQKMDGMGIEMIDEKLMFHVDNGAGRFT
AVYDAGVPGHLCDGQWHKVTANKIKHHIELTVDGNQVEAQSPNPASTSADTNDPVFVGGF
PDDLKQFGLTTSIPFRGCIRSLKLTKGTGKPLEVNFAKALELRGVQPVSCPAN
Download sequence
Identical sequences ENSGGOP00000024019

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]