SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSGGOP00000024019 from Gorilla gorilla 69_3.1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGGOP00000024019
Domain Number 1 Region: 2736-2900
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 1.98e-36
Family Laminin G-like module 0.0000000639
Further Details:      
 
Domain Number 2 Region: 2895-3058
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 3.86e-32
Family Laminin G-like module 0.000000301
Further Details:      
 
Domain Number 3 Region: 2304-2491
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 7.77e-31
Family Laminin G-like module 0.0014
Further Details:      
 
Domain Number 4 Region: 2487-2679
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 5.24e-27
Family Laminin G-like module 0.021
Further Details:      
 
Domain Number 5 Region: 2119-2295
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 3.45e-24
Family Laminin G-like module 0.0014
Further Details:      
 
Domain Number 6 Region: 778-830
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000000053
Family Laminin-type module 0.013
Further Details:      
 
Domain Number 7 Region: 828-882
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000000000795
Family Laminin-type module 0.0025
Further Details:      
 
Domain Number 8 Region: 1385-1436
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000000363
Family Laminin-type module 0.014
Further Details:      
 
Domain Number 9 Region: 373-513,686-702
Classification Level Classification E-value
Superfamily Growth factor receptor domain 0.0000000138
Family Growth factor receptor domain 0.007
Further Details:      
 
Domain Number 10 Region: 1442-1494
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000000024
Family Laminin-type module 0.017
Further Details:      
 
Domain Number 11 Region: 720-772
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000000335
Family Laminin-type module 0.0044
Further Details:      
 
Domain Number 12 Region: 250-309
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000000128
Family Laminin-type module 0.034
Further Details:      
 
Domain Number 13 Region: 881-927
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000019
Family Laminin-type module 0.0049
Further Details:      
 
Domain Number 14 Region: 1025-1073
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000000642
Family Laminin-type module 0.012
Further Details:      
 
Domain Number 15 Region: 930-970
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000753
Family Laminin-type module 0.011
Further Details:      
 
Domain Number 16 Region: 979-1022
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000181
Family Laminin-type module 0.012
Further Details:      
 
Domain Number 17 Region: 307-379
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000307
Family Laminin-type module 0.021
Further Details:      
 
Domain Number 18 Region: 1492-1532
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000335
Family Laminin-type module 0.0088
Further Details:      
 
Domain Number 19 Region: 52-131
Classification Level Classification E-value
Superfamily Galactose-binding domain-like 0.0000964
Family APC10-like 0.038
Further Details:      
 
Weak hits

Sequence:  ENSGGOP00000024019
Domain Number - Region: 1798-1903
Classification Level Classification E-value
Superfamily ADP-ribosylation 0.000173
Family ADP-ribosylating toxins 0.029
Further Details:      
 
Domain Number - Region: 1085-1128
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000232
Family Laminin-type module 0.018
Further Details:      
 
Domain Number - Region: 1342-1371
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00148
Family Laminin-type module 0.076
Further Details:      
 
Domain Number - Region: 1999-2092
Classification Level Classification E-value
Superfamily Tropomyosin 0.085
Family Tropomyosin 0.015
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGGOP00000024019   Gene: ENSGGOG00000004906   Transcript: ENSGGOT00000024034
Sequence length 3058
Comment pep:novel chromosome:gorGor3.1:6:129416644:130090403:1 gene:ENSGGOG00000004906 transcript:ENSGGOT00000024034 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
GLFPAVLNLASNALITTNATCGEKGPEMYCKLVEHVPGQPVRNPQCRICNQNSSNPNQRH
PITNAIDGKNTWWQSPSIKNGIEYHYVTITLDLQQVFQIAYVIVKAANSPRPGNWILERS
LDDVEYKPWQYHAVTDTECLTLYNIYPRTGPPSYAKDDEVICTSFYSKIHPLENGEIHIS
LINGRPSADDPSPELLEFTSARYIRLRFQRIRTLNADLMMFAHKDPREIDPIVTRRYYYS
VKDISVGGMCICYGHARACPLDPATNKSRCECEHNTCGDSCDQCCPGFHQKPWRAGTFLT
KTECEACNCHGKAEECYYDENVARRNLSLNIHGKYIGGGVCINCTQNTAGVNCETCIDGF
FRPKGVSPNYPRPCQPCHCDPIGSLNEVCVKDEKHARRGLAPGSCHCKTGFGGVSCDRCA
RGYTGYPDCKACNCSGLGSKNEDPCFGPCICKENVEGGDCSRCKSGFFNLQEDNWKGCDE
CFCSGVSNRCQSSYWTYGKIQDMSGWYLTDLSGRIRVAPQQDDLDSPQQISISKAEARQA
LPHSYYWSAPAPYLGNKLPAVGGQLTFTISYDLEEEEEDTEHVLQLMIILEGNDLSISTA
QDEVYLHPSEEHTNVLLLKEESFTLHGTHFPVSRKEFMTVLANLKRVLLQITYSFGMDAI
FRLSSVNLESAVSYPTDGSIAAAVEVCQCPPGYTGSSCESCWPRHRRVNGTIFGGICEPC
QCFGHAESCDDVTGECLNCKDHTGGPYCDKCLPGFYGEPTKGTSEDCQPCACPLNIPSNN
FSPTCHLDRSLGLICDGCPVGYTGPRCERCAEGYFGQPSVPGGSCQPCQCNDNLDFSIPG
SCDSLSGSCLICKPGTTGRYCELCADGYFGDAVDAKNCQPCRCNAGGSFSEVCHSQTGQC
ECRANVQGQRCDKCKAGTFGLQSARGCVPCNCNSFGSKSFDCEESGQCWCQPGVTGKKCD
RCAHGYFNFQEGGCTGEACECSHLGNNCDPKTGRCICPPNTVGEKCSKCAPNTWGHSITT
GCKACNCSTVGSLDFQCNVNTGQCNCHPKFSGAKCTECSQGHWNYPRCNLCDCFLPGTDA
ATCDSETKKCSCSDQTGQCTCKVNVEGIHCDRCRPGKFGLDAKNPLGCSSCYCFGTTTQC
SEAKGLIRTWVTLKAEQTILPLVDEALQHTTTKGIVFQHPEIVAHMDLMREDLHLEPFYW
KLPEQFEGKKLMAYGGKLKYAIYFEAREETGFSTYNPQVIIRGGTPTHARIIIRHMAAPL
IGQLTRHEIEMTEKEWKYYGDDPRVHRTVTREDFLDILYDIHYILIKATYGNFMRQSRIS
EISMEVAEQGRRTAMTPPADLIEKCDCPLGYSGLSCEACLPGFYRLRSQPGGRTPGPTLG
TCVPCQCNGHSSLCDPETSICQNCQHHTAGDFCERCALGYYGIVKGLPNDCQQCACPLIS
SSNNFSPSCVAEGLDDYRCTACPRGYEGQYCERCAPGYTGSPGSPGGSCQECECDPYGSL
PVPCDPVTGFCTCRPGATGRKCDGCKHWHAREGWECVFCGDECTGLLLGDLARLEQMVMS
INLTGPLPAPYKMLYGLENMTQELKHLLSPQRAPERLIQLAEGNLNTLVTEMNELLTRAT
KVTADGEQTGQDAERTNARAKSLGEFIKELARDAEAVNEKAIKLNETLGTRDEAFERNLE
GLQKEIDQMIKELRRKNLETQKEIAEDELVAAEALLKKVKKLFGESRGENEEMEKDLREK
LADYKNKVDDAWDLLREATDKIREANRLFAVNQKNMTALEKKKEAVESGKRQIENTLKEG
NDILDEANRLADEINSMIDYVEDIQTKLPPMSEELNDKIDDLSQEIKDRKLAEKVSQAES
HAAQLNDSSAVLDGILDEAKNISFNATAAFKAYSNIKDYIDEAEKVAKEAKDLAHEATKL
ATGPRGLLKEDAKGSLQKSFRILNEAKKLANDVKGQNHNDVRSLAWNLTTDTLNNNGTIQ
NLLTQKDNLSSVYNINTAAKLQAVKVKARQANDTAKDVLAQIKELHQNLDGLKKNYNKLA
DSVAKTNAVVKDPSKNKIIADADATVKNLEQEADRLIDKLKPIKELEDNLKKNISEIKEL
INQARKQANSIKVSVSSGGDCIRTYKPEIKKGSYNNIVVNVKTAVADNLLFYLGSAKFID
FLAIEMRKGKVSFLWDVGSGVGRVEYPDLTIDDSYWYRIVASRTGRNGTISVRALDGPKA
SIVPSTYHSTSPPGYTILDVDANAMLFVGGLTGKLKKADAVRVITFTGCMGETYFDNKPI
GLWNFREKEGDCKGCTVSPQVEDSEGTIQFDGEGYALVSRPIRWYPNISTVMFKFRTFSS
SALLMYLATRDLRDFMSVELTDGHIKVSYDLGSGMASVVSNQNHNDGKWKSFTLSRIQKQ
ANISIVDIDTNQEENIATSSSGNNFGLDLKADDKIYFGGLPTLRNLSMKARPEVNLKKYS
GCLKDIEISRTPYNILSSPDYVGVTKGCSLENVYTVSFPKPGFVELSPVPIDVGTEINLS
FSTKNESGIILLGSGGTPAPPRRKRRQTGQAYYAILLNRGRLEVHLSTGARTMRKIVIRP
EPNLFHDGREHSVHVERTRGIFTVQVDENRRYMQNLTVEQPIEVKKLFVGGAPPEFQPSP
LRNIPPFEGCIWNLVINSVPMDFARPVSFKNADIGRCAHQKLREDEDGAAPAEIVIQPEP
VPTPAFPTPTPVLTHGPCAAESEPALLIGSKQFGLSRNSHIAIAFDDTKVKNRLTIELEV
RTEAESGLLFYMARINHADFATVQLRNGLPYFSYDLGSGDTHTMIPTKINDGQWHKIKIM
RSKQEGILYVDGASNRTISPKKADILDVVGMLYVGGLPINYTTRRIGPVTYSIDGCVRNL
HMAEAPADLEQPTSSFHVGTCFANAQRGTYFDGTGFAKAVGGFKVGLDLLVEFEFRTTRT
TGVLLGISSQKMDGMGIEMIDEKLMFHVDNGAGRFTAVYDAGVPGHLCDGQWHKVTANKI
KHHIELTVDGNQVEAQSPNPASTSADTNDPVFVGGFPDDLKQFGLTTSIPFRGCIRSL
Download sequence
Identical sequences ENSGGOP00000024019 ENSGGOP00000004819

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]