SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSGGOP00000004286 from Gorilla gorilla 76_3.1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGGOP00000004286
Domain Number 1 Region: 2535-2821
Classification Level Classification E-value
Superfamily BEACH domain 5.1e-122
Family BEACH domain 0.0000000462
Further Details:      
 
Domain Number 2 Region: 2914-3101,3145-3182
Classification Level Classification E-value
Superfamily WD40 repeat-like 1.41e-30
Family WD40-repeat 0.001
Further Details:      
 
Domain Number 3 Region: 2388-2513
Classification Level Classification E-value
Superfamily PH domain-like 1.48e-19
Family PreBEACH PH-like domain 0.015
Further Details:      
 
Domain Number 4 Region: 472-660,850-923
Classification Level Classification E-value
Superfamily ARM repeat 0.0000000000108
Family Armadillo repeat 0.037
Further Details:      
 
Domain Number 5 Region: 53-125,170-281,332-614,827-915
Classification Level Classification E-value
Superfamily ARM repeat 0.000000537
Family PBS lyase HEAT-like repeat 0.092
Further Details:      
 
Weak hits

Sequence:  ENSGGOP00000004286
Domain Number - Region: 1068-1225
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 0.0272
Family Glycosyl hydrolases family 16 0.061
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGGOP00000004286   Gene: ENSGGOG00000004358   Transcript: ENSGGOT00000004391
Sequence length 3184
Comment pep:known_by_projection chromosome:gorGor3.1:10:60459508:60765099:1 gene:ENSGGOG00000004358 transcript:ENSGGOT00000004391 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MEAEDLSKAEDRNEDPGSKNEGQLAAVQPDVPHGGQSSSPTALWDMLERKFLEYQQLTHK
SPIERQKSLLSLLPLFLKAWEHSVGIICFPSLQRLAEDVSDQLAQQLQKALVGKPAEQAR
LAAGQLLWWKGDMDQDGYLLLKSVYVLTGTDSETLGRVAESGLPALLLQCLYLFFVFPLD
KDELFESDLQVQKMFVQMLLNICSDSQGLEGLLSGSELQSLLIATTCLREHSCCFWKEPT
FCVLRAISKAQNLSIIQYLQATDCVRLSLQNLSRLADTLPAPEVSEAVSLILGFVKDSYP
VSSALFLEFENSEGYPLLLKVLLRYDGLTQSEVDPHLEELLGLVVWLTTCGRSELKVFDS
ITYPQLEGFKFHHEASGVTVKNLQAFQVLQNVFHKASDSVLCIQVLSAIRTMWAWNARNF
FLLEWTLQPISQFVEIMPLKPAPVQEHFFQLLEALVFELHYVPHEILRKVQHLIKESPGP
SCTLMALQSILSIAGGDPLFTDIFRDSGLLGLLLAQLRKQAKIMRKSGNKVSTPGVQDPE
RELTCVMLRTVVTLLKGSVRNAVVLKDHGMVPFIKIFLDDECYREASLSILEQLSAINAE
EYMSIIVGALCSSTQGELQLKLDLLKSLLRILVTPKGRAAFRVSSGFNGLLSLLSDLEGS
LQEPPLQAWGAVSPRQTLELVLYTLCAVSAALHWDPVNGYFFRRNGLFEKLAEDLCLLGC
FGALEEEGNLLRSWVDTKARPFADLLGTAFSSSGSLPPRIQSCLQILGFLDSMASGTLHL
RGDLKESLRTKQGPVVDVQKGETGSGPQRNFKQWPDLEERMDEGDAAIMHPGVVCIMVRL
LPRLYHEDHPQLSEEIQCSLASHIQSLVKSEKNRQVMCEAGMLGTLMASCHRALVTSGSP
LHSRLIRIFEKLASQAIEPDVLRQFLGLGIASSLSATTKILDSSHAHRGNPGCSGSQTAQ
GSAEGPWPAAPDAGLHPGVTQAPQPLGESQDSTTALQTALSLISMTSPRNLQPQRAALAP
SFVEFDMSVEGYGCLFIPTLSTVMGTSTEYSVSGGIGTGATRPFPPPGGLTFSCWFLISR
HGAATEGHPLRFLTLVRHLARTEQPFVCFSVSLCPDDLSLVVSTEEKEFQPLDVMEPEDD
SEPSAGCQLQVRCGQLLACGQWHHLAVVVTKEMKRHCTVSTCLDGQVIGSAKMLYIQALP
GPFLSMDPSAFVDVYGYIATPRVWKQKSSLIWRLGPTYLFEEAISMETLEVINKLGPRYC
GNFQAVHGQGEDLDSEATPFVAEERVSFGLHIASSSITSVVDIRNAYNEVDSRLIAKEMN
ISSRDNAMPVFLLRNCAGHLSGSLRTIGAVAVGQLGVRVFHSSPAASSLDFIGGPAILLG
LISLATDDHTMYAAVKVLHSVLTSNAMCDFLMQHICGYQIMAFLLRKKASLLNHRIFQLI
LSVAGTVELGFRSSAITNTGIFQHILCNFELWMNTADNLELSLFSHLLEILQSPREGPRN
AKAAHQAQLIPKLIFLFNEPSLIPSKIPTIIGILACQLRGHFSTQDLLRIGLFVVYTLKP
SSVNERQICMDGALDPSLPAGSQTSGKTIWLRNQLLEMLLSVISSPQLHLSSESKEEMFL
KLGPDWFLLLLQGHLHASTTVLALKLLLHFLASPSLRTRFRDGLCAGSWVERSTEGVDIV
MDNLKSQSPLPEQSPCLLPGFRVLNDFLAHHVHIPEVYLIVSTFFLQTPLTELTDGPKDS
LDAMLQWLLQRHHQEEVLQAGLCTEGALLLLEMLKATMSQPLAGSEDGAWAQTFPASVLQ
FLSLVHRTYPQDPAWRAPEFLQTLAIAAFPLGAQKGAGAESTRNTSSLEAAAEGDSTVEG
LQAPTKAHPARRKLREFTQLLLRELLLGASSPKQWLPLEVLLEASPDHATSQQKRDFQSE
VLLSAMELFHMTSGGDAAMFRDGKEPQPSAEAAAAPSLANISCFTQKLVEKLYSGMFSAD
PRHILLFILEHIMVVIETASSQRDTVLSTLYSSLNKVILYCLSKPQQSLSECLSLLSILG
FLQEHWDVVFATYNSNVSFLLCLMHCLLLLNERSYPEGFGLEPKPRMSTYHQVFLSPNED
VKEKREDLPSLSDVQHNIQKTVQTLWQQLVAQRQQTLEDAFKIDLSVKPGEREVKIEEVT
PLWEETMLKAWQHYLASEKKSLASRSNVAHHSKVTLWSGSLSSAMKLMPGRQAKDPECKT
EDFVSCIENYRRRGQELYASLYKDHVQRRKCGNIKAANAWARIQEQLFGELGLWSQGEET
KPCSPWELDWREGPARMRKRIKRLSPLEALSSGRHKESQDKNDHISQTNAENQDELTPRE
AEGELDEVGVDCTQLTFFPALHESLHSEDFLELCRERQVILQELLDKEKVTQKFSLVIVQ
GHLVSEGVLLFGHQHFYICENFTLSPTGDVYCTRHCLSNISDPFIFNLCSKDRSTDHYSC
QCHSYADMRELRQARFLLQDIALEIFFHNGYSKFLVFYNNDRSKAFKSFCSFQPSLKGKA
ASEDPLNLRRYPGSDRTMLQKWQKRDISNFEYLMYLNTAAGRTCNDYMQYPVFPWVLADY
TSETLNLANPKTFRDLSKPMGAQTKERKLKFIQRFKEVEKAEGDMTVQCHYYTHYSSAII
VASYLVRMPPFTQAFCALQGGSFDVADRMFHSVKSTWESASRENMSDVRELTPEFFYLPE
FLTNCNGVEFGCMQDGTVLGDVQLPPWADGDPRKFISLHRKALESDFVSANLHHWIDLIF
GYKQQGPAAVDAVNIFHPYFYGDRMDLSSITDPLIKSTILGFVSNFGQVPKQLFTKPHPA
RTAAGKPLPGKDVSTPVSLPGHPQPFFYSLQSLRPSQVTVKDMYLFSLGSESPKGAIGHI
VPTEKTILAVERNKVLLPPLWNRTFSWGFDDFSCCLGSYGSDKVLMTFENLAAWGRCLCA
VCPSPTMIVTSGTSTVVCVWELSMTKGRPRGLRLQQALYGHTQAVTCLAASVTFSLLVSG
SQDCTCILWDLDHLTHVTRLPTHREGISAITISDVSGTIVSCAGAHLSLWNVNGQPLASI
TTAWGPEGAITCCCLMEGPAWDASQIIITGSQDGMVRVWKTEDVKMSVPGQPAGEEPPAQ
PPSPRGHKWEKNLALSRELDVSIALTGKPSKTSPAVTALAVSRNQTKLLVGDERGRIFCW
SADG
Download sequence
Identical sequences ENSGGOP00000004286 ENSGGOP00000004286

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]