SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSGGOP00000014539 from Gorilla gorilla 76_3.1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGGOP00000014539
Domain Number 1 Region: 213-320
Classification Level Classification E-value
Superfamily Anthrax protective antigen 1.83e-19
Family Anthrax protective antigen 0.012
Further Details:      
 
Domain Number 2 Region: 1671-1751
Classification Level Classification E-value
Superfamily E set domains 3.55e-17
Family E-set domains of sugar-utilizing enzymes 0.021
Further Details:      
 
Domain Number 3 Region: 1932-2019
Classification Level Classification E-value
Superfamily E set domains 1.23e-16
Family Other IPT/TIG domains 0.042
Further Details:      
 
Domain Number 4 Region: 1755-1837
Classification Level Classification E-value
Superfamily E set domains 0.0000000000000056
Family Other IPT/TIG domains 0.017
Further Details:      
 
Domain Number 5 Region: 998-1075
Classification Level Classification E-value
Superfamily E set domains 0.000000000000016
Family E-set domains of sugar-utilizing enzymes 0.012
Further Details:      
 
Domain Number 6 Region: 1080-1160
Classification Level Classification E-value
Superfamily E set domains 0.0000000000000318
Family E-set domains of sugar-utilizing enzymes 0.037
Further Details:      
 
Domain Number 7 Region: 1841-1926
Classification Level Classification E-value
Superfamily E set domains 0.00000000000034
Family NF-kappa-B/REL/DORSAL transcription factors, C-terminal domain 0.029
Further Details:      
 
Domain Number 8 Region: 1502-1584
Classification Level Classification E-value
Superfamily E set domains 0.00000000000331
Family E-set domains of sugar-utilizing enzymes 0.026
Further Details:      
 
Domain Number 9 Region: 3096-3217,3246-3387
Classification Level Classification E-value
Superfamily Pectin lyase-like 0.000000000016
Family Galacturonase 0.077
Further Details:      
 
Domain Number 10 Region: 907-985
Classification Level Classification E-value
Superfamily E set domains 0.0000000000268
Family E-set domains of sugar-utilizing enzymes 0.02
Further Details:      
 
Domain Number 11 Region: 112-174
Classification Level Classification E-value
Superfamily E set domains 0.0000000000467
Family E-set domains of sugar-utilizing enzymes 0.043
Further Details:      
 
Domain Number 12 Region: 1404-1482
Classification Level Classification E-value
Superfamily E set domains 0.000000000182
Family E-set domains of sugar-utilizing enzymes 0.038
Further Details:      
 
Domain Number 13 Region: 1589-1664
Classification Level Classification E-value
Superfamily E set domains 0.00000000038
Family Other IPT/TIG domains 0.049
Further Details:      
 
Domain Number 14 Region: 1172-1229
Classification Level Classification E-value
Superfamily E set domains 0.00000000344
Family E-set domains of sugar-utilizing enzymes 0.02
Further Details:      
 
Domain Number 15 Region: 1247-1343
Classification Level Classification E-value
Superfamily Cupredoxins 0.00000234
Family Plastocyanin/azurin-like 0.063
Further Details:      
 
Weak hits

Sequence:  ENSGGOP00000014539
Domain Number - Region: 2151-2250,2334-2373,2408-2523
Classification Level Classification E-value
Superfamily Pectin lyase-like 0.0022
Family Chondroitinase B 0.088
Further Details:      
 
Domain Number - Region: 1-82
Classification Level Classification E-value
Superfamily E set domains 0.0747
Family E-set domains of sugar-utilizing enzymes 0.074
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGGOP00000014539   Gene: ENSGGOG00000014869   Transcript: ENSGGOT00000014952
Sequence length 4085
Comment pep:known_by_projection chromosome:gorGor3.1:8:108733971:108877420:1 gene:ENSGGOG00000014869 transcript:ENSGGOT00000014952 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
TLITIQGRIFTDVYGSNIALSSNGKNVRILRVYVGGMPCDLLIPQSDSLYGLKLDHPNGD
MGSMVCKTTGTFIGHHNVSFILDNDYGRSFPQKMAYFVSSLNKIAMFQTYAEVTMIFPSQ
GSIRGGTTLTISGRFFDQTDFPVRVLVGGEPCDILNVTENSICCKTPPKPHILKTVYPGG
RGLKLEVWNNSRPVRLEEILEYNEKTPGYMGASWVDSASYIWLMEQDTFVARFSGFLVAP
DSDVYRFYIKGDDRYAIYFSQTGLPEDKVRIAYHSANANSYFSSPTQRSDDIHLQKGKEY
YIEILLQEYRLSAFVDVGLYQYRNVYTEQQTGDAVNEEQVIKSQSTIIQEVQVITLENWE
TTNAINEVQKIKVTSPCVEANSCSLYQYRLIYNMEKTVFLSADASEFILQSALNDLWSIK
PDTVQVIRTQNPQSYVYMVTFISTRGDFDLLGYEVVEGNNVTLDITEQTKGKPNLETFTL
NWDGIASKPLTLWSSEAEFQGAVEEMVSTKCPPQIANFEEGFVVKYFRDYETDFNLEHIN
RGQKTAETDAYCGRYSLKNPAVLFDSADVKPNRRPYGDILLFPYNQLCLAYKGFLANYIG
LKFQYQDNNKITRSTDTQFTYNFAYGNNWTYTCIDLLDLVRTKYTGTNISLQRISLHKAS
ESQSFYVDVVYIGHTSTISTLDEMPKRRLPALANKGIFLEHFQVNQTKTNGPTMTNQYSV
TMTSYNCSYNIPMMAVSFGQIITHETENEFVYRGNNWPGKSKIRIQRIQAASPPLSGSFD
IQAYGHILKGLPAAVSAADLQFALQSLEGMGRISVTREGTCAGYAWNIKWRSTCGKQNLL
QINDSNIIGEKANMTVTRIKEGGLFRQHVLGDLLRTPSQQPQVEVYVNGIPAKCSGDCGF
TWDSNITPLVLATSPSQGSYEEGTILTIVGSGFSPSSAVTVSVGPVGCSLLSVDEKELKC
QILNGSAGHAPVAVSMADVGLAQNVGGEEFYFVYQSQISHVWPDSGSIAGGTLLTLSGFG
FNENSKVLVGNETCNVIEGDLNRITCRTPKKTEGTVDISVTTNGFQATARDAFSYNCLQT
PIITDFSPKVRTILGEVNLTIKGYNFGNELTQNMAVYVGGKTCQILHWNFTDIRCLLPKL
SPGKHDIYVEVRNWGFASTRDKLNSSIQYVLEVTSMFPQRGSLFGGTEITVRGFGFSTIP
AENTVLLGSIPCNVTSSSENVIKCILHSTGNIFRITNNGKDSVHGLGYAWSPSVLNVSVG
DTVAWHWQTHPFLRGIGYRIFSVSSPGSVIYDGKGFTSGRQKSTSGSFSYQFTSPGIHYY
SSGYVDEAHSIFLQGVINVLPAETRHIPLHLFVGSSEATYAYGGPENLHLGSSVAGCLAT
EPLCGLNNTRVKNSKRLLFEVSSCFSPSISNITPSSGTVNELITIIGHGFSNLPCANKVT
IGSYPCVIEESSEDSITCHIDPQNSMDVGIRETVTLTVYNLGIAINTLSNEFDRRFVLLP
NIDLVLPNAGSTTGMTRVTIKGSGFAVSSAGVKVLMGHFPCKVLSVNYTAIECETSPAAQ
QLVDVDLLIHGVPAQCQENCTFSYLESITPYITGVFPNSIIGSVKVLIEGEGLGTVLEDI
AVFIGNQQFRAIEVNENNITALVTPLPVGHHSVSVVVGSKGLALGKLTVSSPPVASLSPT
SGSIGGGTTLVITGNGFYPGNTTVTIGDEPCQIISINPNEVYCRTPAGTTGMVDVKIFVN
TIAYPPLLFTYALEDTPFLRGIIPSRGPPGTEIEITGSNFGFEILEISVMINNIQCNVTM
ANDSVLQCIVGDHAGGTFPVMMHHKTKGSAMSTVVFEYPLNIQNINPSQGSFGGGQTMTV
TGTGFNPQNSIILVCGSECAIDRLRSDYTTLLCEIPSNNGTGAEQACEVSVVNGKDLSQS
MTPFTYAVSLTPLITAVSPKRGSTAGGTRLTVVGSGFSENIEDVHVTIAEAKCDVEYSNK
THIICMTDAHTLSGWAPVCVHIRGVGMAKLDNADFLYVDAWSSNFSWGGKSPPEEGSLVV
ITKGQTILLDQSTPILKMLLIQGGTLIFDEADIELQAENILITDGGVLQIGTETSPFQHK
AVITLHGHLRSPELPVYGAKTLAVREGILDLHGVPVPVIWTRLAHTAKAGERILILQEAV
TWKPGDNIVIASTGHRHSQGENEKMTIASVSPDGINITLSNPLNYTHLGITVTLPDGTLF
EARAEVGILTRNILIRGSDNVEWNNKIPACPDGFDTGEFATQTCLQGKFGEEIGSDQFGG
CVMFHAPVPGANMVTGRIEYVEVFHAGQAFRLGRYPIHWHLLGDLQFKSYVRGCAIHQAY
NRAVTIHNTHHLLVERNIIYDIKGGAFFIEDGIEHGNILQYNLAVFVQQSTSLLNDDVTP
AAFWVTNPNNTIRHNAVAGGTHFGFWYRMNNHPDGPSYDRNICQKRVPLGEFFNNTVHSQ
GWFGMWIFEEYFPMQTGSCTSTVPAPAIFNSFTTWNCQKGAEWVNGGALQFHNFVMVNNY
EAGIETKRILAPYVGGWGETNGAVIKNAKIVGHLDELGMGSAFCTAKGLVLPFSEGLTVS
SVHFMNFDRPNCVALGVTSISGVCNDRCGGWSAKFVDVQYSHTPNKAGFRWEHEMVMIDV
DGSLTGHKGHTVIPHSSLLDPSHCTQEAEWSIGFPGSVCDASVSFHRLAFNQPSPVSLLE
KDVVLSDSFGTSIIPFQKKRLTHMSGWMALIPNANHINWYFKGVDHITNISYTSTFYGFK
EEDYVIISHNFTQNPDMFNIIDTRNGSSNPLNWNTSKNGDWHLEANTSTLYYLVSGRNDL
HQSQLISGNLDPDVKDVVINFQAYCCILQDCFPVHPPSRKPIPKERPATYNLWSNDSFWQ
SSRENNYTVPHPGANVIIPEGTWIVADIDMPSMERLIIWGVLELEDKYNVGAAESSYREV
ILNATYISLQGGRLIGGWEDNPFKGDLKIVLRGNHTTPDWALPEGPNQGAKVLGVFGELD
LHGIPRSIYKTKLSETALAGSKVLSLMDAVDWQEGEEIVITTTSYDFHQTETRSIVKILH
DHKILILNDSLSYTHFAEKYHVPGTGESYTLAADVGILSRNIKIVGEDYPGWSEDSFGAR
VLVGSFTENMMTFKGNARISNVEFYHSGQEGFRDSTDPRYAVTFLNLGQIQEHGSSYIRG
CAFHHGFSPAIGVFGTDGLDIDDNIIHFTVGEGIRIWGNANRVRGNLIALSVWPGTYQNR
KDLSSTLWHAAIEINRGTNTVLQNNVVAGFGRAGYRIDGEPCPGQFNPVEKWFDNEAHGG
LYGIYMNQDGLPGCSLIQGFTIWTCWDYGIYFQTTESVHIYNVTLVDNGMAIFPMIYMPA
AISHKISSKNVQIKSSLIVGSSPGFNCSDVLTNDDPNIELTAAHRSPRSPSEGGRSGICW
PTFASAHNMAPRKPHAGIMSYNAISGLLDISGSTFVGFKNVCSGETNVIFITNPLNEDLQ
HPIHVKNIKLVDTTEQSKIFIHRPDISKVNPSDCVDMVCDAKRKSFLRDIDGSFLGNAGS
VIPQAEYEWDGNSQVGIGDYRIPKAMLTFLNGSRIPVTEKAPHKGIIRDSTCKYLPEWQS
YQCFGMEYAMMVIESLDPDTETRRLSPVAIMGNGYVDLINGPQDHGWCAGYTCQRRLSLF
HSIVALNKSYEVYFTGTSPQNLRLMLLNVDHNKAVLVGIFFSTLQRLDVYVNNLLVCPKT
TIWNAQQKHCELNNHLYKDQFLPNLDSTVLGENYFDGTYQMLYLLVKGTIPVEIHTATVI
FVSFQLPVATEDDFYTSHNLVKNLALFLKIPSDKIRISKIRGKSLRRKRSMGFIIEIEIG
DPPIQFLSNGTTGQMQLSELQEIAGSLGQAVILGNISSILGFNISSMSITNPLPSPSDSG
WIKVTAQPVERSAFPVHHVAFVSSLLVITQPVAAQPGQPFPQQPSVKATDSDGNCVSVGI
TALTLRAILKDSNNNQVNGLSGNTTIPFSSCWANYTDLTPLRTGKNYKIEFILDNVVGVE
SRTFSLLAESVSSSGSSSSSNSKASTVGTYAQIMTVVISCLIGRMWLLEIFMAAVSTLNI
TLRSY
Download sequence
Identical sequences ENSGGOP00000014539

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]