SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSSTOP00000013727 from Ictidomys tridecemlineatus 76_2

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSSTOP00000013727
Domain Number 1 Region: 372-479
Classification Level Classification E-value
Superfamily Anthrax protective antigen 4.32e-19
Family Anthrax protective antigen 0.012
Further Details:      
 
Domain Number 2 Region: 1830-1910
Classification Level Classification E-value
Superfamily E set domains 4.96e-16
Family E-set domains of sugar-utilizing enzymes 0.025
Further Details:      
 
Domain Number 3 Region: 2089-2178
Classification Level Classification E-value
Superfamily E set domains 0.00000000000000747
Family Other IPT/TIG domains 0.054
Further Details:      
 
Domain Number 4 Region: 3192-3221,3255-3376,3405-3519
Classification Level Classification E-value
Superfamily Pectin lyase-like 0.0000000000000134
Family Galacturonase 0.079
Further Details:      
 
Domain Number 5 Region: 1239-1319
Classification Level Classification E-value
Superfamily E set domains 0.0000000000000224
Family E-set domains of sugar-utilizing enzymes 0.022
Further Details:      
 
Domain Number 6 Region: 1914-1997
Classification Level Classification E-value
Superfamily E set domains 0.000000000000035
Family Other IPT/TIG domains 0.019
Further Details:      
 
Domain Number 7 Region: 1157-1234
Classification Level Classification E-value
Superfamily E set domains 0.0000000000000382
Family E-set domains of sugar-utilizing enzymes 0.015
Further Details:      
 
Domain Number 8 Region: 2000-2086
Classification Level Classification E-value
Superfamily E set domains 0.00000000000057
Family NF-kappa-B/REL/DORSAL transcription factors, C-terminal domain 0.028
Further Details:      
 
Domain Number 9 Region: 1066-1142
Classification Level Classification E-value
Superfamily E set domains 0.0000000000012
Family E-set domains of sugar-utilizing enzymes 0.026
Further Details:      
 
Domain Number 10 Region: 1661-1743
Classification Level Classification E-value
Superfamily E set domains 0.00000000000624
Family E-set domains of sugar-utilizing enzymes 0.062
Further Details:      
 
Domain Number 11 Region: 272-335
Classification Level Classification E-value
Superfamily E set domains 0.000000000165
Family E-set domains of sugar-utilizing enzymes 0.032
Further Details:      
 
Domain Number 12 Region: 1563-1629
Classification Level Classification E-value
Superfamily E set domains 0.00000000035
Family E-set domains of sugar-utilizing enzymes 0.022
Further Details:      
 
Domain Number 13 Region: 1748-1825
Classification Level Classification E-value
Superfamily E set domains 0.00000000266
Family E-set domains of sugar-utilizing enzymes 0.054
Further Details:      
 
Domain Number 14 Region: 1331-1387
Classification Level Classification E-value
Superfamily E set domains 0.00000000934
Family E-set domains of sugar-utilizing enzymes 0.018
Further Details:      
 
Domain Number 15 Region: 143-241
Classification Level Classification E-value
Superfamily E set domains 0.0000000342
Family E-set domains of sugar-utilizing enzymes 0.026
Further Details:      
 
Domain Number 16 Region: 33-124
Classification Level Classification E-value
Superfamily E set domains 0.0000000478
Family E-set domains of sugar-utilizing enzymes 0.032
Further Details:      
 
Domain Number 17 Region: 2350-2404,2456-2683
Classification Level Classification E-value
Superfamily Pectin lyase-like 0.00000179
Family Galacturonase 0.071
Further Details:      
 
Domain Number 18 Region: 1406-1499
Classification Level Classification E-value
Superfamily Cupredoxins 0.00000335
Family Plastocyanin/azurin-like 0.047
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSSTOP00000013727   Gene: ENSSTOG00000015273   Transcript: ENSSTOT00000015321
Sequence length 4246
Comment pep:known_by_projection scaffold:spetri2:JH393280.1:35488829:35624465:-1 gene:ENSSTOG00000015273 transcript:ENSSTOT00000015321 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MGHLWLLGTWGLWALLLRAADPHTDDSEVIPKVMEVLPKYGSINGATRLTIKGEGFSQAN
QFDFGVDNAELGNSVQLVSSFRSITCDVEKDSSHSTQITCYTRAMPEDSYTVRVSVDGIP
IAENNTCKGHINSWACSFNAKSFRTPTIMSITPLSGTPGTLITIQGRIFTDVYGSNTALS
SNGKNVRILRVYNGGMPCELLIPQSDNLYGLKLDHPNGDIGSMTCKITGTYIGHHNVSFI
LDSDYGRSFPQKMAYFVSSLNKISMFQTYAEVTTVSPSKGSTGGGTTLTISGRFFDQTDL
PVKVLVGGQTCDILNITENSIYCKTPPKPPILKTVYPGGRGLKLEVWNNSRPLHLEEILG
YNEKTPGYMGASWVDSASYAWPMGQDTFVARFSGFLVAPDSDVYRFYIKGDDRYAIYFSQ
TGLPEDKVRIAYHSANANSYFSSPTQRSDDIHLQKGKEYYIEILLQEYRLSAFVDVGLYQ
FRNVYTEQQTRDAINEEQVIKSQSTIIQEVQVITLENWETTNATNEVQKITVTSPCVGAN
SCSFHQYRLIYNMEKTILLPADASGSLMQSALNDLWSVKPDTVQVIRKRNLQSFIYTITF
VSTRGDFDLLGYEVLEGNNVTLDITEQTKGKPSLETFTLNWDGIASVPLTPASSEVEFQA
AVEEMVSTKCPPQIAHFEEGFVVKYFRDYETYFDLEHINRGQKTAETDAYCGRYSLKNPA
VLFDSTDVKPNRSPYGNILLFPYNQLCLAYKGFLKNYIGLKFQYQDNGKITRSADIQFTY
NFAYGNNWTYTCIDLLDLIQTKYEGTDFSLQRISLQKASESQFFYVDVVYIGQTSTISTL
VEMPKRRLPALANKGIFLKHFQVNQSKINGSTITIQYFIIMTSYNCSHNIPLMAVSFGQV
ITNETENESVYRGKNWPGKSKIHIQRIQEASPPISGTFDIYAYGHVLKGIPAAVSAADLQ
FALQSLEEVGQVSVTQEGTCAGYSWSIKWRSTCGRQNLLQVNDSNITGEKANVTVAKVRE
GGLFRRHILGDLLRTPSQKPQVQVYVNGIPSKCSGDCRFTWDPMSTPLISATSPSQGSYE
DSTILTIAGSGFSPSSAVSVSVGPTSCSLLSVNENEIKCQILNGSAGHVPVAVSIADVGL
AQNVEGEGFHFIYESRISHIWPDSGSLAGGTLLTVSGFGFSENSKVLVGNETCSVTEGNL
NKITCRTPKRIEGTVDISVITNGFQTTAKDVFSYNCLQTPVITDFSPKVRTILGDVNLTI
KGYNFGNELTQNMVVYVGGKPCQVLHWNFTDIRCLLPTLSPGKHDIHVEVRNWGFASTRD
KLNASIWYILKVTNMFPQRGSLYGGTEITVLGFGFSTIPMKNTVLLGSFPCNVTSSSENV
IKCILHSTGNVFRITNNGEDLVHGLGYAWSPSVLNVSVGDTVTWHWQAHPFLRGIGYRVF
SVSSPGSVIYDGRGFTNGRQKSLSGSFSYQFTSPGIHYYSSGYVDEANSTSLQGVINVLP
AQTRHIPLHLFVGSTEATYAQGGPENLHLESSVAGCLATEPLCGLNNIKVKNSNRPFFEL
SSCNSPSISNITPSTGTVNELITISGHGFSNLTCANKVTIGSYPCVVEKSSENSIMCHID
PQNSMDVGIREIVTLIVYNLGTAINTLSKEFDRRFVLLPSIDMVLPSAGSTTGMTRVTIK
GSGFSASSAGVEVFMGHFPCKVLTVNYTVIECETSPAPQQLVHVDLLIHGVPAQCQGNCS
FSYSESITPYITGIFPNSIEGSVNVLIEGEGLGTVLEEIAVFIGNQQFRVTHVNEKNITV
LMTSLPAGPHSLSVVVGSKGLALGNLTVSSPAVASVSPKSGSIGGGTILRITGNGFYPGN
TTVTVGERPCQIVFVNSSEVYCSTPPGRAGKVDLKIFVNAITYPSLSFNYSLEDTPFLRG
IVPDRGLPGTEIEITGSNFGFDISEISVMLGDIQCNVTTVNDSMLQCVTGAHAGGTFPVL
MHHKTKGSAVSTVEFEYPLHIQNIHPTQGSFGGGRTMTVTGTGFNSQNSIVLVCGSECAI
DRLRSDYTTLLCEIPPHDGRGPEQACEVSVVNGKDSSLSVTPFTYMTSLTPLITEISPRR
GSTAGGTRLTVRGSGFSENTQDVHITIAEARCNVEFSNRTHIFCTTEAHTPSGWAPVHVN
IRDIGRATLDNADFLYVDAWSSNYSWGGKSPPEEGSLAIITKGQIILLDQSTPILKMLLI
QGGTLIFDEADIELQAENILITDGGTLQIGTEASPFQHKAVITLHGHLRSPELPVYGAKT
LAVREGVLDLHGLPVSVVWTRLAHTAKAGERILILQEAVTWKEGDKIVIASTGHRHSQRE
NEERTIESVSTDGKNITITNPLNYTHLGITVTLPDGTLFEARAEVGILTRNILIRGSDNI
EWNNKIPACPDGFDTGEFATQTCLQGKFGEEIGSDEFGGCIMFHAPLPDSNMVTGRIEYV
EVFHAGQAFRLGRYPIHWHLLGDLQFKSYVRGCAIHQTYNRAVTIHNTHHLLVERNIIYD
IKGGAFFIEDGIEHGNILQYNLAVFVQQSTSLLNDDVTPAAFWVTNPNNTIRHNAAAGGT
HFGFWYRMNDHPDGPSYDQNICQKRVPLGEFFNNTVHSQGWFGLWIFEEYFPMQTGSCTS
SVPVPAIFNSLTTWNCQKGAEWVNGGALQFHNFVMVNNYEAGIETKRILAPYIGGWGETN
GAVIKSAKIVGHLDELGMGSAFCTSKGLVLPFSEGLTVSSLHFMNFDRPNCVALGVTSIT
GVCNDRCGGWSAKFVDIQYFYTPNKAGFRWEHEAVLIDVDGSLTGHKGYTVIPHSSLLDP
SHCIQEDQWSIGFPGSVCDASVSFHRLAFNKPSPVSLLEKDVVLSDSFGTSIVPFQKKRL
THMSGWMALIPNANHINWYFKGVDHVTNISYTSTFYGFKEEDYVIISHNFTQNPDMFNVI
DMRNGSSNPLNWNTSKNGDWHLEANTSTLYYLVSGRRDLHPIEPISGTLDPDVKDVTINF
QAYCCILQDCFPVHPPSRTPTPRKRPATYNLWSNDSFWQSSQENNYTIPYPGANVVIPEG
TWIVADTDMPPMERLIIWGVLELEDKSNAGPAGPSYRRVVLNATYISVQGGRLIGGWEDN
PFKGELQIILRGNHSTPEWAFPEGPNQGAKVLGVFGELDLHGHPHSIYRTKLSETAEAGS
KVLSLAEAVDWQEGEEIVITTTSYDLHQTETRSIVKILHGNKILILNDTLAYTHLAERYQ
VPGTDQSYSLAADVGILTRNIKIIGEDYPGWIKDSFGARILVSSFTGNMMTFKGNARISN
VEFYHSGQDGYRDSTDPRYAVTFLNLGQIEDHGSSYIRGCAFHHGFSPAIGVFGTDGLDI
DDNIIHFTVGEGIRIWGDANRVRGNLVALSIWPGTYQNRKDLSSTLWHAAIEINKGTNTV
LQNNVVAGFGRAGYRIDGEPCSRKSNPMENWFGNEAHGGLYGIYMNQDGLPGCSLIQGFT
IWTCWDYGIYFQTTESVRIYNVTLVDNGMAIFSMIYMPAAVSHKISSKTVQVKSSLIVGS
SPEFNCSDVLTNDDPNIELTAAHRSSRPPSGGRSGICWPTFASAHNMSPRKPHAGIMSYN
AISGLLDVSDSTFIGFKNVCSGETNVIFITNPLNEDLQHPIHVKNIQLVDTIEQSKIFIH
RPDISKVNPSDCVDMVCDAKRKSFLKDIDGSFLGNSGSVIPQAEYEWNGNSQLGIGDYRI
PKVMLTFLNGSRIPITEKAPYKGIIRDSTCKYIPEWQSYQCFGMEYAMMVIESLDTDTET
RRLSPVAIISSGYVDLINGPQDHGWCAGYTCQRRLSLFHSIVALNKSYEVYFTGTSPQNL
RLMLLNVDQNKAVLVGIFYSTLQRLDVYVNNSLVCPKNTVWNTQQKYCKLNKHLHTEQFL
PHLDSTVLGENYFDRTYQMLYLLVKGNIPVEIYTATVIFVSFQLPAITEDDFYSSHNLVR
NLVLFLKIPSDKIRVSKILRGENMRRKRSTGGTIELEIGDPPTQFLSNDTTGQMQLYELQ
EIASSLGKATILGKTSSILGFNISSMSITNPIPSPKDPGWIKVTAKPVERSAFPVHHVAF
VSSLLVIAQPVAAQPGQPFSQQPSVKAVDSDGNCVSVGITSLTLKAMLKDSSNNLISGLS
GNTTIPFSSCWANYTDLTPLRTGKNFKIEFLLDNVARVESRTFSLLAQLVPGGSGSSTSS
GGSGSSSSSSSTTSAMATSAQLLTIVISFLMGRMLFLEIFMASVFI
Download sequence
Identical sequences ENSSTOP00000013727 ENSSTOP00000013727

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]