SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSCAFP00000033994 from Canis familiaris 69_3.1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSCAFP00000033994
Domain Number 1 Region: 371-481
Classification Level Classification E-value
Superfamily Anthrax protective antigen 5.36e-17
Family Anthrax protective antigen 0.013
Further Details:      
 
Domain Number 2 Region: 1838-1918
Classification Level Classification E-value
Superfamily E set domains 1.16e-16
Family E-set domains of sugar-utilizing enzymes 0.013
Further Details:      
 
Domain Number 3 Region: 2099-2186
Classification Level Classification E-value
Superfamily E set domains 0.0000000000000071
Family E-set domains of sugar-utilizing enzymes 0.073
Further Details:      
 
Domain Number 4 Region: 1922-2004
Classification Level Classification E-value
Superfamily E set domains 0.00000000000014
Family Other IPT/TIG domains 0.018
Further Details:      
 
Domain Number 5 Region: 2008-2094
Classification Level Classification E-value
Superfamily E set domains 0.000000000000407
Family NF-kappa-B/REL/DORSAL transcription factors, C-terminal domain 0.03
Further Details:      
 
Domain Number 6 Region: 3333-3525
Classification Level Classification E-value
Superfamily Pectin lyase-like 0.00000000000106
Family Galacturonase 0.053
Further Details:      
 
Domain Number 7 Region: 1161-1238
Classification Level Classification E-value
Superfamily E set domains 0.00000000000153
Family E-set domains of sugar-utilizing enzymes 0.021
Further Details:      
 
Domain Number 8 Region: 1571-1652
Classification Level Classification E-value
Superfamily E set domains 0.0000000000154
Family E-set domains of sugar-utilizing enzymes 0.018
Further Details:      
 
Domain Number 9 Region: 1668-1751
Classification Level Classification E-value
Superfamily E set domains 0.0000000000178
Family E-set domains of sugar-utilizing enzymes 0.058
Further Details:      
 
Domain Number 10 Region: 271-329
Classification Level Classification E-value
Superfamily E set domains 0.0000000000607
Family E-set domains of sugar-utilizing enzymes 0.029
Further Details:      
 
Domain Number 11 Region: 1068-1148
Classification Level Classification E-value
Superfamily E set domains 0.00000000224
Family Other IPT/TIG domains 0.088
Further Details:      
 
Domain Number 12 Region: 143-241
Classification Level Classification E-value
Superfamily E set domains 0.00000000498
Family E-set domains of sugar-utilizing enzymes 0.033
Further Details:      
 
Domain Number 13 Region: 32-125
Classification Level Classification E-value
Superfamily E set domains 0.0000000153
Family E-set domains of sugar-utilizing enzymes 0.051
Further Details:      
 
Domain Number 14 Region: 1335-1391
Classification Level Classification E-value
Superfamily E set domains 0.0000000242
Family E-set domains of sugar-utilizing enzymes 0.018
Further Details:      
 
Domain Number 15 Region: 1757-1833
Classification Level Classification E-value
Superfamily E set domains 0.00000011
Family E-set domains of sugar-utilizing enzymes 0.04
Further Details:      
 
Domain Number 16 Region: 1410-1507
Classification Level Classification E-value
Superfamily Cupredoxins 0.0000108
Family Plastocyanin/azurin-like 0.072
Further Details:      
 
Domain Number 17 Region: 1246-1322
Classification Level Classification E-value
Superfamily E set domains 0.0000109
Family E-set domains of sugar-utilizing enzymes 0.02
Further Details:      
 
Domain Number 18 Region: 2376-2412,2485-2693
Classification Level Classification E-value
Superfamily Pectin lyase-like 0.0000251
Family Galacturonase 0.074
Further Details:      
 
Weak hits

Sequence:  ENSCAFP00000033994
Domain Number - Region: 2226-2281
Classification Level Classification E-value
Superfamily Composite domain of metallo-dependent hydrolases 0.0918
Family Zn-dependent arginine carboxypeptidase-like 0.067
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSCAFP00000033994   Gene: ENSCAFG00000024786   Transcript: ENSCAFT00000038299
Sequence length 4257
Comment pep:novel chromosome:CanFam3.1:13:9924751:10026403:1 gene:ENSCAFG00000024786 transcript:ENSCAFT00000038299 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MGHLWLLGTLGLWVLLLCAADPRADGSKIIPKVTEIIPKYGSINGATRLTIKGEGFAQAN
QFNYGVDNAELGNSVQLVSSFRSISCDVEKDSSHSTQITCYTRAMPEDSYTVRVSVDGVP
ITESNTCRGRINSWECTFNAKSFRTPMIKSITPLSGTPGTLITIQGRIFTDVYGSNTALS
SNGKNVRILRVYIGGMPCELLIPQSDNLYGLKLDHPNGDMGSMICKTTGTYIGHHNVSFI
LDSDYGRSFPQKMAYFVSSLNKISMFQTYAEVTMISPSRGSTQGGTLLTISGRFFDQTDF
PVRVLIGGQTCDILNVTENSIFCKTPPKPEVLRTVYPGGRGLKLEIWNNSRPVHLEEILE
YNEKTPGYMGAIWVDSASYIWPMEQDTFTARFSGFLVAPDSDVYRFYIKGDDRYAIYFSQ
TGHPKDKDKVRIAYHSSNANNYFSSPTQRSDDIHLQKGKEYYIEILLQEYRLSAFVDVGL
YQYRNVYTEQQTEDAVNEEQVIRSQSTIVQEVQLITLENWETSSATNEVQKITVTSPCVE
ANSCSRYQYRLTYNMEKTVLLPGDASDFILESALNDLWSIKPDTVQVIRTQNPQSYVYLV
TFISTRGDFDLLGYEVFEGNNVTLDITEQIKGKPSLETFTLHWDGVTSKPLTPWSSEAEF
QAAVEEMVTTKCPLQIANFEEGFVVKYFRDYETDFNLEHNNRGQKTAETDAYCGRYSLKN
PAVLFDSADVKPNRLPYGDILLFPYNQLCLAYKGFLANYIALKFQYQDRSKITRSNDIQF
TYDFTNGNTWTYTCIDLLDLIQTKYTGSNFSLRRIGLQKASESQSFYVDTVYIGQASTLS
AWDEMPKRRLPALANKGIFLKHFQVNHTKINGSSVTNQYSITMTSYNCSYNIPMMAVSFG
QIITNETENESVYRGNNWPGKSKIRIQRIQAASPPLSGSFDIQAYGHILKGLPATVSATE
LQFALQSVEKVGRVSVTREGTCAGYSWNIKWRSSCGKQDLLQINDSHVIGEKANMTVMKI
KEGGLFRQRILGDLLRTPSQQPQVEVYVNGIPAKCSGDCGFTWDAMATPLVRAISPSQGS
YEETTILTILGSGFSPNSAVSVSVGPMGCSLLSVDVQESEIKCQILNGSAGRFPVAVSVA
DVGLARNVEGEVFYFIYQNRISHIWPASGSLAGGTLLTLSGFGFNENSKALVGNETCNVI
ERDLSKITCRTPKRIEGTVDISVITDGFRATAKDAYSYNCLQTPVITDFSPKVRTIQGKK
NFMKHGFFCSCFLNLNMCKYVGGKPCRIFQWNFTDIRCLLPKLSPGKHDIYVEVRNWGFA
STRDKLSASIQYILEVTAMFPQRGSLYGGTEITIIGLGFSTIPTENTVLLGSFHCDVTSS
SENVIKCILHSTGSTFRITNTGEDSVHGLGYGWSPSVLNVSVGDTVIWHWQAHPFLRGIG
YKVFSVSSPGSVIYDGKGFTNGREKSASGSFSYQFTSPGIHYYSSGYVDEARSIFLQGVI
NVLPAETRHIPLHLFVGSTEATYAQVLLPKGPVDLHLGSSVAGCLATEPLCGPNNTRVKN
SDRLVFELSSCFSPFINNISPSAGTQNELITITGRGFSNLTCANKVTIGNYPCIVEESSD
NSIRCHIDPQNSMNVGIREIVTVTVYNLGTALNTLSSEFDRRFVLLPNINMVLPNAGSTT
GMTRVTVRGSGFAASPAGVEVFMGHFPCKVLSVNYTAIECETSPAPQQLVKVDLLIHGVP
AQCQGNCTFSYLESITAFVTRVFPNFIKGPVEVLIEGEGFGTILEDIAVFIGNQQFRAMD
VNENNVTVLVGPLPAGLHSLGVVVGSKGLALGNLTVSSPAIASVTPTSGSIAGGTTLLIT
GNGFDPGNTTVTVGYEPCEIISVNSSEVYCCTPAGTAGRVSVKIFVNEVAYPPLSFTYAL
EDTPLLRELVPNTGPPGTKIQIIGSNFGIDILEISVKIDNTPCNVTMVNDSVLQCIIGDH
AGGTFPVMMHRKTKGFAVSTVVFEYALTIQNIHPSQGSFGGGQTMTVTGTGFNPETSIIL
VCGSECAIDRLKSDCTRLLCKIPHNDGRGPEQTCEVSVVNGKDLSHSTTPFTYTMSLTPL
ISDIYPKRGSTAGGTRLTITGSGFSENVQDVLITIAEAKCNVEYSNKTYIICMTSAHSPS
GWAPVHVNIRSLGMAKLDDPDFLYVDAWSSNFSWGGKSPPEEGSLVVITKGQTVLLDQNT
PILKMLLIQGGTLIFDEADIELQAENILITDGGILQIGTEASPFQHRAVITLHGHLRSPE
LPVYGAKTLAVREGILDLHGLPVSVIWTHLAHTAKAASKLFHILQSWQWFPWSRIVNTSI
FYRHSQQENEKRTIASVSADGTNILLTNPLNYTHLGTTVTLPDGTLFEARAEVGILTRNI
LIRGSDHVEWSNKIPACPDGFDTGEFTTQTCFQGKFGEETGSDQFGGCIMFHAPIPSANM
VIGRIEYVEIFHAGQAFRLGRYPIHWHLLGDLQFKSYVKGCAIHQTYNRAVTIHNTHHLL
VERNIIYDIKGGAFFIEDGIEHGNILQYNLAVFVQQSTSLLNDDVTPAAFWVTNPNNTIR
HNAAAGGTHFGFWYRMNNHPDGPSYDRNICQKRVPLGEFFNNTVHSQGWFGLWIFEEYFP
MQTGSCTSTVPVPAIFSSLTTWNCEKGAEWVNGGALQFHNFVMVNNFEAGIETKRILAPY
VGGWGETNGAVIKNAKIVGHLDELGMGSAFCTRKGLVLPFSEGLTVSSVHFMNFDRPNCV
ALGVTSITGVCNERCGGWSAKFVDIQYFHTPNKAGFRWEHEVVLVDVDGSLTGHRGHTVI
PHSSLLDPSHCSQEAEWSIGFPGAVCDTTVSFHRLAFNKPSPVTLLEKDVVLSDSFGTSI
VPFQKKRLTHMSGWMALIPNAKHINWYFKDVDHITNISYTSTFYGFKEEDYVIISHNFTQ
NPDMFNVIDMRNGSSNPLSWNTSKNGDWHLEANTSTLYYLVSGRNDLQQSQPISGTLDPD
VKDVIINFQAYCCILQDCFPVHPPSRKPIPRKRPSIYNLWSNDSFWKSSRENNYTIPHPG
ANVVIPEGTWVVADTDIPPMESLTIWGVLELEAKHSMRAAESSYRKVVLNATYISLQGGR
LIGGWEDNPFKGELQIVLRGNHSTPEWALPEGPNQGSKVLGVFGELDLHGIPRSIYKTKL
SETAEAGSKVLSLVDAVDWQEGEEIVITTTSYDFHQTETRSIIKILHDQKILILNDTLSY
THLAERYHVPGTGQSYTLAADVGILSRNIKILGEDYPGWFKESFGARVLVSSFTQDMMTF
KGNARISNVEFYHSGQEGFRDNTDPRHAVTFLNLGQIQERGSSYIRGCAFHNGFSPAIGV
FGTDGLDIDNNIIHFTVGEGIRIWGDANRVRGNLVVLSVWPGTYQNKKDLSSTLWHAAIE
INRGTNTVLQNNVVAGFGRVGYRIDGEPCSSQSNPMEKWFDNEAHGGLYGIYMNQDGLPG
CSLIQGFTIWTCWDYGIYFQTTESVHIYNVTLVDNGMAISSMIYMPAAVSHKISSKTVQI
KSSLIVGSSPGFNCSDVLTNDDPNIELSAAHRSARPPSGGRSGICWPTFASAHNMAPRKP
HAGIMSYNAISGLLEISGSTFVGFKNICAVETNVIFITNPLNEDLQHPIHVKNIQLVDTT
EQSKIFIHRPDVSKVNPSDCVDMVCDAKRKSFLRDMDGSFLGDSGSVIPQAEYEWNGNSQ
FGIGDYRIPKVMLTFPNGSRIPITEKAPYKGIIRDSTCKYIPQWQSYQCFGMEYAMMVIE
SLDSDTETRRLSPVAIVSNGYVDLINGRTKYRWCAGYTCQRRLSLFHSIVALNKSYEVYF
TGTTPQNLRLMLLNVDHKKAVVVGIFFPTLQRLDVYVNNALVCPRNTVWNPQQKHCELNR
HLYTEQFLPNLNSTVLGENYFDRIYQMLYLLVKGTTPVEIHTTAVIFVSFQLPAVTEDDF
YISHNLVRNLALFLKIPSDKIRVSKIIQGESLRRKRSMGLTVELEIGDPPPQFISNDTAG
QMQLSELQKIASSLGQAVILGRTSSIIGFNISSMSITSPIPSTSDSAWIKVTAQPVERFA
FPVHHVASVSSLSVITQPVAAPPGQAFSQQPSIKAVDSDGNCVSVGITSLTLKAILKDSN
NNQISGLSGNTTIPFSSCWANYTDLTLLRTGKNYKIEFILDDIVRVESATLSLPAQSISS
GSGTSGGGGSSDSNSNSKAPTVGTAAQIMIIAIICLMGKVLLSEIFMATVLNLNINP
Download sequence
Identical sequences J9P647
ENSCAFP00000033994 ENSCAFP00000041128

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]