SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for M3Y2W6 from Uniprot 2018_03 genome

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  M3Y2W6
Domain Number 1 Region: 1840-1920
Classification Level Classification E-value
Superfamily E set domains 4.43e-17
Family E-set domains of sugar-utilizing enzymes 0.01
Further Details:      
 
Domain Number 2 Region: 3337-3526
Classification Level Classification E-value
Superfamily Pectin lyase-like 1.72e-16
Family Galacturonase 0.055
Further Details:      
 
Domain Number 3 Region: 373-493
Classification Level Classification E-value
Superfamily Anthrax protective antigen 0.00000000000000222
Family Anthrax protective antigen 0.013
Further Details:      
 
Domain Number 4 Region: 1924-2006
Classification Level Classification E-value
Superfamily E set domains 0.0000000000000168
Family Other IPT/TIG domains 0.034
Further Details:      
 
Domain Number 5 Region: 2101-2188
Classification Level Classification E-value
Superfamily E set domains 0.000000000000028
Family Other IPT/TIG domains 0.051
Further Details:      
 
Domain Number 6 Region: 2010-2096
Classification Level Classification E-value
Superfamily E set domains 0.0000000000000446
Family NF-kappa-B/REL/DORSAL transcription factors, C-terminal domain 0.053
Further Details:      
 
Domain Number 7 Region: 1246-1326
Classification Level Classification E-value
Superfamily E set domains 0.000000000000294
Family E-set domains of sugar-utilizing enzymes 0.038
Further Details:      
 
Domain Number 8 Region: 1163-1241
Classification Level Classification E-value
Superfamily E set domains 0.0000000000056
Family E-set domains of sugar-utilizing enzymes 0.012
Further Details:      
 
Domain Number 9 Region: 1670-1753
Classification Level Classification E-value
Superfamily E set domains 0.0000000000446
Family E-set domains of sugar-utilizing enzymes 0.046
Further Details:      
 
Domain Number 10 Region: 1573-1639
Classification Level Classification E-value
Superfamily E set domains 0.0000000000574
Family E-set domains of sugar-utilizing enzymes 0.034
Further Details:      
 
Domain Number 11 Region: 272-353
Classification Level Classification E-value
Superfamily E set domains 0.000000000112
Family E-set domains of sugar-utilizing enzymes 0.035
Further Details:      
 
Domain Number 12 Region: 1070-1146
Classification Level Classification E-value
Superfamily E set domains 0.000000000182
Family E-set domains of sugar-utilizing enzymes 0.031
Further Details:      
 
Domain Number 13 Region: 143-241
Classification Level Classification E-value
Superfamily E set domains 0.000000000778
Family E-set domains of sugar-utilizing enzymes 0.041
Further Details:      
 
Domain Number 14 Region: 32-125
Classification Level Classification E-value
Superfamily E set domains 0.0000000153
Family E-set domains of sugar-utilizing enzymes 0.051
Further Details:      
 
Domain Number 15 Region: 1339-1397
Classification Level Classification E-value
Superfamily E set domains 0.0000000293
Family E-set domains of sugar-utilizing enzymes 0.029
Further Details:      
 
Domain Number 16 Region: 1758-1835
Classification Level Classification E-value
Superfamily E set domains 0.0000000504
Family E-set domains of sugar-utilizing enzymes 0.037
Further Details:      
 
Domain Number 17 Region: 1413-1511
Classification Level Classification E-value
Superfamily Cupredoxins 0.000000141
Family Plastocyanin/azurin-like 0.043
Further Details:      
 
Domain Number 18 Region: 2468-2696
Classification Level Classification E-value
Superfamily Pectin lyase-like 0.0000051
Family Galacturonase 0.079
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) M3Y2W6
Sequence length 4257
Comment (tr|M3Y2W6|M3Y2W6_MUSPF) PKHD1 like 1 {ECO:0000313|Ensembl:ENSMPUP00000005667} KW=Complete proteome; Reference proteome OX=9669 OS=Mustela putorius furo (European domestic ferret) (Mustela furo). GN=PKHD1L1 OC=Mustelinae; Mustela.
Sequence
MGHLWLLGTWGFWGLLLCATDPRADGSKIIPKVTEIIPKYGSINGATRLTIKGEGFAQAN
QFNYGVDNAELGNSVQLVSSFRSISCDVEKDSSHSTQITCYTRAMPEDSYTVRVSVDGVP
ITESNTCRGRVKSWECTFNARSFRTPMITNITPLSGTPGTLITIQGRIFTDVYGSNTALS
SNGKNVRILRVYVGGMPCELLIPQSDNLYGLKLDHPNGDMGSMVCKTTGTYIGHHNVSFI
LDSDYGRSFPQKMAYFVSSLNKISMFQTYAEVTMISPSRGSIRGGTVLTISGRFFDQTDF
PVRVLIGGQDCDILNVTENSICCKTPPKPEVLRTVYPGGRGLKLEVWNNSRPARLEEILE
YNEETPGYMGASWVDSASYLWPTEQDTFVARFSGFLVAPDSDVYRFYIKGDDRYAIYFSQ
TRHPKDKDKVRIAYHSSNANGYFSSPTQRSDDIHLQKGKEYYLEILLQEYRLSAFIDVGL
YQYKNVYTEQQTEDAVNEEQVIRFQSTMVQEVQVITLENWETSSATNEVQKITVTSPCVE
ENSCSRYQYRLIYNMEKTAKVLLPGDASDFLLQSALNELWSIKPDTVQVVRTQNAQSYVY
MVTFISTRGDFDMLGYETFEGDNVTLNITEQIKGKPSLDTFTLNWDGVTSKPLTPWSSQA
EFQAAVEEMVTAKCPPQIANFEEGFDVKYFRDYETDFNLEHVNRGKKTAETDAYCGRYAL
KNPAVLFDSADVKPNRLPYGDILLFPYNQLCLAYKGFLANYIALKFQYKDRSKVTRSNDI
QFTYNFANGNNWTYTCIDLLDLIQTKYIGTNFSLQRIGLQKASETQSFYVDAVYVGQTST
VSAWDEMPKRRLPALANKGIFLKSFQVNHTKTNGSSMTNQYSITMTSYNCSYNIPMMAVS
FGQRITNETENESIYRGNNWPGESKIRIQRIQAASPPLRGSFDIQAYGHILKAGLPATVS
ATDLQFALQSVEEVGRVSVTREGNCAGYSWNIKWRSACGKQNLLQINDSNIIGEKANMTV
MKIKEGGLFRQRILGDLLRIPSQQPQVEVYVNGIPAKCSGDCGFTWDPMATPLVRAINPS
QGSYEESTVLTISGSGFSPNSAISVSVGPMACSLLSVSQENEIQCQILNGSAGHFPVAVS
VADAGLARNVEGEEFYFTYQNRISYIWPTSGSLAEGGTLLTVSGFGFNENSRVLVGNETC
NVIEGDLNKITCRTPKRIEGTVDISVITNRFQAKAKDTYSYNCLQTPVITDFSPKVRTII
IGEVNLTIKGYNFGNELTQKVEVYVGGKPCQIFQWNLTDIRCLLPKLSPGKHDIYVEVRN
WGFASTRDKLNASIQYILEVTAMFPQRGSLYGGTEITIMGLGFSTIPTENTVLLGPFPCD
VISSSENAIKCILHSTGNTFRITNTGEDSVHGLGYAWSPSVLNVSVGDTVTWHWQAHPFL
QGIGYRVFSVSSPGSVIYDGKGFTNGREKSVSGSFSYQFTSPGIRYYSSGYVDEAHSIFL
QGVINVSPAETRHIPLHLFVGSTEATYDQGLTGPVNLHLGSSVTGCLATEPLCGPNNTRV
KNSDGLVFELSSCFSPTINNINPSSGTLNELVTITGRGFSNLTCANKVTIGSYPCIVEES
SDNSIICRIDPQNSMDVGVREIVTLMVYNLGMAINTLPNEFDRRFVLLPNIDMVLPYAGS
TAGLTRVTIKGSGFAASSAGIEVFMGHFPCKVLSVNYTAIECETSPAPQQLVKVDLLVHG
VPAQCQGNCSFSYLESITAVVTRIFPNSIEGPVKVLIEGEGFGTILEDIAVFIGDQQFGA
IDVNEKNITVLVSPLPAGLHSLSVVVGSKGLALGNLTVSSPAVASVTPASGSIAGGTTLV
ITGNGFYPGNTTVTVGDEPCEIISVNSSEVHCYTPAGMAGRASVKIFVNAVAYPPLSFTY
ALENTPLLRELVPNTGPPGTKIQITGSNFGTNLLDILVMIDTLPCNVTMVNDSVLQCDTE
DHAGGTFPVTMHHKTKGFAVSTVVFEYPLTIQSIHPSQGSFGGGQTMTVTGTGFNPQTST
ILVCGSECAVDRLKSDRTALLCKIPHNDGRGPEQACEVSVVNGKDLFQSMTPFTYTVLLT
PLVTEICPRRGSTAGGTRLTITGSGFSENMQDVLITIAEAKCDVEYSNKTYILCMTSAHT
SSGWAPVRVNIRGIGMARLDDSDFLYVDAWSSNFSWGGKSPPEEGSLVVITKGQTVLLDQ
NTPILKMLLIQGGTLIFDEADIELQAENILITDGGLLQIGTEASPFQHRAVITLHGHLRS
PELPVYGAKTLAVREGTLDLHGLPVSVIWTRLARTAKAGERTLILQEGVTWKPGDKIVIA
STGHRHSQRENEKRTIASVSADGTEIMLTDPLNYTHLGIIVTLPDGTLFEARAEVGILTR
NILIRGSDNVEWNNKIPACPDGFDTGEFATQTCFQGKFGEEIGSDQFGGCIMFHAPIPGA
NMVTGRIEYVEIFHAGQAFRLGRYPIHWHLLGDLQFNSYVRGCAIHQTYNRAVTIHNTHH
LLVERNVIYDIKGGAFFIEDGIEHGNILQYNLAVFVQQSTSLLNDDVTPAAFWVTNPNNT
IRHNVAAGGTHFGFWYRMNNHPDGPSYDRNICQKRVPLGEFFNNTVHSQGWFGLWIFEEY
FPMETGACTSTVPVPAIFNSLTTWNCEKGAEWVNGGALQFRNFVMVNNFEAGIETKRILA
PYVGGWGETNGAVIKNAKIIGHLDELGMGSAFCTRKGLILPFSEGLTVSSVQFMNFDRPK
CVALGVTSITGVCNERCGGWSAKFVDIQYFHTPNKAGFRWEHEAVLIDVDGSLTGHRGHT
VIPHSSLLDPSHCTQEAEWSIGFPGAVCDTSVSFHRLAFNKPSPVSLLEKDVVLSDSFGT
SIVPFQKKRLTHMSGWMALIPNAKHINWYFKDVDHITNISYTSTFYGFKEEDYVIVSHNF
TQNPDMFKVIDMRNGSSNPLSWNTSKNGDWHLEANTSTLYYLVSGRNDVQQSQPISGTLD
PDAKDVVINFQAYCCILQDCFPVHPPSRKPVPRKRPATYNLWSNDSFWQSSRENNYTIPH
PGANVVIPEGTWVVADTDIPPMESLIIWGVLELEDKHNTGAAESSYRKIVLNATYISLQG
GRLIAGWEDNPFKGELQIVLRGNHSTPEWTLPEGPNQGSKVLGVFGELDLHGIPRSVYKT
KLSETAEAGSKVLSLKDAVDWQEGEEIVITTTSYDFHQTETRSIIKILHDQKILILNDTL
SYTHLVNRRNAFDCIPCYKLKSIFSFLTENNLPGEGLCKLSKSSIAPKKKKKCCHSSLPT
LPGNARISNVEFYHGGQEGFRDSTDPRYAITFLNLGQIQERGSSYIRGCAFHNGFSPAIG
VFGTDGLDIEDNIIHFTVGEGIRIWGDANRVRGNLVTLSIWPGTYQNRRDLSSTLWHAAI
EINRGTNTVLQNNIVAGFGRVGFRIDGEPCSSQSNPMEKWFDNEAHGGLYGIYMNQDGLP
GCSLIQGFTIWTCWDYGIYFQTTESVHIYNVTLVDNGMAISSMIYMPASVSHKISSKTVQ
IKSSLIVGSSPGFNCSDVLTNDDPNIELSAAHRSARPPSGGRSGICWPTFASAHNMAPRK
PHAGLMSYNAISGLLDISGSTFVGFKNVCSGETNVVFITNPLNEDLQHPIHVKNIQLVDT
TEQSKIFIHRPDISKVNPADCVDMVCDAKRKSFLRDMDGSFLGNSGSVIPQAEYEWNGNS
QFGIGDYRIPKVMLTFPNGSRMPVTEKAPYKGIIRDSTCKYIPQWQSYQCFGMEYAMMVI
ESLDSDTETRRLSPVAIVSNGYVDLINGPQDHGWCAGYTCQRRLSLFHSIVALNKSYEIY
FTGTSPQNLRLMLLNVDHRKAVVVGIFFPTLQRLDVYVNNALVCPKNTVWNPQQKYCELN
RHLYTEQFLPNLNSTVLGENYFDRTYQMLYLLVKGTIPVEIHTTAVIFVSFQLPAVTEDD
FYNSHNLVRNLALFLKIPSDKIRVSKLMRGESLRKKRSTGLTVELEIGDPPPPFLSNDTA
GGMQMQLSELQKIAGSLGKAVISGKTSSILGFNISSMSITNPIPSPSDSAWTKVTAQPVE
RFAFPVHHVAFVSSLSVIAQPVPAQPGQPFSQQPSVKAVDSDGNCVSVGITSLTLKAVLK
DSNNNQISGLSGNTTIPFSSCWANYTDLTLLRTGKNYKIEFILDDVVRVESGTLSLAAQS
VSGGGGTSGGGNSGRASTVGTAAQIMVPVISCLLGRLVLLEVFMATVLISNIDLGKR
Download sequence
Identical sequences M3Y2W6
ENSMPUP00000005667 ENSMPUP00000005667

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]