SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for A0A2K5C359 from Uniprot 2018_03 genome

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  A0A2K5C359
Domain Number 1 Region: 366-474
Classification Level Classification E-value
Superfamily Anthrax protective antigen 2.48e-19
Family Anthrax protective antigen 0.012
Further Details:      
 
Domain Number 2 Region: 1825-1905
Classification Level Classification E-value
Superfamily E set domains 0.00000000000000205
Family E-set domains of sugar-utilizing enzymes 0.015
Further Details:      
 
Domain Number 3 Region: 2086-2173
Classification Level Classification E-value
Superfamily E set domains 0.0000000000000043
Family Other IPT/TIG domains 0.036
Further Details:      
 
Domain Number 4 Region: 1152-1229
Classification Level Classification E-value
Superfamily E set domains 0.000000000000013
Family E-set domains of sugar-utilizing enzymes 0.013
Further Details:      
 
Domain Number 5 Region: 1909-1992
Classification Level Classification E-value
Superfamily E set domains 0.0000000000000182
Family Other IPT/TIG domains 0.012
Further Details:      
 
Domain Number 6 Region: 1234-1314
Classification Level Classification E-value
Superfamily E set domains 0.0000000000000509
Family E-set domains of sugar-utilizing enzymes 0.02
Further Details:      
 
Domain Number 7 Region: 1995-2080
Classification Level Classification E-value
Superfamily E set domains 0.000000000000122
Family NF-kappa-B/REL/DORSAL transcription factors, C-terminal domain 0.023
Further Details:      
 
Domain Number 8 Region: 3242-3363,3392-3515
Classification Level Classification E-value
Superfamily Pectin lyase-like 0.000000000000238
Family Galacturonase 0.085
Further Details:      
 
Domain Number 9 Region: 1656-1738
Classification Level Classification E-value
Superfamily E set domains 0.00000000000496
Family E-set domains of sugar-utilizing enzymes 0.024
Further Details:      
 
Domain Number 10 Region: 1061-1139
Classification Level Classification E-value
Superfamily E set domains 0.0000000000056
Family E-set domains of sugar-utilizing enzymes 0.029
Further Details:      
 
Domain Number 11 Region: 1558-1629
Classification Level Classification E-value
Superfamily E set domains 0.000000000196
Family E-set domains of sugar-utilizing enzymes 0.036
Further Details:      
 
Domain Number 12 Region: 1326-1383
Classification Level Classification E-value
Superfamily E set domains 0.000000000306
Family E-set domains of sugar-utilizing enzymes 0.029
Further Details:      
 
Domain Number 13 Region: 1743-1818
Classification Level Classification E-value
Superfamily E set domains 0.00000000168
Family Other IPT/TIG domains 0.073
Further Details:      
 
Domain Number 14 Region: 32-125
Classification Level Classification E-value
Superfamily E set domains 0.00000000777
Family Other IPT/TIG domains 0.088
Further Details:      
 
Domain Number 15 Region: 143-236
Classification Level Classification E-value
Superfamily E set domains 0.0000000132
Family E-set domains of sugar-utilizing enzymes 0.022
Further Details:      
 
Domain Number 16 Region: 269-356
Classification Level Classification E-value
Superfamily E set domains 0.00000149
Family E-set domains of sugar-utilizing enzymes 0.057
Further Details:      
 
Domain Number 17 Region: 1401-1497
Classification Level Classification E-value
Superfamily Cupredoxins 0.00000149
Family Multidomain cupredoxins 0.099
Further Details:      
 
Domain Number 18 Region: 2349-2391,2461-2673
Classification Level Classification E-value
Superfamily Pectin lyase-like 0.0000576
Family Pectate lyase-like 0.074
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) A0A2K5C359
Sequence length 4236
Comment (tr|A0A2K5C359|A0A2K5C359_AOTNA) PKHD1 like 1 {ECO:0000313|Ensembl:ENSANAP00000003132} KW=Complete proteome OX=37293 OS=Aotus nancymaae (Ma's night monkey). GN=PKHD1L1 OC=Platyrrhini; Aotidae; Aotus.
Sequence
MGHLWLLGIWGLWGLLLCAANPRTDGSEIIPKVTEIIPKYGSINGATRLTIRGEGFSQAN
QFNYGVDNAALGNSVQLVSSFQSIACDVEKDASHSTHITCYTRAMPEGSYTVRVSVDGVP
VTENNTCKGHINSWACTFNAKSFRTPTIRSITPLSGTPGTLITIQGRIFTDVYGSNIVRS
SNGKNVRILRVYIGGMPCELLIPQSDNLYGLKLDHPNGDMGSMICKTTGTFIGKSENLTF
FLNRSLPQKMAYFVSSLSKISMFQTYAEVTTIFPSRGSIQGGTTLTISGRFFDQTDFPVR
VLVKLCDTFNVTENSICCKTPPKPRILKTVYPGGRGLKLEVWNNSHPVHLEEILEYNEKT
PGYMGASWVDSASYIWPMEQDTFVARFSGFLVAPDSDVYRFYIKGDDRYAIYFSQTGLPE
DKVRIAYHSANANSYFSSPAQRSDDIHLQKGKEYYIEILLQEYRLSAFVDVGLYQYQNVY
TEQQTGDAVNEEQVIKSQSTIIQEVQVITLENWETTNAINEVQKIKVTSPCVEANSCSLY
QYRLIYNMEKTVFLPADASEFILQSALNDLWSIKPDTVQVKRTQNAQSYIYTITFISTRG
DFDLLGYEVFEGNNVTLDITEQTKGKPNLETFTLNWDGIASKPLTPWSSEAEFQRAVEEM
VSTKCPPQITNFEEGFVVKYFRDYETDFNLERINRGQKTAETDAYCGRYSLKNPAVLFDS
ADVKPNRLPYGDILLFPYNQLCLAYKGFLANYIGLKFQYQDSSKIIRSTDVQFTYNFAYR
NNWTYTCIDLLDLIRTKYTGTNISLQRISLQKASESQSFYVDVVYIGHTSTVSTLDEMPK
RRLPALANKGIFLEHFQVNRTKINGPTMTHQYFVTMTSYNCSYNIPMMAVSFGQIITHKT
ENEIVYRGNNWPGESKIHIQRIQAASPPLSGSFDIQAYGRILKGLPTGVSAADLQFALQS
LEGMGRVSVTRDGTCAGYTWSIKWRSTCGKQNLLQINDSNIIGEKANMTVTRIKEGGLFR
QHILGDLLRTPSQQPQVEVYVNGIPAKCSGDCGFTWDSAITPLVLGTSPSQGSYEEGTIL
TIVGSGFSPSSAVTVSVGPVGCYLLSVDEKEIKCQILNGSAGHAPVAVSIADVGLAQNVG
GEEFYFVYQSQISHIWPDSGSIAGGTLLTLSGFGFNENSKVLVGNETCNVIEGDLNRITC
RTPKKTEGTVDISVTTNGFQATARDAFSYNCLQTPIITDFNPKVRTILGEVNLTINGYNF
GNELTQNMVVYVGGKTCQILHWNFTDIRCLLPKLPPGKHDIYVEVRNWGFASTRDKLNSS
IQYVLEVTSIFPQRGSLFGGTEITIRGFGFSTIPAENTVLLGSIPCNVTSSSENVIKCIL
HSTGNIFRITNNGKDSVHGLGYAWSPSVLNVSVGDTVAWHWQTHPFLRGIGYRVFSVSSP
GSVIYDGKGFTNGRQKSTSGSFSYQFTSPGIHYYSSGFVDEAHSIFLQGVINVLPAETRH
IPLHLFVGSSEATYAHGGPENLHLGSSVAGCLATEPLCGLNNTRVKNSERLLFEVSSCFS
PSISNITPSSGTVNELITIIGHGFSNLPCANKVTIGSYPCVIEESSDDSITCHIDPQNSM
DVGIREIVTLTVYNLGTAINTLSNEFDRRFVLLPNIDLVLPNVGSTTGMTRVTIKGSGFA
VSSAGVEVLMGHFPCKILSVNYTAIECETSPAVQQLVDVYLLIHGVPAQCQGNCTFSYLE
SITPYITGISPNSIIESVKVIIEGEGFGNVLDDIAVFIGNQQFRTIDVNENNITALVTSL
PVGRHSLSVVVGSKGLALGNLSVSSPPVASVSPISGSIGGGTTLLITGNGFYPGNTTVTI
GDDPCQIISINPNEVYCHTPPRTAGMVGVKIFVNTISYPPLLFTYALEDTPFLRGIVPSR
GPPGTEIEITGSNFGIEILDISVTINNVQCNVTMANDSVLQCIVGDHAGGTFPVMMHHKT
KGSAISTVVFEYPLNIENINPSQGSFGGGQTMTVTGTGFNPQNSIILVCGSECAVDRLRS
DYTTLLCEIPSNNGRGAEQACEVSVVNGKYLSQSTTPFTYAVFLTPLITAVSPKRGSTAG
GTRLTVMGSGFSENIEDVHITIAEAKCAVECSNKTHIICMTDAHPLSGWAPVHVHIRGVG
MAKLDNADFLYVDAWSSNFSWGGRSPPEEGSLVVITKGQTILLDQSTPILKMLLIQGGTL
IFDEADIELQAENILLQMEIGTETSPFQHKAVITLHGHLRSPELPVYGAKNTVYMLALGV
PVPVIWTRLAHTAKAGERILILQEAVTWKPGDNIVIASTGHRHSQGENEKRTIASVSADG
ITVTLSNPLNYTHLGITVTLPDGTLFEARAEVGILTRNILIRGSNNVEWNNKIPACPDGF
DTGEFATQTCLQGKFGEEIGSDQFGGCIMFHAPVPGANMVTGRIEYVEVFHAGQAFRLGR
YPIHWHLLGDLRFKSYVRGCAIHQAYNRAVTIHNTHHLLVERNIIYDIRGGAFFIEDGIE
HGNILQYNLAVFVQQSTSLLNDDVTPAAFWVTNPNNTIRHNAVAGGTHFGFWYRMNNHPD
GPSYDRNICQKRVPLGEFFNNTVHSQGWFGMWIFEEYFPMQTGSCISTVPVPARFNSFTA
WNCQKGAEWVNGGALQFHNFMMVNNYEAGIETKRILAPYVGGWGETNGAMIKNAKIVGHL
YELGMGSAFCTRKGLVLPFSEGLTVSSVHFMNFDRPNCVALGVTSISGVCNDRCGGWSAK
FVDIQYFHTPNKAGFRWEHEMVLIDVDGSLTGHKGHTVIPHSSLLDPSHCTQKAEWSIGF
PGSVCDASVSFHRLAFNKPSPVSLLEKDVVLSDSFGTSIIPFQKKRLTHMSGWMALIPNA
NHINWYFKGVDHITNISYTSTFYGFKDEDYVIISHNFTQNPDMFNIIDMRNGSLNPLNWN
TSKNGDWHLEANTSTLYYLVSGRNDLQQSQPISGNLDPDVKDVVINFQAYCCILQDCFPV
HPPSRKPMPKKRPATYNLWSNDSFWQSSRENNYTVPHPGANVIIPEGIWIVADIDMPSME
RLTIWGVLELEDKYNVGAAESSYREVVLNATYISLQGGRLIGGWEDNPFKGDLKIVLRGN
HTTRDWALPEGPNQGSKVLGVFGELDLHGIPRSIYKTKLSETADAGSKILSLVDAVDWQE
GEEIVITTTSYDFHQTETRSIVKILHDRKILILNDSLSYTHFAEKYHVPGTGESYTLAAD
VGILSRNIKIVGEDYPGWSQDSFGAHVLVGSFTENMMTFKGNARISNVEFYHSGQEGFRD
STDPRYAVTFLNLGQIQEHGSSYIRGCAFHHGFSPAIGVFGTDGLDIDDNIIHFTVGEGI
RIWGNANRVRGNLIALSVWPGTYQNRKDLSSTLWHAAIEINGGTNTVLQNNVVAGFARAG
YRIDGEPCPSQFNPVEMWFDNEAHGGLYGIYMNQDGFPGCSLIQGFTIWTCWDYGIYFQT
TENVYIYNVTLVDNGMAIFPMIYLPAAISHKISNKKVQIKSSLIVGSSPGFNCSDILTND
DPNVELTAAHRSPRSPSGGRSGICWPTFASAHNMAPRKPHAGIMSYNAISGLLDISGSTF
VGFKNVCSGETNVIFITNPLNEDLQHPIHVKNIKLVDTTEQSKIFIHRPDISKVNPSDCV
DMVCDAKRKSFLRDIDGSFLGNAGFVIPQAEYEWDGNSQLGIGDYRIPKVMLTSLNGSKI
PVTEKAPHKGIIRDSTCKYIPQWQSYQCFGMEYALMVIESLDPDTETRRISPVAIMGNGY
VDLLNGPQDHGWCAGYTCQRRLSLFHSIVALNKSYEIYFTGTSPQNLRLMLLNVDHNKAV
VVGIFFSTLQRLDVYVNNSLVCPKNTIWNAQQKHCELNTLLYKDQFLPNLDSAVLGENYF
DRTYQLLYLLVKGTIPVEIHTATVIFVSFQLPAITEDDFYTSHNLVRNLALFLKIPNDKI
RISKMIREKSLRRKRSMGFIIEIEIGDPPIQFLNNGTTGQMKLSELQEIAGSLGQAVILG
KISSILGFNISSMSITNPLPSPSDSGWIKVTAQPVERSAFPVHHVAFVSSLSVITQPVAA
QPGQPFPQQPSVKATDSDGNCVSVGITALTLRAILKDSSNNQVSGLSGNTTIPFSSCWAN
YTDLTPLRSGKNYKIEFILDNVVRVESRTFSLQAESVSSSSSSSGSSSSSSSSGGSSSSN
SKASTVGTSAQIMTVVISCLIGRMWLLEIFMAAVRL
Download sequence
Identical sequences A0A2K5C359

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]