SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for A0A2K5C366 from Uniprot 2018_03 genome

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  A0A2K5C366
Domain Number 1 Region: 369-477
Classification Level Classification E-value
Superfamily Anthrax protective antigen 2.48e-19
Family Anthrax protective antigen 0.012
Further Details:      
 
Domain Number 2 Region: 1828-1908
Classification Level Classification E-value
Superfamily E set domains 0.00000000000000205
Family E-set domains of sugar-utilizing enzymes 0.015
Further Details:      
 
Domain Number 3 Region: 2089-2176
Classification Level Classification E-value
Superfamily E set domains 0.0000000000000043
Family Other IPT/TIG domains 0.036
Further Details:      
 
Domain Number 4 Region: 1155-1232
Classification Level Classification E-value
Superfamily E set domains 0.000000000000013
Family E-set domains of sugar-utilizing enzymes 0.013
Further Details:      
 
Domain Number 5 Region: 1912-1995
Classification Level Classification E-value
Superfamily E set domains 0.0000000000000182
Family Other IPT/TIG domains 0.012
Further Details:      
 
Domain Number 6 Region: 1237-1317
Classification Level Classification E-value
Superfamily E set domains 0.0000000000000509
Family E-set domains of sugar-utilizing enzymes 0.02
Further Details:      
 
Domain Number 7 Region: 1998-2083
Classification Level Classification E-value
Superfamily E set domains 0.0000000000000993
Family NF-kappa-B/REL/DORSAL transcription factors, C-terminal domain 0.021
Further Details:      
 
Domain Number 8 Region: 3245-3366,3395-3518
Classification Level Classification E-value
Superfamily Pectin lyase-like 0.000000000000236
Family Galacturonase 0.085
Further Details:      
 
Domain Number 9 Region: 1659-1741
Classification Level Classification E-value
Superfamily E set domains 0.00000000000496
Family E-set domains of sugar-utilizing enzymes 0.024
Further Details:      
 
Domain Number 10 Region: 1064-1142
Classification Level Classification E-value
Superfamily E set domains 0.0000000000056
Family E-set domains of sugar-utilizing enzymes 0.029
Further Details:      
 
Domain Number 11 Region: 1561-1632
Classification Level Classification E-value
Superfamily E set domains 0.000000000196
Family E-set domains of sugar-utilizing enzymes 0.036
Further Details:      
 
Domain Number 12 Region: 1329-1386
Classification Level Classification E-value
Superfamily E set domains 0.000000000306
Family E-set domains of sugar-utilizing enzymes 0.029
Further Details:      
 
Domain Number 13 Region: 143-241
Classification Level Classification E-value
Superfamily E set domains 0.000000000576
Family E-set domains of sugar-utilizing enzymes 0.022
Further Details:      
 
Domain Number 14 Region: 1746-1821
Classification Level Classification E-value
Superfamily E set domains 0.00000000168
Family Other IPT/TIG domains 0.073
Further Details:      
 
Domain Number 15 Region: 32-125
Classification Level Classification E-value
Superfamily E set domains 0.00000000777
Family Other IPT/TIG domains 0.088
Further Details:      
 
Domain Number 16 Region: 272-359
Classification Level Classification E-value
Superfamily E set domains 0.00000149
Family E-set domains of sugar-utilizing enzymes 0.057
Further Details:      
 
Domain Number 17 Region: 1404-1500
Classification Level Classification E-value
Superfamily Cupredoxins 0.00000149
Family Multidomain cupredoxins 0.099
Further Details:      
 
Domain Number 18 Region: 2352-2394,2464-2674
Classification Level Classification E-value
Superfamily Pectin lyase-like 0.0000513
Family Pectate lyase-like 0.074
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) A0A2K5C366
Sequence length 4226
Comment (tr|A0A2K5C366|A0A2K5C366_AOTNA) PKHD1 like 1 {ECO:0000313|Ensembl:ENSANAP00000003146} KW=Complete proteome OX=37293 OS=Aotus nancymaae (Ma's night monkey). GN=PKHD1L1 OC=Platyrrhini; Aotidae; Aotus.
Sequence
MGHLWLLGIWGLWGLLLCAANPRTDGSEIIPKVTEIIPKYGSINGATRLTIRGEGFSQAN
QFNYGVDNAALGNSVQLVSSFQSIACDVEKDASHSTHITCYTRAMPEGSYTVRVSVDGVP
VTENNTCKGHINSWACTFNAKSFRTPTIRSITPLSGTPGTLITIQGRIFTDVYGSNIVRS
SNGKNVRILRVYIGGMPCELLIPQSDNLYGLKLDHPNGDMGSMICKTTGTFIGHHNVSFI
LDSDYGRSLPQKMAYFVSSLSKISMFQTYAEVTTIFPSRGSIQGGTTLTISGRFFDQTDF
PVRVLVKLCDTFNVTENSICCKTPPKPRILKTVYPGGRGLKLEVWNNSHPVHLEEILEYN
EKTPGYMGASWVDSASYIWPMEQDTFVARFSGFLVAPDSDVYRFYIKGDDRYAIYFSQTG
LPEDKVRIAYHSANANSYFSSPAQRSDDIHLQKGKEYYIEILLQEYRLSAFVDVGLYQYQ
NVYTEQQTGDAVNEEQVIKSQSTIIQEVQVITLENWETTNAINEVQKIKVTSPCVEANSC
SLYQYRLIYNMEKTVFLPADASEFILQSALNDLWSIKPDTVQVKRTQNAQSYIYTITFIS
TRGDFDLLGYEVFEGNNVTLDITEQTKGKPNLETFTLNWDGIASKPLTPWSSEAEFQRAV
EEMVSTKCPPQITNFEEGFVVKYFRDYETDFNLERINRGQKTAETDAYCGRYSLKNPAVL
FDSADVKPNRLPYGDILLFPYNQLCLAYKGFLANYIGLKFQYQDSSKIIRSTDVQFTYNF
AYRNNWTYTCIDLLDLIRTKYTGTNISLQRISLQKASESQSFYVDVVYIGHTSTVSTLDE
MPKRRLPALANKGIFLEHFQVNRTKINGPTMTHQYFVTMTSYNCSYNIPMMAVSFGQIIT
HKTENEIVYRGNNWPGESKIHIQRIQAASPPLSGSFDIQAYGRILKGLPTGVSAADLQFA
LQSLEGMGRVSVTRDGTCAGYTWSIKWRSTCGKQNLLQINDSNIIGEKANMTVTRIKEGG
LFRQHILGDLLRTPSQQPQVEVYVNGIPAKCSGDCGFTWDSAITPLVLGTSPSQGSYEEG
TILTIVGSGFSPSSAVTVSVGPVGCYLLSVDEKEIKCQILNGSAGHAPVAVSIADVGLAQ
NVGGEEFYFVYQSQISHIWPDSGSIAGGTLLTLSGFGFNENSKVLVGNETCNVIEGDLNR
ITCRTPKKTEGTVDISVTTNGFQATARDAFSYNCLQTPIITDFNPKVRTILGEVNLTING
YNFGNELTQNMVVYVGGKTCQILHWNFTDIRCLLPKLPPGKHDIYVEVRNWGFASTRDKL
NSSIQYVLEVTSIFPQRGSLFGGTEITIRGFGFSTIPAENTVLLGSIPCNVTSSSENVIK
CILHSTGNIFRITNNGKDSVHGLGYAWSPSVLNVSVGDTVAWHWQTHPFLRGIGYRVFSV
SSPGSVIYDGKGFTNGRQKSTSGSFSYQFTSPGIHYYSSGFVDEAHSIFLQGVINVLPAE
TRHIPLHLFVGSSEATYAHGGPENLHLGSSVAGCLATEPLCGLNNTRVKNSERLLFEVSS
CFSPSISNITPSSGTVNELITIIGHGFSNLPCANKVTIGSYPCVIEESSDDSITCHIDPQ
NSMDVGIREIVTLTVYNLGTAINTLSNEFDRRFVLLPNIDLVLPNVGSTTGMTRVTIKGS
GFAVSSAGVEVLMGHFPCKILSVNYTAIECETSPAVQQLVDVYLLIHGVPAQCQGNCTFS
YLESITPYITGISPNSIIESVKVIIEGEGFGNVLDDIAVFIGNQQFRTIDVNENNITALV
TSLPVGRHSLSVVVGSKGLALGNLSVSSPPVASVSPISGSIGGGTTLLITGNGFYPGNTT
VTIGDDPCQIISINPNEVYCHTPPRTAGMVGVKIFVNTISYPPLLFTYALEDTPFLRGIV
PSRGPPGTEIEITGSNFGIEILDISVTINNVQCNVTMANDSVLQCIVGDHAGGTFPVMMH
HKTKGSAISTVVFEYPLNIENINPSQGSFGGGQTMTVTGTGFNPQNSIILVCGSECAVDR
LRSDYTTLLCEIPSNNGKGAEQACEVSVVNGKYLSQSTTPFTYAVFLTPLITAVSPKRGS
TAGGTRLTVMGSGFSENIEDVHITIAEAKCAVECSNKTHIICMTDAHPLSGWAPVHVHIR
GVGMAKLDNADFLYVDAWSSNFSWGGRSPPEEGSLVVITKGQTILLDQSTPILKMLLIQG
GTLIFDEADIELQAENILLQMEIGTETSPFQHKAVITLHGHLRSPELPVYGAKNTVYMLA
LGVPVPVIWTRLAHTAKAGERILILQEAVTWKPGDNIVIASTGHRHSQGENEKRTIASVS
ADGITVTLSNPLNYTHLGITVTLPDGTLFEARAEVGILTRNILIRGSNNVEWNNKIPACP
DGFDTGEFATQTCLQGKFGEEIGSDQFGGCIMFHAPVPGANMVTGRIEYVEVFHAGQAFR
LGRYPIHWHLLGDLRFKSYVRGCAIHQAYNRAVTIHNTHHLLVERNIIYDIRGGAFFIED
GIEHGNILQYNLAVFVQQSTSLLNDDVTPAAFWVTNPNNTIRHNAVAGGTHFGFWYRMNN
HPDGPSYDRNICQKRVPLGEFFNNTVHSQGWFGMWIFEEYFPMQTGSCISTVPVPARFNS
FTAWNCQKGAEWVNGGALQFHNFMMVNNYEAGIETKRILAPYVGGWGETNGAMIKNAKIV
GHLYELGMGSAFCTRKGLVLPFSEGLTVSSVHFMNFDRPNCVALGVTSISGVCNDRCGGW
SAKFVDIQYFHTPNKAGFRWEHEMVLIDVDGSLTGHKGHTVIPHSSLLDPSHCTQKAEWS
IGFPGSVCDASVSFHRLAFNKPSPVSLLEKDVVLSDSFGTSIIPFQKKRLTHMSGWMALI
PNANHINWYFKGVDHITNISYTSTFYGFKDEDYVIISHNFTQNPDMFNIIDMRNGSLNPL
NWNTSKNGDWHLEANTSTLYYLVSGRNDLQQSQPISGNLDPDVKDVVINFQAYCCILQDC
FPVHPPSRKPMPKKRPATYNLWSNDSFWQSSRENNYTVPHPGANVIIPEGIWIVADIDMP
SMERLTIWGVLELEDKYNVGAAESSYREVVLNATYISLQGGRLIGGWEDNPFKGDLKIVL
RGNHTTRDWALPEGPNQGSKVLGVFGELDLHGIPRSIYKTKLSETADAGSKILSLVDAVD
WQEGEEIVITTTSYDFHQTETRSIVKILHDRKILILNDSLSYTHFAEKYHVPGTGESYTL
AADVGILSRNIKIVGEDYPGWSQDSFGAHVLVGSFTENMMTFKGNARISNVEFYHSGQEG
FRDSTDPRYAVTFLNLGQIQEHGSSYIRGCAFHHGFSPAIGVFGTDGLDIDDNIIHFTVG
EGIRIWGNANRVRGNLIALSVWPGTYQNRKDLSSTLWHAAIEINGGTNTVLQNNVVAGFA
RAGYRIDGEPCPSQFNPVEMWFDNEAHGGLYGIYMNQDGFPGCSLIQGFTIWTCWDYGIY
FQTTENVYIYNVTLVDNGMAIFPMIYLPAAISHKISNKKVQIKSSLIVGSSPGFNCSDIL
TNDDPNVELTAAHRSPRSPSGGRSGICWPTFASAHNMAPRKPHAGIMSYNAISGLLDISG
STFVGFKNVCSGETNVIFITNPLNEDLQHPIHVKNIKLVDTTEQSKIFIHRPDISKVNPS
DCVDMVCDAKRKSFLRDIDGSFLGNAGFVIPQAEYEWDGNSQLGIGDYRIPKVMLTSLNG
SKIPVTEKAPHKGIIRDSTCKYIPQWQSYQCFGMEYALMVIESLDPDTETRRISPVAIMG
NGYVDLLNGPQDHGWCAGYTCQRRLSLFHSIVALNKSYEIYFTGTSPQNLRLMLLNVDHN
KAVVVGIFFSTLQRLDVYVNNSLVCPKNTIWNAQQKHCELNTLLYKDQFLPNLDSAVLGE
NYFDRTYQLLYLLVKGTIPVEIHTATVIFVSFQLPAITEDDFYTSHNLVRNLALFLKIPN
DKIRISKMIREKSLRRKRSMGFIIEIEIGDPPIQFLNNGTTGQMKLSELQEIAGSLGQAV
ILGKISSILGFNISSMSITNPLPSPSDSGWIKVTAQPVERSAFPVHHVAFVSSLSVITQP
VAAQPGQPFPQQPSVKATDSDGNCVSVGITALTLRAILKDSSNNQVSGLSGNTTIPFSSC
WANYTDLTPLRSGKNYKIEFILDNVVRVESRTFSLQAESVSSSSSSSGSSSSTSTVGTSA
QIMTVVISCLIGRMWLLEIFMAAVRL
Download sequence
Identical sequences A0A2K5C366

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]