SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for A0A2K6TGD9 from Uniprot 2018_03 genome

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  A0A2K6TGD9
Domain Number 1 Region: 372-481
Classification Level Classification E-value
Superfamily Anthrax protective antigen 1.31e-19
Family Anthrax protective antigen 0.012
Further Details:      
 
Domain Number 2 Region: 1830-1910
Classification Level Classification E-value
Superfamily E set domains 0.00000000000000107
Family E-set domains of sugar-utilizing enzymes 0.015
Further Details:      
 
Domain Number 3 Region: 2091-2178
Classification Level Classification E-value
Superfamily E set domains 0.00000000000000374
Family Other IPT/TIG domains 0.036
Further Details:      
 
Domain Number 4 Region: 1157-1234
Classification Level Classification E-value
Superfamily E set domains 0.000000000000016
Family E-set domains of sugar-utilizing enzymes 0.014
Further Details:      
 
Domain Number 5 Region: 1914-1997
Classification Level Classification E-value
Superfamily E set domains 0.0000000000000336
Family Other IPT/TIG domains 0.012
Further Details:      
 
Domain Number 6 Region: 1239-1319
Classification Level Classification E-value
Superfamily E set domains 0.0000000000000904
Family E-set domains of sugar-utilizing enzymes 0.023
Further Details:      
 
Domain Number 7 Region: 2000-2085
Classification Level Classification E-value
Superfamily E set domains 0.00000000000014
Family NF-kappa-B/REL/DORSAL transcription factors, C-terminal domain 0.023
Further Details:      
 
Domain Number 8 Region: 3293-3376,3405-3528
Classification Level Classification E-value
Superfamily Pectin lyase-like 0.00000000000489
Family Galacturonase 0.085
Further Details:      
 
Domain Number 9 Region: 1658-1745
Classification Level Classification E-value
Superfamily E set domains 0.000000000014
Family E-set domains of sugar-utilizing enzymes 0.023
Further Details:      
 
Domain Number 10 Region: 1066-1144
Classification Level Classification E-value
Superfamily E set domains 0.0000000000318
Family E-set domains of sugar-utilizing enzymes 0.036
Further Details:      
 
Domain Number 11 Region: 272-361
Classification Level Classification E-value
Superfamily E set domains 0.000000000205
Family E-set domains of sugar-utilizing enzymes 0.031
Further Details:      
 
Domain Number 12 Region: 143-241
Classification Level Classification E-value
Superfamily E set domains 0.000000000576
Family E-set domains of sugar-utilizing enzymes 0.022
Further Details:      
 
Domain Number 13 Region: 1563-1634
Classification Level Classification E-value
Superfamily E set domains 0.000000000644
Family E-set domains of sugar-utilizing enzymes 0.036
Further Details:      
 
Domain Number 14 Region: 1331-1388
Classification Level Classification E-value
Superfamily E set domains 0.000000000827
Family E-set domains of sugar-utilizing enzymes 0.02
Further Details:      
 
Domain Number 15 Region: 1748-1823
Classification Level Classification E-value
Superfamily E set domains 0.0000000021
Family Other IPT/TIG domains 0.086
Further Details:      
 
Domain Number 16 Region: 32-125
Classification Level Classification E-value
Superfamily E set domains 0.00000000764
Family Other IPT/TIG domains 0.088
Further Details:      
 
Domain Number 17 Region: 1406-1502
Classification Level Classification E-value
Superfamily Cupredoxins 0.000000199
Family Plastocyanin/azurin-like 0.039
Further Details:      
 
Domain Number 18 Region: 2370-2380,2477-2686
Classification Level Classification E-value
Superfamily Pectin lyase-like 0.000000942
Family Galacturonase 0.074
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) A0A2K6TGD9
Sequence length 4247
Comment (tr|A0A2K6TGD9|A0A2K6TGD9_SAIBB) PKHD1 like 1 {ECO:0000313|Ensembl:ENSSBOP00000018690} KW=Complete proteome OX=39432 OS=Saimiri boliviensis boliviensis (Bolivian squirrel monkey). GN=PKHD1L1 OC=Platyrrhini; Cebidae; Saimiriinae; Saimiri.
Sequence
MGHLWLLGIWGLWGLLLCAANPRADGSEIIPKVTEIIPKYGSINGATRLTIRGEGFSQAN
QFNYGADNTELGNSVQLVSSFQSIACDVEKDASHSTHITCYTRAMPEGSYTVRVSVDGVP
VTENNTCKGHINSWACTFNAKSFRTPTIRSITPLSGTPGTLITIQGRIFTDVYGSNIVRS
SNGKNVRILRVYIGGMPCELLIPQSDNLYGLKLDHPNGDMGSMICKTTGTFIGHHNVSFI
LDSDYGRSLPQKMAYFVSSLSKISMFQTYAEVTAIFPSRGSIQGGTMLTISGRFFDQTDF
PVRVLVGGETCDTLNVTENSISCKTPPKPHILKTVYPGGRGLKLEVWNNSRPVHLEEILE
YNEKTPGYMEASWVDSPSYIWPMEQDTFVARFSGFLVAPDSDVYRFYIKGDDRYAIYFSQ
TGLPEDKVRIAYHSANANSYFSSPAQRSDDIHLQKGKEYYIEILLQEYRLSAFVDVGLYQ
YQNVYTEQQTGDAVNEEQVIKSQSTIIQEVQVITLENWETTNAINEVQKIKITSPCVEAN
SCSLYQYRLIYNMEKTVFLPADASEFILQSSLNDLWSIKPDTVQVIRTQNTQSYIYTITF
ISTRGDFDLLGYEVFEGNNVTLDITEQTKGKPNLETFTLNWDGIASKPLTPWSSEAEFQR
AVEEMVSTKCPPQITNFEEGFVVKYFRDYETDFNLDRINRGQKTAETDAYCGRYSLKNPA
VLFDSADVKPNRLPYGDILLFPYNQLCLAYKGFLANYIGLKFQYQDTSKIIRSTDVQFTY
NFAYGNNWTYTCIDLLDLIRTKYTGTNISLQRISLQKASESQSFYVDVMYIGHTSTISTL
DEMPKRRLPALANKGIFLEHFQVNRTKINGPTMTYQYFVTMTSYNCSYNIPMMAVSFGQI
ITHETENEIVCRGNNWPGESKIHIQRIQAASPPLSGSFDIQAYGRILKGLPAGVSAADLQ
FALQSLEGMGRVSVTREGTCAGYTWNIKWRSTCGKQNLLQINDSNIIGEKANMTVTRIKE
GGLFRQHILGDLLRTPSQQPQVEVYVNGIPAKCSGDCGFTWDSAITPLVLGTSPSQGSYE
EGTILTIVGSGFSPSSAVTVSVGAGGCYLLSVDEKEIKCQILKGSAGHAPVAVSIADVGL
AQNVGGEEFYFVYQSQISHIWPDSGSIAGGTLLTLSGFGFNENSKVFVGNETCDVIEGDL
NRITCRTPKKTEGTVDISVTTNGFQATARDAFSYNCLQTPIITDFSPKVRTILGEVNLTV
NGYNFGNELTQNMVVYVGGKTCQILHWNFTDIRCLLPKLPPGKHDIYVEVRNWGFASTRD
KLNSSIQYVLEVTSMFPQRGSLFGGTEITIRGFGFSTIPAENTVLLGSIPCNVTSSSENV
IKCILHSTGNIFRITNNGKDSVHGLGYAWSPSVLNVSVGDTVTWHWQTHPFLRGIGYRVF
SVSSPGSVIYDGKGFTNGRQKSTSGSFSYQFTSPGIHYYSSGFVDEAHSIFLQGVINVLP
AETRHIPLHLFVGSSEATYAHGGPENLHLGSSVAGCLATEPLCGLNNTRVKNSERLLFEV
SSCFSPSISNITPSSGTVNELITIIGHGFSNLPCANKITIGSYPCVIEESNDDSITCHID
PQNSMDVGIREIVTLTVYNLGTAINTLSNEFDRRFVLLPNIDLVLPNVGSTTGMTRVTIK
GSGFAVSSAGVEVLMGHFPCKVLSVNYTAIECDTSPAVQQLVDVYLLIHGVPAHCQGNCT
FSYLESITPYITGISPNSIIESVKVIIEGEGFGTVLDDIAVYIGNQQFRAIDVNENNITA
VVTSLPVGRHSLSVVVRSKGLALGNLSVSSPPVASVSPTSGSIGGGTTLVITGNGFYPGN
TTVTIGDDPCQIIFVNPNKVYCHTPPGTAGMVGVKIFVNTISYPPLLFTYALEDTPFLRG
IVPSRGPPGTEIEITGSNFGIEILDISVMINNIQCNVTMANDSVLQCIVGDHAGGTFPVM
MHHKTKGSAVSTVVFEYPLNIQNINPSQGSFGGGQTMTVTGTGFNPQNSVILVCGSECAV
DRLRSDYTTLLCEIPSNNGRGAEQACEVSVINGKDLSQSTTPFTYAVFLTPLITAVSPKR
GSTAGGTRLTVMGSGFSENIEDVHITIAEAKCAVECSNKTHIICMTNAHPLSGWAPVHVH
IRGVGMAKLDNADFLYVDAWSSNFSWGGRSPPEEGSLVVISKGQTILLDQSTPILKMLLI
QGGTLIFDEADIELQAENILITDGGILQIGTETSPFQHKAVITLHGHLRSPELPVYGAKT
LAVREGILDLHGVPVPVIWTRLAHTAKAGERILILQEAVTWKPGDNIVIASTGHRHSQGE
NEKRIIASVSADGVTITLSNPLNYTHLGITVTLPDGTLFEARAEVGILTRNILIRGSDNV
EWNKKIPACPDGFDTGEFATQTCLQGKFGEEIGSDQFGGCIMFHAPVPGANMVTGRIEYV
EVFHTGQAFRLGRYPIHWHLLGDLHFKSYVRGCAIHQAYNRAVTIHNTHHLLVERNIIYD
IRGGAFFIEDGIEHGNILQYNLAVFVQQSTSLLNDDVTPAAFWVTNPNNTIRHNAVAGGT
HFGFWYRMNNHPDGPSYDRNICQKRVPLGEFFNNTVHSQGWFGMWIFEEYFPMQTGSCTS
TVPVPARFNSFTAWNCQKGAEWVNGGALQFHNFVMVNNYEAGIETKRILAPYVGGWGETN
GAMIKNAKIVGHLDELGMGSAFCTRKGLVLPFSEGLTVSCVHFMNFDHPNCVALGVTSIS
GVCNDRCGGWSAKFVDIQYFHTPNKAGFRWEHEMVLIDVDGSLTGHKGHTVIPHSSLLDP
SHCTQEAEWSIGFPGSVCDASVSFHRLAFNKPSPVSLLEKDVVLSDSFGTSIIPFQKKRL
THMSGWMALIPNANHINWYFKGVDHITNISYTSTFYGFKDEDYVIISHNFTQNPDMFNII
DMRNGSSNPLNWNTSRNGDWHLEANTSTLYYLVSGRNDLQESQPISGNLDPDVKDVVINF
QAYCCILQDCFPVHPPSRKPMPKKRPATYNLWSNDSFWQSSRENNYTVPHPGANVIIPEG
IWIVADIDMPSMERLIIWGVLELEDKYNVGAAESSYREVVLNATYISLQGGRLIGGWEDN
PFKGDLKIVLRGNHTTRDWALPEGPNQGSKMLGVFGELDLHGIPRSIYKTKLSETADAGS
KILSLMDAVDWQEGEEVVITTTSYDFHQTETRSIVKILHDRKILILNDSLSYTHFAEKYH
VPGTGESYTLAADVGILSRNIKIVSEDYPGWSQDSFGAHVLVGSFTENMMTFKGNARISN
VEFYHSGQEGFRDSTDPRYAVTFLNLGQIQEHGSSYIQGCAFHHGFSPAIGVFGTDGLDI
DDNIIHFTVGEGIRIWGNANRVRGNLIALSVWPGTYQNRKDLSSTLWHAAIEINGGTNTV
LQNNVVAGFARAGYRIDGEPCPSQFNPVEKWFDNEAHGGLYGIYMNQDGLPGCSLIQGFI
IWTCWDYGIYFQTTENVYIYNVTLVDNGMAIFPMIYLPAAISHKISNKKVQIKSSLIVGS
SPGFNCSDILTNDDPNVELTAAHRSPRSPSGGRSGICWPTFASAHNMAPQKPHAGIMSYN
AISGLLDISGSTFVGFKNVCSGETNVIFITNPLNEDLQHPIHVKNIKLVDTTEQSKIFIH
RPDISKVNPSDCVDMVCDAKRKSFLRDIDGSFLGNAGFVIPQAEYEWDGNSQLGIGDYRI
PKVMLTFLNGSKIPVTEKAPYKGIIRDSTCKYIPQWQSYQCFGMEYALMVIESLDPDTET
RRISPVAIMGNGYVDLLNGPQDHGWCAGYTCQRRLSLFHSIVALNKSYEIYFTGTSPQNL
RLMLLNVDRNKAVLVGIFFSTLQRLDVYVNNSLVCPKNTIWNAQQRHCELNTLLYKDQFL
PNLDSTVLGENYFDRTYQMLYLLVKGTIPVEIHTATVIFVSFQLPAVTEDDFYTSHNLVR
NLALFLKIPSDKIRVSKMTRQKSLRRKRSIGFIIEIEIGDPPIQFLNNGTTGQMQFSELQ
ETAGSLGQAVILGKISSILGFNISSISITNPLPSPSDSGWIKVTAQPVERSAFPVHHVAF
VSSLSVITQPVAAQPGQPFPQQPSVKATDSDGNCVSVGITALTLRAILKDSNNNQVSGLS
GNTTIPFSSCWANYTDLTPLRTGKNYKIEFLLDNVVRVESRTFSLQAESVSSSSSSSGSS
SSNSKASTVGTSAQIMTVVISCLIGRMWLLEIFMAAVSTVNITLRSY
Download sequence
Identical sequences A0A2K6TGD9
XP_003938633.1.74449

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]