SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for H0WB44 from Uniprot 2018_03 genome

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  H0WB44
Domain Number 1 Region: 361-475
Classification Level Classification E-value
Superfamily Anthrax protective antigen 1.96e-18
Family Anthrax protective antigen 0.012
Further Details:      
 
Domain Number 2 Region: 2078-2165
Classification Level Classification E-value
Superfamily E set domains 1.31e-16
Family Other IPT/TIG domains 0.025
Further Details:      
 
Domain Number 3 Region: 3175-3208,3242-3363,3392-3515
Classification Level Classification E-value
Superfamily Pectin lyase-like 0.00000000000000507
Family Pectate lyase-like 0.086
Further Details:      
 
Domain Number 4 Region: 1821-1897
Classification Level Classification E-value
Superfamily E set domains 0.000000000000033
Family E-set domains of sugar-utilizing enzymes 0.036
Further Details:      
 
Domain Number 5 Region: 1146-1223
Classification Level Classification E-value
Superfamily E set domains 0.000000000000151
Family E-set domains of sugar-utilizing enzymes 0.014
Further Details:      
 
Domain Number 6 Region: 1228-1308
Classification Level Classification E-value
Superfamily E set domains 0.000000000000726
Family E-set domains of sugar-utilizing enzymes 0.019
Further Details:      
 
Domain Number 7 Region: 1055-1132
Classification Level Classification E-value
Superfamily E set domains 0.000000000000938
Family E-set domains of sugar-utilizing enzymes 0.037
Further Details:      
 
Domain Number 8 Region: 1901-1983
Classification Level Classification E-value
Superfamily E set domains 0.00000000000196
Family Other IPT/TIG domains 0.012
Further Details:      
 
Domain Number 9 Region: 1553-1629
Classification Level Classification E-value
Superfamily E set domains 0.000000000154
Family E-set domains of sugar-utilizing enzymes 0.035
Further Details:      
 
Domain Number 10 Region: 1987-2073
Classification Level Classification E-value
Superfamily E set domains 0.00000000031
Family NF-kappa-B/REL/DORSAL transcription factors, C-terminal domain 0.027
Further Details:      
 
Domain Number 11 Region: 1735-1807
Classification Level Classification E-value
Superfamily E set domains 0.00000000171
Family E-set domains of sugar-utilizing enzymes 0.031
Further Details:      
 
Domain Number 12 Region: 132-230
Classification Level Classification E-value
Superfamily E set domains 0.00000000233
Family E-set domains of sugar-utilizing enzymes 0.028
Further Details:      
 
Domain Number 13 Region: 1647-1731
Classification Level Classification E-value
Superfamily E set domains 0.00000000547
Family E-set domains of sugar-utilizing enzymes 0.068
Further Details:      
 
Domain Number 14 Region: 1320-1377
Classification Level Classification E-value
Superfamily E set domains 0.00000000548
Family E-set domains of sugar-utilizing enzymes 0.023
Further Details:      
 
Domain Number 15 Region: 22-113
Classification Level Classification E-value
Superfamily E set domains 0.0000000162
Family E-set domains of sugar-utilizing enzymes 0.042
Further Details:      
 
Domain Number 16 Region: 260-320
Classification Level Classification E-value
Superfamily E set domains 0.000000149
Family E-set domains of sugar-utilizing enzymes 0.079
Further Details:      
 
Domain Number 17 Region: 1395-1494
Classification Level Classification E-value
Superfamily Cupredoxins 0.00000107
Family Multidomain cupredoxins 0.084
Further Details:      
 
Domain Number 18 Region: 2473-2670
Classification Level Classification E-value
Superfamily Pectin lyase-like 0.0000432
Family Galacturonase 0.072
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) H0WB44
Sequence length 4234
Comment (tr|H0WB44|H0WB44_CAVPO) PKHD1 like 1 {ECO:0000313|Ensembl:ENSCPOP00000020209} KW=Complete proteome; Reference proteome OX=10141 OS=Cavia porcellus (Guinea pig). GN=PKHD1L1 OC=Hystricomorpha; Caviidae; Cavia.
Sequence
LCDLTCAILLYLSDGSKIIPRVTEVMPKYGSVNGATRLTIKGEGFSQANQFNYGVDNADL
GNSVQLVSSFRSITCDVEKDSSHSTQITCYTRPMPEDSYAVRVSVDGVPIAENNTCKGRI
NSWACSFNAKNFRTPTIRSITPTSGTPGTLITIQGRIFTDVYGSNTALSSNGKNVRILRV
YVGGMPCELLKPQSDDLYGLKLDHPSGDVGTVVCKVTGTYVGHHNVSFILDSDYGRSLPQ
KMAYFVSSLNRISMFQTFAEVTMISPSKGSIQGGATLRINSRFFDQTDLPVTVLVGGQPC
DILNVTEDSICCKTPPRPHILKTVYPGGRGLKLEVWNSSQPEQLEEILGYSENTPGYLGA
TWVDSASYIWPMEHDTFVARFSGFLVAPDSDVYRFYIRGDDRYAVYFSQTGLPEDKVKIA
YHSANANSYFSSPTQRSDDIHLQKGKEYYIEILLQEYRLSAFVDVGLYQYKSVYTEQQTG
DAVNEEQVIKSQSTVIQEVQVITLENWETAIATNEVQKIIVTSPCVDTNSCSHYQFRLIY
NLEKTVWLPADASEVLLQSALNDLWSIKPDTVQVIRVQSLQHCIYTVTFLSNRGDFDLLA
YEVFEGNNVTVDIEEQTKGKPSLETFTLTWDGILSKPLTPWSSEAEFQVAVEEMVSTKCP
PQIAHFEEGFVVKYFRDYETDFDLENVNRGQKTAESDAYCGRYSLKNPSVLFDSADAKPN
RLPYGDILLFPYNQLCLAYKGFLANHIGLKFQYQDNGKITRSTERQFMYKFAQENSWTYT
CLDLLDLIQSKQTGTDFSLQRISLQKASEMQSFYVDVVYIGQVSTTSTLDEMPKRRLPAL
ASRGIFLKHFQVNESKRDESTMTIQYSVTMTSYNCSYNIPLMAVSFGQIITSETENESVY
RGNNWPGESKIRIRRIQEASPPISGSFDIQAYGHILKGIPADVSVADLRFALQSLEEAGQ
VSVTREGTCAGYAWSIKWRSACGKQSLLQINDSNIVGEKANMTVTKVKEGGLFRQRILGD
LFRTPHQQPQVEVYINGIPAKCSGDCSFTWDPAATPQVLAISPSQGSYEESTLLTIAGSG
FSPSSAVSVSIGSTNCSLLSVEENEIKCQVLKGSAGRLPVAVSVADVGLARTAHSQGFHF
VYQSGISHITPDSGSLAGGTLLTLSGFGFSENSKVLVGNETCDVIEGNLNRITCRTPKRS
EGTVDISVITNGFQATVRNVFRYNCLQTPVITDFSPKIRTILGDVNLTIKGYNFGNELTQ
NVVVYVGGKVCQILHWNFTDIRCLLPLLSPGEHSVYAEVRNWGFASTSDKLNASVQYVLE
VTDMFPRRGSLYGGTEITIMGFGFSTIPAENTVLLGSFPCNVTRSSENAIKCTLHSTRNV
FRVTNDGSDLVHGLGYAWSPSVLNVSVGDTVIWHWQAPPFLRGVGYRVFSVSSPGSVTYD
GKGFTNGRQKSISGSFSYQFTSPGIHYYSSGYVDEAHSISLQGVVNVLPAESRHLPLHLF
VSSTEATHIQGGPENLHLGSSGAGCLATEPLCGPNDTRVQNSSIFLFELSSCISPSISNI
TPSTGTANELITITGSGFSNLTCANKVTIGHYPCVVEESSANSITCHIDPQDSMEVGIRE
LVTLVVYNLGTAINTLSHEFDRRFVLLPSIDMVWPNTGSTTGLTRVIIKGSGFSWAGVEV
FMGHFPCKVLTVNYTAVECETSPAPRQLAQVHLLTRGVPGLCQGSCSFAYSDSITPYITG
VSPNTIRGPVKVLIEGAGFGTVLEEIAVLIGNQQFRAIDVNENNITVLVSSLPAGLHPLR
VVVGTKGLALGNLTVSSPMEASVSPSSGSLGGGTMLVITGNGFHPDSTTVTVGEGPCQIL
FANASEVHCSTPAGRAGKADLQVLVNAVPYPPLPFTYALEDTPFLRRIAPNIGPPGTEVE
ITGSNFGVDISEVSVTISNAGCNVTMVNGSVLRCVAGDHAGGTFPVLLHHKTKGSGVSTV
VFEYPLHIHSIHPEQGSFGGGQILTVTGVGFNPQDSVISICSTECATDRLRSDRTTLLCE
IPPHDGVGPDQVCEVRVVNGKDSSPSTTLFTYMMALTPRVTEISPTKGSTAGGTRLTVLG
SGFSENPQDVQVTIAEAKCDVEYSNKTHIICVTNAHTPSGWAPVHVNIRNIGMARQENTY
FLYVDIWSSNFSWGGKPPPEEGSLAVITKGQILLLDQSTPILKMLLIQGGTLIFDEADIE
LQAENILITDGGTLQVGTETSPFQHQAVITLHGHLRSPELPVYGAKTLAVREGTLDLHGL
PVPVVWTRLAQTAKAGQWTLILQDPVTWKAGDAIVIASTGHRHSQQENEKRTIASVSADG
VSITLTRPLNYTHLGITVTLPDGSLFEARAEVGILTRNIIIRGSENVEWNSKIPACPTGF
DTGEFATQTCLQGKFGEEIGSDQFGGCVMFHAPLPGADVVTGRIEHVEVFHAGQAFRLGR
YPIHWHLLGDLQFKSYVRGCAIHQSYNRAVTIHNTHHLLVERNIIYDIKGGAFFIEDGIE
HGNILQYNLAVFVRQSTSLLNDDVTPAAFWVTNPNNTIRHNAAAGGTHFGFWYRMNNHPD
GPSYDRNICQKRVPLGEFFNNTVHSQGWFGLWIFEEYFPMETGSCTSTVPLPAVFNSLTT
WNCQKGAEWVNGGALQFHNFVMVNNHEAGIETKRILAPYVGGWGETSGAVIKNAKIVGHL
DELGMGTGFCTSKGLVLPFSEGLTVSSVQFMNFDRPNCVALGVTSITGVCHDKCGGWSAK
FVDIQYFHTPNKAGFRWEHEAALIDVDGSLTGHKGHTVIPHSSLLDPSHCTQEAQWSIGF
PGSVCDASVSFHRLAFNKPSPVSLLEKDVVLSDSFGTSIVPFQKKRLTHTSGWMALIPNA
NHINWYFRDVDHVTNISYTSTFYGFKEEDYVIISHNFTQSPDMFSVIDMRNGSSNPLNWN
TSKNGDWHLEANTSTLYYLVSGRSDHHQSQSISGTLDPDVKDVVINFQAYCCTLQDCFPV
HPPSRKPIPRRRPASYILWSNHSFWQSSQENNYTVPHPGAQVVIPEGAWIVADTDLPPME
RLIIWGVLELEDKSSVGASGPSYRRIVLNATYISVQGGRLIGGWEDNPFKGELQIILRGN
HSTPEWALPEGPNQGAKVLGVFGELDLHGLPRLIYKTKLSETAEAGSKVLSLMDAVDWQE
GDEIVITTTSYDLHQTETRSIVRILHGHKILILNDSLSYNHLAKRYYIPETGQNYILAAD
VGLLSRNIKIVGEDYPGSSRDSFGARILVGSFTGNMMTFKGNARISNVEFYHSGQEGFRD
STDPRYAVTFLNLGQIQEHSSSYIRGCAFHHGFSPAIGVFGTDGVDIDDNVIHFTVGEGI
RVWGDANRVRRNLVALSVWPGTYQNRKDSSSTLWHAAIEINRGTNTVLQNNVVAGFGRVG
YRIDGEPCSSQPNPVENWFDNEAHGGLYGVYMNQDGLPGCSLIQGFTIWMCWDYGIYFQT
TERVHIYNVTLVDNGMAIFSMIYMPPAVSHKISSKTVKSTLIVGSSPEFNCSDVLRDDDP
NIELTGAHRSARPPAGGRSGICWPTFASAHNMAPWKPHAGIMSYNAISGHLDVSGSTFVG
FKNVCSGETNVIFMTNPLNEDLQHPIYVKNIRLVDTTEPSKIYIHRPDISKVNPSDCVDM
VCDAKRKSFLKDLDGSFLGNSGSVIPQAEYEWNGNSQLGIGDYRIPQVMLTFLNGSRIPV
SEKAPYKGIIRDATCKYIPEWQSYQCFGMEYAMLVIESLDSDTETRRLSPVAIVSNGYVD
LLNGPQDHGWCVGYTCQRRLSLFHSIVALNKSYEVYFTGTSPQNLRLMLLNVEHTRAVLV
GIFYSTLQRLDVYVNNSLVCPRDTIWNPQQKLCAFNRDPRADQFLPKLNSTVLGENYFDR
TYQMLYILVKGTTPVEIHTAMVIFVSFQLPAVTEDDFYSSHNLVRNLALFLNIPSHKIRV
SKVIREKSLRRKRSTGHSIELEIGEPPMRILSNDTTGQMKLSEFQEISGSLGQAVILGKI
SGILGFNISSMSITNPIPSPNDSGWIKVTAQPVARAAFPVHHVARVSSLSVVTQPVAGQP
GQPFSQQPSVKAVDPEGNCVSVGITSLTLKAVLKDSSNNQVSGLGGNTTILFSSCWANYT
DLTPLRTGKNYKLEFLLDDVIRAESRTFSLPVTSSSSSGGSNGNHSKASAVGTAAQTVAV
LMSCLAGRVLLLEIFMTAAFVLNINVGKHSSFCT
Download sequence
Identical sequences H0WB44

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]