SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSCPOP00000020209 from Cavia porcellus 69_3

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSCPOP00000020209
Domain Number 1 Region: 349-465
Classification Level Classification E-value
Superfamily Anthrax protective antigen 8.24e-18
Family Anthrax protective antigen 0.012
Further Details:      
 
Domain Number 2 Region: 2070-2157
Classification Level Classification E-value
Superfamily E set domains 1.31e-16
Family Other IPT/TIG domains 0.025
Further Details:      
 
Domain Number 3 Region: 3168-3200,3234-3355,3384-3506
Classification Level Classification E-value
Superfamily Pectin lyase-like 0.0000000000000243
Family Pectate lyase-like 0.086
Further Details:      
 
Domain Number 4 Region: 1813-1889
Classification Level Classification E-value
Superfamily E set domains 0.0000000000000355
Family E-set domains of sugar-utilizing enzymes 0.036
Further Details:      
 
Domain Number 5 Region: 1138-1215
Classification Level Classification E-value
Superfamily E set domains 0.000000000000163
Family E-set domains of sugar-utilizing enzymes 0.014
Further Details:      
 
Domain Number 6 Region: 1220-1300
Classification Level Classification E-value
Superfamily E set domains 0.000000000000726
Family E-set domains of sugar-utilizing enzymes 0.019
Further Details:      
 
Domain Number 7 Region: 1893-1975
Classification Level Classification E-value
Superfamily E set domains 0.00000000000196
Family Other IPT/TIG domains 0.012
Further Details:      
 
Domain Number 8 Region: 1046-1124
Classification Level Classification E-value
Superfamily E set domains 0.000000000101
Family E-set domains of sugar-utilizing enzymes 0.045
Further Details:      
 
Domain Number 9 Region: 1545-1621
Classification Level Classification E-value
Superfamily E set domains 0.000000000154
Family E-set domains of sugar-utilizing enzymes 0.035
Further Details:      
 
Domain Number 10 Region: 1979-2065
Classification Level Classification E-value
Superfamily E set domains 0.000000000323
Family NF-kappa-B/REL/DORSAL transcription factors, C-terminal domain 0.027
Further Details:      
 
Domain Number 11 Region: 1727-1799
Classification Level Classification E-value
Superfamily E set domains 0.00000000171
Family E-set domains of sugar-utilizing enzymes 0.031
Further Details:      
 
Domain Number 12 Region: 120-218
Classification Level Classification E-value
Superfamily E set domains 0.00000000233
Family E-set domains of sugar-utilizing enzymes 0.028
Further Details:      
 
Domain Number 13 Region: 1639-1723
Classification Level Classification E-value
Superfamily E set domains 0.00000000547
Family E-set domains of sugar-utilizing enzymes 0.068
Further Details:      
 
Domain Number 14 Region: 1312-1369
Classification Level Classification E-value
Superfamily E set domains 0.00000000598
Family E-set domains of sugar-utilizing enzymes 0.023
Further Details:      
 
Domain Number 15 Region: 10-101
Classification Level Classification E-value
Superfamily E set domains 0.0000000162
Family E-set domains of sugar-utilizing enzymes 0.042
Further Details:      
 
Domain Number 16 Region: 248-308
Classification Level Classification E-value
Superfamily E set domains 0.000000204
Family E-set domains of sugar-utilizing enzymes 0.079
Further Details:      
 
Domain Number 17 Region: 1387-1486
Classification Level Classification E-value
Superfamily Cupredoxins 0.00000107
Family Multidomain cupredoxins 0.084
Further Details:      
 
Domain Number 18 Region: 2465-2662
Classification Level Classification E-value
Superfamily Pectin lyase-like 0.0000432
Family Galacturonase 0.072
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSCPOP00000020209   Gene: ENSCPOG00000019466   Transcript: ENSCPOT00000027015
Sequence length 4219
Comment pep:known scaffold:cavPor3:scaffold_0:14639024:14784258:-1 gene:ENSCPOG00000019466 transcript:ENSCPOT00000027015 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
SDGSKIIPRVTEVMPKYGSVNGATRLTIKGEGFSQANQFNYGVDNADLGNSVQLVSSFRS
ITCDVEKDSSHSTQITCYTRPMPEDSYAVRVSVDGVPIAENNTCKGRINSWACSFNAKNF
RTPTIRSITPTSGTPGTLITIQGRIFTDVYGSNTALSSNGKNVRILRVYVGGMPCELLKP
QSDDLYGLKLDHPSGDVGTVVCKVTGTYVGHHNVSFILDSDYGRSLPQKMAYFVSSLNRI
SMFQTFAEVTMISPSKGSIQGGATLRINSRFFDQTDLPVTVLVGGQPCDILNVTEDSICC
KTPPRPHILKTVYPGGRGLKLEVWNSSQPEQLEEILGYSENTPGYLGATWVDSASYIWPM
EHDTFVARFSGFLVAPDSDVYRFYIRGDDRYAVYFSQTGLPEDKDKVKIAYHSANANSYF
SSPTQRSDDIHLQKGKEYYIEILLQEYRLSAFVDVGLYQYKSVYTEQQTGDAVNEEQVIK
SQSTVIQEVQVITLENWETAIATNEVQKIIVTSPCVDTNSCSHYQFRLIYNLEKTEVWLP
ADASEVLLQSALNDLWSIKPDTVQVIRVQSLQHCIYTVTFLSNRGDFDLLAYEVFEGNNV
TVDIEEQTKGKPSLETFTLTWDGILSKPLTPWSSEAEFQVAVEEMVSTKCPPQIAHFEEG
FVVKYFRDYETDFDLENVNRGQKTAESDAYCGRYSLKNPSVLFDSADAKPNRLPYGDILL
FPYNQLCLAYKGFLANHIGLKFQYQDNGKITRSTERQFMYKFAQENSWTYTCLDLLDLIQ
SKQTGTDFSLQRISLQKASEMQSFYVDVVYIGQVSTTSTLDEMPKRRLPALASRGIFLKH
FQVNESKRDESTMTIQYSVTMTSYNCSYNIPLMAVSFGQIITSETENESVYRGNNWPGES
KIRIRRIQEASPPISGSFDIQAYGHILKGIPADVSVADLRFALQSLEEAGQVSVTREGTC
AGYAWSIKWRSACGKQSLLQINDSNIVGEKANMTVTKVKEGGLFRQRILGDLFRTPHQQP
QVEVYINGIPAKCSGDCSFTWDPAATPQVLAISPSQGSYEESTLLTIAGSGFSPSSAVSV
SIGSTNCSLLSVEEENEIKCQVLKGSAGRLPVAVSVADVGLARTAHSQGFHFVYQSGISH
ITPDSGSLAGGTLLTLSGFGFSENSKVLVGNETCDVIEGNLNRITCRTPKRSEGTVDISV
ITNGFQATVRNVFRYNCLQTPVITDFSPKIRTILGDVNLTIKGYNFGNELTQNVVVYVGG
KVCQILHWNFTDIRCLLPLLSPGEHSVYAEVRNWGFASTSDKLNASVQYVLEVTDMFPRR
GSLYGGTEITIMGFGFSTIPAENTVLLGSFPCNVTRSSENAIKCTLHSTRNVFRVTNDGS
DLVHGLGYAWSPSVLNVSVGDTVIWHWQAPPFLRGVGYRVFSVSSPGSVTYDGKGFTNGR
QKSISGSFSYQFTSPGIHYYSSGYVDEAHSISLQGVVNVLPAESRHLPLHLFVSSTEATH
IQGGPENLHLGSSGAGCLATEPLCGPNDTRVQNSSIFLFELSSCISPSISNITPSTGTAN
ELITITGSGFSNLTCANKVTIGHYPCVVEESSANSITCHIDPQDSMEVGIRELVTLVVYN
LGTAINTLSHEFDRRFVLLPSIDMVWPNTGSTTGLTRVIIKGSGFSWAGVEVFMGHFPCK
VLTVNYTAVECETSPAPRQLAQVHLLTRGVPGLCQGSCSFAYSDSITPYITGVSPNTIRG
PVKVLIEGAGFGTVLEEIAVLIGNQQFRAIDVNENNITVLVSSLPAGLHPLRVVVGTKGL
ALGNLTVSSPMEASVSPSSGSLGGGTMLVITGNGFHPDSTTVTVGEGPCQILFANASEVH
CSTPAGRAGKADLQVLVNAVPYPPLPFTYALEDTPFLRRIAPNIGPPGTEVEITGSNFGV
DISEVSVTISNAGCNVTMVNGSVLRCVAGDHAGGTFPVLLHHKTKGSGVSTVVFEYPLHI
HSIHPEQGSFGGGQILTVTGVGFNPQDSVISICSTECATDRLRSDRTTLLCEIPPHDGVG
PDQVCEVRVVNGKDSSPSTTLFTYMMALTPRVTEISPTKGSTAGGTRLTVLGSGFSENPQ
DVQVTIAEAKCDVEYSNKTHIICVTNAHTPSGWAPVHVNIRNIGMARQENTYFLYVDIWS
SNFSWGGKPPPEEGSLAVITKGQILLLDQSTPILKMLLIQGGTLIFDEADIELQAENILI
TDGGTLQVGTETSPFQHQAVITLHGHLRSPELPVYGAKTLAVREGTLDLHGLPVPVVWTR
LAQTAKAGQWTLILQDPVTWKAGDAIVIASTGHRHSQQENEKRTIASVSADGVSITLTRP
LNYTHLGITVTLPDGSLFEARAEVGILTRNIIIRGSENVEWNSKIPACPTGFDTGEFATQ
TCLQGKFGEEIGSDQFGGCVMFHAPLPGADVVTGRIEHVEVFHAGQAFRLGRYPIHWHLL
GDLQFKSYVRGCAIHQSYNRAVTIHNTHHLLVERNIIYDIKGGAFFIEDGIEHGNILQYN
LAVFVRQSTSLLNDDVTPAAFWVTNPNNTIRHNAAAGGTHFGFWYRMNNHPDGPSYDRNI
CQKRVPLGEFFNNTVHSQGWFGLWIFEEYFPMETGSCTSTVPLPAVFNSLTTWNCQKGAE
WVNGGALQFHNFVMVNNHEAGIETKRILAPYVGGWGETSGAVIKNAKIVGHLDELGMGTG
FCTSKGLVLPFSEGLTVSSVQFMNFDRPNCVALGVTSITGVCHDKCGGWSAKFVDIQYFH
TPNKAGFRWEHEAALIDVDGSLTGHKGHTVIPHSSLLDPSHCTQEAQWSIGFPGSVCDAS
VSFHRLAFNKPSPVSLLEKDVVLSDSFGTSIVPFQKKRLTHTSGWMALIPNANHINWYFR
DVDHVTNISYTSTFYGFKEEDYVIISHNFTQSPDMFSVIDMRNGSSNPLNWNTSKNGDWH
LEANTSTLYYLVSGRSDHHQSQSISGTLDPDVKDVVINFQAYCCTLQDCFPVHPPSRKPI
PRRRPASYILWSNHSFWQSSQENNYTVPHPGAQVVIPEGAWIVADTDLPPMERLIIWGVL
ELEDKSSVGASGPSYRRIVLNATYISVQGGRLIGGWEDNPFKGELQIILRGNHSTPEWAL
PEGPNQGAKVLGVFGELDLHGLPRLIYKTKLSETAEAGSKVLSLMDAVDWQEGDEIVITT
TSYDLHQTETRSIVRILHGHKILILNDSLSYNHLAKRYYIPETGQNYILAADVGLLSRNI
KIVGEDYPGSSRDSFGARILVGSFTGNMMTFKGNARISNVEFYHSGQEGFRDSTDPRYAV
TFLNLGQIQEHSSSYIRGCAFHHGFSPAIGVFGTDGVDIDDNVIHFTVGEGIRVWGDANR
VRRNLVALSVWPGTYQNRKDSSSTLWHAAIEINRGTNTVLQNNVVAGFGRVGYRIDGEPC
SSQPNPVENWFDNEAHGGLYGVYMNQDGLPGCSLIQGFTIWMCWDYGIYFQTTERVHIYN
VTLVDNGMAIFSMIYMPPAVSHKISSKTVKVESTLIVGSSPEFNCSDVLRDDDPNIELTG
AHRSARPPAGWGKGGRSGICWPTFASAHNMAPWKPHAGIMSYNAISGHLDVSGSTFVGFK
NVCSGETNVIFMTNPLNEDLQHPIYVKNIRLVDTTEPSKIYIHRPDISKVNPSDCVDMVC
DAKRKSFLKDLDGSFLGNSGSVIPQAEYEWNGNSQLGIGDYRIPQVMLTFLNGSRIPVSE
KAPYKGIIRDATCKYIPEWQSYQCFGMEYAMLVIESLDSDTETRRLSPVAIVSNGYVDLL
NGPQDHGWCVGYTCQRRLSLFHSIVALNKSYEVYFTGTSPQNLRLMLLNVEHTRAVLVGI
FYSTLQRLDVYVNNSLVCPRDTIWNPQQKLCAFNRDPRADQFLPKLNSTVLGENYFDRTY
QMLYILVKGTTPVEIHTAMVIFVSFQLPAVTEDDFYSSHNLVRNLALFLNIPSHKIRVSK
VIREKSLRRKRSTGHSIELEIGEPPMRILSNDTTGQMKLSEFQEISGSLGQAVILGKISG
ILGFNISSMSITNPIPSPNDSGWIKVTAQPVARAAFPVHHVARVSSLSVVTQPVAGQPGQ
PFSQQPSVKAVDPEGNCVSVGITSLTLKAVLKDSSNNQVSGLGGNTTILFSSCWANYTDL
TPLRTGKNYKLEFLLDDVIRAESRTFSLPVQSVSGDKASAVGTAAQTVAVLMSCLAGRVL
LLEIFMTAAFVLNINVVSY
Download sequence
Identical sequences ENSCPOP00000020209 ENSCPOP00000020209 10141.ENSCPOP00000020209

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]