SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSOARP00000014090 from Ovis aries 76_3.1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSOARP00000014090
Domain Number 1 Region: 372-481
Classification Level Classification E-value
Superfamily Anthrax protective antigen 6.02e-19
Family Anthrax protective antigen 0.011
Further Details:      
 
Domain Number 2 Region: 1835-1915
Classification Level Classification E-value
Superfamily E set domains 1.12e-16
Family E-set domains of sugar-utilizing enzymes 0.01
Further Details:      
 
Domain Number 3 Region: 2096-2183
Classification Level Classification E-value
Superfamily E set domains 1.49e-16
Family Other IPT/TIG domains 0.064
Further Details:      
 
Domain Number 4 Region: 1919-2001
Classification Level Classification E-value
Superfamily E set domains 0.0000000000000105
Family Other IPT/TIG domains 0.016
Further Details:      
 
Domain Number 5 Region: 1161-1238
Classification Level Classification E-value
Superfamily E set domains 0.000000000000993
Family E-set domains of sugar-utilizing enzymes 0.014
Further Details:      
 
Domain Number 6 Region: 1663-1750
Classification Level Classification E-value
Superfamily E set domains 0.00000000000104
Family E-set domains of sugar-utilizing enzymes 0.045
Further Details:      
 
Domain Number 7 Region: 2005-2090
Classification Level Classification E-value
Superfamily E set domains 0.00000000000178
Family NF-kappa-B/REL/DORSAL transcription factors, C-terminal domain 0.022
Further Details:      
 
Domain Number 8 Region: 1242-1321
Classification Level Classification E-value
Superfamily E set domains 0.00000000000267
Family E-set domains of sugar-utilizing enzymes 0.035
Further Details:      
 
Domain Number 9 Region: 3329-3532
Classification Level Classification E-value
Superfamily Pectin lyase-like 0.0000000000223
Family Galacturonase 0.038
Further Details:      
 
Domain Number 10 Region: 271-354
Classification Level Classification E-value
Superfamily E set domains 0.0000000000542
Family E-set domains of sugar-utilizing enzymes 0.034
Further Details:      
 
Domain Number 11 Region: 1569-1648
Classification Level Classification E-value
Superfamily E set domains 0.0000000000826
Family E-set domains of sugar-utilizing enzymes 0.012
Further Details:      
 
Domain Number 12 Region: 143-241
Classification Level Classification E-value
Superfamily E set domains 0.0000000011
Family E-set domains of sugar-utilizing enzymes 0.039
Further Details:      
 
Domain Number 13 Region: 1068-1146
Classification Level Classification E-value
Superfamily E set domains 0.00000000182
Family E-set domains of sugar-utilizing enzymes 0.023
Further Details:      
 
Domain Number 14 Region: 1336-1393
Classification Level Classification E-value
Superfamily E set domains 0.00000000318
Family E-set domains of sugar-utilizing enzymes 0.038
Further Details:      
 
Domain Number 15 Region: 2361-2409,2460-2694
Classification Level Classification E-value
Superfamily Pectin lyase-like 0.0000000255
Family Pectate lyase-like 0.082
Further Details:      
 
Domain Number 16 Region: 32-125
Classification Level Classification E-value
Superfamily E set domains 0.0000000484
Family E-set domains of sugar-utilizing enzymes 0.048
Further Details:      
 
Domain Number 17 Region: 1756-1828
Classification Level Classification E-value
Superfamily E set domains 0.000000154
Family E-set domains of sugar-utilizing enzymes 0.032
Further Details:      
 
Domain Number 18 Region: 1411-1508
Classification Level Classification E-value
Superfamily Cupredoxins 0.000000234
Family Plastocyanin/azurin-like 0.042
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSOARP00000014090   Gene: ENSOARG00000013110   Transcript: ENSOART00000014297
Sequence length 4249
Comment pep:known_by_projection chromosome:Oar_v3.1:9:67545553:67716867:-1 gene:ENSOARG00000013110 transcript:ENSOART00000014297 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MGHLWLWGTCGLWGLLLSIAEPRADGSKIIPKITEIIPKYGSINGATRLTIKGEGFAQAN
QFDYGVGNAELGNRVQLISSFRSISCDVEKDSSHSTHITCYTRAMPEDSYTVRVSVDGVP
VTENNTCKGHVSSRACIFSAKIFRTPTIRSITPSSGTPGTLITIRGRIFTDVYGSNTALS
SNGKNVRILRVYIGGMPCELLISQSDNLYGLKLDHPNGDMGSMVCKMSGTYIGHHNVSFI
LDSDYGRSLPEKTAYFVSSLNKISMFQTYAEITRISPSQGSTQGGTLLTISGRFFDQTDF
PVRVLVGGQACHILNVTENSICCKTPPEPDILRTIYPGGRGLKLEVWNNSRPVRLEEILE
YSEKTPGYMGAIWVDSASYVWPMEQDTFVARFSGFLVAPESDVYRFYIKGDDRYAIYFSQ
TGLPEDKDKVRIAYHSSNANNYFSSPTQRSDDIHLQKGKEYYIEILLQEYRLSAFVDVGL
YQYKSVYTEQQTEDAVNEEQVIKSQSAVVQEVQVITLENWETTRATNEVQKVMVTSPCVE
ANLCSLYQYRLIYNMEKTVWLPADASEFILQSALNDLWSIKPDTVQVIRIQDPRSYVYLI
TFISTRGDFDLLSYEAFEGNNVTLAITEQTKGKPSLDTFTLSWDGITSRPLTPQSSVAEF
QAAVEEMVSSKCPPQIATFEEGFVVKYFRDYETDFNLEHINRGQKTAETDAYCGRYSLKN
PAVLFDSADVKQNRLPYGDILLFPYNQLCLAYKGFLANYIGLKFQYQDTSKIARSTDTQF
PFKFAYGNNWTYTCIDLLDLIQTKYTGTSFSLQRISLQKASESQSFYVDIVYIGQTSTMT
TLDEMSKRRLPALANKGIFLKDFQVNQTKSNGSSITNQYSVTMTSYNCSYNIPMMAVSFG
QRITNETENESVYRGNNWPGESKIRIQRIQAASPPLRGSFDIHAYGHVLTGIPAAVSATD
LQFALQSLEEVGQVSVTREGGCAGYSWRVKWRSTCGKQPLLQINDSNIFGENANMTVTKI
KEGGLFRQHVPGDLLRTPSKQPQVEVYVNEIPAKCSGNCGFTWEPTTTPQIRAINPSQGS
YEESTILTISGSGFSPSSSVSVSVGPAGCSLLSLSAQENEIKCQILNGSAGRFPVAVSIA
DVGRARNVEEKGFHFTYQSQISHIWPASGSLAGGTLLTVSGFGFHEYSKVLVGNETCSVI
EGDLNKITCRTPKSIEGTVDISVITSGVQATAKNAYSYNCLQTPVITDFSPKVRTILGEV
NLMIKGYNFGNELTQNMEVYVGGVSCQVLHRNFTDIRCLLPKLSPGKHDICVEVRNWGFA
STSRDKSSASIQYILEVTNMFPQKGSLYGGTEITIMGLGFSTIPTENTVLLGSFPCNVTS
SSETVIKCVLHSTGNVFRITNNGEDSVHGLGYAWSPSVLNVSVGDTVTWLWRAHPYLRGI
GYSVFSVSSPGSVIYDGKGFTNGREKSASGSFSYQFTSPGIHYYSSGYVDEAHSIFLQGV
INVLPAETSHIPLHLFVGGTEATYAQGRPVSLHLGSSVAGCQAREPLCGLNSTRAENSER
LLFELSSCLSPSISHISPSIGTLNELITITGRGFSNLTCANKVKIGSYPCVVEESSNNSL
VCRIDPQNSMDVGIREIVTVTVYNLGTAINTLSSEFDRRFVLLPNIDMISPNAGSTTGMT
KVTIKGSGFAVSSAGAQVLMGHFPCKVLSVNYTAIECETSPAPQQLVKVNLLIHGVPAQC
QGNCTFSYLESIAVFVTRIFPNSLQGSEKVLIEGEGFGTALEDISVFTGNQQFRAVDVNE
NNITVLMTPLPAGLHSLRVVVGTKGLALGNLTVSSPAVASVTPTSGSIGGGTTLMVTGNG
FSPGNTTVTVGDEPCQILSVNSSDIHCRTPAGTAGRVSVKVSVNAVAYPPLSFMYALEDT
PLLKGIVPSTGPPETEIQVTGSNFGTDILEISVMISSTQCNVTKVNDTVLQCIVGDHAGG
TFPVTMHHKTKGFAMSTVVFKYPLTIDSIHPSQGSFGGDQTMTVTGTGFNPQNSVVLVCG
SKCAVDKLKSNHTTLLCDIPPSNGRGPEQACEVSVVNGKDSSQSTATFTYNMSMTPFITK
IAPKRGSTAGGTRLTVLGSGFSENVEDVLVTVAEATCDVEYSNKTCIICMTNAHSPSGWA
PVHVSIRSTGLAKRANADFLYVDTWSSNSSWGGKSPPEEGSLVVITKGQIILLDQNTPIL
KMLLIQGGTLIFHEADIELQAENILITDGGILQIGTEAAPFQHRAVITLHGHLRSPELPV
YGAKTLAVREGILDLHGLPVPVIWTRLAHTAKAGERTLILQEAVTWKPGDKIVIASTGHR
HSQRENEERTIASVSADGRNITLTDPLNYTHLGITVTLPDGTLFEARAEIGILTRNILIR
GSDNVEWNNKIPACPDGFDTGEFATQTCLQGKFGEEIGSDQFGGCIMLHAPLAGANMVTG
RIEYVEIFHAGQAFRLGRYPIHWHLLGDLQFKSYVKGCAIHQTYNRAITIHNTHHLLVER
NIIYDIRGGAFFIEDGIEHGNILQYNLAVFVHQSTSLLNDDVTPAAFWVTNPNNTIRHNV
AAGGTHFGFWYRMNNHPDGPSYDRNICQKRVPLGEFFNNTVHSQGWFGLWIFEEYFPMQT
GSCTSSVPVPAVFHSLTTWNCQKGAEWVNGGALQFHNFVMVNNYEAGIETKRILAPYVGG
WGETNGAMIKNGKIVGHLDELGMGSAFCTTKGLVLPFNEGLTVSSVHFMNFDRPSCAALG
VTSITGVCNHRCGGWSAKFAGIQYSHTPNKAGFRWEHEVVLIDVDGSLTGHKGHTVIPHS
PLLDPSHCTQEAEWSIGFPGSVCDASVSFHRLAFNKPSPVSLLEKDVVLSDSFGTSIIPF
QKKRLTHMSGWMALIPNAKHINWYFKGVDHITNISYTSTFYGFKEEDYVIISHNFTQNPD
MFNIIDMRNGSSNPLNWNTSKNGDWHLEANTSTLYYLVSGKNDLHQSQSISGTLDPDVKD
VIINFQAYCCVLQDCFPVHPPSRKPIPRKRPAAYNLWSNNSFWQSSRENNYTIPYPGANV
VIPEGTWIVADTDIPPMARLVIRGVLELEDKHNVGAAESSYRKVVLNATYISLQGGRLIG
GWEDNPFKGELQIVLRGNHSTPEWALPEGPNQGSKVLGVFGELDLHGIPHSIYKTKLSET
AEAGSKVLSLMDAVDWQEGEEIVITTTSYDFHQTETRSIVKILHDHKILILNDTLSYTHF
AERYHVPGTSQSYTLAADVGILSRNIKILGEDYPGWLKESFGARVLVSSFTENMVTFKGN
ARISNVEFYHSGQEGFRDSTDPRYAVTFLNLGQIQERGSSYVQGCAFHNAFSPAIGVFGT
DGLDIDDNVIHFTVGEGIRIWGNANRVRGNLVTLSVWPGTYQNRKDLSSTLWHAAIEINR
GTNTVLQNNVVAGFGRAGYRIDGEPCSGQSHSLEKWFDNEAHGGLFGIYMNQDGLPGCLL
IQGFTIWSCWDYGIYFQTTESVHIYNVTLIDNGMAIFSMIYMPSAVSHKISSKTVQIKNS
LIVGSSPEFNCSDVLTNDDPNIELSAAHRSSRPPSGGRSGICWPTFSSAHNMAPRKPHAG
IMSYNSISGLLDISDTTFVGFKNVCSGETNVIFITNPLNEDLQHPIHVKNIQLVDTTEQS
KIFIHRPDLSKVNPSDCVDMVCDAKRKSLLRDMDGSFLGDSGSVIPEAEYEWNGNSQFGI
GDYRIPKVMLTFPNGSRIPVTEKAPYKGIIRDSTCKYIPEWQGHQCFGMEYAMMVIESLD
SDTETRRVSPVAIVSNGYVDLINDVGPQDHGWCAGYTCQRRLSLFHSIVALGKSYEVYFT
GTSPQNLRLMLLNVDHDKAVLVGIFFPTRQRLDVYVNNTLVCPENTEWNSQQKYCEPARQ
LYTDQLLPNLNSTVLGENYFDRTYQMLYLLVKGTIPVEVHTAAVIYVSFQLPAVTEDDFY
SSHNLVRNLALFLKIPSDKIRVSKVIRGESLRRKRSMGLTVELEIGDPRPRFITKDTAGQ
MQLPELQEIADSLGQAVILGKTNSILGFNVSSMSITDPIHSPNDSGWAKVAAQPVGRLSF
PVHHVAFVTSLSVITQPVATQLGQPFSQQPSIKAEDADGNCVSVGITALALKAVLKDSNN
NQISGLSGNTTIPFSSCWANYTDLTLLRTGKNFKIEFILDEFIWVESQPLSLASQSVSSP
GSSSGGSSNSKASTVGTAAQIVTTVISCLVGRVLLLEMFMTAVLTLNIL
Download sequence
Identical sequences W5PUF9
ENSOARP00000014090

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]