SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for A0A075EB17 from Uniprot 2018_03 genome

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  A0A075EB17
Domain Number 1 Region: 2998-3297
Classification Level Classification E-value
Superfamily Trypsin-like serine proteases 5.84e-147
Family Viral cysteine protease of trypsin fold 0.00000000136
Further Details:      
 
Domain Number 2 Region: 6156-6346
Classification Level Classification E-value
Superfamily S-adenosyl-L-methionine-dependent methyltransferases 4.82e-72
Family Nsp15 N-terminal domain-like 0.0000664
Further Details:      
 
Domain Number 3 Region: 3700-3853
Classification Level Classification E-value
Superfamily Coronavirus NSP8-like 2.22e-69
Family Coronavirus NSP8-like 0.0000123
Further Details:      
 
Domain Number 4 Region: 6337-6489
Classification Level Classification E-value
Superfamily EndoU-like 1.18e-62
Family Nsp15 C-terminal domain-like 0.0000154
Further Details:      
 
Domain Number 5 Region: 3971-4092
Classification Level Classification E-value
Superfamily Coronavirus NSP10-like 6.41e-55
Family Coronavirus NSP10-like 0.00000888
Further Details:      
 
Domain Number 6 Region: 4514-4818,4859-5002
Classification Level Classification E-value
Superfamily DNA/RNA polymerases 7.91e-52
Family RNA-dependent RNA-polymerase 0.02
Further Details:      
 
Domain Number 7 Region: 3860-3965
Classification Level Classification E-value
Superfamily Replicase NSP9 1.96e-43
Family Replicase NSP9 0.00014
Further Details:      
 
Domain Number 8 Region: 3580-3662
Classification Level Classification E-value
Superfamily Coronavirus NSP7-like 1.44e-33
Family Coronavirus NSP7-like 0.00045
Further Details:      
 
Domain Number 9 Region: 1264-1485
Classification Level Classification E-value
Superfamily Macro domain-like 7.23e-32
Family Macro domain 0.00019
Further Details:      
 
Domain Number 10 Region: 5317-5622
Classification Level Classification E-value
Superfamily P-loop containing nucleoside triphosphate hydrolases 4.77e-27
Family Nitrogenase iron protein-like 0.049
Further Details:      
 
Domain Number 11 Region: 2178-2303
Classification Level Classification E-value
Superfamily Pectin lyase-like 0.00000000596
Family Pectate lyase-like 0.01
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) A0A075EB17
Sequence length 6795
Comment (tr|A0A075EB17|A0A075EB17_9ALPC) Polyprotein {ECO:0000313|EMBL:AID57026.1} KW=Complete proteome OX=28295 OS=Porcine epidemic diarrhea virus. GN= OC=Nidovirales; Coronaviridae; Coronavirinae; Alphacoronavirus.
Sequence
MASNHVTLAFANDAEISAFGFCTASEAVSYYSEAAASGFMQCRFVSFDLADTVEGLLPED
YVMVVVGTTKLSAYVDTFGSRPKNICGWLLFSNCNYFLEELELTFGRRGGNIVPVDQYMC
GADGKPVLQESEWEYTDFFADSEDGQLNIAGITYVKAWIVERSDVSYASQNLTSIKSITY
CSTYEHTFPDGTAMKVARTPKIKKTVVLSEPLATIYREIGSPFVDNGSDARSIIKRPVFL
HAFVKCKCGSYHWTVGDWTSYVSTCCGFKCKPVLVASCSATPGSVVVTRAGAGTGVKYYN
NMFLRHVADIDGLAFWRILKVQSKDDLACSGKFLEHHEEGFTDPCYFLNDSSIATKLKFD
ILSGKFSDEVKQAIFAGHVVVGSALVDIVDDALGQPWFIRKLGDLASAAWEQLKAVVRGL
NLLSDEVVLFGKRLSCATLSIVNGVFEFIAEVPEKLAAAVTVFVNFLNELFESACDCLKV
GGKTFNKVGSYVLFDNALVKLVKAKVRGPRQAGVCEVRYTSLVIGSTTKVVSKRVENANV
NLVVVDEDVTLNTTGRTVVVDGLAFFESDGFYRHLADADVVIEHPVYKSACELKPVFECD
PIPDFPMPVAASVAELCVQTDLLLKNYNTPYKTYSCVVRGDKCCITCTLHFTAPSYMEAA
ANFVDLCTKNIGTAGFHEFYITAHEQQDLQGFVTTCCTMSGFECFMPIIPQCPAVLEEID
GGSIWRSFITGLNTMWDFCKHLKVSFGLDGIVVTVARKFKRLGALLAEMYNTYLSTVVEN
LVLAGVSFKYYATSVPKIVLGCCFHSVKSVLASAFQIPVQAGVEKFKVFLNCVHPVVPRV
IETSFVELEETTFKPPALNGSIAIVDGFAFYYDGTLYYPTDGNSVVPICFKKKGGGDVKF
SDEVSVKTIDPVYKVSLEFEFESETIMAVLNKAVGNCIKVTGGWDDVVEYINVAIEVLKD
HIDVPKYYIYDEEGGTDPNLPVMVSQWPLNDDTISQDLLDVEVVTDAPVDFEGDEVDSSD
PDKVADVANSEPEDDGLNVAPETNVESEVEEVAATLSFIKDTPSTVTKDPFAFDFASYGG
LKVLRQSHNNCWVTSTLVQLQLLGIVDDPAMELFSAGRVGPMVRKCYESQKAILGSLGDV
SACLESLTKDLHTLKITCSVVCGCGTGERIYDGCAFRMTPTLEPFPYGACAQCAQVLMHT
FKSIVGTGIFCRDTTALSLDSLVVKPLCAAAFIGKDSGHYVTNFYDAAMAIDGYGRHQIK
YDTLNTICVKDVNWTAPFVPDVEPVLEPVVKPFYSYKNVDFYQGDFSDLVKLPCDFVVNA
ANENLSHGGGIAKAIDVYTKGMLQKCSNDYIKAHGPIKVGRGVMLEALGLKVFNVVGPRK
GKHAPELLVKAYKSVFANSGVALTPLISVGIFSVPLEESLSAFLACVGGRHCKCFCYSDK
EREAIINYMDGLVDAIFKDALVDTTPVQEDVQQVSQKPVLPNFEPFRIEGAHAFYECNPE
GLMSLGADKLVLFTNSNLDFCSVGKCLNNVTGGALLETINVFKKSNKTVPAGNCVTFECA
DMISITMVVLPSDGDANYDKNYARAVVKVSKLKGKLLLAVGDAMLYSKLSHLSVLGFVST
PDDVERFYANKSVVIKVTEDTRSVKTVKVESTVTYGQQIGPCLVNDTVVTDNKPVVADVV
AKVVPSANWDSHYGFDKAGEFHMLDHTGFAFPSEVVNGRRVLKTTDNNCWVNVTCLQLQF
ARFRFKSAGIQAMWESYCTGDVAMFVHWLYWLTGVDKGQPSDSENALNMLSKYIVPAGSV
TIERVTHDGCCCSKRVVTAPVVNASVLKLGVEDGLCPHGLNYIDKVVVVKGTTIVVNVGK
PVVAPSHLFLKGVSYTTFLDNGNGVAGHYTVFDHDTGMVHDGDVFVPGDLNVSPVTNVVV
SEQTAVVIKDPVKKVELDATKLLDTMNYASERFFSFGDFMSRNLITVFLYILSILGLCFR
AFRKRDVKVLAGVPQRTGIILRKSVRYNAKALGVFFKLKLYWFKVLGKFSLGIYALYALL
FMTIRFTPIGGPVCDDVVAGYANSSFDKNEYCNSVICKVCLYGYQELSDFSHTQVVWQHL
RDPLIGNVMPFFYLAFLAIFGGVYVKAITLYFIFQYLNILGVFLGLQQSIWFLQLVPFDV
FGDEIVVFFIVTRVLMFLKHVFLGCDKASCVACSKSARLKRVPVQTIFQGTSKSFYVHAN
GGSKFCKKHNFFCLNCDSYGPGCTFINDVIATEVGNVVKLNVQPTGPATILIDKVEFSNG
FYYLYSGDTFWKYNFDITDNKYTCKESLKNCSIITDFIVFNNNGSNVNQVKNACVYFSQM
LCKPVKLVDSALLASLSVDFGASLHSAFVSVLSNSFGKDLSSCNDMQDCKSTLGFDDVPL
DTFNAAVAEAHRYDVLLTDMSFNNFTTSYAKPEEKLPVHDIATCMRVGAKIVNHNVLVKD
SIPVVWLVRDFIALSEETRKYIIRTTKVKGITFMLTFNDCRMHTTIPTVCIANKKGAGLP
SFSKVKKFFWFLCLFIVAVFFALSFFDFSTQVSSDSDYDFKYIESGQLKTFDNPLSCVHN
VFSNFDQWHDAKFGFTPVNNPSCPIVVGVSDEARTVPGIPAGVYLAGKTLVFAINTIFGT
SGLCFDASGVADKGACIFNSACTTLSGLGGTAVYCYKNGLVEGAKLYSELAPHSYYKMVD
GNAVSLPEIISRGFGIRTIRTKAMTYCRVGQCVQSAEGVCFGADRFFVYNAESGSDFVCG
TGLFTLLMNVISVFSKTVPVTVLSGQILFNCIIAFAAVAVCFLFTKFKRMFGDMSVGVFT
VGACTLLNNVSYIVTQNTLGMLGYATLYFLCTKGVRYMWIWHLGFLISYILIAPWWVLMV
YAFSAIFEFMPNLFKLKVSTQLFEGDKFVGSFENAAAGTFVLDMHAYERLANSISTEKLR
QYASTYNKYKYYSGSASEADYRLACFAHLAKAMMDYASNHNDTLYTPPTVSYNSTLQAGL
RKMAQPSGVVEKCIVRVCYGNMALNGLWLGDTVICPRHVIASSTTSTIDYDYALSVLRLH
NFSISSGNVFLGVVGVTMRGALLQIKVNQNNVHTPKYTYRIVRPGESFNILACYDGSAAG
VYGVNMRSNYTIRGSFINGACGSPGYNINNGTVEFCYLHQLELGSGCHVGSDLDGVMYGG
YEDQPTLQVEGASSLFTXNVLAFLYAALINGSTWWLSSSRIAVDRFNEWAVHNGMTTVVN
TDCFSILAAKTGVDVQRLLASIQSLHKNFGGKQILGYTSLTDEFTTGEVIRQMYGVNLQS
GYVSRACRNVLLVGSFLTFFWSELVSYTKFFWVNPGYVTPMFACLSLLSSLLMFTLKHKT
LFFQVFLIPALIVTSCINLAFDVEVYNYLAEHFDYHVSLMGFNAQGLVNIFVCFVVTILH
GTYTWRFFNTPVSSVTYVVALLTAAYNYFYASDILSCAMTLFASVTGNWFVGAVCYKAAV
YMALRFPTFVAIFGDIKSVMFCYLVLGYFTCCFYGILYWFNRFFKVSVGVYDYTVSAAEF
KYMVANGLRAPTGTLDSLLLSAKLIGIGGERNIKISSVQSKLTDIKCSNVVLLGCLSSMN
VSANSTEWAYCVDLHNKINLCNDPEKAQEMLLALLAFFLSKNSAFGLDDLLESYFNDNSM
LQSVASTYVGLPSYVIYENARQQYEDAVNNGSPPQLVKQLRHAMNVAKSEFDREASTQRK
LDRMAEQAAAQMYKEARAVNRKSKVVSAMHSLLFGMLRRLDMSSVDTILNLAKDGVVPLS
VIPAVSATKLNIVTSDIDSYNRIQREGCVHYAGTIWNIIDIKDNDGKVVHVKEVTAQNAE
SLSWPLVLGCERIVKLQNNEIIPGKLKQRSIKAEGDGIVGEGKALYNNEGGRTFMYAFIS
DKPDLRVVKWEFDGGCNTIELEPPRKFLVDSPNGAQIKYLYFVRNLNTLRRGAVLGYIGA
TVRLQAGKQTEQAINSSLLTLCAFAVDPAKTYIDAVKSGHKPVGNCVKMLANGSGNGQAV
TNGVEASTNQDSYGGASVCLYCRAHVEHPSMDGFCRLKGKYVQVPLGTVDPIRFVLENDV
CKVCGCWLANGCTCDRSIMQSTDMAYLNEYGALVQLDYGLFKRVRGSSAARLEPCNGTDT
QHVYRAFDIYNKDVACLGKFLKVNCVRLKNLDKHDAFYVVKRCTKSAMEHEQSIYSRLEK
CGAVAEHDFFTWKDGRAIYGNVCRKDLTEYTMMDLCYALRNFDENNCDVLKSILIKVGAC
EESYFNNKVWFDPVENEDIHRVYALLGTIVSRAMLKCVKFCDAMVEQGIVGVVTLDNQDL
NGDFYDFGDFTCSIKGMGIPICTSYYSYMMPVMGMTNCLASECFVKSDIFGEDFKSYDLL
EYDFTEHKTALFNKYFKYWGLQYHPNCVDCSDEQCIVHCANFNTLFSTTIPITAFGPLCR
KCWIDGVPLVTTAGYHFKQLGIVWNNDLNLHSSRLSINELLQFCSDPALLIASSPALVDQ
RTVCFSVAALGTGMTNQTVKPGHFNKEFYDFLLEQGFFSEGSELTLKHFFFAQKGDAAVK
DFDYYRYNRPTVLDICQARVVYQIVQRYFDIYEGGCITAKEVVVTNLNKSAGYPLNKFGK
AGLYYESLSYEEQDELYAYTKRNILPTMTQLNLKYAISGKERARTVGGVSLLSTMTTRQY
HQKHLKSIVNTRGASVVIGTTKFYGGWDNMLKNLIDGVENPCLMGWDYPKCDRALPNMIR
MISAMILGSKHTTCCSSTDRFFRLCNELAQVLTEVVYSNGGFYLKPGGTTSGDATTAYAN
SVFNIFQAVSANVNKLLSVDSNVCHNLEVKQLQRKLYECCYRSTTVDDQFVVEYYGYLRK
HFSMMILSDDGVVCYNNDYASLGYVADLNAFKAVLYYQNNVFMSASKCWIEPDINKGPHE
FCSQHTMQIVDKDGTYYLPYPDPSRILSAGVFVDDVVKTDAVVLLERYVSLAIDAYPLSK
HENPEYKKVFYVLLDWVKHLYKTLNAGVLESFSVTLLEDSTAKFWDESFYANMYEKSAVL
QSAGLCVVCGSQTVLRCGDCLRRPMLCTKCAYDHVIGTTHKFILAITPYVCCASDCGVND
VTKLYLGGLSYWCHEHKPRLAFPLCSAGNVFGLYKNSATGSPDVEDFNRIATSDWTDVSD
YRLANDVKDSLRLFAAETIKAKEESVKSSYACATLHEVVGPKELLLKWEVGRPKPPLNRN
SVFTCYHITKNTKFQIGEFVFEKAEYDNDAVTYKTTATTKLVPGMVFVLTSHNVQPLRAP
TIANQERYSTIHKLHPAFNIPEAYSSLVPYYQLIGKQKITTIQGPPGSGKSHCVIGLGLY
YPGARIVFTACSHAAVDSLCVKASTAYSNDKCSRIIPQRARVECYDGFKSNNTSAQYLFS
TVNALPECNADIVVVDEVSMCTNYDLSVINQRISYRHVVYVGDPQQLPAPRVMISRGTLE
PKDYNVVTQRMCALKPDVFLHKCYRCPAEIVRTVSEMVYENQFIPVHPDSKQCFKIFCKG
NVQVDNGSSINRRQLDVVRMFLAKNPRWSKAVFISPYNSQNYVASRMLGLQIQTVDSSQG
SEYDYVIYTQTSDTAHACNVNRFNVAITRAKKGILCIMCDRSLFDVLKFFELKLSDLQAN
EGCGLFKDCSRGDDLLPPSHANTFMSLADNFKTDQDLAVQIGVNGPIKYEHVISFMGFRF
DINIPNHHTLFCTRDFAMRNVRGWLGFDVEGAHVVGSNVGTNVPLQLGFSNGVDFVVRPE
GCVVTESGDYIKPVRARAPPGEQFAHLLPLLKRGQPWDVVRKRIVQMCSDYLANLSDILI
FVLWAGGLELTTMRYFVKIGPSKSCDCGKVATCYNSALHTYCCFKHALGCDYLYNPYCID
IQQWGYKGSLSLNHHEHCNVHRNEHVASGDAIMTRCLAIHDCFVKNVDWSITYPFIGNEA
VINKSGRIVQSHTMRSVLKLYNPKAIYDIGNPKGIRCAVTDAKWFCFDKNPTNSNVKTLE
YDYITHGQFDGLCLFWNCNVDMYPEFSVVCRFDTRCRSPLNLEGCNGGSLYVNNHAFHTP
AFDKRAFAKLKPMPFFFYDDTECDKLQDSINYVPLRASNCITKCNVGGAVCSKHCAMYHS
YVNAYNTFTSAGFTIWVPTSFDTYNLWQTFSNNLQGLENIAFNVVKKGSFVGAEGELPVA
VVNDKVLVRDGTVDTLVFTNKTSLPTNVAFELYAKRKVGLTPPITILRNLGVVCTSKCVI
WDYEAERPLTTFTKDVCKYTDFEGDVCTLFDNSIVGSLERFSMTQNAVLMSLTAVKKLTG
IKLTYGYLNGVPVNTHEDKPFTWYIYTRKNGKFEDYPDGYFTQGRTTADFSPRSDMEKDF
LSMDMGLFINKYGLEDYGFEHVVYGDVSKTTLGGLHLLISQVRLACMGVLKIDEFVSSND
STLKSCTVTYADNPSSKMVCTYMDLLLDDFVSILKSLDLSVVSKVHEVMVDCKMWRWMLW
CKDHKLQTFYPQLQASEWKCGYSMPSIYKIQRMCLEPCNLYNYGAGIKLPDGIMFNVVKY
TQLCQYLNSTTMCVPHHMRVLHLGAGSDKGVAPGTAVLRRWLPLDAIIVDNDSVDYVSDA
DYSVTGDCSTLYLSDKFDLVISDMYDGKIKSCDGENVSKEGFFPYINGVITEKLALGGTV
AIKVTEFSWNKKLYELIQKFEYWTMFCTSVNTSSSEAFLIGVHYLGDFASGAVIDGNTMH
ANYIFWRNSTIMTMSYNSVLDLSKFNCKHKATVVINLKDSSISDVVLGLLKNGKLLVRNN
DAICGFSNHLVNVNK
Download sequence
Identical sequences A0A075EB17

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]