SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for W5ZZG7 from Uniprot 2018_03 genome

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  W5ZZG7
Domain Number 1 Region: 2629-2931
Classification Level Classification E-value
Superfamily Trypsin-like serine proteases 3.32e-118
Family Viral cysteine protease of trypsin fold 0.0000000415
Further Details:      
 
Domain Number 2 Region: 5814-5998
Classification Level Classification E-value
Superfamily S-adenosyl-L-methionine-dependent methyltransferases 3.57e-71
Family Nsp15 N-terminal domain-like 0.00000886
Further Details:      
 
Domain Number 3 Region: 3348-3500
Classification Level Classification E-value
Superfamily Coronavirus NSP8-like 4.45e-62
Family Coronavirus NSP8-like 0.00000387
Further Details:      
 
Domain Number 4 Region: 6001-6151
Classification Level Classification E-value
Superfamily EndoU-like 3.73e-56
Family Nsp15 C-terminal domain-like 0.00000507
Further Details:      
 
Domain Number 5 Region: 3624-3746
Classification Level Classification E-value
Superfamily Coronavirus NSP10-like 7.45e-54
Family Coronavirus NSP10-like 0.00000537
Further Details:      
 
Domain Number 6 Region: 4165-4470,4509-4655
Classification Level Classification E-value
Superfamily DNA/RNA polymerases 2.03e-51
Family RNA-dependent RNA-polymerase 0.019
Further Details:      
 
Domain Number 7 Region: 496-650
Classification Level Classification E-value
Superfamily Macro domain-like 3.03e-33
Family Macro domain 0.00000984
Further Details:      
 
Domain Number 8 Region: 3511-3618
Classification Level Classification E-value
Superfamily Replicase NSP9 1.44e-32
Family Replicase NSP9 0.0000347
Further Details:      
 
Domain Number 9 Region: 4966-5269
Classification Level Classification E-value
Superfamily P-loop containing nucleoside triphosphate hydrolases 1.84e-30
Family Tandem AAA-ATPase domain 0.068
Further Details:      
 
Domain Number 10 Region: 3227-3309
Classification Level Classification E-value
Superfamily Coronavirus NSP7-like 3.79e-29
Family Coronavirus NSP7-like 0.000097
Further Details:      
 
Domain Number 11 Region: 235-346
Classification Level Classification E-value
Superfamily NSP3A-like 7.19e-28
Family NSP3A-like 0.00083
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) W5ZZG7
Sequence length 6459
Comment (tr|W5ZZG7|W5ZZG7_9BETC) ORF1ab {ECO:0000313|EMBL:AHI48614.1} OX=1335626 OS=Middle East respiratory syndrome-related coronavirus. GN= OC=Nidovirales; Coronaviridae; Coronavirinae; Betacoronavirus.
Sequence
LSVASTYFLVRLLQDKTGDFMSTIITSCQTAVSKLLDTCFEATEATFNFLLDLAGLFRIF
LRNAYVYTSQGFVVVNGKVSTLVKQVLDLLNKGMQLLHTKVSWAGSNISAVIYSGRESLI
FPSGTYYCVTTKAKSVQQDLDVILPGEFSKKQLGLLQPTDNSTTVSVTVSSNMVETVVGQ
LEQTNMHSPDVIVGDYVIISEKLFVRSKEEDGFAFYPACTNGHAVPTLFRLKGGAPVKKV
AFGGDQVHEVAAVRSVTVEYNIHAVLDTLLASSSLRTFVVDKSLSIEEFADVVKEQVSDL
LVKLLRGMPIPDFDLDDFIDAPCYCFNAEGDASWSSTMIFSLHPVECDEECSEVEASDLE
EGESECISETSTEQVDVSHEVSDDEWAAAVDEAFPLDEAEDVTESVQEESQPVEVPVEDI
AQVVIADTLQETPVVSDTVEVPPQVVKLPSEPQTIQPEVKEVAPVYEADTEQTQSVTVKP
KRLRKKRNVDPLSNFEHKVITECVTIVLGDAIQVAKCYGESVLVNAANTHLKHGGGIAGA
INAASKGAVQKESDEYILAKGPLQVGDSVLLQGHSLAKNILHVVGPDARAKQDVSLLSKC
YKAMNAYPLVVTPLVSAGIFGVKPAVSFDYLIREAKTRVLVVVNSQDVYKSLTIVDIPQS
LTFSYDGLRGAIRKAKDYGFTVFVCTDNSANTKVLRNKGVDYTKKFLTVDGVQYYCYTSK
DTLDDILQQANKSVGIISMPLGYVSHGLDLIQAGSVVRRVNVPYVCLLANKEQEAILMSE
DVKLNPSEDFIKHVRTNGGYNSWHLVEGELLVQDLRLNKLLHWSDQTICYKDSVFYVVKN
STAFPFETLSACRAYLDSRTTQQLTIEVLVTVDGVNFRTVVLNNKNTYRSQLGCVFFNGA
DISDTIPDEKQNGHSLYLADNLTADETKALKELYGPVDPTFLHRFYSLKAAVHKWKMVVC
DKVRSLKLSDNNCYLNAVIMTLDLLKDIKFVIPALQHAFMKHKGGDSTDFIALIMAYGNC
TFGAPDDASRLLHTVLAKAELCCSARMVWREWCNVCGIKDVVLQGLKACCYVGVQTVEDL
RARMTYVCQCGGERHRQIVEHTTPWLLLSGTPNEKLVTTSTAPDFVAFNVFQGIETAVGH
YVHARLKGGLILKFDSGTVSKTSDWKCKVTDVLFPGQKYSSDCNVVRYSLDGNFRTEVDP
DLSAFYVKDGKYFTSEPPVTYSPATILAGSVYTNSCLVSSDGQPGGDAISLSFNNLLGFD
SSKPVTKKYTYSFLPKEDGDVLLAEFDTYDPIYKNGAMYKGKPILWVNKASYDTNLNKFN
RASLRQIFDVAPIELENKFTPLSVXSTPVEPPTVDVVALQQEMTIVKCKGLNKPFVKDNV
SFVADDSGTPVVEYLSKEDLHTLYVDPKYQVIVLKDNVLSSMLRLHTVESGDINVVAASG
SLTRKVKLLFRASFYFKEFATRTFTATTAVGSCIKSVVRHLGVTKGILTGCFSFVKMLFM
LPLAYFSDSKLGTTEVKVSALKTAGVVTGNVVKQCCTAAVDLSMDKLRRVDWKSTLRLLL
MLCTTMVLLSSVYHLYVFNQVLSSDVMFEDAQGLKKFYKEVRAYLGISSACDGLASAYRA
NSFDVPTFCANRSAMCNWCLISQDSITHYPALKMVQTHLSHYVLNIDWLWFAFETGLAYM
LYTSAFNWLLLAGTLHYFFAQTSIFVDWRSYNYAVSSAFWLFTHIPMAGLVRMYNLLACL
WLLRKFYQHVINGCKDTACLLCYKRNRLTRVEASTVVCGGKRTFYITANGGISFCRRHNW
NCVDCDTAGVGNTFICEEVANDLTTALRRPINATDRSHYYVDSVTVKETVVQFNYRRDGQ
PFYERFPLCAFTNLDKLKFKEVCKTTTGIPEYNFIIYDSSDRGQESLARSACVYYSQVLC
KSILLVDSSLVTSVGDSSEIATKMFDSFVNSFVSLYNVTRDKLEKLISTARDGVRRGDNF
HSVLTTFIDAARGPAGVESDVETNEIVDSVQYAHKHDIQITNESYNNYVPSYVKPDSVST
SDLGSLIDCNAASVNQIVLRNSNGACIWNAAAYMKLSDALKRQIRIACRKCNLAFRLTTS
KLRANDNILSVRFTANKIVGGAPTWFNVLRDFTLKGYVLATIIVFLCAVLMYLCLPTFSM
VPVEFYEDRILDFKVLDNGIIRDVNPDDKCFANKHRSFTQWYHEHVGGVYDNSITCPLTV
AVIAGVAGARIPDVPTTLAWVNNQIIFFVSRVFANTGSVCYTPIDEIPYKSFSDSGCILP
SECTMFRDAEGRMTPYCHDPTVLPGAFAYSQMRPHVRYDLYDGNMFIKFPEVVFESTLRI
TRTLSTQYCRFGSCEYAQEGVCITTNGSWAIFNDHHLNRPGVYCGSDFIDIVRRLAVSLF
QPITYFQLTTSLVLGIGLCAFLTLLFYYINKVKRAFADYTQCAVIAVVAAVLNSLCICFV
ASIPLCIVPYTALYYYATFYFTNEPAFIMHVSWYIMFGPIVPIWMTCVYTVAMCFRHFFW
VLAYFSKKHVEVFTDGKLNCSFQDAASNIFVINKDTYAALRNSLTNDAYSRFLGLFNKYK
YFSGAMETAAYREAAACHLAKALQTYSETGSDLLYQPPNCSITSGVLQSGLVKMSHPSGD
VEACMVQVTCGSMTLNGLWLDNTVWCPRHVMCPADQLSDPNYDALLISMTNHSFSVQKHI
GAPANLRVVGHAMQGTLLKLTVDVANPSTPAYTFTTVKPGAAFSVLACYNGRPTGTFTVV
MRPNYTIKGSFLCGSCGSVGYTKEGSVINFCYMHQMELANGTHTGSAFDGTMYGAFMDKQ
VHQVQLTDKYCSVNVVAWLYAAILNGCAWFVKPNRTSVVSFNEWALANQFTEFVGTQSVD
MLAVKTGVAIEQLLYAIQQLYTGFQGKQILGSTMLEDEFTPEDVNMQIMGVVMQSGVRKV
TYGTAHWLFATLVSTYVIILQATKFTLWNYLFETIPTQLFPLLFVTMAFVMLLVKHKHTF
LTLFLLPVAICLTYANIVYEPTTPISSALIAVANWLAPTNAYMRTTHTDIGVYISMSLVL
VIVVKRLYNPSLSNFALALCSGVMWLYTYSIGEASSPIAYLVFVTTLTSDYTITVFVTVN
LAKVCTYAIFAYSPQLTLVFPEVKMILLLYTCLGFMCTCYFGVFSFLNLKLRAPMGVYDF
KVSTQEFRFMTANNLTAPRNSWEAMALNFKLIGIGGTPCIKVAAMQSKLTDLKCTSVVLL
SVLQQLHLEANSRAWAFCVKCHNDILAATDPSEAFEKFVSLFATLMTFSGNVDLDALASD
IFDTPSVLQATLSEFSHLATFAELEAAQKAYQEAMDSGDTSPQVLKALQKAVNIAKNAYE
KDKAVARKLERMADQAMTSMYKQARAEDKKAKIVSAMQTMLFGMIKKLDNDVLNGIISNA
RNGCIPLSVIPLCASNKLRVVIPDFTVWNQVVTYPSLNYAGALWDITVINNVDNEIVKSS
DVVDSNENLTWPLVLECTRASTSAVKLQNNEIKPSGLKTMVVSAGQEQTNCNTSSLAYYE
PVQGRKMLMALLSDNAYLKWARVEGKDGFVSVELQPPCKFLIAGPKGPEIRYLYFVKNLN
NLHRGQVLGHIAATVRLQAGSNTEFASNSSVLSLVNFTVDPQKAYLDFVNAGGAPLTNCV
KMLTPKTGTGIAISVKPESTADQETYGGASVCLYCRAHIEHPDVSGVCKYKGKFVQIPAQ
CVRDPVGFCLSNTPCNVCQYWIGYGCNCDSLRQVALPQSKDSNFLNRVRGSIVNARIEPC
SSGLSTDVVFRAFDICNYKAKVAGIGKYYKTNTCRFVELDDQGHHLDSYFVVKRHTMENY
ELEKHCYDLLRDCDAVAPHDFFIFDVDKVKTPHIVRQRLTEYTMMDLVYALRHFDQNSEV
LKAILVKYGCCDVTYFENKLWFDFVENPSVIGVYHKLGERVRQAILNTVKFCDHMVKAGL
VGVLTLDNQDLNGKWYDFGDFVITQPGSGVAIVDSYYSYLMPVLSMTDCLAAETHRDCDF
NKPLIEWPLTEYDFTDYKVQLFEKYFKYWDQTYHANCVNCTDDRCVLHCANFNVLFAMTM
PKTCFGPIVRKIFVDGVPFVVSCGYHYKELGLVMNMDVSLHRHRLSLKELMMYAADPAMH
IASSNAFLDLRTSCFSVAALTTGLTFQTVRPGNFNQDFYDFVVSKGFFKEGSSVTLKHFF
FAQDGNAAITDYNYYSYNLPTMCDIKQMLFCMEVVNKYFEIYDGGCLNASEVVVNNLDKS
AGHPFNKFGKARVYYESMSYQEQDELFAMTKRNVIPTMTQMNLKYAISAKNRARTVAGVS
ILSTMTNRQYHQKMLKSMAATRGATCVIGTTKFYGGWDFMLKTLYKDVDNPHLMGWDYPK
CDRAMPNMCRIFASLILARKHGTCCTTRDRFYRLANECAQVLSEYVLCGGGYYVKPGGTS
SGDATTAYANSVFNILQATTANVSALMGANGNKIVDKEVKDMQFDLYVNVYRSTSPDPKF
VDKYYAFLNKHFSMMILSDDGVVCYNSDYAAKGYIAGIQNFKETLYYQNNVFMSEAKCWV
ETDLKKGPHEFCSQHTLYIKDGDDGYFLPYPDPSRILSAGCFVDDIVKTDGTLMVERFVS
LAIDAYPLTKHEDIEYQNVFWVYLQYIEKLYKDLTGHMLDSYSVMLCGDNSAKFWEEAFY
RDLYSSPTTLQAVGSCVVCHSQTSLRCGTCIRRPFLCCKCCYDHVIATPHKMVLSVSPYV
CNAPGCGVSDVTKLYLGGMSYFCVDHRPVCSFPLCANGLVFGLYKNMCTGSPSIVEFNRL
ATCDWTESGDYTLANTTTEPLKLFAAETLRATEEASKQSYAIATIKEIVGERQLLLVWEA
GKSKPPLNRNYVFTGYHITKNSKVQLGEYIFERIDYSDAVSYKSSTTYKLTVGDIFVLTS
HSVATLTAPTIVNQERYVKITGLYPTITVPEEFASHVANFQKSGYSKYVTVQGPPGTGKS
HFAIGLAIYYPTARVVYTACSHAAVDALCEKAFKYLNIAKCSRIIPAKARVECYDRFKVN
ETNSQYLFSTINALPETSADILVVDEVSMCTNYDLSIINARIKAKHIVYVGDPAQLPAPR
TLLTRGTLEPENFNSVTRLMCNLGPDIFLSMCYRCPKEIVSTVSALVYNNKLLAKKELSG
QCFKILYKGNVTHDASSAINRPQLTFVKNFITANPAWSKAVFISPYNSQNAVARSMLGLT
TQTVDSSQGSEYQYVIFCQTADTAHANNINRFNVAITRAQKGILCVMTSQALFESLEFTE
LSFTNYKLQSQIVTGLFKDCSRETSGLSPAYAPTYVSVDDKYKTSDELCVNLNLPANVPY
SRVISRMGFKLDATVPGYPKLFITREEAVRQVRSWIGFDVEGAHASRNACGTNVPLQLGF
STGVNFVVQPVGVVDTEWGNMLTGIAARPPPGEQFKHLVPLMHKGAAWPIVRRRIVQMLS
DTLDKLSDYCTFVCWAHGFELTSASYFCKIGKEQKCCMCNRRAAAYSSPLQSYACWTHSC
GYDYVYNPFFVDVQQWGYVGNLATNHDRYCSVHQGAHVASNDAIMTRCLAIHSCFIERVD
WDIEYPYISHEKKLNSCCRIVERNVVRAALLAGSFDKVYDIGNPKGIPIVDDPVVDWHYF
DAQPLTRKVQQLFYTEDMASRFADGLCLFWNCNVPKYPNNAIVCRFDTRVHSEFNLPGCD
GGSLYVNKHAFHTPAYDVSAFRDLKPLPFFYYSTTPCEVHGNGSMIEDIDYVPLKSAVCI
TTCNLGGAVCRKHATEYREYMEAYNLVSASGFRLWCYKTFDIYNLWSTFTKVQGLENIAF
NVVKQGHFIGVEGELPVAVVNDKIFTKSGVNDICMFENKTTLPTNIAFELYAKRAVRSHP
DFKLLHNLQADICYKFVLWDYERSNIYGTATIGVCKYTDIDVNSALNICFDIRDNGSLEK
FMSTPNAIFISDRKIKKYPCMVGPDYAYFNGAIIRDSDVVKQPVKFYLYKKVNNEFIDPT
ECIYTQSRSCSDFLPLSDMEKDFLSFDSDVFVKKYGLENYAFEHVVYGDFSHTTLGGLHL
LIGLYKKQQEGHIIMEEMLKGSSTIHNYFITETNTAAFKAVCSVIDLKLDDFVMILKSQD
LGVVSKVVKVPIDLTMIEFMLWCKDGQVQTFYPRLQASADWKPGHAMPSLFKVQNVNLER
CELANYKQSIPMPRGVHMNIAKYMQLCQYLNTCTLAVPANMRVIHFGAGSDKGIAPGTSV
LRQWLPTDAIIIDNDLNEFVSDADITLFGDCVTVRVGQQVDLVISDMYDPTTKNVTGSNE
SKALFFTYLCNLINNNLALGGSVAIKITEHSWSVELYELMGKFAWWTVFCTNANASSSEG
FLLGINYLGTIKENIDGGAMHANYIFWRNSTPMNLSTYSLFDLSKFQLKLKGTPVLQLKE
SQINELVISLLSQGKLLIRDNDTLSVSTDVLVNTYRKLR
Download sequence
Identical sequences W5ZZG7

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]