SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for A0A291I627 from Uniprot 2018_03 genome

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  A0A291I627
Domain Number 1 Region: 3248-3550
Classification Level Classification E-value
Superfamily Trypsin-like serine proteases 2.11e-118
Family Viral cysteine protease of trypsin fold 0.000000041
Further Details:      
 
Domain Number 2 Region: 3967-4119
Classification Level Classification E-value
Superfamily Coronavirus NSP8-like 2.88e-62
Family Coronavirus NSP8-like 0.00000387
Further Details:      
 
Domain Number 3 Region: 4243-4365
Classification Level Classification E-value
Superfamily Coronavirus NSP10-like 4.84e-54
Family Coronavirus NSP10-like 0.00000537
Further Details:      
 
Domain Number 4 Region: 1115-1269
Classification Level Classification E-value
Superfamily Macro domain-like 5.21e-34
Family Macro domain 0.0000104
Further Details:      
 
Domain Number 5 Region: 4130-4237
Classification Level Classification E-value
Superfamily Replicase NSP9 9.81e-33
Family Replicase NSP9 0.0000347
Further Details:      
 
Domain Number 6 Region: 3846-3928
Classification Level Classification E-value
Superfamily Coronavirus NSP7-like 2.48e-29
Family Coronavirus NSP7-like 0.000097
Further Details:      
 
Domain Number 7 Region: 854-965
Classification Level Classification E-value
Superfamily NSP3A-like 4.71e-28
Family NSP3A-like 0.00083
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) A0A291I627
Sequence length 4391
Comment (tr|A0A291I627|A0A291I627_9BETC) ORF1a {ECO:0000313|EMBL:ATG84733.1} OX=1335626 OS=Middle East respiratory syndrome-related coronavirus. GN=orf1ab OC=Nidovirales; Coronaviridae; Coronavirinae; Betacoronavirus.
Sequence
MSFVAGVIAQGARGTYRAALNSEKHQDHVSLTVPLCGSGNLVEKLSPWFMDGENAYEVVK
AMLLKKEPLLYVPIRLAGHTRHLPGPRVYLVERLIACENPFMVNQLAYSSSANGSLVGTT
LQGKPIGMFFPYDIELVTGKQNILLRKYGRGGYHYTPVHYERDNTSCPEWMDDFEADPKG
KYAQNLLKKLIGGDVTPVDQYMCGVDGKPISAYAFLMAKDGITKLADVEADVAARADDEG
FITLKNNLYRLVWHVERKDVPYPKQSIFTINSVVQKDGVENTPPHYFTLGCKILTLTPRN
KWSGVSDLSLKQKLLYTFYGKESLENPTYIYHSAFIECGSCGNDSWLTGNAIQGFACGCG
ASYTANDVEVQSSGMIKPNALLCATCPFAKGDSCSSNCKHSVAQLVSYLSERCNVIADSK
SFTLIFGGVAYAYFGCEEGTMYFVPRAKSVVSRIGDSIFTGCTGSWNKVTQIANMFLEQT
QHSLNFVGEFVVNDVVLAILSGTTTNVDKIRQLLKGVTIDKLRDYLADYDVAVTAGPFMD
NAINVGGTGLQYAAITAPYVVLTGLGESFKKVATIPYKVCSSVKDTLTYYAHSVLYRVFP
YDMDSGVSSFSELLFDCVDLSVASTYFLVRLLQDKTGDFMSTIITSCQTAVSKLLDTCFE
ATEATFNFLLDLAGLFRIFLRNAYVYTSQGFVVVNGKVSTLVKQVLDLLNKGMQLLHTKV
SWAGSNISAVIYSGRESLIFPSGTYYCVTTKAKSVQQDLDVILPGEFSKKQLGLLQPTDN
STTVSVTVSSNMVETVVGQLEQTNMHSPDVIVGDYVIISEKLFVRSKEEDGFAFYPACTN
GHAVPTLFRLKGGAPVKKVAFGGDQVHEVAAVRSVTVEYNIHAVLDTLLASSSLRTFVVD
KSLSIEEFADVVKEQVSDLLVKLLRGMPIPDFDLDDFIDAPCYCFNAEGDASWSSTMIFS
LHPVECDEECSEVEASDLEEGESECISETSTEQVDVSHDVSDDEWAAAVDEAFPLDEAED
VTESVQEEAQPVEVPVEDIAQVVIADTLQETPVVSDTVEVPPQVVKLPSEPQTIQPEVKE
VAPVYEADTEQTQSVTVKPKRLRKKRNVDPLSNFEHKVITECVTIVLGDAIQVAKCYGES
VLVNAANTHLKHGGGIAGAINAASKGAVQKESDEYILAKGPLQVGDSVLLQGHSLAKNIL
HVVGPDARAKQDVSLLSKCYKAMNAYPLVVTPLVSTGIFGVKPAVSFDYLIREAKTRVLV
VVNSQDVYKSLTIVDIPQSLTFSYDGLRGAIRKAKDYGFTVFVCTDNSANTKVLRNKGVD
YTKKFLTVDGVQYYCYTSKDTLDDILQQANKSVGIISMPLGYVSHGLDLIQAGSVVRRVN
VPYVCLLANKEQEAILMSEDVKLNPSEDFIKHVRTNGGYNSWHLVEGELLVQDLRLNKLL
HWSDQTICYKDSVFYVVKNSTAFPFETFSACRAYLDSRTTQQLTIEVLVTVDGVNFRTVV
LNNKNTYRSQLGCVFFNGADISDTIPDEKQNGHSLYLADNLTADETKALKELYGPVDPTF
LHRFYSLKAAVHKWKMVVCDKVRSLKLSDNNCYLNAVIMTLDLLKDIKFVIPALQHAFMK
HKGGDSTDFIALIMAYGNCTFGAPDDASRLLHTVLAKAELCCSARMVWREWCNVCGIKDV
VLQGLKACCYVGVQTVEDLRARMTYVCQCGGERHRQIVEHTTPWLLLSGTPNEKLVTTST
APDFVAFNVFQGIETAVGHYVHARLKGGLILKFDSGTVSKTSDWKCKVTDVIFPGQKYSS
DCNVVRYSLDGNFRTEVDPDLSAFYVKDGKYFTSEPPVTYSPATILAGSVYTNSCLVSSD
GQPGGDAISLSFNNLLGFDSSKPVTKKYTYSFLPKEDGDVLLAEFDTYDPIYKNGAMYKG
KPILWVKKASYDTNLNKFNRASLRQIFDVAPIELENKFTPLSVASTPVEPPTVDVVALQQ
EMTIVKCKGLNKPFVKDNVSFVVDDSGTPVVEYLSKEDLHTLYVDPKYQVIVLKDNVLSS
MLRLHTVESGDINVVAASGSLTRKVKLLFRASFYFKEFATRTFTATTAVGSCIKSVVRHL
GVTKGILTGCFSFVKMLFILPLAYFSDSKLGTTEVKVSALKTAGVVTGNVVKQCCTAAVD
LSMDKLRRVDWKSTLRLLLMLCTTMVLLSSVYHLYVFNQVLSSDVMFEDAQGLKKFYKEV
RAYLGISSACDGLASAYRANSFDVPTFCANRSAMCNWCLISQDSITHYPALKMVQTHLSH
YVLNIDWLWFAFETGLAYMLYTSAFNWLLLAGTLHYFFAQTSIFVDWRSYNYAVSSAFWL
FTHIPMAGLVRMYNLLACLWLLRKFYQHVINGCKDTACLLCYKRNRLTRVEASTVVCGGK
RTFYITANGGISFCRRHNWNCVDCDIAGVGNTFICEEVANDLTTALRRPINATDRSHYYV
DSVTVKETVVQFNYRRDGQPFYERFPLCAFTNLDKLKFKEVCKTTTGIPEYNFIIYDSSD
RGQESLARSACVYYSQVLCKSILLVDSSLVTSVGDSSEIATKMFDSFVNSFVSLYNVTRD
KLEKLISTARDGVRRGDNFHSVLTTFIDAARGPAGVESDVETNEIVDSVQYAHKHDIQIT
NESYNNYVPSYVKPDSVSTSDLGSLIDCNAASVNQIVLRNSNGACIWNAAAYMKLSDALK
RQIRIACRKCNLAFRLTTSKLRANDNILSVRFTANKIVGGAPTWFNVLRDFTLKGYVLAT
IIVFLCAVLMYLCLPTFSMVPVEFYEDRILDFKVLDNGIIRDVNPDDKCFANKHRSFTQW
YHEHVGGVYDNSITCPLTVAVIAGVAGARIPDVPTTLAWVNNQIIFFVSRVFANTGSVCY
TPIDEIPYKSFSDSGCILPSECTMFRDAEGRMTPYCHDPTVLPGAFAYSQMRPHVRYDLY
DGNMFIKFPEVVFESTLRITRTLSTQYCRFGSCEYAQEGVCITTNGSWAIFNDHHLNRPG
VYCGSDFIDIVRRLAVSLFQPITYFQLTTSLVLGIGLCAFLTLLFYYINKVKRAFADYTQ
CAVIAVVAAVLNSLCICFVASIPLCIVPYTVLYYYATFYFTNEPAFIMHVSWYIMFGPIV
PIWMTCVYTVAMCFRHFFWVLAYFSKKHVEVFTDGKLNCSFQDAASNIFVINKDTYAALR
NSLTNDAYSRFLGLFNKYKYFSGAMETAAYREAAACHLAKALQTYSETGSDLLYQPPNCS
ITSGVLQSGLVKMSHPSGDVEACMVQVTCGSMTLNGLWLDNTVWCPRHVMCPADQLSDPN
YDALLISMTNHSFSVQKHIGAPANLRVVGHAMQGTLLKLTVDVANPSTPAYTFTTVKPGA
AFSVLACYNGRPTGTFTVVMRPNYTIKGSFLCGSCGSVGYTKEGSVINFCYMHQMELANG
THTGSAFDGTMYGAFMDKQVHQVQLTDKYCSVNVVAWLYAAILNGCAWFVKPNRTSVVSF
NEWALANQFTEFVGTQSIDMLAVKTGVAIEQLLYAIQQLYTGFQGKQILGSTMLEDEFTP
EDVNMQIMGVVMQSGVRKVTYGTAHWLFATLVSTYVIILQATKFTLWNYLFETIPTQLFP
LLFVTMAFVMLLVKHKHTFLTLFLLPVAICLTYANIVYEPTTPISSALIAVANWLAPTNA
YMRTTHTDIGVYISMSLVLVIVVKRLYNPSLSNFALALCSGVMWLYTYSIGEASSPIAYL
VFVTTLTSDYTITVFVTVNLAKVCTYAIFAYSPQLTLVFPEVKMILLLYTCLGFMCTCYF
GVFSLLNLKLRAPMGVYDFKVSTQEFRFMTANNLTAPRNSWEAMALNFKLIGIGGTPCIK
VAAMQSKLTDLKCTSVVLLSVLQQLHLEANSRAWAFCVKCHNDILAATDPSEAFEKFVSL
FATLMTFSGNVDLDALASDIFDTPSVLQATLSEFSHLATFAELEAAQKAYQEAMDSGDTS
PQVLKALQKAVNIAKNAYEKDKAVARKLERMADQAMTSMYKQARAEDKKAKIVSAMQTML
FGMIKKLDNDVLNGIISNARNGCIPLSVIPLCASNKLRVVIPDFTVWNQVVTYPSLNYAG
ALWDITVINNVDNEIVKSSDVVDSNENLTWPLVLECTRASTSAVKLQNNEIKPSGLKTMV
VSAGQEQTNCNTSSLAYYEPVQGRKMLMALLSDNAYLKWARVEGKDGFVSVELQPPCKFL
IAGPKGPEIRYLYFVKNLNNLHRGQVLGHIAATVRLQAGSNTEFASNSSVLSLVNFTVDP
QKAYLDFVNAGGAPLTNCVKMLTPKTGTGIAISVKPESTADQETYGGASVCLYCRAHIEH
PDVSGVCKYKGKFVQIPAQCVRDPVGFCLSNTPCNVCQYWIGYGCNCDSLRQAALPQSKD
SNFLNESGVLL
Download sequence
Identical sequences A0A291I627

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]