SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for gi|565952979|ref|NP_937947.2| from NCBI viral sequences

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  gi|565952979|ref|NP_937947.2|
Domain Number 1 Region: 3247-3547
Classification Level Classification E-value
Superfamily Trypsin-like serine proteases 3.59e-132
Family Viral cysteine protease of trypsin fold 0.000000115
Further Details:      
 
Domain Number 2 Region: 6422-6605
Classification Level Classification E-value
Superfamily S-adenosyl-L-methionine-dependent methyltransferases 9.14e-75
Family Nsp15 N-terminal domain-like 0.0000000739
Further Details:      
 
Domain Number 3 Region: 3964-4113
Classification Level Classification E-value
Superfamily Coronavirus NSP8-like 1.96e-61
Family Coronavirus NSP8-like 0.0000072
Further Details:      
 
Domain Number 4 Region: 6641-6791
Classification Level Classification E-value
Superfamily EndoU-like 4.9e-59
Family Nsp15 C-terminal domain-like 0.000000352
Further Details:      
 
Domain Number 5 Region: 4770-5074,5115-5259
Classification Level Classification E-value
Superfamily DNA/RNA polymerases 1.09e-53
Family RNA-dependent RNA-polymerase 0.021
Further Details:      
 
Domain Number 6 Region: 4238-4359
Classification Level Classification E-value
Superfamily Coronavirus NSP10-like 1.16e-52
Family Coronavirus NSP10-like 0.0000075
Further Details:      
 
Domain Number 7 Region: 4125-4232
Classification Level Classification E-value
Superfamily Replicase NSP9 7.72e-40
Family Replicase NSP9 0.0000767
Further Details:      
 
Domain Number 8 Region: 850-966
Classification Level Classification E-value
Superfamily NSP3A-like 1.7e-35
Family NSP3A-like 0.00042
Further Details:      
 
Domain Number 9 Region: 1273-1428
Classification Level Classification E-value
Superfamily Macro domain-like 2.52e-35
Family Macro domain 0.00016
Further Details:      
 
Domain Number 10 Region: 5555-5879
Classification Level Classification E-value
Superfamily P-loop containing nucleoside triphosphate hydrolases 1.32e-29
Family Tandem AAA-ATPase domain 0.079
Further Details:      
 
Domain Number 11 Region: 3837-3925
Classification Level Classification E-value
Superfamily Coronavirus NSP7-like 4.71e-27
Family Coronavirus NSP7-like 0.00021
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) gi|565952979|ref|NP_937947.2|
Sequence length 7095
Comment replicase polyprotein [Human coronavirus OC43]
Sequence
MSKINKYGLELHWAPEFPWMFEDAEEKLDNPSSSEVDMICSTTAQKLETDGICPENHVMV
DCRRLLKQECCVQSSLIREIVMNASPYDLEVLLQDALQSREAVLVTTPLGMSLEACYVRG
CNPKGWTMGLFRRRSVCNTGRCTVNKHVAYQLYMIDPAGVCLGAGQFVGWVIPLAFMPVQ
SRKFIVPWVMYLRKRGEKGAYNKDHGRGGFGHVYDFKVEDAYDQVHDEPKGKFSKKAYAL
IRGYRGVKPLLYVDQYGCDYTGSLADGLEAYADKTLQEMKALFPTWSQELLFDVIVAWHV
VRDPRYVMRLQSAATIRSVAYVANPTEDLCDGSVVIKEPVHVYADDSIILRQYNLVDIMS
HFYMEADTVVNAFYGVALKDCGFVMQFGYIDCEQDSCDFKGWIPGNMIDGFACTTCGHVY
EVGDLIAQSSGVLPVNPVLHTKSAAGYGGFGCKDSFTLYGQTVVYFGGCVYWSPARNIWI
PILKSSVKSYDSLVYTGVLGCKAIVKETNLICKALYLDYVQHKCGNLHQRELLGVSDVWH
KQLLLNRGVYKPLLENIDYFNMRRAKFSLETFTVCADGFMPFLLDDLVPRAYYLAVSGQA
FCDYADKLCHAVVSKSKELLDVSLDSLGAAIHYLNSKIVDLAQHFSDFGTSFVSKIVHFF
KTFTTSTALAFAWVLFHVLHGAYIVVESDIYFVKNIPRYASAVAQAFQSVAKVVLDSLRV
TFIDGLSCFKIGRRRICLSGRKIYEVERGLLHSSQLPLDVYDLTMPSQVQKAKQKPIYLK
GSGSDFSLADSVVEVVTTSLTPCGYSEPPKVADKICIVDNVYMAKAGDKYYPVVVDDHVG
LLDQAWRVPCAGRRVTFKEQPTVKEIISMPKIIKVFYELDNDFNTILNTACGVFEVDDTV
DMEEFYAVVIDAIEEKLSPCKELEGVGAKVSAFLQKLEDNPLFLFDEAGEEVLAPKLYCA
FTAPEDDDFLEESDVEEDDVEGEETDLTVTSAGQPCVASEQEESSEVLEDTLDDGPSVET
SDSQVEEDVEMSDFVDLESVIQDYENVCFEFYTTEPEFVKVLGLYVPKATRNNCWLRSVL
AVMQKLPCQFKDKNLQDLWVLYKQQYSQLFVDTLVNKIPANIVLPQGGYVADFAYWFLTL
CDWQCVAYWKCIKCDLALKLKGLDAMFFYGDVVSHICKCGESMVLIDVDVPFTAHFALKD
KLFCAFITKRIVYKAACVVDVNDSHSMAVVDGKQIDDHRITSITSDKFDFIIGHGMSFSM
TTFEIAQLYGSCITPNVCFVKGDIIKVSKLVKAEVVVNPANGHMAHGGGVAKAIAVAAGQ
QFVKETTDMVKSKGVCATGDCYVSTGGKLCKTVLNVVGPDARTQGKQSYVLLERVYKHLN
NYDCVVTTLISAGIFSVPSDVSLTYLLGTAKKQVVLVSNNQEDFDLISKCQITAVEGTKK
LAARLSFNVGRSIVYETDANKLILINDVAFVSTFNVLQDVLSLRHDIALDDDARTFVQSN
VDVVPEGWRVVNKFYQINGVRTVKYFECTGGIDICSQDKVFGYVQQGIFNKATVAQIKAL
FLDKVDILLTVDGVNFTNRFVPVGESFGKSLGNVFCDGVNVTKHKCDINYKGKVFFQFDN
LSSEDLKAVRSSFNFDQKELLAYYNMLVNCFKWQVVVNGKYFTFKQANNNCFVNVSCLML
QSLHLTFKIVQWQEAWLEFRSGRPARFVALVLAKGGFKFGDPADSRDFLRVVFSQVDLTG
AICDFEIACKCGVKQEQRTGLDAVMHFGTLSREDLEIGYTVDCSCGKKLIHCVRFDVPFL
ICSNTPASVKLPKGVGSANIFIGDKVGHYVHVKCEQSYQLYDASNVKKVTDVTGKLSDCL
YLKNLKQTFKSVLTTYYLDDVKKIEYKPDLSQYYCDGGKYYTQRIIKAQFKTFEKVDGVY
TNFKLIGHTVCDSLNAKLGFDSSKEFVEYKITEWPTATGDVVLATDDLYVKRYERGCITF
GKPVIWLSHEKASLNSLTYFNRPSLVDDNKFDVLKVDDVDDGGDSSESGAKETKEINIIK
LSGVKKPFKVEDSVIVNDDTSETKYVKSLSIVDVYDMWLTGCKYVVRTANALSRAVNVPT
IRKFIKFGMTLVSIPIDLLNLREIKPAVNVVKAVRNKISVCFNFIKWLFVLLFGWIKISA
DNKVIYTTEIASKLTCKLVALAFKNAFLTFKWSMVARGACIIATIFLLWFNFIYANVIFS
DFYLPKIGFLPTFVGKIAQWIKNTFSLVTICDLYSMQDVGFKNQYCNGSIACQFCLAGFD
MLDNYKAIDVVQYEADRRAFVDYTGVLKIVIELIVSYALYTAWFYPLFALISIQILTTWL
PELFMLSTLHWSFRLLVALANMLPAHVFMRFYIIIASFIKLFSLFRHVAYGCSKSGCLFC
YKRNRSLRVKCSTIVGGMIRYYDVMANGGTGFCSKHQWNCIDCDSYKPGNTFITVEAALD
LSKELKRPIQPTDVAYHTVTDVKQVGCSMRLFYDRDGQRTYDDVNASLFVDYSNLLHSKV
KSVPNMHVVVVENDADKANFLNAAVFYAQSLFRPILMVDKNLITTANTGTSVTETMFDVY
VDTFLSMFDVDKKSLNALIATAHSSIKQGTQIYKVLDTFLSCARKSCSIDSDVDTKCLAD
SVMSAVSAGLELTDESCNNLVPTYLKSDNIVAADLGVLIQNSAKHVQGNVAKIAGVSCIW
SVDAFNQFSSDFQHKLKKACCKTGLKLKLTYNKQMANVSVLTTPFSLKGGAVFSYFVYVC
FVLSLVCFIGLWCLMPTYTVHKSDFQLPVYASYKVLDNGVIRDVSVEDVCFANKFEQFDQ
WYESTFGLSYYSNSMACPIVVAVIDQDFGSTVFNVPTKVLRYGYHVLHFITHALSADGVQ
CYTPHSQISYSNFYASGCVLSSACTMFTMADGSPQPYCYTEGLMQNASLYSSLVPHVRYN
LANAKGFIRFPEVLREGLVRIVRTRSMSYCRVGLCEEADEGICFNFNGSWVLNNDYYRSL
PGTFCGRDVFDLIYQLFKGLAQPVDFLALTASSIAGAILAVIVVLVFYYLIKLKRAFGDY
TSVVFVNVIVWCVNFMMLFVFQVYPILSCVYAICYFYATLYFPSEISVIMHLQWLVMYGT
IMPLWFCLLYIAVVVSNHAFWVFSYCRKLGTSVRSDGTFEEMALTTFMITKDSYCKLKNS
LSDVAFNRYLSLYNKYRYYSGKMDTAAYREAACSQLAKAMDTFTNNNGSDVLYQPPTASV
STSFLQSGIVKMVNPTSKVEPCVVSVTYGNMTLNGLWLDDKVYCPRHVICSASDMTNPDY
TNLLCRVTSSDFTVLFDRLSLTVMSYQMRGCMLVLTVTLQNSRTPKYTFGVVKPGETFTV
LAAYNGKPQGAFHVTMRSSYTIKGSFLCGSCGSVGYVIMGDCVKFVYMHQLELSTGCHTG
TDFNGDFYGPYKDAQVVQLLIQDYIQSVNFVAWLYAAILNNCNWFVQSDKCSVEDFNVWA
LSNGFSQVKSDLVIDALASMTGVSLETLLAAIKRLKNGFQGRQIMGSCSFEDELTPSDVY
QQLAGIKLQSKRTRLFKGTVCWIMASTFLFSCIITAFVKWTMFMYVTTNMFSITFCALCV
ISLAMLLVKHKHLYLTMYITPVLFTLLYNNYLVVYKHTFRGYVYAWLSYYVPSVEYTYTD
EVIYGMLLLVGMVFVTLRSINHDLFSFIMFVGRLISVFSLWYKGSNLEEEILLMLASLFG
TYTWTTVLSMAVAKVIAKWVAVNVLYFTDIPQIKIVLLCYLFIGYIISCYWGLFSLMNSL
FRMPLGVYNYKISVQELRYMNANGLRPPKNSFEALMLNFKLLGIGGVPIIEVSQFQSKLT
DVKCANVVLLNCLQHLHVASNSKLWHYCSTLHNEILATSDLSVAFEKLAQLLIVLFANPA
AVDSKCLTSIEEVCDDYAKDNTVLQALQSEFVNMASFVEYEVAKKNLDEARFSGSANQQQ
LKQLEKACNIAKSAYERDRAVAKKLERMADLALTNMYKEARINDKKSKVVSALQTMLFSM
VRKLDNQALNSILDNAVKGCVPLNAIPSLAANTLNIIVPDKSVYDQVVDNVYVTYAGNVW
QIQTIQDSDGTNKQLNEISDDCNWPLVIIANRYNEVSATVLQNNELMPAKLKIQVVNSGP
DQTCNTPTQCYYNNSNNGKIVYAILSDVDGLKYTKILKDDGNFVVLELDPPCKFTVQDAK
GLKIKYLYFVKGCNTLARGWVVGTISSTVRLQAGTATEYASNSSILSLCAFSVDPKKTYL
DFIQQGGTPIANCVKMLCDHAGTGMAITVKPDATTSQDSYGGASVCIYCRARVEHPDVDG
LCKLRGKFVQVPVGIKDPVSYVLTHDVCRVCGFWRDGSCSCVSTDTTVQSKDTNFLNRVR
GTSVDARLVPCASGLSTDVQLRAFDIYNASVAGIGLHLKVNCCRFQRVDENGDKLDQFFV
VKRTDLTIYNREMKCYERVKDCKFVAEHDFFTFDVEGSRVPHIVRKDLTKYTMLDLCYAL
RHFDRNDCMLLCDILSIYAGCEQSYFTKKDWYDFVENPDIINVYKKLGPIFNRALVSATE
FADKLVEVGLVGVLTLDNQDLNGKWYDFGDYVIAAPGCGVAIADSYYSYIMPMLTMCHAL
DCELYVNNAYRLFDLVQYDFTDYKLELFNKYFKHWSMPYHPNTVDCQDDRCIIHCANFNI
LFSMVLPNTCFGPLVRQIFVDGVPFVVSIGYHYKELGIVMNMDVDTHRYRLSLKDLLLYA
ADPALHVASASALYDLRTCCFSVAAITSGVKFQTVKPGNFNQDFYDFVLSKGLLKEGSSV
DLKHFFFTQDGNAAITDYNYYKYNLPTMVDIKQLLFVLEVVYKYFEIYDGGCIPASQVIV
NNYDKSAGYPFNKFGKARLYYEALSFEEQDEIYAYTKRNVLPTLTQMNLKYAISAKNRAR
TVAGVSILSTMTGRMFHQKCLKSIAATRGVPVVIGTTKFYGGWDDMLRRLIKDVDNPVLM
GWDYPKCDRAMPNLLRIVSSLVLARKHETCCSQSDRFYRLANECAQVLSEIVMCGGCYYV
KPGGTSSGDATTAFANSVFNICQAVSANVCALMSCNGNKIEDLSIRALQKRLYSHVYRSD
KVDSTFVTEYYEFLNKHFSMMILSDDGVVCYNSDYASKGYIANISAFQQVLYYQNNVFMS
ESKCWVEHDINNGPHEFCSQHTMLVKMDGDDVYLPYPNPSRILGAGCFVDDLLKTDSVLL
IERFVSLAIDAYPLVYHENEEYQKVFRVYLAYIKKLYNDLGNQILDSYSVILSTCDGQKF
TDESFYKNMYLRSAVMQSVGACVVCSSQTSLRCGSCIRKPLLCCKCCYDHVMATDHKYVL
SVSPYVCNAPGCDVNDVTKLYLGGMSYYCEDHKPQYSFKLVMNGLVFGLYKQSCTGSPYI
DDFNRIASCKWTDVDDYILANECTERLKLFAAETQKATEEAFKQSYASATIQEIVSEREL
ILSWEIGKVKPPLNKNYVFTGYHFTKNGKTVLGEYVFDKSELTNGVYYRATTTYKLSVGD
VFVLTSHSVANLSAPTLVPQENYSSIRFASVYSVLETFQNNVVNYQHIGMKRYCTVQGPP
GTGKSHLAIGLAVFYCTARVVYTAASHAAVDALCEKAYKFLNINDCTRIVPAKVRVECYD
KFKINDTTRKYVFTTINALPEMVTDIVVVDEVSMLTNYELSVINARIRAKHYVYIGDPAQ
LPAPRVLLSKGTLEPKYFNTVTKLMCCLGPDIFLGTCYRCPKEIVDTVSALVYENKLKAK
NESSSLCFKVYYKGVTTHESSSAVNMQQIYLINKFLKANPLWHKAVFISPYNSQNFAAKR
VLGLQTQTVDSAQGSEYDYVIYSQTAETAHSVNVNRFNVAITRAKKGILCVMSNMQLFEA
LQFTTLTLDKVPQAVETKVQCSTNLFKDCSKSYSGYHPAHAPSFLAVDDKYKATGDLAVC
LGIGDSAVTYSRLISLMGFKLDVTLDGYCKLFITKEEAVKRVRAWVGFDAEGAHATRDSI
GTNFPLQLGFSTGIDFVVEATGLFADRDGYSFKKAVAKAPPGEQFKHLIPLMTRGHRWDV
VRPRIVQMFADHLIDLSDCVVLVTWAANFELTCLRYFAKVGREISCNVCTKRATVYNSRT
GYYGCWRHSVTCDYLYNPLIVDIQQWGYIGSLSSNHDLYCSVHKGAHVASSDAIMTRCLA
VYDCFCNNINWNVEYPIISNELSINTSCRVLQRVILKAAMLCNRYTLCYDIGNPKAIACV
KDFDFKFYDAQPIVKSVKTLLYSFEAHKDSFKDGLCMFWNCNVDKYPPNAVVCRFDTRVL
NNLNLPGCNGGSLYVNKHAFHTKPFARAAFEHLKPMPFFYYSDTPCVYMDGMDAKQVDYV
PLKSATCITRCNLGGAVCLKHAEEYREYLESYNTATTAGFTFWVYKTFDFYNLWNTFTKL
QSLENVVYNLVKTGHYTGQAGEMPCAIINDKVVAKIDKEDVVIFINNTTYPTNVAVELFA
KRSVRHHPELKLFRNLNIDVCWKHVIWDYARESIFCSNTYGVCMYTDLKFIDKLNVLFDG
RDNGALEAFKRSNNGVYISTTKVKSLSMIRGPPRAELNGVVVDKVGDTDCVFYFAVRKEG
QDVIFSQFDSLGVSSNQSPQGNLGSNGKPGNVGGNDALSISTIFTQSRVISSFTCRTDME
KDFIALDQDVFIQKYGLEDYAFEHIVYGNFNQKIIGGLHLLIGLYRRQQTSNLVVQEFVS
YDSSIHSYFITDEKSGGSKSVCTVIDILLDDFVALVKSLNLNCVSKVVNVNVDFKDFQFM
LWCNDEKVMTFYPRLQAASDWKPGYSMPVLYKYLNSPMERVSLWNYGKPVTLPTGCMMNV
AKYTQLCQYLNTTTLAVPVNMRVLHLGAGSEKGVAPGSAVLRQWLPAGTILVDNDLYPFV
SDSVATYFGDCITLPFDCQWDLIISDMYDPITKNIGEYNVSKDGFFTYICHMIRDKLALG
GSVAIKITEFSWNAELYKLMGYFAFWTVFCTNANASSSEGFLIGINYLCKPKVEIDGNVM
HANYLFWRNSTVWNGGAYSLFDMAKFPLKLAGTAVINLRADQINDMVYSLLEKGKLLIRD
TNKEVFVGDSLVNVI
Download sequence
Identical sequences gi|565952979|ref|NP_937947.2|

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]