SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for H9AA28 from Uniprot 2018_03 genome

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  H9AA28
Domain Number 1 Region: 3307-3607
Classification Level Classification E-value
Superfamily Trypsin-like serine proteases 6.74e-136
Family Viral cysteine protease of trypsin fold 0.00000011
Further Details:      
 
Domain Number 2 Region: 6478-6661
Classification Level Classification E-value
Superfamily S-adenosyl-L-methionine-dependent methyltransferases 4.48e-75
Family Nsp15 N-terminal domain-like 0.0000000739
Further Details:      
 
Domain Number 3 Region: 4024-4173
Classification Level Classification E-value
Superfamily Coronavirus NSP8-like 3.01e-60
Family Coronavirus NSP8-like 0.00000709
Further Details:      
 
Domain Number 4 Region: 6698-6847
Classification Level Classification E-value
Superfamily EndoU-like 1.84e-58
Family Nsp15 C-terminal domain-like 0.000000343
Further Details:      
 
Domain Number 5 Region: 4832-5134,5175-5319
Classification Level Classification E-value
Superfamily DNA/RNA polymerases 4.84e-53
Family RNA-dependent RNA-polymerase 0.021
Further Details:      
 
Domain Number 6 Region: 4298-4419
Classification Level Classification E-value
Superfamily Coronavirus NSP10-like 5.23e-53
Family Coronavirus NSP10-like 0.00000705
Further Details:      
 
Domain Number 7 Region: 4185-4292
Classification Level Classification E-value
Superfamily Replicase NSP9 9.94e-41
Family Replicase NSP9 0.000081
Further Details:      
 
Domain Number 8 Region: 840-953
Classification Level Classification E-value
Superfamily NSP3A-like 3.53e-36
Family NSP3A-like 0.00039
Further Details:      
 
Domain Number 9 Region: 1289-1444
Classification Level Classification E-value
Superfamily Macro domain-like 1.68e-33
Family Macro domain 0.00016
Further Details:      
 
Domain Number 10 Region: 5618-5940
Classification Level Classification E-value
Superfamily P-loop containing nucleoside triphosphate hydrolases 2.31e-30
Family Tandem AAA-ATPase domain 0.076
Further Details:      
 
Domain Number 11 Region: 3897-3985
Classification Level Classification E-value
Superfamily Coronavirus NSP7-like 2.09e-27
Family Coronavirus NSP7-like 0.0002
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) H9AA28
Sequence length 7151
Comment (tr|H9AA28|H9AA28_9BETC) Polyprotein {ECO:0000313|EMBL:AFE48800.1} KW=Complete proteome OX=1160968 OS=Rabbit coronavirus HKU14. GN= OC=unclassified Betacoronavirus.
Sequence
MPKINKYGLELQWAPEFPWMFEDTEEKLDYPSSSEVGMVCPTTAQKLGSSGIFLKNHVMV
DCRRLVKYECCVQSNLIREINMNTGPDAMDAVIQEALQSRSAVLVTPPQRMSLDMYYKLG
CCPKGWTMGLFRRHCQCRTGNCNVETHVANQLYMIDPEGVCLGAGTFIGWFVPLFALPEC
VRDKTFPWVLYLRKHGEKGAHSKGHGDTFYVDYDFDVEDAYEEVHDNPKGKYSKKAYALL
RGYRGVKPLLYVDQYGCDYTGNLAEGLEAYAEFSLQEMKELFPIWSLSLPYDVVVAWHVV
RDPQFVMKLQSLATIRSIEYVAEPTEDLVDGTVVIKEPVHTLAPDAIVLKLPKLIDLMQH
TDSTVVESIYKTKLKHCGFVMQFGYVECSQDDCTFTGWVPGNMIDGFACTSCAFVYGTVD
LLAQSSGVMPQNPVLFTKGQAITNGDSFKLYGNSVIPFGGCLYWSPTPGVWLPLIKSSVK
AYDGMVYTGVVGCKTIVKETEAVCKALYLDYVQYKCSDLKQREGLGLADVWHKQLLINRG
DYQPLLDNVDYFSMRRARFSMETATVCSEGFMPFLLDGLVPRTYYLVKSGQAFCDMLCEF
GQEVADLSKELLVVTLDSVTSALQFLTLNVGRLTECLKGFGIKFVNKLIQYFKTATRCTA
LAFAWVLLHVLRGAYIVVESDIYFIMSIPDYARVVVRTFQNVFKMALDCVKVSFLKGLSA
FKIGREKICFVGSKFYKVERGNLNDLVRRDLVVPSVTQRAKNQQPVYLTGNCAPVNVDDD
VVEVVTNPVTSCGYQKPPQKCDKICIVDNVYMAKCGEKFFPVVVNEDYIGLLDQAWRFPC
AGKKISFVEEPSVKEIVTKKTVKVCFELDANFNTILDTACSVFEVDNTVDMEEFAAVVAD
AIEEKLTPCKELDGVGIKVSAFLQKLEDNNLHFFDEAGDCVLASKLYCTFSAPIDEDFGE
SEYEEGDVDVDETDSIVTSTSQEVCAGSGIPCGTDCNSIANGQEEASEVFEIAEVEDSIL
EELQVSIQAHDEVDVVSADSDSVLVHDYIDGVNYDTFYCDTVFDFYVAHKEPDFVKVLGV
YVPKATRNNCWLRSVLAVFQKLPCTFKDKNLQSLWLSYKQQFDQLFVDTIMQKIPANIVV
PQGGYVADFAYWLLTLCDWNAASHWRCLKCDLALNLQGLDAIFFYGDIVSHVCKCGESMV
LLKVDVPFTAHFAVKDKQFCSFTTQRRIFKAACVIDKNDRHSMAVIDGKQIDDKLVTDIN
SDKFDFIIGHEMAFSMSSFEIAQLYGCCITPNVCFVKGDVIRIAQLVYADVVVNPANGHM
AHGGGVAKAIANAAGQSFIKETANMVKSKGVCATGDCYVSSGGKLCKTVLNVVGPDARAQ
GKQCYALLEKTYKHLNKYDCSLTTLISAGIFSVPSDVSLTYLLGVVEKQVILVSNNKEDF
DLISKCQLTAVEGTRKFAERLSFNVGRTIKYETDANKLLISNDVVFVSTFNVLQDVNTLR
HDIKLDDDARVFVQSNMENLPTDWRIVNKFDQINGVRTVKYFECPGGIDICSQGKDFGYI
QQGSFYKATVSQIKALFVDKIDVLLTVDGVNFTTRYVPLGEVFGKTLGNVFCDAINVTKC
KAEQKYKGKVFFQFDNLSNADLKAVKSSFNFDQKELLAYYNVLVSCGKWQIVVNGKYFTF
KQANNNCFVNAACLMLQNVNLKFVSMQWQEAWLEFRAGKPLRFVALVYAKGAFKFGDPAD
SRDFIRVVLSQTDLAEAACDYEFVCKCGVKQEQRTGIDAVMHFGTLSREDLEKGYTIDCS
CGDKLIHCTRINVPFLICSNTPKDSVVPKGVTCANVFIGGNVGHYTHLRCDSSYQLFDAS
TVKKITTVNGKITDCLYLKNLKQTFRSVLTTYYLDDVKKVEYNPDLTQYYCEGGKYYTQR
IIKAQFRTFEKVDGVYTNFKLVGHTICDSLNAKLGFDANKHFEEFKVTEWPIATGDVVLV
TDDMYVKRYEKGCITFGKPVIWYNHDQASLNSLTYFNRPSLVDVNKFDVLKVDDVVAEVE
SCSSDLSHGSLSGSVSTGSSYLSPQGNQGSNVEAHTTVLASGNQGSNAINGSANSNKIVK
LNGVKKPFKVENSVVVNDSTSETKFVKSLSIVDVYDMWLTGSRYVVKTANALSAAVNVPT
IKKFIKFGMTLVSIPIDLLNLREIKPVFGAAKVVRDRVSDCYRFIKWLFVLLFGWIKISS
YNRVVYTTEIASKLTCKLVALAVKNALLTFKWSMVVRGFFLIATIFLLWFNFIYANVIFS
DFYLPKIGFLPTFVGSVVQWLKTTFGFYTLCDFYDTASIGFKNQYCNGSLACQLCVSGFD
MLDNYKAIDVVQYEADRRSVVDYTGMIKLIIELVVSYALYTVWFYPLFGLICLQMLTTWL
PEFFMVSSLHWFLRILVSVANLLPAHVFLRFYITVTFIFKIFSLFRHVIHGCNKAGCLFC
YKRNRSVRVKCSTVVGGMIRYYDVMANGGTGFCSKHQWNCINCDSYKPGNTFITVEAAAE
LSKELRRPVVPTDVAYHTVTDVKQVGCSMRLFYDRDGQRVYDDVNASLFVDYNNLLHSKV
KSVPNLHVVVVENDADKANFLNAAVFYAQSLFRPVLMVDKSLITTANTGTSVSQTMFDVY
VDTLLSMFDVDRKSLTSFINTAHSSIKEGVQLDKVLNTFISCARKSCSIDSDVDTKCVAD
SVMSAVAAGIELTDESYNNLVPTYIKSDNIVAADLGVLIQNSSKHVQGNVAKIAGVSCIW
SVDAFNQLSSDFQHKLKKACCKTGLKLKLTYNKQSSNVSVLTTPFSLKGGAVFSYFVYSC
FVVSLICFIGLWCLMPTYSVHKSDFELPIYASYKVLDNGVIRDVSVNDVCFANKFEQFDA
WYESTFGLTYYSNSMACPIVVAVIDQDIGSTVFNVPTKVLRYGFHVLHFITHALSTDSVQ
CYTPHYQIPYSNFYDSGCVLSSACTMFAMSDGKPQPFCYTDGLMNNASLYSSLAPHVRYN
LANVKGYIRFPEVLREGLVRIVRTRSMTYCRVGLCEVSDEGICFNFNGSWVLNNDYYRSL
PGTFCGRDVFDLVYQFLSGLSQPVDFFALTASSIAGAILAIIVVLVFYYLIKLKRAFGDY
TSVVVVNVIVWFVNFLMLFVFQVYPTLSCIYAAFYFYITLYFPSEISVIMHLQWVVMYGS
IMPLWFSLLYIAIVISNHAFWVFSYCRKLGTGVRSDGTFEEMALTTFMITKDSYCKLKNS
LSDVAFNRYLGLYNKYRYYSGKMDTAAYREAACSQLAKAMDTFTNNNGSDVLYQPPTASV
STSFLQSGIVKMVSPTSKVEPCVVSVTYGNMTLNGLWLDDKVYCPRHVICSASDMTNPDY
PNLLCRVTSSDFTIMSDRMSLTVMSYQMQGCMLVLTVTLQNPRTPKYTFGVVKPGETFTV
LAAYNGRPQGAFHVTMRSSFTIKGSFLCGSCGSVGYVLMGDCVKFVYMHQLELSTGCHTG
TDFNGDFYGPYKDAQVVQLPVQDYVQSVNFVAWLYAAILNNCNWFVQSDRCSIEDYNVWA
MSNGFSQIKSDLVVDALASMTGVSLENLLAAIKRLHKGFQGRQIMGSCAFEDELTPSDVY
QQLAGVKLQSKRSRVIKGTICWVIASTFLFSCIITAFVKWTMFMYVTTHMLSVTVLALCC
VSFTMLLVKHKHLYLTMYIIPVLLTLLYNNYLVVYKHSFRGYVYAWLSHFMPSVDYTYTD
EVIYSIVLLFGMIFITMRSINHDVFSVIMFAGRVISTVSMWYIGSNLEEEVLLLLVSAFG
TYTWTTVLSLAVSKIIAKWISVNLLYFTDIPLIKLVLLSYLFVGYVVSCYWGLFSLMNKL
FRMPLGVYNYKISVQELRYMNANGLRPPRNSFEALMLNFKLLGIGGVPIIEVSQIQSKLT
DVKCANVVLLNCLQHLHVASNSKLWQYCSTLHNEILATSDLSTAFEKLAQLLIVLFANPA
AVDSKCLSSIEEVCDDYAKDNTVLQALQSEFVNMASFVEYEVAKKNLDEARSSGSANQQQ
LKQLEKACNIAKSAYERDRAVARKLERMADLALTNMYKEARINDKKSKVVSALQTMLFSM
VRKLDTQALNSILDNAVKGCVPLNAIPLLTANTLTIIVPDKQVFDQVVDNVYVAYAGNVW
HIQSVQDADGTNKQLNEISEDSNWPLVIVANRHNEVSQAVLQNNELMPAKLRTQIVNSGP
DMNCNTPTQCYYNNSNTGRIIYAILSDVDGLKYTKIVKEDGNCVVLELDPPCKFTVQDAK
GLKVKYLYFVKGCNTLARGWVVGTISSTVRLQAGTATEYASNSSILSLCAFSVDPKKTYL
DFIQQGGAPISNCVKMLCDHAGTGMAITVKPEATTSQDSYGGASVCIYCRARIEHPDVDG
LCKLRGKFVQVPVGIKDPVSYILTHDVCQVCGFWRDGSCSCVSTGAFVQSKDTNFLNRVR
GASVDARLVPCATGLSTDVQLRAFDICNASVAGIGLHLKVNCCRFQRLDESGNKMDRFFV
VKRTDLVTYNREMECYERVKGCRVVAEHDFFTFAVEGSRVPHIVRKDLTKYTMLDLCYAL
RHFDRNDCSLLCDILSMYAGCEQSYFTQKDWYDFVENPDIINVYKKLGPIFNRALVNTAE
FADALVEAGLVGVLTLDNQDLNGMWYDFGDYVVTAPGCGVAVADSYYSYMMPMLTMCHAL
DCELYVNNTYRQFDLVQYDFTDYKLELFNKYFKHWSMPYHPNTIDCQDDRCIIHCANFNI
LFSMVLPNTCFGPLVRQIFVDGVPFVVSIGYHYKELGVVMNMDVDTHRYRLSLKDLLLYA
ADPALHVASATALYDLRTCCFSVAAIASGVKFQTVKPGNFNQDFYDFILSKGLLKEGSSV
DLKHFFFTQDGNAAITDYNYYKYNLPTMVDIKQLLFVLEVVYKYFEIYDGGCIPASQVIV
NNYDKSAGYPFNKFGKARLYYEALSFDEQDDIYAYTKRNVLPTLTQMNLKYAISAKNRAR
TVAGVSILSTMTGRMFHQKCLKSIAATRGVPVVIGTTKFYGGWDDMLRRLIKDVDNPVLM
GWDYPKCDRAMPNILRIVSSLVLARKHDACCTQSDRFYRLANECAQVLSEIVMCGGCYYV
KPGGTSSGDATTAFANSVFNICQAVSANVCSLMACNGNKIEDLSIRALQKRLYSHVYRSD
TVDPTFVTEYYEFLNKHFSMMILSDDGVVCYNSDYASKGYIANISAFQQVLYYQNNVFMS
EAKCWVENDINHGPHEFCSQHTMLVKMDGDDVYLPYPDPSRILGAGCFVDDLLKTDSVLL
IERFVSLAIDAYPLVYHENEEYQKVFRVYLEYIKKLYNDLGNQILDSYSVILSTCDGQKF
TDESFYKNMYLRSAVMQSVGACVVCSSQTSLRCGSCIRKPLLCCKCCYDHVMATDHKYVL
SVSPYVCNSPGCDVNDVTKLYLGGMSYYCEDHKPQYSFKLVMNGMVFGLYKQSCTGSPYI
DDFNRIASCKWTDVDDYILANECTERLKLFAAETQKATEEAFKQSYASATIQEIVSDREL
ILSWETGKVKPPLNKNYVFTGYHFTKNGKTVLGEYIFDKSELTNGVYYRATTTYKLSVGD
VFVLTSHSVANLSAPTLVPQENYSSIRFASVYSVLETFQSNVVNYQHIGMKRYCTVQGPP
GTGKSHLAIGLAVYYCTARVVYTAASHAAVDALCEKAYKFLNINDCTRIVPAKVRVDCYD
KFKINDTSRKYVFTTINALPEMVTDIVVVDEVSMLTNYELSVINARIRAKHYVYIGDPAQ
LPAPRVLLSKGSLEPKYFNTVTKLMCCLGPDIFLGTCYRCPKEIVDTVSALVYDNKLKAK
NENSSLCFKVYFKGVTTHESSSAVNMQQIYLISKFLKANPLWHNAVFISPYNSQNFAAKR
VLGLQTQTVDSAQGSEYDYVIYSQTAETAHSVNVNRFNVAITRAKKGILCVMCNMQLFEA
LQFTALTLDKVPSKLQCTTNLFKDCSKSYIGYHPAHAPSFLAVDEKYKVNGDLAVCLGVG
DSSVTYSRLISLMGFKLDLTLEGYCKLFITKEEAVKRVRAWVGFDAEGAHATRDNIGTNF
PLQLGFSTGIDFVVEATGLFAERDGYSFRKAVAKAPPGEQFKHLIPLMSQGQRWDVVRPR
IVQMFSDHLVDLADSVVLVTWAASFELTCLRYFAKIGKETCCNVCTNRATVYNSRTGYYG
CWRHSVSCDYLYNPLIVDIQQWGYVGSLSSNHDMYCSIHKGAHVASSDAIMTRCLAVYDC
FCNNINWNVEYPIISNELSINSSCRTLQRVMLKAAMLCNRYSLCYDIGNPKAIACIKGYD
FKFYDAQPIVKSVKTLFYSYEAHKDSFKDGLCMFWNCNVDKYPSNAVVCRFDTRVLNNLN
LPGCNGGSLYVNKHAFHTNPFSRAAFEYLKPMPFFYYSDTPCVYMDGMDNKQVDYVPLKA
ATCITKCNLGGAVCLKHAEEYREYLECYNTATTAGFTFWVYKTFDFYNLWNTFTKLQSLE
NVVYNLVKTGHYTGQTGEMPCAIINDKVVAKIEQEDVVIFTNNTTYPTNIAVELFAKRSV
RHHPELKLLRNLNIDVCWKHVIWDYVRQSIYCSNTYGVCTYTDLKFIDKLNVLFDGRDNG
ALEAFKRCENGVYISTTKIKSLQMIKGPPRAELNGVVVDKVGDTDVVFYFAMRKDGQDVI
FSHIDSLGVSPYWSPQGNPGGNGKPGNVGGNDALAQVTIFTQSRVISSFECRSDMEKDFI
ALDEEMFIQKYGLEDYAFDHIVYGSFNQKIIGGLHLLIGLFRRHQKSNLVVQEFVSYDSS
IHSYFITDDKSGSSKSVCTVVDILLDDFVALVKSLNLNCVSKVVNVNVDFKDFQFMLWCN
EEKVMTFYPRLQAASDWKPGYSMPVLYKYLTSPMERVNLWNYGKPITLPTGCMMNVAKYT
QLCQYLNTTTLAVPVNMRVLHLGAGSEKGVAPGSAVLRQWLPAGTILIDNDLYPFVSDSV
ATYFGDCITLPFECQWDLIISDMYDPITKNIGEYNVSKDGFFTYICHMIRDKLALGGSVA
IKITEFSWNAELYKLMGYFAFWTVFCTNANASSSEGFLIGINYLGKPKVDIDGNVMHANY
LFWRNSTVWNGGAYSLFDMAKFPLKLAGTAVINLKPDQINDMVYSLLEKGKLLIRDTNKE
VFVGDSLVNVI
Download sequence
Identical sequences H9AA28
YP_005454239.1.11934 gi|394935459|ref|YP_005454239.1|

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]