SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for A0A0U2GRF0 from Uniprot 2018_03 genome

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  A0A0U2GRF0
Domain Number 1 Region: 2971-3270
Classification Level Classification E-value
Superfamily Trypsin-like serine proteases 7.19e-146
Family Viral cysteine protease of trypsin fold 0.0000000000266
Further Details:      
 
Domain Number 2 Region: 3672-3825
Classification Level Classification E-value
Superfamily Coronavirus NSP8-like 8.11e-71
Family Coronavirus NSP8-like 0.0000086
Further Details:      
 
Domain Number 3 Region: 6116-6305
Classification Level Classification E-value
Superfamily S-adenosyl-L-methionine-dependent methyltransferases 7.06e-68
Family Nsp15 N-terminal domain-like 0.0000657
Further Details:      
 
Domain Number 4 Region: 6307-6458
Classification Level Classification E-value
Superfamily EndoU-like 2.35e-60
Family Nsp15 C-terminal domain-like 0.0000228
Further Details:      
 
Domain Number 5 Region: 3944-4065
Classification Level Classification E-value
Superfamily Coronavirus NSP10-like 1.7e-55
Family Coronavirus NSP10-like 0.0000096
Further Details:      
 
Domain Number 6 Region: 4476-4773,4818-4957
Classification Level Classification E-value
Superfamily DNA/RNA polymerases 7.91e-47
Family RNA-dependent RNA-polymerase 0.026
Further Details:      
 
Domain Number 7 Region: 3832-3938
Classification Level Classification E-value
Superfamily Replicase NSP9 4.05e-45
Family Replicase NSP9 0.0000974
Further Details:      
 
Domain Number 8 Region: 3552-3634
Classification Level Classification E-value
Superfamily Coronavirus NSP7-like 1.01e-33
Family Coronavirus NSP7-like 0.00052
Further Details:      
 
Domain Number 9 Region: 5276-5580
Classification Level Classification E-value
Superfamily P-loop containing nucleoside triphosphate hydrolases 2.9e-27
Family Nitrogenase iron protein-like 0.081
Further Details:      
 
Domain Number 10 Region: 1286-1435
Classification Level Classification E-value
Superfamily Macro domain-like 3.53e-25
Family Macro domain 0.00031
Further Details:      
 
Domain Number 11 Region: 2153-2266
Classification Level Classification E-value
Superfamily Pectin lyase-like 0.0000565
Family Pectate lyase-like 0.027
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) A0A0U2GRF0
Sequence length 6763
Comment (tr|A0A0U2GRF0|A0A0U2GRF0_CVH22) Polyprotein ORF1ab {ECO:0000313|EMBL:ALA50241.1} KW=Complete proteome OX=1699095 OS=Camel alphacoronavirus. GN=orf1ab OC=Nidovirales; Coronaviridae; Coronavirinae; Alphacoronavirus.
Sequence
MACNRVTLAVASDTEISATGCSTIALAVRRYSEAASNGFRACRFVSFGLHDCVVGIANDD
YVMGLHGNQTLSCNIMKFSDRPFMLRGWLVFSNSNYLLEEFDVVFGKRGGGNVTYTDQYL
CGADGKPVISDDLWQFVDHFGENEEIIINGHTYVCAWLTKRKPLDYKRQNNLAIEEIEYV
RGDALHTLRNGSVLEMAKEVKTSSKVVLSDALDKLYKVFGSPVMTNGSNILEAFIKPVFI
SAFVQCTCGNKSWSVGDWTGFKSTCCNVLSNKLCVVPGNVKPGDAVVTTQQAGVGVKYFC
GMTLKFVANIEGVSVWRVIAVQSVDGFVASATFVEEEHANRMDTFCFNVRNSTTDECRLA
MLGAEMTSNVRRQVAAGVIDISTGWFDVYDDIFAENKPWFVRKAEDIFGPCWSALVSVLK
QLKVTTGELMRFVKSICSSAVAVVSGTIQIVASVPDMFLPAFDVFVKAVQTVFDCAVETS
TIAGKSFDKVFDYVLLDNALVKLVTIKLKGVRASGLKTVKYATAVVGSTEEVKSSRVERS
TAVLTIANNYPKLSDEGYTAVIGDVAYFVSDGYFRLMASPNSVLTTAVYKPLFAFNVNVM
GTRPEKFPTIVTCENLESAVLFVNDKITEFQLDCSVDVIDNEIIVKPNISLCVPLYVRDY
VDKWDDFCRQYSNESWFEDDYRAFISVLDVADADVKAAESKAFIDTIIPSCPSILKIIDG
GKIWSGIIKAVSSVADWLKSLKLTLTPEGLFGTCAKRFKRFLTVLLDAYNAFLDTVASIV
KIGGKAFKKYAFDKPYIVVCDIVCKVEHKTDADWVELMPRNDRIKSFSTFENAYLPIADP
THFDIEEVELLDTEFVEPGCGGILALIDDHVFYKKDDIYYPSNGTKILPVAFTKAAGGKV
SFSDAVEVKDIPPVYRVKLCFEFEDEKLVDVCEKAIGEKIKHEGDWDSFCKTIQSALSVV
SSYVNLPTYYIYDEQGGTDLSLPVMISEWPLSESDKEEEVQQEQQEDTVVPEVEVVVDQV
EEVNSSFAIEAVDVKYEVSPFEMPFEELNGLKILKQMDNNCWVNSVMLQLQLTGILDDDY
AMQFFKIGRVSKMVERCYNAEQCIRGAMGDVGLCLYRLLKDLHTGFMVMDYKCSCTSGRL
EESGSVLFCTPTKKAFPYGTCLNCNAPRMCTIRQLQGTIIFVQQNPEPVNPCAFVVKPVC
ASVFRGAVSSGHYQINIYPQKLCVDGFGVNKIQPWPNDALNTICIRDANYSAKVEKPVTP
GKPPAELAPIDETVVKVKLNSFLTCNNVSFYQGDIDAVVNGVDFDFIVNAANENLAHCGG
LAKALDVYTKGKLQRLSKEHIGLAGKVKVGAGVMVECDGLRIFNVVGPRKGKHERDLLIK
AYNTINNEQGIPLTPILSCGIFGVKLETSLEVLFAVCNTKEVKVFVYTDTEVCKVKDFVS
GLVKVQKVEQPKIEPKSVSVTKVAPKPYKVDGKFSYFTDDLLCVAVGKPIVLFTDSMLTL
DDRGLALDNALNGVLSAAIKDCIDTNKAIPSGNLIKFDIESVVVYMCVVPSDQDKHLDKN
VQRCTRKLNRLMCDIVCTIPAEHVLPLLLSSLTCNVSFVGELKAVESKVITIKVTEDGVN
VHDVTVTTDKSFEQQVGVIAVKDKDLSGAVPSDLNTSELLTKAIDVDWVEFYGFGDAVTF
ATVDHSDFAYDSAVVNGFRVLKTSDNNCWVNAVCISLQYLKPHFISQGLDAAWNKFVLGD
VETFVAFIYYVAGLVKGAKGDAEDILNKLSKYLANEAQVQLEHYSSCVECEATFKNPVAS
VNSAIVCASVKRDGVQVGYCAHGIKYYSRVRSVSGRAIIFSVEQLEPCSQSRLLSGVAYT
AFSGPADNGHYTVYDTAKKSMYDGDRFVKHDLSLLSVTSVVMVGGYVAPVKTVKPKPVIN
QLDEKAQKFFDFGDFLVHNFVTFFTWLLSMFTLCKTAVTTCDVKIMAKAPQRTGVVLKRS
LKYNLKASTAVLKSKWWLLAKFMKLLLLIYTLYSVVLLGVLFGPFNLCSETVNGYAKSNF
VKDDYCDGSLGCKMCLFGYQELSQFSHLDVVWKHITDPLFSNMQPFIVMVLLLIFGDNYL
RCFLLYFVAQMISTVGVFLGYKETNWFLHFVPFDVICDELLVTVIVIKVISFVRHVLFGC
ENPDCIACSKSARLKRFPVNTIVNGVQRSFYVNANGGSKFCKKHRFFCVDCDSYGYGNTF
ITPEVSRELGNITKTNVQPTGPAYVMVDKVEFENGFYRLYSGEAFWRYNFDITESKYSCK
EVLKNCNVLDDFIVFNNNGTNVTQVKNASVYFSQLLCRPIKLVDSELLSTLSVDFNGVLH
KAYIDVLRNSFGKDLNANMSLAECKSALGLSISDHEFTSAISNAHRCDVLLSDLSFNNFV
SSYAKPDEKLSAYDLACCMRAGAKVVNANVLTKDQTPIVWHAKDFNSLSAEGRKYIVKTS
KAKGLTFLLTINENQAVTQIPATSIVAKQGAGDAGHSSTWLWLLCGLVCLIQFYLCFFMP
YFDTVRSFEGYDFKYIENGQLKNFEAPLKCVRNVFENFEDWHYAKFGFIPLNKQSCPIVV
GVSEIVNTVAGIPSNVYLVGKTLIFTLQAAFGNAGVCYDIFGVTTPEKCIFTSACTRLEG
LGGNNVYCYNTDLMEGSLPYSSIQANAYYKYDNGNFIKLPEVIAQGFGFRTVRTIATKYC
RVGECVDSNAGVCFGFDKWFVNDGRVDNGYVCGTGLWNLVFNILSMFSSSFSVAAMSGQI
LLNCALGAFAIFCCFLVTKFRRMFGDLSVGVCTVVMAVLLNNVSYIVTQNLVTMIAYAVL
YFFATRSLRYAWIWCAAYLIAYISFAPWWLCAWYFLAMLTGLLPSLLKLKVSTNLFEGDK
FVGTFESAAAGTFVIDMRSYEKLANSISPEKLKSYAASYNRYKYYSGNANEADYRCACYA
YLAKAMLDFSRDHNDILYTPPTVSYGSTLQAGLRKMAQPSGIVEKCVVRVCYGNTVLNGL
WLGDIVYCPRHVIASNTTAAIDYDHEYSIMRLHNFSINSGTAFLGVVGATMHGATLKIKV
SQTNMHTPRHSFKTLKSGEGFNILACYDGCAQGVFGVNMRTNWTIRGSFINGACGSPGYN
LKNGEVEFVYMHQIELGSGSHVGSSFDGVMYGGFEDQPNLQVESANQMLTVNVVAFLYAA
ILNGCIWWLKGDKLSVEHYNEWAQANGFTAMNGEDAFSILAAKTGVCVERLLHAIQVLNN
GFGGKNILGYSSLNDEFNINEVVKQMFGVNLQSGKTTSMFKSLSLFAGFFIMFWAELFVY
TTTVWVNPGFLTPFMILLVALSLCLTSFVKHKVLFLQVFLLPSIIVAAIQNCAWDYHVTK
VLAEKFDYNVSVMQMDIQGFVNIFICLFVALLHTWRFAKERCTHWCTYLFSLLAVLYTAL
YSYDYVSLLVMLLCAISNEWYIGAIIFRICRFGVACLPVAYVAYFGSVKTVLLFYMLLGF
VSCMYYGLLYWINRFCKCTLGVYDFCVSPAEFKYMVANGLNAPDGPFDALFLSFKLMGIG
GPRTIKVSTVQSKLTDLKCTNVVLMGILSNMNIASNSKEWAYCVETHNKINLCDNPETAQ
ELLLALLAFFLSKHSDFGLGDLVDSYFENDSILQSVASSFVGMPSFVAYETARQEYENAV
ANGSSPQIIKQLKKAMNVAKAEFDRESSVQRKINRMAEQAAAAMYKEARAVNRKSKVVSA
MHSLLFGMLRRLDMSSVDTILNMARNGVVPLSVIPATSASKLVVVVPDHDSFARMMVDGF
VHYAGVVWTLQEVKDNDGKNVHLKDVTKENQETLVWPLILTCERVVKLQNNEIMPGKMKV
KATKAEGDGGITSEGNALYNNEGGRAFMYAYVTTKPDMKYVKWEHDSGVVTVELEPPCRF
VVDTPTGPQIKYLYFVKNLNTLRRGAVLGYIGATVRLQAGKQTEFVSNSHLLTHCSFAVD
PAAAYLDAVKQGAKPVGNCVKMLTNGSGSGQAITSTIDSNTTQDTYGGASVCIYCRAHVA
HPTMDGFCQYKGKWVQVPIGTNDPIRFCLENTVCKVCGCWLNHGCTCDRTAIQSFDNSYL
KRVRGSSAARLEPCNGTDIDYCVRAFDVYNKDASFIGKNLKSNCVRFKNADKDDAFYIVK
RCIKSVMDHEQSMYNLLKGCNAVAKHDFFTWHEGRTIYGNVSRQDLTKYTMMDLCFALRN
FDEKDCEVLKEILVLTGCCGTDYFEMKNWFDPVENEDIHRVYAALGTVVANAMLKCVALC
DEMVLRGVVGVLTLDNQDLNGNFYDFGDFVLCPPGMGIPYCTSYYSYMMPVMGMTNCLAS
ECFMKSDIFGQDFKTYDLLKYDFTEHKLVLFNKYFKYWGQGYHPDCVDCYDEMCILHCSN
FNTLFATTIPNTAFGPLCRKVFIDGVPVVATAGYHFKQLGLVWNKDVNTHSTRLTITELL
QFVTDPALIVASSPALVDKRTVCFSVAALSTGLTSQTVKPGHFNKEFYDFLRSQGFFDEG
SELTLKHFFFTQKGDAAIKDFDYYRYNRPTMLDIGQARVAYQVASRYFDCYEGGCITSRE
VVVTNLNKSAGWPLNKFGKAGLYYESISYEEQDAMFALTKRNILPTMTQLNLKYAISGKE
RARTVGGVSLLATMTTRQFHQKCLKSIVATRNATVVIGTTKFYGGWDNMLKNLIADVDDP
KLMGWDYPKCDRAMPSMIRMLSAMILGSKHVTCCTASDKFYRLSNELAQVLTEVVYSNGG
FYFKPGGTTSGDATTAYANSVFNIFQAVSSNINRILSVNSSNCNNLNVKKLQKQLYDNCY
RNSNVDESFVDDFYGYLQKHFSMMILSDDGVVCYNKIYAELGYIADISAFKATLYYQNGV
FMSTAKCWTEEDLSVGPHEFCSQHTMQIVDENGKYYLPYPDPSRIISAGVFVDDITKTDA
VILLERYVSLAIDAYPLSKHPKPEYRKVFYALLDWVKYLNKTLNEGVLESFSVTLLDEQE
SKFWDESFYASMYEKSTVLQAAGLCVVCGSQTVLRCGDCLRKPMLCTKCAYDHVFGTDHK
FILAITPYVCNTSGCNVNDVTKLYLGGLNYYCVDHKPHLSFPLCSAGNVFGLYKSSALGS
IDVDVFNKLSTSDWSDIRDYKLANEAKESLRLFAAETVKAKEESVKSSYAYATLKEIVGP
KELLLSWESGKAKPPLNRNSVFTCFQITKDSKFQVGEFVFEKVDYGSDTVTYKSTATTKL
VPGMLFILTSHNVAPLRAPTMANQEKYSTIYKLHPSFNVSDAYANLVPYYQLIGKQRITT
IQGPPGSGKSHCSIGIGVYYPGARIVFTACSHAAVDSLCAKAATAYSVDKCTRIIPARAR
VECYSGFKPNNNSAQYVFSTVNALPEVNADIVVVDEVSMCTNYDLSVINQRISYKHIVYV
GDPQQLPAPRVLISKGVMEPIDYNVVTQRMCAIGPDVFLHKCYRCPAEIVNTVSELVYEN
KFVPVKEASKQCFKIFERGSVQVDNGSSINRRQLDVVKRFIHKNPTWSKAVFISPYNSQN
YVAARLLGLQTQTVDSAQGSEYDYVIFAQTSDTAHACNANRFNVAITRAKKGIFCIMSDR
TLFDALKFFEITMTDLQSENSCGLFKDCARNPIDLPPSHATTYLSLSDRFKTSGDLAVQI
GSNNVCTYEHVISYMGFRFDVSMPGSHSLFCTRDFAMRHVRGWLGMDVEGAHVTGDNVGT
NVPLQVGFSNGVDFVAQPEGCVVTNIGSVVKPVRARAPPGEQFTHLVPLLRKGQPWSVLR
KRIVQMIADYLAGSSDVLVFVLWAGGLELTTMRYFVKIGAVKHCQCGTVATCYNSVSNDY
CCFKHALGCDYVYNPYVIDIQQWGYVGSLSINHHAICNVHRNEHVASGDAIMTRCLAVYD
CFVKNVDWSITYPMIANEKAINRGGRTVQSHIMRAAIKLYNPKAIHDIGNPKGIRCAVTD
AKWYCYDKDPINSNVKTLEYDYMTHGQMDGLCLFWNCNVDMYPEFSIVCRFDTRTRSTLN
LEGVNGGSLYVNNHAFHTPAYDKRAMAKLKPAPFFYYDDGPCEVVHDQVNYVPLRATNCI
TKCNIGGAVCSKHANLYRAYVESYNTFTQAGFNIWVPTTFDCYNLWQTFTEVNLQGLENI
AFNVLKKGSFVCADGELPVAISGDKVFVRDGNIDNLVFVNKTSLPTNIAFELFAKRKVGL
TPPLSILKNLGVVATYKFVLWDYEAERPFTSFTKSVCGYTDFTEDVCTCYDNSIQGSYER
FTLSNNAVLFSATAVKAGGKSLPAIKLNFGMLNGNAIATVKSEDGNIKNVNWFVYVRKDG
KPVDHYDGFYTQGRNLQDFLPRSTMEEDFLNMDIGVFIQKYGLEDFNFEHVVYGDVSKTT
LGGLHLLISQVRLSKMGILKAEEFVSASDITLKCCTVTYLNDPSYKTVCTYMDLLLDDFV
AILKSLDLTVVSKVHEVIIDNKPWRWMLWCKDNAVATFYPQLQSAEWKCGYSMPGIYKTQ
RMCLEPCNLYNYGAGLKLPSGIMFNVVKYTQLCQYLNSTTLCVPHNMRVLHLGAGSDYGV
APGTAVLKRWLPHDAIVVDNDVVDYVSDADFSVTGDCATVYLEDKFDLLISDMYDGRTKA
IDGENVSKEGFFTYINGVICEKLAIGGSVAIKVTEYSWNKKLYELVQKFSFWTMFCTSVN
TSSSEAFVVGINYLGDFAKGPFIDGNIIHANYVFWRNSTVMTLSYNSVLDLSKFNCKHKA
TVVVQLKDGDINEMVLSLVRNGKLLVRGNGKCLSFSNHLVSTK
Download sequence
Identical sequences A0A0U2GRF0

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]