SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for A0A166ZLH2 from Uniprot 2018_03 genome

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  A0A166ZLH2
Domain Number 1 Region: 3305-3607
Classification Level Classification E-value
Superfamily Trypsin-like serine proteases 1.89e-118
Family Viral cysteine protease of trypsin fold 0.0000000641
Further Details:      
 
Domain Number 2 Region: 6489-6673
Classification Level Classification E-value
Superfamily S-adenosyl-L-methionine-dependent methyltransferases 5.65e-74
Family Nsp15 N-terminal domain-like 0.00000506
Further Details:      
 
Domain Number 3 Region: 4024-4176
Classification Level Classification E-value
Superfamily Coronavirus NSP8-like 5.62e-62
Family Coronavirus NSP8-like 0.00000498
Further Details:      
 
Domain Number 4 Region: 6674-6825
Classification Level Classification E-value
Superfamily EndoU-like 4.71e-54
Family Nsp15 C-terminal domain-like 0.00000635
Further Details:      
 
Domain Number 5 Region: 4300-4421
Classification Level Classification E-value
Superfamily Coronavirus NSP10-like 1.57e-51
Family Coronavirus NSP10-like 0.00000625
Further Details:      
 
Domain Number 6 Region: 4841-5147,5185-5332
Classification Level Classification E-value
Superfamily DNA/RNA polymerases 1.19e-50
Family RNA-dependent RNA-polymerase 0.017
Further Details:      
 
Domain Number 7 Region: 848-960
Classification Level Classification E-value
Superfamily NSP3A-like 7.85e-38
Family NSP3A-like 0.00077
Further Details:      
 
Domain Number 8 Region: 1169-1326
Classification Level Classification E-value
Superfamily Macro domain-like 5.05e-37
Family Macro domain 0.0000143
Further Details:      
 
Domain Number 9 Region: 4187-4294
Classification Level Classification E-value
Superfamily Replicase NSP9 9.29e-32
Family Replicase NSP9 0.0000378
Further Details:      
 
Domain Number 10 Region: 5642-5946
Classification Level Classification E-value
Superfamily P-loop containing nucleoside triphosphate hydrolases 3.58e-31
Family Extended AAA-ATPase domain 0.059
Further Details:      
 
Domain Number 11 Region: 3903-3985
Classification Level Classification E-value
Superfamily Coronavirus NSP7-like 3.14e-29
Family Coronavirus NSP7-like 0.00011
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) A0A166ZLH2
Sequence length 7132
Comment (tr|A0A166ZLH2|A0A166ZLH2_9NIDO) p1ab protein {ECO:0000313|EMBL:ANA96038.1} KW=Complete proteome OX=1508220 OS=Bat coronavirus. GN=ORF1ab OC=Nidovirales; Coronaviridae; Coronavirinae; unclassified coronaviruses.
Sequence
MLSKASVTTQGARGKYRAELYNEKRSDHVACTVPLCDTEDMACKLTPWFEDGETAFNQVS
SILKEKGKILFVPMHMQKAMKFLPGPRVYLVERLTGGMLSKHFLVNQLAYKDHVGAAMMR
TTLNAKPLGMFFPYDSSLETGEHTFLLRKSGLGGQFFRERPWDRKEAPYVEILDDLEADP
TGKYSQNLLKKLIGGDFTPVDQYMCGKNGKPIADYAKIVAKEGLTTLADIEADVQSRVDQ
DRFIVLNKKLYRIVWNVTRRNVPYPKQTAFTIVSVVQCDDKESVPEHTFTIGAEILIVSP
LKATNNKDFNLKQRLLYTFYGKDAVQQPGYIYHSAFADCSACGRGSWCTGNAIQGFACDC
GAIYSANDVELQSSGLVPKNALFLATCPCSVDGVCNHSAAQAYSVLDGKACVEAGGKSFT
LTFGGVVYAYMGCSDGTMYFIPRAKSCVSRIGDAIFTGCTGAWTKVLETAHLFLEKAQRS
LNFCQQFALTETVLAILSGTTSTFEELCDLCHNASYDKVRDHLVSRGFVVTVGDYVKDAI
NIGVNGICNATINAPFIVFTGLGESFKKVATIPWKVCGNLKQALDYYCFNINFRVFPYDI
PCDVNQFVELLLDCGKLTVATSYFVLRYLDEKVETVVNTVNTACQTALSSFLNACVSASK
ATIGFITDMFNLFKVLMHKLYVYTSCGYVAVAEHSSKFVQQVLDIMSKAMKLLHTKISWA
GAKVSAIIYEGRDALLFNSGTYFCVSTKAKALQDQMNLVLPGDYNRKILGILDPTPNADT
VDVNVNSTVVDVVHGQLEPTNEHGPSMIVGNYVLVSDKVFIRTEEEEFYPLCTNGKVVST
LFRLKGGMPLKKVTFGDVNTVEVTAFRSVSITYDIHPVLDALLSSSKLATFTVEKDLPVE
DFVDVIKDEVLTLLTPLLRGYDIDGFDVEDFIDVPCYVYNQDGDCAWSSNMVFSVNPVED
VDETNEFMEDNYLSDELPIDDDEEAWARAVEEVMPLDDALIAEIELEEELPLETALESVE
AEVGESVSDEVCVVETAEAQEPSVESIDYTPSTSAVVGENDSCVKPVPRVAETVDVLEVE
KAVVGGPASEVSNTETNDVVSVEQAQQCGSSSLSIQNEAHQILVSQAPEIQSVEVLCSET
TVAQSSEIIQHRQEKPKRSRKSKVDLSKYKHTAINNSVTLVLGDAIQIASLLPKCILVNA
ANRHLKHGGGIAGAINKASGGDVQEESDEYISNNGPLHVGDSVLLKGHGLADAILHVVGP
DARNNEDSVLLKRCYKAFNKHTTVVTPLISAGIFSVDPKVSFEYLVANVTTTTYVVVNNE
DIYNALATPSKPDGLVYSFEGWRGTVRTAKNYGFTCFICTEYSANVKFLRSKGVDINKKL
QTVDGVSYYLYSAKDSLTDVIAAANGCPGICAMPFGYVTHGLDLAQSGNYVRQVKVPYVC
LLASKEQIPIMNSDVAIQTPETAFINNVTANGGYHCWHLVSGDLIVKDVCYKKLLHWSGQ
TICYADNKFYVVKNDVALPFGDLEACRAYLTSRAAQQINIEVLVTIDGVNFRTVILNDTT
TFRNQLGATFYKGVDISDALPTVKMGGESLFVADNLSEAEKVVLKEYYGTSDITFLQRYY
SLQPLVQQWKFVVHDGVKSLKLSNYNCYINATIMMIDMLHDIKFVVPALQNAYLRYKGGD
PYDFLALIMAYGDCTFGNPDDEAKLLHTLLAKAELTVSAKMVWREWCTVCGVRDIEYTGM
RACVYAGVNSIEELQSVFNETCVCGSVKHRRLVEHSVPWLLVSGLNEVKVSTSTDPAYRA
FNVFQGLETSVGHYLHVRVKDGLFYKYDSGSLTKTSDMKCKMTSVCYPRVRYTADCNVVV
YDLDGVTKVEVNPDLSNYYMKDGKYYTSKPTTKYSPATILPGSVYSNSCLVGADGTPGSD
TISKFFNNLLGFDETKLIAKKLTYSLLPNESGDVLLDEFSNYNPLYKKGAMFKGRPILWV
INGACDSTLNKPNRASLRQLYDVAPIVLDNKYTVLQDNTPQTQEVKAPGVEDVSIITRKL
IEVKCKGLNKPFVKGNFSFVNDPNGVTVVDTLGLTELRALYVDINTRYIVLRDNNWSSLF
KLHTVESGDLQVVANGGSVTRGARVLLGASSLFASFAKITVTATTAACKTAGRGFCKFVV
NYGVLQNMFVFLKMLFFLPFNYLWPKKQPTVDVGVSGLRIAGVVTTNIVKQCGTAAYYML
LGKFKRVDWKATLRLFLLLCTTVLLLSSIYHLIIFNQVLSSDVMLEDATGILAIYKEVRS
YLGISTLCDGLVAEYRNTSFDVVDFCSNRSVLCQWCLIGQDSLTRYSALQMLQTHITSYV
LNIDWIWFALEFLLAYVLYTSSFNVLLLVVTAQYFFAYTSAFVNWRAYNYIVSGIFFLVT
HIPLHGLVRVYNFLACLWFLRKFYSHVINGCKDTACLLCYKRNRLTRVEASTIVCGTKRT
FYIAANGGMSYCCKHNWNCVDCDTAGVGNTFICTEVANDLTTSLRRLIKPTDQSHYYVDS
VVVKDAVVELHYNRDGSSCYERYPLCYFTNLEKLKFKEVCKTPTGIPEHNFLIYDTNDRG
QENLARAACVYYSQVLCKPMLLVDANLVTTVGDSREIAIKMLDSFINSFISLFSVSRDKL
EKLINTARDCVRRGDDFQNVLKTFTDAARGHAGVESDVETTTVVDALQYAHKNDIQLTTE
GYNNYVPGYIKPESINTLDLGCLIDLKAASVNQTSMRNANGACVWNSGDYMKLSDSFKRQ
IRIACRKCTIPFRLTTSKLRAADNILSVKFSATKIVGGAPNWLLRVRDLTVKGYCILTLV
IFSFAVLSWFCLPSYSIATVNFNDDRILTYKVIENGIVRDIAPNDACFANKYGHFSKWFN
ENHGGVYRNSVDCPITVAVIAGVAGTRVANVPANLAWVGRQIVLFVSRVFANTNVCFTPT
NEIPYDTFSDSGCVLSSECTLFRDAEGNLNPFCYDPTVLPGASSYADMKPHVRYDMYDSD
MYIKFPEVIFESTLRITKTLATQYCRFGSCEESDAGVCISTNGSWALYNQNYSTRPGIYC
GDDYLDIVRRLAVSLFQPVTYFQLSTSLAMGLVLCVFLTAAFYYINKVKRALADYTQCAV
VAVVAALLNSLCLCFIVANPLLVAPYTAMYYYATFYLTGEPAFIMHISWYVMFGTVVPIW
MLASYTVGVMLRHLFWVLAYFSKKHVDVFTDGKLNCSFQDAASNIFVIGKDTYVALRNAI
TQDSFVRYLSLFNKYKYYSGAMDTASYREACAAHLCKALQTYSETGSDILYQPPNCSVTS
SVLQSGLVKMSAPSGAVENCIVQVTCGSMTLNGLWLDNTVWCPRHIMCPADQLTDPNYDA
LLISKTNHSFIVQKHIGAQANLRVVAHTMVGVLLKLTVDVANPSTPAYTFSTVKPGTSFS
VLACYNGKPTGVFTVNLRHNSTIKGSFLCGSCGSVGYTENGGVLNFVYMHQMELSNGTHT
GSSFDGVMYGAFEDKQTHQLQLTDKYCTINVVAWLYAAVLNGCKWFVKPTRVGIVTYNEW
ALSNQFTEFVGTQSIDMLAHRTGVSVEQMLAAIQSLHAGFQGKTILGQSTLEDEFTPDDV
NMQVMGVVMQSGVKRISYGFMHWLLSTLVLAYVSVMQLTKFTMWTYLFETIPTQMTPLLL
GFMACVMFTVKHKHTFLSLFLLPVALCLTYANIVYEPQTLVSSTLIAVANWLTPTSVYMR
TTHLDFGLYISLSFVLAIIVRRLYRPSMSNLALALCSGVMWFYTYVIGDHSSPITYLMFI
TTLTSDYTITVFVIVNLAKFISGMVFLYAPHLGFILPEVKLVLLVYLCLGYMCTMYFGVF
SLLNLKLRVPLGVYDYSVSTQEFRFLTGNGLHAPRNSWEALILNFKLLGIGGTPCIKVAT
VQSKLTDLKCTSVVLLTVLQQLHLESNSKAWSYCVRLHNEILAAVDPTEAFDKFVCLFAT
LMSFSANVDLEALANDLFETSSVLQATLTEFSHLATYAELETAQSSYQKALNSGDASPQV
LKALQKAVNVAKNAYEKDKAVARKLERMAEQAMTSMYKQARAEDKKAKIVSAMQTMLFGM
IKKLDNDVLNGVIANARNGCVPLSIVPLCASNKLRVVIPDISVWNKVVNWPSVSYAGSLW
DITVINNVDNEVVKPTDVVETNESLTWPLVIECSRASSSAVKLQNNEIQPKGLKTMVITA
GVDQVNCNSSAVAYYEPVQGHRMVMGLLSDNAHLKWAKVEGKDGFVNIELQPPCKFLIAG
PKGPEIRYLYFVKNLNNLHRGQLLGHIAATVRLQAGANTEFASNSTVLTLVAFAVDPAKA
YLDYVGSGGTPLSNYVKMLAPKTGTGVAISVKPEATADQETYGGASVCLYCRAHIEHPDV
SGVCKYKTRFVQIPSHIRDPVGFLLKNVQCNVCQYWVGYGCNCDALRNNTIPQSKDTNFL
NRVRGSSVNARLEPCSSGLTTDVVYRAFDICNFKAKVAGIGKYYKTNTCRFVQVDDEGHK
LDSYFIVKRHTMSNYELEKRCYDLLKDCDAVAVHDFFIFDVDKTKTPHIVRQCLTEYTMM
DLVYALRHFDQNNCEVLKSILVKYGCCEQSYFDNKLWFDFVENPSVIGVYHKLGERVRQA
MLSTVKMCDHMVKNGLVGVLTLDNQDLNGKWYDFGDFVITQPGAGVAIVDSYYSYLMPVL
SMTNCLAAETHKDCDFNKPLVEWPLLEYDYTDYKIDLFNKYFKHWDQAYHPNCVNCGDDR
CILHCANFNVLFSMVLPNTSFGPIVRKIFVDGVPFIVSCGYHYKELGLVMNMDVNIHRHR
LALKELMMYAADPAMHIASASALWDLRTPCFSVAALTTGLTFQTVRPGNFNKDFYDFVVS
RGFFKEGSSVTLKHFFFAQDGHAAITDYSYYAYNLPTMVDIKQMLFCMEVVDKYFDIYDG
GCLNASEVIVNNLDKSAGHPFNKFGKARVYYESMSYQEQDELFAVTKRNVLPTITQMNLK
YAISAKNRARTVAGVSILSTMTNRQYHQKMLKSMAATRGATCVIGTTKFYGGWDFMLKTL
YKDVESPHLMGWDYPKCDRAMPNMCRILASLILARKHSTCCTNSDRFYRLANECAQVLSE
YVLCGGGYYVKPGGTSSGDATTAYANSVFNILQATTANVSALMSANGNTIVDREIKDMQF
DLYINVYRKVVPDPKFIDRYYAFLNKHFSMMILSDDGVVCYNSDYASKGYVASIQNFKET
LYYQNNVFMSEAKCWVETDLEKGPHEFCSQHTLYIKDGDDGYFLPYPDPSRILSAGCFVD
DIVKTDGTVMMERYVSLAIDAYPLTKHDDTEYQNVFWVYLQYIEKLYKDLTGHMLDSYSV
MLCGDDSAKFWEEGFYRDLYSAPTTLQAVGSCVVCHSQTSLRCGTCIRRPFLCCKCCYDH
VIATTHKMVLSVSPYVCNAPGCDVSDVTKLYLGGMSYYCNDHRPVCSFPLCANGLVFGLY
KNMCTGSSSIMDFNRLATCDWSDSGDYTLANTTTEPLKLFAAETLRATEEASKQSYAIAT
IKEIVGERELILVWEVGKSKPPLNRNYVFTGYHLTKNSKVQLGEYVFERIDYSDAVFYKS
STTYKLAVGDIFVLTSHSVATLSAPTIVNQERYLKITGIYPTITVPEEFANHVVNFQKAG
FSKYVTVQGPPGTGKSHFAIGLAIYYPTARIVYTACSHAAVDALCEKAFKYLNIAKCSRI
IPAKARVECYDRFKVNDTNAQYLFSTVNALPETSVDILVVDEVSMCTNYDLSIINARVKA
KHIVYVGDPAQLPAPRTLLTRGTLEPENFNSVTRLMCNLGPDIFLSVCYRCPKEIVSTVS
ALVYNNKLSAKKDASGQCFKILFKGSVTHDASSAINRPQLNFVKTFITANPNWSKAVFIS
PYNSQNAVARSMLGLTTQTVDSSQGSEYPYVIFCQTADTAHANNLNRFNVAVTRAQKGIL
CVMTSQVLFDSLEFAELSLNNYKLQSQIATGLFKDCSREESGLPPAYAPTYLSVDAKYKT
TDELCVNLNITPNVTYSRVISRMGFKLDATIPGYPKLFITRDEAIRQVRSWVGFDVEGAH
ASRNACGTNVPLQLGFSTGVNFVVQPVGVVDTEWGSMLTTIAARPPPGEQFKHLVPLMHK
GATWPIVRRRIVQMLSDTLDKLSDYCTFVCWAHGFELTSASYFCKIGKEQRCCMCSRRAS
TFSSPLQSYACWSHSSGYDYVYNPFFVDVQQWGYIGNLATNHDRYCGIHAGAHVASSDAI
MTRCLAIYDCFIERVDWDITYPYISHEQKLNSCCRIVERNVVRSAVLSGKFDKIYDIGNP
KGIAIISDPVEWHFYDAQPLSNKVKKLFYTDDVSKQFEDGLCLFWNCNVSKYPSNAVVCR
FDTRVHSEFNLPGCNGGSLYVNKHAFHTPAYDINAFRDLKPLPFFYYSTTPCEVHGNGNM
LEDIDYVPLKSAVCITACNLGGAVCRKHAAEYRDYMEAYNIVSAAGFRLWVFKTFDIYNL
WSTFVKVQGLENIAFNVIKQGHFTGVDGELPVAVVNDKIFTKNGTDDVCIFKNETALPTN
VAFELYAKRAVRSHPDLNLIRNLEVDVCYNFVLWDYGRNNIYGTTTIGVCKYTDIDVNPN
LNICFDIRDKGSLERFMSMPNGVLISDRKIKNYPCISGPKYAYFNGAILRNVDAKQPISF
YLYKKVNNEFVSFSNTFYTCGRTVEDFSALTPMEEDFLVLDSDVFIKKYNLEDYAFEHVV
YGDFSHTTLGGLHLLIGLYKKVCDGHILMEEMLKDRATVHNYFITDSKTASYKAVCSVID
LKLDDFVNIIKEMDLGVVSKVVKVPIDLTMIEFMLWCKGGKVQTFYPRLQAANDWKPGLT
MPSLFKVQQMNLEPCLLANYKQSIPMPNGVHMNVAKYMQLCQYLNTCTLAVPANMRVIHF
GAGCEKGVAPGTSVLRQWLPLDAVLIDNDLNEFVSDADITIFGDCVTVHVGQQVDLLISD
MYDPCTKAVGEVNQTKALFFVYLCNFIKNNLALGGSVAIKITEHSWSADLYKIMGRFAYW
TVFCTNANASSSEGFLIGINFLGELREDIDGNVMHANYIFWRNSTPMNLSTYSLFDLSRF
PLKLKGTPVLQLKESQINELVISLLSQGKLLIRDNDTLNVSTDVLVNFRKRL
Download sequence
Identical sequences A0A166ZLH2

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]