SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for I1TMH0 from Uniprot 2018_03 genome

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  I1TMH0
Domain Number 1 Region: 3332-3632
Classification Level Classification E-value
Superfamily Trypsin-like serine proteases 1.71e-137
Family Viral cysteine protease of trypsin fold 0.000000101
Further Details:      
 
Domain Number 2 Region: 6504-6691
Classification Level Classification E-value
Superfamily S-adenosyl-L-methionine-dependent methyltransferases 3.99e-74
Family Nsp15 N-terminal domain-like 0.00000000678
Further Details:      
 
Domain Number 3 Region: 6722-6872
Classification Level Classification E-value
Superfamily EndoU-like 1.45e-60
Family Nsp15 C-terminal domain-like 0.000000144
Further Details:      
 
Domain Number 4 Region: 4049-4198
Classification Level Classification E-value
Superfamily Coronavirus NSP8-like 2.35e-59
Family Coronavirus NSP8-like 0.00000637
Further Details:      
 
Domain Number 5 Region: 4856-5159,5200-5344
Classification Level Classification E-value
Superfamily DNA/RNA polymerases 8.4e-55
Family RNA-dependent RNA-polymerase 0.021
Further Details:      
 
Domain Number 6 Region: 4323-4444
Classification Level Classification E-value
Superfamily Coronavirus NSP10-like 6.15e-52
Family Coronavirus NSP10-like 0.00000674
Further Details:      
 
Domain Number 7 Region: 4210-4317
Classification Level Classification E-value
Superfamily Replicase NSP9 1.06e-42
Family Replicase NSP9 0.0001
Further Details:      
 
Domain Number 8 Region: 830-945
Classification Level Classification E-value
Superfamily NSP3A-like 1.06e-37
Family NSP3A-like 0.00036
Further Details:      
 
Domain Number 9 Region: 1318-1474
Classification Level Classification E-value
Superfamily Macro domain-like 4.37e-34
Family Macro domain 0.00018
Further Details:      
 
Domain Number 10 Region: 5656-5962
Classification Level Classification E-value
Superfamily P-loop containing nucleoside triphosphate hydrolases 2.36e-30
Family Tandem AAA-ATPase domain 0.068
Further Details:      
 
Domain Number 11 Region: 3922-4009
Classification Level Classification E-value
Superfamily Coronavirus NSP7-like 7.45e-25
Family Coronavirus NSP7-like 0.00032
Further Details:      
 
Domain Number 12 Region: 980-1163
Classification Level Classification E-value
Superfamily S-adenosyl-L-methionine-dependent methyltransferases 6.15e-24
Family Nsp15 N-terminal domain-like 0.041
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) I1TMH0
Sequence length 7176
Comment (tr|I1TMH0|I1TMH0_9BETC) 1ab polyprotein {ECO:0000313|EMBL:AFG25758.1} KW=Complete proteome OX=31632 OS=Rat coronavirus. GN= OC=Nidovirales; Coronaviridae; Coronavirinae; Betacoronavirus.
Sequence
MAKMGKYGLGFKWAPEFPWMLPNASEKLGNPERSEEDGFCPSAAQEPKVKGRTLVNHVRV
DYSRLPALECCVQSAIIRDIFVDEDPQKVEASTMMALQFGSAVLIMPSKRLSIQAWANLG
VLPRTPAMGLFKRVCLCNTRGCSCDVHVAFQLFTVQPDGVCLGNGRFIGWFVPVTAIPEY
AKQWLQPWSILLRKGGNKGSVTSGHRRAVTMPVYHFNVEDACEEVHLNPKSEYSRKAYTL
LKGYRGVKPILFVDQYGCDYTGCLAKGLEDYGDLTLSEMKELFPVWRDSLDNEVVVAWHV
DRDPRAVMRLQTLATLRSIDYVGQPTEDVVDGDVVVREPAHLLAADAVVKRLPRLVETML
YTDSSVTEFCYKTKLCDCGFITQFGYVDCCGDTCDFRGWVPGNMLDGFPCPGCSKSYMPW
ELEAQSSGVIPEGGVLFTQSTDTVNREAFKLYGHAVVPFGSAVYWSPYPGMWLPVVWSSV
KSYSGLTYTGVVGCKAIVQETDAICRSLYMDYVQHKCGNLDQRATLGLDDVYHRQLLVNR
GDYSLLLENVDLFVKRRAEFACKFATCGDGFVPLLLDGLVPRSYYLIKSGQAYTSMMVNF
SHEVIDMCMDMALLFMHDVKVATKYVKKFTGKLAVRFKALGVAVVRKITEWFDLAVDIAA
SAAGWLCYQLVNGLFAVANGVITFVQEAPELVKNFVAKFRAFFKVLIDSMSVSILSGLTV
VKTASNRVCLAGSKFYEVVQKSLSAYVLPVGCSEATCLVGESEPAVFEDDVVGVVKTPLT
YQGCCKPPTSFEKICIVDKLYMAKCGDQFYPVVVDNDTVGVLDQCWRFPCAGKKVVFNDK
PKVKEVPSTRKIKIIFALDATFDSVLSKACSEFEVDKDVTLDELLDVVLDAVESTLSPCK
EHDVIGTKVCALLDRLAEDYVYLFDEGGDEVIAPRMYCSFSAPDDEDCVAADVVDADENQ
DDDADDSVVLVADAQEDGVAKEQVEVDSEICVAHTGGQDELTEPDAVGSQTPIASAEETE
VGEASDREGIAEAKRTVCADALDACPDQVEAFEIEEVEDSILDELQTELNAPADRTYEDV
LAFDAIYSEALSAVYAVPSDETHFKVCGFYSPAIERTNCWLRSTLIVMQSLPLEFKDLEM
QKLWLSYKAGYDQCFVDKLVKSVPRSIILPQGGYVADFAYYFLSQCSFKAHANWRCLKCD
MELKLQGLDAMFFYGDVVSHMCKCGSGMTLLSADIPYTLHFGVRDDKFCAFYTPRKAFRA
ACAVDVNDCHSMAVVDGKLIDGKNVTKFTGDKFDFMVGHGMTFSMSPFETAQLYGSCITP
NVCFVKGDVIKVARLVEAEVIVNPANGRMAHGAGVAGAIAKAAGKFFIKETADMVKNQGV
CLVGECYESAGGKLCKKVLNIVGPDAPRQGRQCYSLLERAYQHINKCDNVVTTLISAGIF
SVPTDVSLTYLLGVVTKNVILVSNNKDDFDVIEKCQVTSVAGTKALSLQLAKNLCRDVKF
VTNACDSLFGASCFVASYDVLQEVELLQHDIQLDDDARVFVQANMDCLPTDWRLVNKLDV
VDGVRTIKHFECPGEIFVSSQGKKFGYVQNGLFKVASVSQIRALLANKVDVLCTVDGVNF
RSCCVAEGEVFGKTLGSVFCDGINVTKVRCSAIHKGKVFFQYSGLSEADLVAVKDAFGFD
EPQLLKYYNMLGMCKWPVVVCGNYFAFKQSNNNCYINVACLMLQHLNLKFPKWQWQEAWN
EFRSGKPLRFVSLVLAKGSFKFNEPSDSTDFIRVVVREADLSGATCDLEFICKCGVKQEQ
RKGVDAVMHFGTLDKSDLVKGYNIACTCGSKLVHCTQFNVPFLICSYTPEGRKLPDDVVA
ANIFTGGSLGHYTHVKCKPKYQLYDACNVSKVSEAKGNFXDCLYLKNLKQTFSSXLTTYY
LDDVKCVEYKPDLSQYYCESGNIIQNPLLRPNLEHLEKVDGVYTNFKLVGHSIAEKLNAK
LGFDCDSPFVEYKITEWPTATGDVVLASDDLYVSRYLSGCITFGKPVVWLGHEEASLKSL
TYFNRPSVVCENKFNVLPVDVSEPTDKEPVPAAVLVTGVPSADASRDAGTAKEQKACASD
NVEEQVVTEVRQEPSVSSVDVKEVKLNGVKKPVKVEDSVVVNDPTSDTXVVKSLSIVDVY
DMFLTGCKYVVWTANELSRLVNSPTVREYMKWGMGKIVIPTKLLLLRDERQEFVAPKVVK
AKAIACYGAVKWFFFYCFSWIKFNTGNKVIYTTEVASKLTFKLCCLAFKNALQTFNWSVV
SRGFFLVATVFLLWFNFLYANVILSDFYLPNIGSLPTFVGQIVAWFKTTLGVSTICDFYQ
VTDLGYRSSFCNGSMVCELCFSGFDMLDNYDAINVVQHVIDRRVSFDYISILKLVVELII
GYSLYTVCFYPLFVLIGMQLLTTWLPEFFMLETMHWSARLFVFVANMLPAFTLLRFYIVV
TAMYKVYCLCRHVMYGCSKPGCLFCYKRNRSVRVKCSTVVGGSLRYYDVMANGGTGFCTK
HQWNCLNCDSWKPGNTFITFEAAADLSKELKRPVNPTDSAYYSVTEVKQVGCSMRLFYER
DGQRVYDDVSASLFVDMNGLLHSKVKGVPETHVVVVENEADKAGFLGAAXFYAQSLYRPM
LMVEKKLITTANTGLSVSQTMFDLYVDSLLNVLDVDRKNLTSFVNAAHNSLKEGVQLEQV
MDTFVGCARRKCAIDSDVETRSITKSVMSAVNAGVDFTDESCNNLVPTYVKSDTIVAADL
GVLIQNNAKHVQSNVAKAANVACIWSVDAFNQLSADLQHRLRKACSKTGLKIKLTYNKQE
ANVPILTTPFSLKGGAVFSKFLQWLFVANLICFIVLWALMPTYAVHKSDMQLPLYASFKV
IENGVLRDVSVTDACFANKFNQFDQWYESTFGLAYCRNSKACPVVVAVIDQDIGHTLFNV
PTKVLRHGFHVLHFITHAFATDSVQCYTPHMQIPYDNFYASGCVLSSLCTVLAHADGTPH
PYCYTEGVMHNASLYSSLVPHVRYNLASSNGYIRFPEVVSEGIVRVVRTRSMTYCRVGLC
EEAEEGICFNFNSSWVLNNPYYRAIPGTFCGRNAFDLIHQVLGGLVQPIDFFALTASSVA
GAILAIIVVLAFYYLIKLKRAFGDYTSVVVINVIVWCINFLMLFVFQVYPTLSCLYACFY
FYTTLYFPSEISVVMHLQWLVMYGAIMPLWFCIIYVAVVVSNHALWLFSYCRKIGTEVRS
DGTFEEMALTTFMITKESYCKLKNSVSDVAFNRYLSLYNKYRYFSGKMDTAAYREAACSQ
LAKAMETXNHNNGNDVLYQPPTASVTTSFLQSGIVKMVSPTSKVEPCVVSVTYGNMTLNG
LWLDDKVYCPRHVICSSDDMTDPDYPNLLCRVTSSDFCVMSDRMSLTVMSXQMQGSLLVL
TVTLQNLNTPKYSFGVVKPGETFTVLAAYNGRPQGAFHVVMRSSHTIKGSFLCGSCGSVG
YVLTGDSVRFVYMHQLELSTGCHTGTDLSGNFYGPYRDAQVVQLPVQDYTQTVNVVAWLY
AAILNRCNWFVQSDSCSLEEFNVWAMTNGFSSIKADLVLDALASMTGVTSGEVLAAIKRL
HSGFQGKQILGSCVLEDELTPSDVYQQLAGVKLQSKRTRVIKGTCCWILASTFLFCSIIA
AFVKWTMFMYVTTHMLGVTLCALCFVSFAMLLIKHKHLYLTMYIMPVLCTLFYTNYLVVY
KQSFRGLAYAWLSHFVPAVDYTYMDEVLYGVVLLVAMVFVTMRSINHDVFSIMFLVGRLV
SLVSMWYFGANLEEEVLLFLTSLFGTYTWTTMLSLATAKVIAKWLAVNVLYFTDVPQIKL
VLLSYLCIGYVCCCYWGVLSLLNSIFRMPLGVYNYKISVQELRYMNANGLRPPRNSFEAL
VLNFKLLGIGGVPVIEVSQIQSRLTDVKCANVVLLNCLQHLHIASNSKLWQYCSTLHNEI
LATSDLSVAFDKLAQLLVVLFANPAAVDSKCLASIEEVSDDYVRDNTVLHALQSEFVNMA
SFVEYELAKKNLDEAKASGSANQQQIKQLEKACNIAKSAYERDRAVARKLERMADLALTN
MYKEARINDKKSKVVSALQTMLFSMVRKLDNQALNSILDNAVRGCVPLNAIPSLTSNTLT
IIVPDKQVFDQVVDNVYVTYAGNVWHIQSIQDADGAVKQLNEIDVNSIWPLVIAANRHNE
VSTVVLQNNELMPQKLRTQVVNSGSDMNCNTPTQCYYNTTGTGKIVYAILSDCDGLKYTK
IVKEDGNCVVLELDPPCKFSVQDVKGLKIKYLYFVKGCNTLARGWVVGTLSSTVRLQAGT
ATEYASNSAILSLCAFSVDPKKTYLDYIQQGGVPVTNCVKMLCDHAGTGMAITIKPEATT
NQDSYGGASVCIYCRSRVEHPDVDGLCKLRGKFVQVPLGIKDPVSYVLTHDVCQVCGFWR
DGSCSCVGTGSQFQSKDTNFLKRVRGTSVNARLVPCASGLDTDVQLRAFDICNANRAGIG
LYYKVNCCRFQRVDEDGNKLDKFFVVKRTNLEVYNKEKECYELTKECGVVAEHEFFTFDV
EGSRVPHIVRKDLSKFTMLDLCYALRHFDRNDCSTLKEILLTYAECGESYFQKKDWYDFV
ENPDIINVYKKLGPIFNRALLNTAKFADALVEAGLVGVLTLDNQDLYGQWYDFGDFVKTV
PGCGVAVADSYYSYMMPMLTMCHALDSELYVNGTYREFDLVQYDFTDFKLELFNKYFKHW
SMTYHPNTCECEDDRCIIHCANFNILFSMVLPKTCFGPLVRQIFVDGVPFVVSIGYHYKE
LGVVMNMDVDTHRYRLSLKDLLLYAADPALHVASASALLDLRTCCFSVAAITSGVKFQTV
KPGNFNQDFYEFILSKGLLKEGSSVDLKHFFFTQDGNAAITDYNYYKYNLPTMVDIKQLL
FVLEVVNKYFEIYEGGCIPATQVIVNNYDKSAGYPFNKFGKARLYYEALSFEEQDEIYAY
TKRNVLPTLTQMNLKYAISAKNRARTVAGVSILSTMTGRMFHQKCLKSIAATRGVPVVIG
TTKFYGGWDDMLRRLIKDVDSPVLMGWDYPKCDRAMPNILRIVSSLVLARKHDSCCSHTD
RFYRLANECAQVLSEIVMCGGCYYVKPGGTSSGDATTAFANSVFNICQAVSANVCSLMAC
NGHKIEDLSIRELQKRLYSNVYRADHVDPAFVSEYYEFLNKHFSMMILSDDGVVCYNSEF
ASKGYIANISAFQQVLYYQNNVFMSEAKCWVETDIEKGPHEFCSQHTMLVKMDGDEVYLP
YPDPSRILGAGCFVDDLLKTDSVLLIERFVSLAIDAYPLVHHENPEYQNVFRVYLEYIKK
LYNDLGNQILDSYSVILSTCDGQKFTDETFYKNMYLRSAVMQSVGACVVCSSQTSLRCGS
CIRKPLLCCKCSYDHVMATDHKYVLSVSPYVCNSPGCDVNDVTKLYLGGMSYYCEDHKPQ
YSFKLVMNGMVFGLYKQSCTGSPYIEDFNKIASCKWTEVDDYALANECTERLKLFAAETQ
KATEEAFKQCYASATIREIVSDRELILSWEIGKVRPPLNKNYVFTGYHFTNNGKTVLGEY
VFDKSELTNGVYYRATTTYKLSVGDVFILTSHAVSSLSAPTLVPQENYTSIRFASVYSVP
ETFQNNVPNYQHIGMKRYCTVQGPPGTGKSHLAIGLAVYYCTARVVYTAASHAAVDALCE
KAHKFLNINDCTRIVPAKVRVDCYDKFKVNDTTRKYVFTTINALPELVTDIIVVDEVSML
TNYELSVINSRVRAKHYVYIGDPAQLPAPRVLLNKGTLEPRYFNSVTKLMCCLGPDIFLG
TCYRCPKEIVDTVSALVYNNKLKAKNDNSSMCFKVYYKGQTTHESSSAVNMQQIHLISKF
LKANPSWSNAVFISPYNSQNYVAKRVLGLQTQTVDSAQGSEYDFVIYSQTAETAHSVNVN
RFNVAITRAKKGILCVMSSMQLFESLNFTTLTLDKINNPRLQCTTNLFKDCSKSYIGYHP
AHAPSFLAVDDKYKVGGDLAVCLNVADSAVTYSRLISLMGFKLDLTLDGYCKLFITRDEA
IKRVRAWVGFDAEGAHATRDSIGTNFPLQLGFSTGIDFVVEATGMFAERDGYVFKKAAAR
APPGEQFKHLVPLLSRGQKWDEVRIRIVPMVSDHLVDLADSVVLVTWAASFELTCLRYFA
KVGKEVVCSVCNKRATCFNSRTGYYGCWRHSYSCDYLYNPLIVDIEQWGYTGSLTSNHDP
ICSVHKGAHVASSDAIMTRCLAVHDCFCKSVNWNLEYPIISNEVSVNTSCRLLQRVMFRA
AMLCNRYDVCYDIGNPKGLACVKGYDFKFYDASPVVKSVKQFVYKYEAHKDLFLDGLCMF
WNCNVDKYPANAVVCRFDTRVLNKLNLPGCNGGSLYVNKHAFHTSPFTRAAFENLKPMPF
FYYSDTPCVYMEGMDSKQVDYVPLRSATCITRCNLGGAVCLKHAEEYREYLESYNTATTA
GFTFWVYKTFDFYNLWNTFAKLQSLENVVYNLVNAGHFDGRAGELPCAVIGEKVIAKIQN
EDVVVFKNNTPFPTNVAVELFAKRSIRPHPELKLFRNLNIDVCWSHVLWDYTKDSVFCSS
TYKVCKYTDLQCIESLNVLFDGRDNGALEAFKKCRNGVYINTTKIKSLSMIKGPQRADLN
GVVVEKVGDSDVEFWFAMRRDGDDVIFSRIGSLEPSHYRSPQGNPGGNRVADLSGNEALA
RGTIFTQSRFLSSFAPRSEMEKDFMDLEEDVFIAKYSLQDYAFEHVVYGSFNQKIIGGLH
LLIGLARRQKRSNLVIQEFVPYDSSIHSYFVTDENSGSSKSVCTVIDLLLDDFVDIVKSL
NLNCVSKVVNVNVDFKDFQFMLWCNEEKVMTFYPRLQAAADWKPGYVMPVLYKYLESPLE
RVNLWNYGKPITLPTGCLMNVAKYTQLCQYLNTTTIAVPANMRVLHLGAGSDKGVAPGSA
VLRQWLPAGSILVDNDVNPFVSDSVASYYGNCITLPFDCQWDLIISDMYDPLTKNIGEYN
VSKDGFFAYLCHLIRDKLALGGSVAIKVTEFSWNAELYSLMGKFAFWTIFCTNVNASSSE
GFLIGINWLNRTRTEIDGKTMHANYLFWRNSTMWNGGAYSLFDMSKFPLKAAGTAVVSLK
PDQINDLVLSLIEKGKLLVRDTRKEVFVCDSLVNVK
Download sequence
Identical sequences I1TMH0

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]