SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for A0A1Y0EV22 from Uniprot 2018_03 genome

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  A0A1Y0EV22
Domain Number 1 Region: 3247-3547
Classification Level Classification E-value
Superfamily Trypsin-like serine proteases 1.57e-132
Family Viral cysteine protease of trypsin fold 0.000000114
Further Details:      
 
Domain Number 2 Region: 3964-4113
Classification Level Classification E-value
Superfamily Coronavirus NSP8-like 1.57e-61
Family Coronavirus NSP8-like 0.00000731
Further Details:      
 
Domain Number 3 Region: 4238-4359
Classification Level Classification E-value
Superfamily Coronavirus NSP10-like 6.8e-53
Family Coronavirus NSP10-like 0.0000075
Further Details:      
 
Domain Number 4 Region: 4125-4232
Classification Level Classification E-value
Superfamily Replicase NSP9 4.58e-40
Family Replicase NSP9 0.0000767
Further Details:      
 
Domain Number 5 Region: 850-966
Classification Level Classification E-value
Superfamily NSP3A-like 1.23e-35
Family NSP3A-like 0.00037
Further Details:      
 
Domain Number 6 Region: 1273-1428
Classification Level Classification E-value
Superfamily Macro domain-like 1.68e-35
Family Macro domain 0.00016
Further Details:      
 
Domain Number 7 Region: 3837-3925
Classification Level Classification E-value
Superfamily Coronavirus NSP7-like 2.75e-27
Family Coronavirus NSP7-like 0.00021
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) A0A1Y0EV22
Sequence length 4383
Comment (tr|A0A1Y0EV22|A0A1Y0EV22_CVHOC) Replicase polyprotein 1a {ECO:0000313|EMBL:ARU07607.1} OX=31631 OS=Human coronavirus OC43 (HCoV-OC43). GN= OC=Nidovirales; Coronaviridae; Coronavirinae; Betacoronavirus. OH=9606
Sequence
MSKINKYGLELHWAPEFPWMFEDAEEKLDNPSSSEVDMICSTTAQKLETDGICPENHVMV
DCRRLLKQECCVQSSLIREIVMNASPYYLEVLLQDALQSREAVLVTTPLGMSLEACYVRG
CNPKGWTMGLFRRRSVCNTGRCTVNKHVAYQLYMIDPTGVCLGAGQFVGWVIPLAFMPVQ
SRKFIVPWVMYLRKRGEKGAYNKDHGCGGFGHVYDFKVEDAYDQVHDEPKGKFSKKAYAL
IRGYRGVKPLLYVDQYGCDYTGSLADGLEAYADKTLQEMKALFPTWSQELPFDVIVAWHV
VRDPRYVMRLQSAATICSVAYVANPTEDLCDGSVVIKEPVHVYADDSIILRQYNLFDIMS
HFYMEADTVVNAFYGVALKDCGFVMQFGYIDCEQDSCDFKGWIPGNMIDGFACTTCGHVY
EVGDLIAQSSGVLPVNPVLHTKSAAGYGGFGCKDSFTLYGQTVVYFGGCVYWSPARNIWI
PILKSSVKSYDSLVYTGVLGCKAIVKETNLICKALYLDYVQHKCGNLHQRELLGVSDVWH
KQLLINRGVYKPLLENIDYFNMRRAKFSLETFTVCADGFMPFLLDDLVPRAYYFAVSGQA
FCDYADKLCHAVVSKSKELLDVSLDSLGAAIHYLNSKIVDLAQHFSDFGTSFVSKIVHFF
KTFTTSTALAFAWVLFHVLHGAYIVVESDIYFVKNIPRYASAVAQAFQSVAKVVLDSLRV
TFIDGLSCFKIGRRRICLSGRKIYEVARGLLHSSQLPLDVYDLTMPSQVQKAKQKPIYLK
GSGSDFSLADSVVEVVTTSLTPCGYSEPPKVADKICIVDNVYMAKAGDKYYPVVVDDHVG
LLDQAWRVPCAGRRVTFKEQPTVKEIISMPKIIKVFYELDNDFNTILNTACGVFEVDDTV
DMEEFYAVVIDAIEEKLSPCKELEGVGAKVSAFLQKLEDNPLFLFDEAGEEVFAPKLYCA
FTAPEDDDFLEESDVEEDDVEGEETDLTITSAGQPCVASEQEESSEVLEDTLDDGPSVET
SDSQVEEDVEMSDFVDLESVIQDYENVCFEFYTTEPEFVKVLGLYVPKATRNNCWLRSVL
AVMQKLPCQFKDKNLQDLWVLYKQQYSQLFVDTLVNKIPANIVLPQGGYVADFAYWFLTL
CDWQCVAYWKCIKCDLALKLKGLDAMFFYGDVVSHICKCGESMVLIDVDVPFTAHFALKD
KLFCAFITKRIVYKAACVVDVNDSHSMAVVDGKQIDDHRITSITSDKFDFIIGHGMSFSM
TTFEIAQLYGSCITPNVCFVKGDIIKVSKLVKAEVVVNPANGHMVHGGGVAKAIAVAAGQ
QFVKETTNMVKSKGVCATGDCYVSTGGKLCKTVLNVVGPDARTQGKQSYVLLERVYKHFN
NYDCVVTTLISAGIFSVPSDVSLTYLLGTAKKQVVLVSNNQEDFDLISKCQITAVEGTKK
LAARLSFNVGRSIVYETDANKLILINDVAFVSTFNVLQDVLSLRHDIALDDDARTFVQSN
VDVLPEGWRVVNKFYQINGVRTVKYFECTGGIDICSQDKVFGYVQQGIFNKATVAQIKAL
FLDKVDILLTVDGVNFTNRFVPVGDSFGKSLGNVFCDGVNVTKHKCDINYKGKVFFQFDN
LSSEDLKAVRSSFNFDQKELLAYYNMLVNCFKWQVVVNGTYFTFKQANNNCFVNVSCLML
QSLHLTFKIVQWQEAWLEFRSGRPARFVALVLAKGGFKFGDPADSRDFLRVVFSQVDLTG
AICDFEIACKCGVKQEQRTGLDAVMHFGTLSREDLEIGYTVDCSCGKKLIHCVRFDVPFL
ICSNTPASVKLPRGVGSANIFIGDNVGHYVHVKCEQSYQLYDASNVKKVTDVTGKLSDCL
YLKNLKQTFKSVLTTYYLDDVKKIEYKPDLSQYYCDGGKYYTQRIIKAQFKTFEKVDGVY
TNFKLIGHTVCDSLNSKLGFDSSKEFVEYKITEWPTATGDVVLANDDLYVKRYERGCITF
GKPVIWLSHEKASLNSLTYFNRPLLVDDNKFDVLKVDDVDDSGDSSESGVKETKEINIIK
LSGVKKPFKVEDSVIVNDDTSETKYVKSLSIVDVYDMWLTGCKYVVRTANALSRAVNVPT
IRKFIKFGMTLVSIPIDLLNLREIKPAVNVVKAVRNKTSACFNFIKWLFVLLFGWIKISA
DNKVIYTTEIASKLTCKLVALAFKNAFLTFKWSMVARGACIIATIFLLWFNFIYANVIFS
DFYLPKIGFLPTFVGKIXXXXXXTFSLVTICDLYSIQDVGFKNQYCNGSIACQFCLAGFD
MLDNYKAIDVVQYEADRRAFVDYTGVLKIVIELIVSYALYTAWFYPLFALISIQILTTWL
PELFMLSTLHWSFRLLVALANMLPAHVFMRFYIIIASFIKLFSLFKHVAYGCSKSGCLFC
YKRNRSLRVKCSTIVGGMIRYYDVMANGGTGFCSKHQWNCIDCDSYKPGNTFITVEAALD
LSKELKRPIQPTDVAYHTVTDVKQVGCSMRLFYDRDGQRIYDDVNASLFVDYSNLLHSKV
KSVPNMHVVVVENDADKANFLNAAVFYAQSLFRPILMVDKNLITTANTGTSVTETMFDVY
VDTFLSMFDVDKKSLNALIATAHSSIKQGTQIYKVLDTFLSCARKSCSIDSDVDTKCLAD
SVMSAVSAGLELTDESCNNLVPTYLKSDNIVAADLGVLIQNSAKHVQGNVAKIAGVSCIW
SVDAFNQFSSDFQHKLKKACCKTGLKLKLTYNKQMANVSVLTTPFSLKGGAVFSYFVYVC
FVLSLVCFIGLWCVMPTYTVHKSDFQLPVYASYKVLDNGVIRDVSVEDVCFANKFEQFDQ
WYESTFGLSYYSNSMACPIVVAVIDQDFGSTVFNVPTKVLRYGYHVLHFITHALSADGVQ
CYTPHSQISYSNFYASGCVLSSACTMFTMADGSPQPYCYTDGLMQNASLYSSLVPHVRYN
LANAKGFIRFPEVLREGLVRVVRTRSMSYCRVGLCEEADEGICFNFNGSWVLNNDYYRSL
PGTFCGRDVFDLIYQLFKGLAQPVDFLALTASSIAGAILAVIVVLVFYYLIKLKRAFGDY
TSVVFVNVIVWCVNFMMLFVFQVYPTLSCVYAICYFYATLYFPSEISVIMHLQWLVMYGT
IMPLWFCLLYIAVVVSNHAFWVFSYCRKLGTTVRSDGTFEEMALTTFMITKDSYCKLKNS
LSDVAFNRYLSLYNKYRYYSGKMDTAAYREAACSQLAKAMDTFTNNNGSDVLYQPPTASV
STSFLQSGIVKMVNPTSKVEPCVVSVTYGNMTLNGLWLDDKVYCPRHVICSASDMTNPDY
TNLLCRVTSSDFTVLFDRLSLTVMSYQMRGCMLVLTVTLQNSRTPKYTFGVVKPGDTFTV
LAAYNGKPQGAFHVTMRSSYTIKGSFLCGSCGSVGYVIMGDCVKFVYMHQLELSTGCHTG
TDFNGDFYGPYKDAQVVQLPIQDYIQSVNFLAWLYAAILNNCNWFIQSDKCSVEDFNVWA
LSNGFSQVKSDLVIDALASMTGVSLETLLAAIKRLKNGFQGRQIMGSCSFEDELTPSDVY
QQLAGIKLQSKRTRLFKGTVCWIMASTFLFSCIITAFVKWTMFMYVTTNMFSITFCALCV
ISLAMLLVKHKHLYLTMYITPVLFTLLYNNYLVVYKHTFRGYVYAWLSYYVPSVEYTYTD
EVIYGMLLLVGMVFVTLRSINHDLFSFIMFVGRLISVFSLWYKGSNLEEEILLMLASLFG
TYTWTTVLSMAVAKVIAKWVAVNVLYFTDIPQIKIVLLCYLFIGYIISCYWGLFSLMNSL
FRMPLGVYNYKISVQELRYMNANGLRPPKNSFEALMLNFKLLGIGGVPIIEVSQFQSKLT
DVKCANVVLLNCLQHLHVASNSKLWHYCSTLHNEILATSDLSVAFEKLAQLLIVLFANPA
AVDSKCLTSIEEVCDDYAKDNTVLQALQSEFVNMASFVEYEVAKKNLDEARFSGSANQQQ
LKQLEKACNIAKSAYERDRAVAKKLERMADLALTNMYKEARINDKKSKVVSALQTMLFSM
VRKLDNQALNSILDNAVKGCVPLNAIPSLAANTLNIIVPDKSVYDQIVDNIYVTYAGNVW
QIQTIQDSDGTNKQLNEISDDCNWPLVIIANRYNEVPATVLQNNELMPAKLKIQVVNSGP
DQTCNTPTQCYYNNSNNGKIVYAILSDVDGLKYTKILKDDGNFVVLELDPPCKFTVQDAK
GLKIKYLYFVKGCNTLARGWVVGTISSTVRLQAGTATEYASNSSILSLCAFSVDPKKTYL
DFIQQGGTPIANCVKMLCDHAGTGMAITVKPDATTSQDSYGGASVCIYCRARVEHPDVDG
LCKLRGKFVQVPVGIKDPVSYVLTHDVCRVCGFWRDGSCSCVSTDTTVQSKDTNFLNAFG
VRV
Download sequence
Identical sequences A0A1Y0EV22

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]