SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for E0AGX5 from Uniprot 2018_03 genome

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  E0AGX5
Domain Number 1 Region: 2887-3186
Classification Level Classification E-value
Superfamily Trypsin-like serine proteases 1.15e-143
Family Viral cysteine protease of trypsin fold 0.0000000000598
Further Details:      
 
Domain Number 2 Region: 3603-3756
Classification Level Classification E-value
Superfamily Coronavirus NSP8-like 1.44e-67
Family Coronavirus NSP8-like 0.0000121
Further Details:      
 
Domain Number 3 Region: 3877-3998
Classification Level Classification E-value
Superfamily Coronavirus NSP10-like 8.11e-55
Family Coronavirus NSP10-like 0.00001
Further Details:      
 
Domain Number 4 Region: 3763-3871
Classification Level Classification E-value
Superfamily Replicase NSP9 2.88e-41
Family Replicase NSP9 0.00028
Further Details:      
 
Domain Number 5 Region: 508-645
Classification Level Classification E-value
Superfamily Coronavirus NSP10-like 6.41e-34
Family Coronavirus NSP10-like 0.036
Further Details:      
 
Domain Number 6 Region: 3483-3565
Classification Level Classification E-value
Superfamily Coronavirus NSP7-like 7.98e-34
Family Coronavirus NSP7-like 0.00034
Further Details:      
 
Domain Number 7 Region: 1321-1479
Classification Level Classification E-value
Superfamily Macro domain-like 4.71e-27
Family Macro domain 0.00047
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) E0AGX5
Sequence length 4025
Comment (tr|E0AGX5|E0AGX5_9ALPC) Polyprotein orf1a {ECO:0000313|EMBL:ADL71474.1} KW=Complete proteome OX=871681 OS=Feline coronavirus UU24. GN= OC=Nidovirales; Coronaviridae; Coronavirinae; Alphacoronavirus.
Sequence
MSSKQFKILVNEDYQVNVPSLPFRDVLQEIKYCYRNGFEGYVFVPEYRRDLVDCDRKNHY
VIGVLGNGISDLKPVLLTEPSVMLQGFIVKANCNGVLEDFDLKIARTGNGAIYVDQYMCG
ADGKPVIEGEFKDYFGDEDVIFYEGEEYHCAWSTVRDEKPLCQQTLLTIKEIQYNLDIPH
KLPNCAIREVAPPVKKNSKVVLSEEYRKLYDIFGSPFMNNGDSLNKCFDNLHFIAATLKC
PCGAESSGVGDWTGFKTACCGLHGKVKGVTLGAVKPGDAVVTSMSAGKGVKFFANSVLQY
AGDVENVSVWKVIKTFTVNETVCTTDFEGELNDFIKPESTSLVSCSIKKAFITGEVDDAV
HDCIITGKLDFSTNLFGSANLLFKKTPWFVQKCGAIFADAWKVVEELLCSLKLTYKQIYD
VVASLCTSAFTIMDYKPVFVVSSNSVKDLVDKCVKILVKAFDVFTQTITIAGVEAKCFIL
GSKYLLFNNALVKLVSVKILGKRQKGLDSAFFATNLIGATVNVTPQRTEAANISLTKVDD
VVTPGEGHIVIIGDMAFYKSDEYYFMMASPDSVLVNNVFKAARVPSYNIVYNVDDDTKSK
MVVKLGTSFDFDGDLDAAIVKVNDLLIEFRQEKLCFRALKDGDNILVEAHLKKYKMPACL
KNHVGLWDIIRRDSDKKGFLDTFNHLNELEDVRDTNVQAIKNILCPDLLLELDFGAIWYR
CMPTCSDISILGNVKIMLGNGVKVVCDGCNSFAKRLTISYNKLCDTARKDIEIGGIPFST
FKTPSSNFIDMKDAIYSVVEYGEAFSFKTASVPVTNSGTVTTDDWSDPILLEPADYVEPK
DNGDVIVIAGYTFYKDEDDHFYPYGSGMVVQKMYNKMGGGDKSVSFSDNVNIREIEPVTR
VRLEFEFDNEVVTQVLEKVIGTKYKFIGTTWEEFEASISEKLDNIFDTLAEQGVELEGYF
IYDTCGGFDINNPDGVMISQYDLNTADDVKSDSDASMEDTSSISDNEAVEQIEEENVSTV
AVEEETVSVADVEDSIEQVTFVEELTKPDEQLSSVEEKVEVSAKNDPWAAAVGEQEAEQP
KPSLTPFRTTNLNGKIILKQQDNNCWINACCYQLQAFDFFNHDLWDGFKKDDVMPFVDFC
YAALTLKQGDSGDAEYLLEMVLNDYSTAKVTLSAKCGCGVKEIVLERTVFKLTPLKSEFK
YGVCGDCKQINMCRFASVVGSGVFVHDRIEKQTPVSQFIVPPTMHAVYTGTTQSGHYMIE
DCIHDYCVDGMGIKPRKHKFYTSTLFLNANVMTADFKTKVEPPAPVKEKCVEECQSPKDL
ITPFYKAGKVSFYQGELDVLINFLEPDVVVNAANGDLRHIGGVAKAIDVFTGGKLTKRSK
DYLKSNKPLTPGNVVLFENVLEHLSVLNAVGPRNGDSRVEGKLCNVYKAIAKCDGKILTP
LVSVGIFKVKLEVSLQCLLKTVTDRELSVFVYTDQERIAIENFFNGXIPVKVTEDNVNQK
RVSVALDKTYGEQLKGTVVIKDKDVTDQLPSVSDAGEKVVKALDVDWNAYYGFPNAAAFS
ASSHDAYKFDVVTHNNFIVHKQTDNNCWVNAICLALQRLKPTWKFPGVKSLWDDFLTRKT
AGFVHMLYYISGLKKGQPGDAELTLHKLGELMLSDSAVTVTHSTACDKCAKVETFTGPVV
AAPLMIYGTDETCVHGVSVNVKVTSVRGTVAITSLIGPVVGDVIDATGYICYTGLNSRGH
YTYYDHRNGLMIDAEKSYHFEKNLLQVTTAIASNFVSNTPKKETLPTNLVKEPNTARVFS
EVEETPKNIVRKEKLLAIESGVDYTITTFGKCADAFFMTGDRILRFLLEVFKYLLVVFMC
LRKSKMPKIKVKPPHVFKDIGAKARTLNYVRQLNKPALWRYGKLVLLLIALYHFFYLFVS
IPVMHKLVCSSSVQAYSNSSFVKSEVCGNSILCKACLASYDELADFDHLQVSWDYKSDPL
WNRVIQLSYFIFLAVFGNNYVRCLLMYFVSQYLNLWLSYFGYXKYSWFLHVVNFESISVE
FVIIVVVFKAVLALKHIFVPCNNPSCKTCSKIARQTRIPIQVVVNGSMKTVYVQANGTGK
LCKKHNFYCKNCDSYGFDHTFICDEIVRELSNSIKQTVYATDRSYQEVTKVECSDGFYRF
YVGEEFTAYDYDVKHKKYSSQEVLKTMFLLDDFIVYSPSGSSLASVRNVCVYFSQLIGRP
IKIVNSDLLEDLSVDFKGALFNAKKNVIKNSFNVDVSECKNLEECYKACNLDVTFSTFEM
AVNNAHRFGVLITDRSFNNFWPSKIKPGSSGVTAMDIGKCMTFDAKIVNAKVLTQRGKSV
VWLSQDFSALSSTAQKVLVKTFVEEGVNFSLTFNAVGSDEDLPYERFTESVSAKNGSGFF
DVLKQLKQLFWCFVLFIIIYGLCSVYSVVTQSYVDSAEGYDYMVIKNGVVQPFDDSINCV
HNTYKGFGVWFKAKYGFVPTFDKSCPIVLGTVFDLGNMRPIPDVPAYVALVGRSLVFAIN
AAFGVTNVCYDHTGAAVSENSYFDTCVFNSACTTLAGLGGTIVYCAKQGLVEGSRLYSEL
MPDYYYEHASGNMVKIPTIVRGFGLRFVKTQATTYCRVGECTESQAGFCFGGDNWFVYDK
EFGDGYICGSSTLGFFKNVFALFNSNMSVIATSGAMLANIVIACFAIAVCYGVLKFKKIF
GDCTLLVVMIIVTLVVNNVSYFVTQNTFFMIVYAIVYYFTTRKLAYPGILDAGFIIAYVN
MAPWYVLVLYIMVFLYDSLPSLFKLKVTTNLFEGDKFVGSFESAAMGTFVIDMRSYETLV
NSTSLDRIKSYANSFNKYKYYTGSMGEADYRMACYAHLGKALMDYSVSRNDMLYTPPTVS
VNSTLQSGLRKMAQPSGVVEPCIVRVAYGNNVLNGLWLGDEVICPRHVIASDTSRVINYE
NELSSVRLHNFSIAKNNVFLGVVSAKYRGVNLVLKVNQVNPNTPEHKFKSVRPGESFNIL
ACYEGCPGSVYGVNMRSQGTIKGSFIAGTCGSVGYVLENGTLYFVYMHHLELGNGSHVGS
NLEGEMYGGYEDQPSMQLEGTNVMSSDNVVAFLYAALINGERWFVTNASMSLESYNAWAK
TNSFTEIVSTDAFNMLAAKTGYSVEKLLECIVRLSKGFGGRTILSYGSLCDEFTPTEVIR
QMYGVNLQGGKVKSLFYPVMTVMTILFSFWLEFFMYTPFTWINPTFVSVILAITTLVSVI
LVAGIKHKMLFFMSFVMPSVVLATAHNVVWDMTYYESLQVLVENVNTTFLPVDMQGIMLA
LFCVVVFVTYTIRFFTCKQSWFSLFVTTVFVVFNIVKLLGMVGEPWTEDHILLCLVNMLT
MLISLTTKDWFVVFASYKFAYYIVVYVMNPAFVQDFGFVKCVSIIYMACGYFFCCYYGIL
YWVNRFTCMTCGVYQFTVSPAELKYMTANNLSAPKTAYDAMILSVKLMGIGGERNIKIST
VQSKLTEMKCTNVVLLGLLSKMHVESNSKEWNYCVGLHNEINLCDDPEVVLEKLLALIAF
FLSKHNTCDLSDLIDSYFENTTILQSVASAYAALPSWIAYEKARADLEEAKKNDVSPQLL
KQLTKACNIAKSEFEREASVQKKLDKMAEQAAASMYKEARAVDRKSKIVSAMHSLLFGML
KKLDMSSVNTIIEQARNGVLPLSIIPAASATRLIVVTPNLEVLSKVRQENNVHYAGAIWS
IVEVKDANGAQVHLKEVTAANELNITWPLSITCERTTKLQNNEILPGKLKEKAVKASATI
DGDAYGSGKALMASECGKSFIYAFIASDSNLKYVKWESNNDVIPIELEAPLRFYVDGVNG
PEVKYLYFVKNLNTLRRGAVLGYIGATVRLQAGKPTEHPSNSGLLTLCAFAPDPAKAYVD
AVKRGMQPVTNCVKMLSNGAGNGMAITNGVESNTQQDSYGGASVCIYCRCHVEHPAIDGL
CRFKGKFVQVPTGTQDPIRFCIENEVCVVCGCWLNNGCMCDRTSMQGSTIDQSYLNECGV
LVQLD
Download sequence
Identical sequences E0AGX5
E0AGX5_9NIDO

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]