SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for S5Y3P7 from Uniprot 2018_03 genome

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  S5Y3P7
Domain Number 1 Region: 3350-3650
Classification Level Classification E-value
Superfamily Trypsin-like serine proteases 6.29e-132
Family Viral cysteine protease of trypsin fold 0.000000107
Further Details:      
 
Domain Number 2 Region: 4067-4216
Classification Level Classification E-value
Superfamily Coronavirus NSP8-like 2.35e-60
Family Coronavirus NSP8-like 0.00000742
Further Details:      
 
Domain Number 3 Region: 4341-4462
Classification Level Classification E-value
Superfamily Coronavirus NSP10-like 7.98e-54
Family Coronavirus NSP10-like 0.00000738
Further Details:      
 
Domain Number 4 Region: 808-922
Classification Level Classification E-value
Superfamily NSP3A-like 2.48e-40
Family NSP3A-like 0.00042
Further Details:      
 
Domain Number 5 Region: 4228-4335
Classification Level Classification E-value
Superfamily Replicase NSP9 3.27e-40
Family Replicase NSP9 0.00011
Further Details:      
 
Domain Number 6 Region: 1374-1531
Classification Level Classification E-value
Superfamily Macro domain-like 2.52e-36
Family Macro domain 0.00018
Further Details:      
 
Domain Number 7 Region: 3940-4028
Classification Level Classification E-value
Superfamily Coronavirus NSP7-like 6.02e-25
Family Coronavirus NSP7-like 0.00027
Further Details:      
 
Weak hits

Sequence:  S5Y3P7
Domain Number - Region: 1253-1328,1437-1452,1628-1661
Classification Level Classification E-value
Superfamily Growth factor receptor domain 0.0816
Family Growth factor receptor domain 0.027
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) S5Y3P7
Sequence length 4486
Comment (tr|S5Y3P7|S5Y3P7_CVHK1) Replicase polyprotein 1a {ECO:0000313|EMBL:AGT17766.1} KW=Complete proteome OX=290028 OS=Human coronavirus HKU1 (HCoV-HKU1). GN=Pp1a OC=Nidovirales; Coronaviridae; Coronavirinae; Betacoronavirus. OH=9606
Sequence
MIKTSKYGLGFKWAPEFRWLLPDAAEELASPMKSDEGGLCPSTGQAMESVGFVYDNHVKI
DCRCILGQEWHVQSNLIRDIFVHEDLHVVEVLTKTAVKSGTAILIKSPLHSLGGFPKGYV
MGLFRSYKTKRYVVHHLSMTTSTTNFGEDFLGWIVPFGFMPSYVHKWFQFCRLYIEESDL
IISNFKFDDYDFSVEDVYAEVHAEPKGKYSQKAYALLRQYRGIKPVLFVDQYGCDYSGKL
ADCLQAYGHYSLQDMRQKQSVWLANCDFDIVVAWHVVRDSRFVMRLQTIATICGIKYVAQ
PTEDVVDGDVVIREPVHLLSADAIVLKLPSLMKVMTHMDDFSIKSIYNVDLCDCGFVMQY
GYVDCFNDNCDFYGWVSGNMMDGFSCPLCCTVYDSSEVKAQSSGVIPENPVLFTNSTDTV
NHDSFNLYGYSVTPFGSCIYWSPRPGLWIPIIKSSVKSYDDLVYSGVVGCKSIVKETALI
THALYLDYVQCKCGNLEQNHILGVNNSWCRQLLLNRGDYNMLLKNIDLFVKRRADFACKF
AVCGDGFVPFLLDGLIPRSYYLIQSGIFFTSLMSQFSQEVSDMCLKMCILFMDRVSVATF
YIEHYVNRLVTQFKLLGTTLVNKMVNWFNTMLDASAPATGWLLYQLLNGLFVVSQANFNF
VALIPDYAKILVNKFYTFFKLLLECVTVDVLKDMPVLKTINGLVCIVGNKFYNVSTGLIP
GFVLPCNAQEQQIYFFEGVAESVIVEDDVIENVKSSLSSYEYCQPPKSVEKICIIDNMYM
GKCGDKFFPIVMNDKNICLLDQAWRFPCAGRKVNFNEKPVVMEIPSLMTVKVMFDLDSTF
DDILGKVCSEFEVEKGVTVDDFVAVVCDAIENALNSCKEHPVVGYQVRAFLNKLNDNVVY
LFDEAGDEAMASRMYCTFAIEDVEDVISSEAVEDTIDGVVEDTINDDEDVVTGDNDDEDV
VTGDNDDEDVVTGDNDDEDVVTGDNDDEDVVTGDNDDEDVVTGDNDDEDVVTGDNDDEDV
VTGDNDDEDVVTGDNDDEDVVTGDNDDEDVVTGDNDDEDVVTGDNDDEDVVTGDNDDEDV
VTGDNDDEDVVTGDNDDEDNKDEEIVTGDNDDQIVVTGDDVDDIESIYDFDTYKALLVFN
DVYNDALFVSYGSSVETETYFKVNGLWSPTITHTNCWLRSVLLVMQKLPFKFKDLAIENM
WLSYKVGYNQSFVDYLLTTIPKAIVLPQGGFVADFAYWFLNQFDINAYANWCCLKCGFSF
DLNGLDALFFYGDIVSHVCKCGHNMTLIAADLPCTLHFSLFDDNFCAFCTPKKIFIAACA
VDVNVCHSVAVIGDEQIDGKFVTKFSGDKFDFIVGYGMSFSMSSFELAQLYGLCITPNVC
FVKGDIINVARLVKADVIVNPANGHMLHGGGVAKAIAVAAGKKFSKETAAMVKSKGVCQV
GDCYVSTGGKLCKTILNIVGPDARQDGRQSYVLLARAYKHLNNYDCCLSTLISAGIFSVP
ADVSLTYLLGVVDKQVILVSNNKEDFDIIQKCQITSVVGTKALAVRLTANVGRVIKFETD
AYKLFLSGDDCFVSNSSVIQEVLLLRHDIQLNNDVRDYLLSKMTSLPKDWRLINKFDVIN
GVKTVKYFECPNSIYICSQGKDFGYVCDGSFYKATVNQVCVLLAKKIDVLLTVDGVNFKS
ISLTVGEVFGKILGNVFCDGIDVTKLKCSDFYADKILYQYENLSLADISAVQSSFGFDQQ
QLLAYYNFLTVCKWSVVVNGPFFSFEQSHNNCYVNVACLMLQHINLKFNKWQWQEAWYEF
RAGRPHRLVALVLAKGHFKFDEPSDATDFIRVVLKQADLSGAICELELICDCGIKQESRV
GVDAVMHFGTLAKTDLFNGYKIGCNCAGRIVHCTKLNVPFLICSNTPLSKDLPDDVVAAN
MFMGVGVGHYTHLKCGSPYQHYDACSVKKYTGVSGCLTDCLYLKNLTQTFTSMLTNYFLD
DVEMVAYNPDLSQYYCDNGKYYTKPIIKAQFKPFAKVDGVYTNFKLVGHDICAQLNDKLG
FNVDLPFVEYKVTVWPVATGDVVLASDDLYVKRYFKGCETFGKPVIWFCHDEASLNSLTY
FNKPSFKSENRYSVLSVDSVSEESQGNVVTPVMESQISTKEVKLKGVRKTVKIEDAIIVN
DENSSIKVVKSLSLVDVWDMYLTGCDYVVWVANELSRLVKSPTVREYIRYGIKPITIPID
LLCLRDDNQTLLVPKIFKARAIEFYGFLKWLFIYVFSLLHFTNDKTIFYTTEIASKFTFN
LFCLALKNAFQTFRWSIFIKGFLVVATVFLFWFNFLYINVIFSDFYLPNISVFPIFVGRI
VMWIKATFGLVTICDFYSKLGVGFTSHFCNGSFICELCHSGFDMLDTYAAIDFVQYEVDR
RVLFDYVSLVKLIVELVIGYSLYTVWFYPLFCLIGLQLFTTWLPDLFMLETMHWLIRFIV
FVANMLPAFVLLRFYIVVTAMYKVVGFIRHIVYGCNKAGCLFCYKRNCSVRVKCSTIVGG
VIRYYDITANGGTGFCVKHQWNCFNCHSFKPGNTFITVEAAIELSKELKRPVNPTDASHY
VVTDIKQVGCMMRLFYDRDGQRVYDDVDASLFVDINNLLHSKVKVVPNLYVVVVESDADR
ANFLNAVVFYAQSLYRPILLVDKKLITTACNGISVTQTMFDVYVDTFMSHFDVDRKSFNN
FVNIAHASLREGVQLEKVLDTFVGCVRKCCSIDSDVETRFITKSMISAVAAGLEFTDENY
NNLVPTYLKSDNIVAADLGVLIQNGAKHVQGNVAKAANISCIWFIDAFNQLTADLQHKLK
KACVKTGLKLKLTFNKQEASVPILTTPFSLKGGVVLSNLLYILFFVSLICFILLWALLPT
YSVYKSDIHLPAYASFKVIDNGVVRDISVNDLCFANKFFQFDQWYESTFGSVYYHNSMDC
PIVVAVMDEDIGSTMFNVPTKVLRHGFHVLHFLTYAFASDSVQCYTPHIQISYNDFYASG
CVLSSLCTMFKRGDGTPHPYCYSDGVMKNASLYTSLVPHTRYSLANSNGFIRFPDVISEG
IVRIVRTRSMTYCRVGACEYAEEGICFNFNSSWVLNNDYYRSMPGTFCGRDLFDLFYQFF
SSLIRPIDFFSLTASSIFGAILAIVVVLVFYYLIKLKRAFGDYTSVVVINVVVWCINFLM
LFVFQVYPICACVYACFYFYVTLYFPSEISVIMHLQWIVMYGAIMPFWFCVTYVAMVIAN
HVLWLFSYCRKIGVNVCSDSTFEETSLTTFMITKDSYCRLKNSVSDVAYNRYLSLYNKYR
YYSGKMDTAAYREAACSQLAKAMETFNHNNGNDVLYQPPTASVSTSFLQSGIVKMVSPTS
KIEPCIVSVTYGSMTLNGLWLDDKVYCPRHVICSSSNMNEPDYSALLCRVTLGDFTIMSG
RMSLTVVSYQMQGCQLVLTVSLQNPYTPKYTFGNVKPGETFTVLAAYNGRPQGAFHVTMR
SSYTIKGSFLCGSCGSVGYVLTGDSVKFVYMHQLELSTGCHTGTDFTGNFYGPYRDAQVV
QLPVKDYVQTVNVIAWLYAAILNNCAWFVQNDVCSTEDFNVWAMANGFSQVKADLVLDAL
ASMTGVSIETLLAAIKRLYMGFQGRQILGSCTFEDELAPSDVYQQLAGVKLQSKTKRFIK
ETIYWILISTFLFSCIISAFVKWTIFMYINTHMIGVTLCVLCFVSFMMLLVKHKHFYLTM
YIIPVLCTLFYVNYLVVYKEGFRGFTYVWLSYFVPAVNFTYVYEVFYGCILCVFAIFITM
HSINHDIFSLMFLVGRIVTLISMWYFGSNLEEDVLLFITAFLGTYTWTTILSLAIAKIVA
NWLSVNIFYFTDVPYIKLILLSYLFIGYILSCYWGFFSLLNSVFRMPMGVYNYKISVQEL
RYMNANGLRPPRNSFEAILLNLKLLGIGGVPVIEVSQIQSKLTDVKCANVVLLNCLQHLH
VASNSKLWQYCSVLHNEILSTSDLSVAFDKLAQLLIVLFANPAAVDTKCLASIDEVSDDY
VQDSTVLQALQSEFVNMASFVEYEVAKKNLADAKNSGSVNQQQIKQLEKACNIAKSVYER
DKAVARKLERMADLALTNMYKEARINDKKSKVVSALQTMLFSMVRKLDNQALNSILDNAV
KGCVPLSAIPALAANTLTIIIPDKQVFDKVVDNVYVTYAGSVWHIQTVQDADGINKQLTD
ISVDSNWPLVIIANRYNEVANAVMQNNELMPHKLKIQVVNSGSDMNCNIPTQCYYNNGSS
GRIVYAVLSDVDGLKYTKIMKDDGNCVVLELDPPCKFSIQDVKGLKIKYLYFIKGCNTLA
RGWVVGTLSSTIRLQAGVATEYAANSSILSLCAFSVDPKKTYLDYIQQGGVPIINCVKML
CDHAGTGMAITIKPEATINQDSYGGASVCIYCRARVEHPDVDGICKLRGKFVQVPLGIKD
PILYVLTHDVCQVCGFWRDGSCSCVGSSVAVQSKDLNFLNGFGVLV
Download sequence
Identical sequences S5Y3P7

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]