SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for S5YNN7 from Uniprot 2018_03 genome

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  S5YNN7
Domain Number 1 Region: 2964-3263
Classification Level Classification E-value
Superfamily Trypsin-like serine proteases 1.93e-147
Family Viral cysteine protease of trypsin fold 0.0000000000186
Further Details:      
 
Domain Number 2 Region: 3665-3818
Classification Level Classification E-value
Superfamily Coronavirus NSP8-like 1.7e-70
Family Coronavirus NSP8-like 0.00000959
Further Details:      
 
Domain Number 3 Region: 3937-4058
Classification Level Classification E-value
Superfamily Coronavirus NSP10-like 2.48e-55
Family Coronavirus NSP10-like 0.00000975
Further Details:      
 
Domain Number 4 Region: 3825-3931
Classification Level Classification E-value
Superfamily Replicase NSP9 1.7e-43
Family Replicase NSP9 0.0001
Further Details:      
 
Domain Number 5 Region: 3545-3627
Classification Level Classification E-value
Superfamily Coronavirus NSP7-like 8.37e-35
Family Coronavirus NSP7-like 0.00044
Further Details:      
 
Domain Number 6 Region: 1274-1426
Classification Level Classification E-value
Superfamily Macro domain-like 9.92e-27
Family Macro domain 0.00023
Further Details:      
 
Domain Number 7 Region: 2144-2260
Classification Level Classification E-value
Superfamily Pectin lyase-like 0.0000596
Family Pectate lyase-like 0.026
Further Details:      
 
Weak hits

Sequence:  S5YNN7
Domain Number - Region: 2819-2897,3267-3445
Classification Level Classification E-value
Superfamily Calcium ATPase, transmembrane domain M 0.000458
Family Calcium ATPase, transmembrane domain M 0.038
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) S5YNN7
Sequence length 4083
Comment (tr|S5YNN7|S5YNN7_CVH22) Replicase polyprotein 1a {ECO:0000313|EMBL:AGT21358.1} KW=Complete proteome OX=11137 OS=Human coronavirus 229E (HCoV-229E). GN=Pp1a OC=Nidovirales; Coronaviridae; Coronavirinae; Alphacoronavirus. OH=9606
Sequence
MACNRVTLAVASDSEISAYGCSTIAQAVRRYSEAASNGFRACRFVSLDLQDCIVGIADDT
YVMGLHGNQTLFCNIMKFSDRPFMLHGWLVFSNSNYLLEEFDVVFGKRGGGNVTYTDQYL
CGADGKPVMSEDLWQFVDHFGENEEIVINGRTYVCAWLTKRKPLDYKRQNNLAIEEIEYV
HGDALHTLRNGSVLEMAKEVKTSSKVVLSDALDKLYKVFGSPVMTNGSNILEAFTKPVFI
SALVQCTCGTKSWSVGDWTGFKSSCCNLISNKLCVVPGNVKPGDAVITTQQAGAGVKYFC
GMTLKFVANIEGVSVWRVIALQSVDCFVASSTFVEEEHVNRMDTFCFNVRNSVTDECRLA
MLGAEMTSNVRRQVASGVIDISTGWFDVYDDIFAESKPRFVRKAEDIFGPCWSALVSALK
QLKVTTGELVRFVKSICNSAVAVVGGTIQILASVPEKFLNAFDVFVTAIQTVFDCAVETC
TIAGKAFDKVFDYVLLDNALVKLVTTKLKGVREGGLNKVKYATVVVGSTEEVKSSRVERS
NAVLTIANNYSKLFDEGYTVVIGDVAYFASDGYFRLMASPNSVLTTAVYKPLVDFNVNVM
GTRPEKFPTTVTCENLESAVLFVNDKITEFQLDYSIDVIDNEIIVKPNISLCVPLYVRDY
VDKWDDFCRQYSNESWFEDDYKAFISVLDITDAAVKAAESKAFVDTIVPPCPSILKVIDG
GKIWNGVIKNVNYVRDWLKSLKLNLTQQGLLGTCAKRFKRWLGILLEAYNAFLDTVVSTV
KIGGLTFKTYAFDKPYIVIRDIVCKVENKTEAEWIELFPHNDRIKSFSTFESAYMPIADP
THFDIEEVELLDAEFVEPGCGGILSVIDEHVFYKKDGVYYPSNGTNILPVAFTKAAGGKV
SFSDDVEVKDIEPVYRVKLCFEFEDEKLVDVCEKAIGKKIKHEGDWDSFCKTIQSALSVV
SCYVNLPTYYIYDEEGGNDLSLPVMISEWPLSVQQAQQEATHIAEDVVDQVEEVNSIFDI
ETVDVKHEMSPFEMPFEELNGLKILKQLDNNCWVNSVMLQIQLTGILDGDYAMQFFKMGR
VAKMIERCYTAEQCIRGAMGDVGLCMYRLLKDLHTGFMVMDYKCSCTSGRLEESGAVLFC
TPTKKAFPYGTCLNCNAPRMCTIRQLQGTIIFVQQKPEPVNPVSFVVKPVCSSIFRGAVS
CGHYQTNIYSQNLCVDGFGVNKIQPWTNDALNTICIKDADYNAKDEISVTPIKNTVDTTP
KEEFVVKEKLNAFLVHDNVAFYQGDVDTVVNGVDFDFIVNAANENLAHGGGLAKALDVYT
KGKLQRLSKEHIGLAGKVRVGTGVMVECDSLRIFNVVGPRKGKHERDLLVKAYNTINNEQ
GTPLTPILSCGIFGVKLETSLEVLLDVCNTKEFKVFVYTDTEVCKVKDFVSGLVNVQKVA
QPKIEQKPVSLIKVAPKPYRVDGKFSYFTEDLLCVADDKPIVLFTDSMLTLDDRGLALDN
ALSGVLSAAIKDCVDINKAIPSGNLIKFDIDSVVVYMCVVPSEKDKHLDNNVQRCTRKLN
RLMCDIVCTIPADYILPLVLSSLTCNVSFVGELKAAEAKVITIKVTEDGVNVHDVTVTTD
KSFEQQVGVIADKDKDLSGAVPSDLNTSELLTKAIDVDWVEFYGFKDAVTFATVDHSAFA
YESAVVNGIRVLKTSDNNCWVNAVCIALQYSKPHFISQGLDAAWNKFVLGDVEIFVAFVY
YVARLVKGDKGDAEDTLNKLSKYLANEAQVQLEHYSSCVECDAKFKNSVTSINSAIVCAS
VKRDGVQVGYCVHGIKYYSRVKSVRGRAIIVSVEQLEPCVQSRLLSGVAYTAFSGPVDKG
HYTVYDTAKKSMYDGDRFVKHDLSLLSVTSVVMVGDYVAPVSTVKPKPVISQLDENAQKF
FDFGDFLIHNFVIFFTWLLSMFTLCKTAVTTCDVKIMAKAPQRTGVVLKRSLKYNLKASA
AVLKSKWWLLAKFTKLLLLIYTLYSVVLFCVRFGPFKFCSETVNGYAKSNFVKDDYCDGS
LGCKMCLFGYQELSQFSHLDVVWKHITDPLFSNIQPFIVMVLLLIFGDNYLRCFLLYFVA
QMISTVGVFLGYKETNWFLHFIPFDVICDELLVTVIVIKVISFVRHVLFGCENPDCIACS
KSARLKRFPVNTIVNGVQRSFYVNANGGSKFCKKHRFFCVDCDSYGYGSTFITPEVSREL
GNITKTNVQPTGPAYVMIDKVEFENGFYRLYSGETFWRYNFDITESKYSCKEVFKNCNVL
DDFIVFNNNGTNVTQVKNASVYFSQLLCRPIKLVDSELLSTLSVDFNGVLHKAYIDVLRN
SFGKDLNANMSLAECKSALGLSISDHEFTSAISNAHRCDVLLSDLSFNNFVSSYAKPEEK
LSAYDLACCMRAGAKVVNANVLTKDQTPIVWHAKDFNSLSAEGRKYIVKTSKAKGLTFLL
TINENQAVTQIPATSIVAKQGAGDAGHSLTWLWLLCGLVCLIQFYLCFFMPYFMYDVVSS
FEGYDFKYIENGQLKNFEAPLKCVRNVFENFEDWHYAKFGFTPLNKQSCPIVVGVSEIVN
TVAGIPSNVYLVGKTLIFTLQAAFGNAGVCYDIFGVTTPEKCIFTSACTRLEGLGGNNVY
CYNTELMEGSLPYSSIQANAYYKYDNGNFIKLPEVIAQGFGFRTVRTIATKYCRVGECVD
SNAGVCFGFDKWFVNDGRVANGYVCGTGLWNLVFNILSMFSSSFSVAAMSGQILLNCALG
AFAIFCCFLVTKFRRMFGDLSVGVCTVVVAVLLNNVSYIVTQNLVTMIAYAILYFFATRS
LRYAWIWCAAYLIAYISFAPWWLCAWYFLAMLTGLLPSLLKLKVSTNLFEGDKFVGTFES
AAAGTFVIDIRSYEKLANSISPEKLKSYAASYNRYKYYSGNANEADYRCACYAYLAKAML
DFSRDHNDILYTPPTVSYGSTLQAGLRKMAQPSGFVEKCVVRVCYGNTVLNGLWLGDIVY
CPRHVIASNTTSAIDYDHEYSIMRLHNFSIISGTAFLGVVGATMHGVTLKIKVSQTNMHT
PRHSFRTLKSGEGFNILACYDGCAQGVFGVNMRTNWTIRGSFINGACGSPGYNLKNGEVE
FVYMHQIELGSGSHVGSSFDGVMYGGFEDQPNLQVESANQMLTVNVVAFLYAAILNGCTW
WLKGDKLFVEHYNEWAQANGFTAMNGEDAFSILAAKTGVCVERLLHAIQVLNNGFGGKQI
LGYSSLNDEFSINEVVKQMFGVNLQSGKTTSMFKSISLFAGFFVMFWAELFVYTTTIWVN
PGFLTPFMILLVALSLCLTFVVKHKVLFLQVFLLPSIIVAAIQNCAWDYHITKVLAEKFD
YNVSVMQMDIQGFVNIFICLFVALLHTWRFAKERCTHWCTYLFSLIAVLYTALYSYDYVS
LLVMLLCAISNEWYIGAIIFRICRFGVAFLPVEYVSYFDGVKTVLLFYMLLGFVSCMYYG
LLYWINRFCKCTLGVYDFCVSPAEFKYMVANGLNAPNGPFDALFLSFKLMGIGGPRTIKV
STVQSKLTDLKCTNVVLMGILSNMNIASNSKEWAYCVEMHNKINLCDDPETAQELLLALL
AFFLSKHSDFGLGDLVDSYFENDSILQSVASSFVGMPSFVAYETARQEYENAVANSSSPQ
IIKQLKKAMNVAKAEFDRESSVQKKINRMAEQAAAAMYKEARAVNRKSKVVSAMHSLLFG
MLRRLDMSSVDTILNMARNGVVPLSVIPATSAARLVVVVPDHDSFVKMMVDGFVHYAGVV
WTLQEVKDNDGKNVHLKDVTKENQEILVWPLILTCERVVKLQNNEIMPGKMKVKATKGEG
DGGIISEGNALYNNECGRAFMYAYVTTKPGMKYVKWEHDSGVVTVELEPPCRFVIDTPTG
PQIKYLYFVKNLNNLRRGAVLGYIGATVRLQAGKQTEFVSNSHLLTHCSFAVDPAVAYLD
AVKQGAKPVGNCVKMLTNGSGSGQAITSTIDSNTTQDTYGGASVCIYCRAHVAHPTMDGF
CQYKGKWVQVPIGTNDPIRFCLENTVCKVCGCWLNHGCTCDRTAIQSFDSSYLNESGALV
PLD
Download sequence
Identical sequences S5YNN7

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]