SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for C6GHP6 from Uniprot 2018_03 genome

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  C6GHP6
Domain Number 1 Region: 3247-3547
Classification Level Classification E-value
Superfamily Trypsin-like serine proteases 1.53e-132
Family Viral cysteine protease of trypsin fold 0.000000112
Further Details:      
 
Domain Number 2 Region: 3964-4113
Classification Level Classification E-value
Superfamily Coronavirus NSP8-like 3.4e-62
Family Coronavirus NSP8-like 0.00000677
Further Details:      
 
Domain Number 3 Region: 4238-4359
Classification Level Classification E-value
Superfamily Coronavirus NSP10-like 2.88e-53
Family Coronavirus NSP10-like 0.00000738
Further Details:      
 
Domain Number 4 Region: 4125-4232
Classification Level Classification E-value
Superfamily Replicase NSP9 8.37e-41
Family Replicase NSP9 0.0000753
Further Details:      
 
Domain Number 5 Region: 850-966
Classification Level Classification E-value
Superfamily NSP3A-like 1.2e-36
Family NSP3A-like 0.00036
Further Details:      
 
Domain Number 6 Region: 1273-1428
Classification Level Classification E-value
Superfamily Macro domain-like 2.52e-35
Family Macro domain 0.00016
Further Details:      
 
Domain Number 7 Region: 3837-3925
Classification Level Classification E-value
Superfamily Coronavirus NSP7-like 8.37e-28
Family Coronavirus NSP7-like 0.00022
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) C6GHP6
Sequence length 4383
Comment (tr|C6GHP6|C6GHP6_9BETC) Orf1a polyprotein {ECO:0000313|EMBL:ACT11015.1} KW=Complete proteome OX=502105 OS=Bovine respiratory coronavirus bovine/US/OH-440-TC/1996. GN= OC=Nidovirales; Coronaviridae; Coronavirinae; Betacoronavirus.
Sequence
MSKINKYGLELHWAPEFPWMFEDAEEKLDNPSSSEVDIVCSTTAQKLETGGICPENHVMV
DCRRLLKQECCVQSSLIREIVMNTRPYDLEVLLQDALQSREAVLVTPPLGMSLEACYVRG
CNPNGWTMGLFRRRSVCNTGRCAVNKHVAYQLYMIDPAGVCFGAGQFVGWVIPLAFMPVQ
SRKFIVPWVMYLRKCGEKGAYNKDHKRGGFEHVYNFKVEDAYDLVHDEPKGKFSKKAYAL
IRGYRGVKPLLYVDQYGCDYTGGLADGLEAYADKTLQEMKALFPVWSQELPFDVTVAWHV
VRDPRYVMRLQSASTIRSVAYVANPTEDLCDGSVVIKEPVHVYADDSIILRQHNLVDIMS
CFYMEADAVVNAFYGVDLKDCGFVMQFGYIDCEQDLCDFKGWVPGNMIDGFACTTCGHVY
ETGDLLAQSSGVLPVNPVLHTKSAAGYGGFGCKDSFTLYGQTVVYFGGCVYWSPARNIWI
PILKSSVKSYDGLVYTGVVGCKAIVKETNLICKALYLDYVQHKCGNLHQRELLGVSDVWH
KQLLLNRGVYKPLLENIDYFNMRRAKFSLETFTVCADGFMPFLLDDLVPRAYYLAVSGQA
FCDYADKICHAVVSKSKELLDVSLDSLSAAIHYLNSKIVDLAQHFSDFGTSFVSKIVHFF
KTFTTSTALAFAWVLFHVLHGAYIVVESDIYFVKNIPRYASAVAQAFRSVAKVVLDSLRV
TFIDGLSCFKIGRRRICLSGSKIYEVERGLLHSSQLPLDVYDLTMPSQVQKAKQKPIYLK
GSGSDFSLADSVVEVVTTSLTPCGYSEPPKVADKICIVDNVYMAKAGDKYYPVVVDGQVG
LLDQAWRVPCAGRRVTFKEQPTVNEIASTPKTIKVFYELDKDFNTILNTACGVFEVDDTV
DMEEFYAVVVDAIEEKLSPCKELEGVGAKVSAFLQKLEDNSLFLFDEAGEEVLAPKLYCA
FTAPEDDDFLEESGVEEDDVEGEETDLTVTSAGEPCVASEQEESSEILEDTLDDGPCVET
SDSQVEEDVEMSDFVDLESVIQDYENVCFEFYTTEPEFVKVLDLYVPKATRNNCWLRSVL
AVMQKLPCQFKDKNLQDLWVLYKQQYSQLFVDTLVNKIPANIVVPQGGYVADFAYWFLTL
CDWQCVAYWKCIKCDLALKLKGLDAMFFYGDVVSHVCKCGESMVLIDVDVPFTAHFALKD
KLFCAFITKRSVYKAACVVDVNDSHSMAVVDGKQIDDHRVTSITSDKFDFIIGHGMSFSM
TTFEIAQLYGSCITPNVCFVKGDIIKVSKRVKAEVVVNPANGHMAHGGGVAKAIAVAAGQ
QFVKETTDMVKSKGVCATGDCYVSTGGKLCKTVLNVVGPDARTQGKQSYALLERVYKHLN
KYDCVVTTLISAGIFSVPSDVSLTYLLGTAEKQVVLVSNNQEDFDLISKCQITAVEGTKK
LAERLSFNVGRSIVYETDANKLILSNDVAFVSTFNVLQDVLSLRHDIALDDDARTFVQSN
VDVVPEGWRVVNKFYQINGVRTVKYFECPGGIDICSQDKVFGYVQQGSFNKATVAQIKAL
FLDKVDILLTVDGVNFTNRFVPVGESFGKSLGNVFCDGVNVTKHKCDINYKGKVFFQFDN
LSSEDLKAVRSSFNFDQKELLAYYNMLVNCSKWQVVFNGKYFTFKQANNNCFVNVSCLML
QSLNLKFKIVQWQEAWLEFRSGRPARFVSLVLAKGGFKFGDPADSRDFLRVVFSQVDLTG
AICDFEIACKCGVKQEQRTGVDAVMHFGTLSREDLEIGYTVDCSCGKKLIHCVRFDVPFL
ICSNTPASVKLPKGVGSANIFKGDKVGHYVHVKCEQSYQLYDASNVKKVTDVTGNLSDCL
YLKNLKQTFKSVLTTYYLDDVKKIEYNPDLSQYYCDGGKYYTQRIIKAQFKTFEKVDGVY
TNFKLIGHTICDILNAKLGFDSSKEFVEYKVTEWPTATGDVVLATDDLYVKRYERGCITF
GKPVIWLSHEQASLNSLTYFNRPLLVDENKFDVLKVDDVDDGGDISESDAKESKEINIIK
LSGVKKPFKVEDSVIVNDDTSEIKYVKSLSIVDVYDMWLTGCRYVVRTANALSMAVNVPT
IRKFIKFGMTIVSIPIDLLNLREIKPVFNVVKAVRNKISACFNFIKWLFVLLFGWIKISA
DNKVIYTTEVASKLTCKLVALAFKNAFLTFKWSVVARGACIIATIFLLWFNFIYANVIFS
DFYLPKIGFLPTFVGKIAQWIKSTFSLVTICDLYSIQDVGFKNQYCNGSIACQFCLAGFD
MLDNYKAIDVVQYEADRRAFVDYTGVLKIVIELIVSYALYTAWFYPLFALISIQILTTWL
PELFMLSTLHWSVRLLVSLANMLPAHVFMRFYIIIASFIKLFSLFRHVAYGCSKPGCLFC
YKRNRSLRVKCSTIVGGMIRYYDVMANGGTGFCSKHQWNCIDCDSYKPGNTFITVEAALD
LSKELKRPIQPTDVAYHTVTDVKQVGCYMRLFYERDGQRTYDDVNASLFVDYSNLLHSKV
KGVPNMHVVVVENDADKANFLNAAVFYAQSLFRPILMVDKNLITTANTGTSVTETMFDVY
VDTFLSMFDVDKKSLNALIATAHSSIKQGTQICKVLDTFLSCARKSCSIDSDVDTKCLAD
SVMSAVSAGLELTDESCNNLVPTYLKGDNIVAADLGVLIQNSAKHVQGNVAKIAGVSCIW
SVDAFNQLSSDFQHKLKKACCKTGLKLKLTYNKQMANVSVLTTPFSLKGGAVFSYFVYVC
FLLSLVCFIGLWCLMPTYTVHKSDFQLPVYASYKVLDNGVIRDVSVEDVCFANKFEQFDQ
WYESTFGLSYYSNSMACPIVVAVVDQDLGSTVFNVPTKVLRYGYHVLHFITHALSADGVQ
CYTPHSQISYSNFYASGCVLSSACTMFAMADGSPQPYCYTEGLMQNASLYSSLVPHVRYN
LANAKGFIRFPEVLREGLVRIVRTRSMSYCRVGLCEEADEGICFNFNGSWVLNNDYYRSL
PGTFCGRDVFDLIYQLFKGLAQPVDFLALTASSIAGAILAVIVVLVFYYLIKLKRAFGDY
TSIVFVNVIVWCVNFMMLFVFQVYPTLSCVYAICYFYATLYFPSEISVIMHLQWLVMYGT
IMPLWFCLLYISVVVSNHAFWVFAYCRRLGTSVRSDGTFEEMALTTFMITKDSYCKLKNS
LSDVAFNRYLSLYNKYRYYSGKMDTAAYREAACSQLAKAMDTFTNNNGSDVLYQPPTASV
STSFLQSGIVKMVNPTSKVEPCIVSVTYGNMTLNGLWLDDKVYCPRHVICSASDMTNPDY
TNLLCRVTSSDFTVLFDRLSLTVMSYQMQGCMLVLTVTLQNSRTPKYTFGVVKPGETFTV
LAAYNGKPQGAFHVTMRSSYTIKGSFLCGSCGSVGYVLMGDCVKFVYMHQLELSTGCHTG
TDFNGDFYGPYKDAQVVQLPVQDYIQSVNFVAWLYAAILNNCNWFVQSDKCSVEDFNVWA
LSNGFSQVKSDLVIDALASMTGVSLETLLAAIKRLKNGFQGRQVMGSCSFEDELTPSDVY
QQLAGIKLQSKRTRLVKGIVCWIMASTFLFSCIITAFVKWTMFMYVTTNMLSITFCALCV
ISLAMLLVKHKHLYLTMYIIPVLFTLLYNNYLVVYKQTFRGYVYAWLSYYVPSVEYTYTD
EVIYGMLLLIGMVFVTLRSINHDLFSFIMFVGRVISVVSLWYMGSNLEEEILLMLASLFG
TYTWTTALSMAAAKVIAKWVAVNVLYFTDIPQIKIVLVCYLFIGYIISCYWGLFSLMNSL
FRMPLGVYNYKISVQELRYMNANGLRPPKNSFEALMLNFKLLGIGGVPIIEVSQFQSKLT
DVKCANVVLLNCLQHLHVASNSKLWQYCSTLHNEILATSDLGVAFEKLAQLLIVLFANPA
AVDSKCLTSIEEVCDDYAKDNTVLQALQSEFVNMASFVEYEVAKKNLDEARSSGSANQQQ
LKQLEKACNIAKSAYERDRAVARKLERMADLALTNMYKEARINDKKSKVVSALQTMLFSM
VRKLDNQALNSILDNAVKGCVPLNAIPSLAANTLTIIVPDKSVYDQVVDNVYVTYAGNVW
QIQTIQDSDGTNKQLNEISDDCNWPLVIIANRHNEVSATVLQNNELMPAKLKTQVVNSGP
DQTCNTPTQCYYNNSNNGKIVYAILSDVDGLKYTKILKDDGNFVVLELDPPCKFTVQDVK
GLKIKYLYFVKGCNTLARGWVVGTISSTVRLQAGTATEYASNSSILSLCAFSVDPKKTYL
DFIQQGGTPIANCVKMLCDHAGTGMAITVKPDATTNQDSYGGASVCIYCRARVEHPDVDG
LCKLRGKFVQVPVGIKDPVSYVLTHDVCQVCGFWRDGSCSCVSTDTTVQSKDTNFLNGFG
VRV
Download sequence
Identical sequences C6GHP6
gi|253756594|ref|YP_003038508.1| C6GHP6_9NIDO

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]