SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for U5WI49 from Uniprot 2018_03 genome

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  U5WI49
Domain Number 1 Region: 3243-3540
Classification Level Classification E-value
Superfamily Trypsin-like serine proteases 1.39e-141
Family Viral cysteine protease of trypsin fold 0.000000000015
Further Details:      
 
Domain Number 2 Region: 13-127
Classification Level Classification E-value
Superfamily SARS Nsp1-like 2.09e-68
Family SARS Nsp1-like 0.000000537
Further Details:      
 
Domain Number 3 Region: 3957-4110
Classification Level Classification E-value
Superfamily Coronavirus NSP8-like 2.48e-67
Family Coronavirus NSP8-like 0.0000000483
Further Details:      
 
Domain Number 4 Region: 4236-4358
Classification Level Classification E-value
Superfamily Coronavirus NSP10-like 8.24e-55
Family Coronavirus NSP10-like 0.000000157
Further Details:      
 
Domain Number 5 Region: 4120-4230
Classification Level Classification E-value
Superfamily Replicase NSP9 1.83e-45
Family Replicase NSP9 0.000000514
Further Details:      
 
Domain Number 6 Region: 819-930
Classification Level Classification E-value
Superfamily NSP3A-like 1.44e-42
Family NSP3A-like 0.000000746
Further Details:      
 
Domain Number 7 Region: 1003-1163
Classification Level Classification E-value
Superfamily Macro domain-like 1.55e-36
Family Macro domain 0.0000000368
Further Details:      
 
Domain Number 8 Region: 3837-3919
Classification Level Classification E-value
Superfamily Coronavirus NSP7-like 1.96e-30
Family Coronavirus NSP7-like 0.00000651
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) U5WI49
Sequence length 4382
Comment (tr|U5WI49|U5WI49_CVHSA) Non-structural polyprotein 1a {ECO:0000313|EMBL:AGZ48829.1} KW=Complete proteome OX=1415852 OS=Bat SARS-like coronavirus WIV1. GN= OC=Nidovirales; Coronaviridae; Coronavirinae; Betacoronavirus.
Sequence
MESLVLGVNEKTHVQLSLPVLQVRDVLVRGFGDSVEEALSEAREHLKNGTCGLVELEKGV
LPQLEQPYVFIKRSDALSTNHGHKVVELVAELDGIQYGRSGITLGVLVPHVGETPIAYRN
VLLRKNGNKGAGGHSFGIDLKSYDLGDELGTDPIEDYEQNWNTKHGSGVLRELTRELNGG
AFTRYVDNNFCGPDGYPLDCIKDFLARAGKSMCTLSEQLDYIESKRGVYCCRDHEHEVAW
FTERSDKSYEHQTPFEIKSAKKFDTFKGECPKFVFPLDSKVKVIQPRVEKKKTEGFMGRI
RSVYPVASPQECNNMHLSTLMKCNHCDEVSWQTCDFLKATCEHCGTENSVTEGPTTCGYL
PTNAVVKMPCPACQDPEIGPEHSVADYHNHSNIETRLRKGGRTRCFGGCVFAYVGCYNKR
AYWVPRASADIGSGHTGITGDNVETLNEDLLEILSRERVNINIVGDFQLNEEVAIILASF
SASTSAFIDTIRSLDYKSFKAIVESCGNYKVTKGKPVKGAWNIGQQRSVLTPLCGFPSQA
AGVIRSIFARTLDAANHSIPDLQRAAVTILDGISEQSLRLVDVMVYTSDLLTNSVIIMAY
VTGGLVQQTSQWLSNLLGTTVEKLRPIFVWIEAKLSAGVEFLKDAWEILKFLITGVFDIV
KGQIQVASDNIKGCVKCFIDVVNKALEMCIDQVTIAGTKLRSLNLGEVFIAQSKGLYRQC
IRGKEQLQLLMPLKAPKEVTFLEGDSHDTVLTSEEVVLKNGELEALETPVDSFTNGAVVG
TPVCVNGLMLLEIKDKEQYCALSPGLLATNNVFRLKGGAPTKGVTFGEDTVLEVQGYKNV
RITFELDERVDKVLNEKCSVYTVESGTEVTEFACVVAEAVVKTLQPVSDLLTNMGIDLDE
WSVATFYLFDDAGEEKLSSRMYCSFYPPDDEEDCDEYDEEEEVLEESCAHEYGTEEDYQG
LSLEFGASTEMQVEEEEEEDWLGDATELSEHEPEPELTPEEPVNQFTGYLKLTDNVAIKC
VDIVKEAQNANPTVIVNAANIHLKHGGGVAGALNKATNGAMQKESDDYIKLNGPLVVGGS
CLLSGHNLAKKCLHVVGPNLNAGEDIQLLKAAYENFNSQDTLLAPLLSAGIFGAKPLQSL
QVCVQTVRTQVYIAVNDKALYEQVVMDYLDSLKPRVEAPKQEEPPRTEDPKIEEKSVVQK
PIDVKPKIKACIDEVTTTLEETKFLTNKLLLFADINGKLYHDSHNMLRGEDMSFLEKDAP
YVVGDVITSGDITCVVIPSKKAGGTTEMLSRALKKVPVDEYITTYPGQGCAGYTLEEAKT
ALKKCKSAFYVLPSETPNAKEEILGTVSWNLREMLAHAEETRKLMPICMDVRAIMATIQR
KYKGIKVQEGIVDYGVRFFFYTSKEPVASIITKLNSLNEPLVTMPIGYVTHGFNLEEAAR
CMRSLKAPAVVSVSSPDAVTTYNGYLTSSSKTSEEHFVETVSLAGSYRDWYYSGQRTELG
VEFLKRGDKIVYHTLESPVEFHLDGEVLPLDKLKSLLSLREVKTIKVFTTVDNTNLHTQL
VDMSMTYGQQFGPTYLDGADVTKIKPHVNHEGKTFFVLPSDDTLRSEAFEYYHTLDESFL
GRYMSALNHTKKWKFPQVGGLTSIKWADNNCYLSSVLLALQQIEVKFNAPALQEAYYRAR
AGDAANFCALILAYSNKTVGELGDVRETMTHLLQHANLESAKRVLNVVCKHCGQKTTTLT
GVEAVMYMGTLSYDNLKTGVSIPCVCGRDATQYLVQQESSFVMMSAPPAEYKLQQGTFLC
ANEYTGNYQCGHYTHITAKETLYRIDGAHLTKMSEYKGPVTDVFYKETSYTTTIKPVSYK
LDGVTYTEIEPKLDGYYKKDNAYYTEQPIDLVPTQPLPNASFDNFKLTCSNTKFADDLNQ
MTGFTKPASRELYVTFFPDLNGDVVAIDYRHYSASFKKGAKLLHKPIVWHINQATTKTTF
KPNTWCLRCLWSTKPVDTSNSFEVLAVEDTQGMDNLACESQQPTSEEVVENPTIQKEVIE
CDVKTTEVVGNVILKPSDEGVKVTQELDHEDLMAAYVENTSITIKKPNELSLALGLKTIA
THGIAAINSVPWSKILAYVKPFLGQAAITTSNCAKRLAQRVFNNYMPYVLTLLFQLCTFT
KSTNSRIRASLPTTIAKNSVRGVARLCLDAGINYVKSPKFSKLFTIAMWLLLLSICLGSL
IYVTAALGVLLSNFGAPSYCSGVRESYLNSSNVTTMDFCEGSFPCSVCLSGLDSLDSYPA
LETIQVTISSYKLDLTILGLAAEWFLAYMLFTKFFYLLGLSAIMQVFFGYFASHFISNSW
LMWFIISIVQMAPVSAMVRMYIFFASFYYIWKSYVHIMDGCTSSTCMMCYKRNRATRVEC
TTIVNGMKRSFYVYANGGRGFCKTHNWNCLNCDTFCTGSTFISDEVARDLSLQFKRPINP
TDQSSYIVDSVAVKNGALHLYFDKAGQKTYERHPLSHFINLDNLRANNTKGSLPINVIVF
DGKSKCDESAARSASVYYSQLMCQPILLLDQALISDVGDSTEVSVKMFDAYVDTFSATFS
VPMEKLRALVATAHSELAKGVALDGVLSTFVSAARQGVVDTDVDTKDVIECLKLSHHSDL
EVTGDSCNNFMLTYNKVENMTPRDLGACIDCNARHINAQVAKSHNVSLIWNVKDYMSLSE
QLRKQIRSAAKKNNIPFRLTCATTRQVVNVITTKISLKGGKIVSTWFKLMLKATLLCVIA
TLVCYIVMPVHTLSVHDGYTNEIIGYKAIQDGVTRDIVSTDDCFANKHAGFDSWLSQRGG
SYKNDKSCPVVAAIITREIGFIVPGLPGTVLRAINGDFLHFLPRVFSAVGNICYTPSKLI
EYSDFATSACVLAAECTIFKDAMGKPVPYCYDTNLLEGSISYSELRPDTRYVLMDGSIIQ
FPNTYLEGSVRVVTTFDAEYCRHGTCERSEAGICLSTSGRWVLNNEHYRALPGVFCGVDA
MNLIANIFTPLVQPVGALDVSASVVAGGIIAILVTCAAYYFMKFRRAFGEYNHVVAANAL
LFLMSFTILCLAPAYSFLPGVYSVFYLYLTFYFTNDVSFLAHLQWFAMFSPIVPFWITAI
YVFCISLKHCHWFFNNYLRKRVMFNGVTFSTFEEAALCTFLLNKEMYLKLRSETLLPLTQ
YNRYLALYNKYKYFSGALDTTSYREAACCHLAKALNDFSNSGADVLYQPPQTSITSAVLQ
SGFRKMAFPSGKVEGCMVQVTCGTTTLNGLWLDDTVYCPRHVICTAEDMLNPNYEDLLIR
KSNHSFLVQAGNVQLRVIGHSMQNCLLRLKVDTSNPKTPKYKFVRIQPGQTFSVLACYNG
SPSGVYQCAMRPNHTIKGSFLNGSCGSVGFNIDYDCVSFCYMHHMELPTGVHAGTDLEGK
FYGPFIDRQTAQAAGTDTTITLNVLAWLYAAVINGDRWFLNRFTTTLNDFNLVAMKYNYE
PLTQDHVDILGPLSAQTGIAVLDMCAALKELLQNGMNGRTILGSTILEDEFTPFDVVRQC
SGVTFQGKFKKIVKGTHHWMLLTFLTSLLILVQSTQWSLFFFVYENAFLPFTLGIMAIAA
CAMLLVKHKHAFLCLFLLPSLATVAYFNMVYMPASWVMRIMTWFELADTSLSGYRLKDCV
MYASALVLLILMTARTVYDDAARRVWTLMNVITLVYKVYYGNALDQAISMWALVISVTSN
YSGVVTTIMFLARAIVFVCVEYYPLLFITGNTLQCIMLVYCFLGYCCCCYFGLFCLLNRY
FRLTLGVYDYLVSTQEFRYMNSQGLLPPKSSIDAFKLNIKLLGIGGKPCIKVATVQSKMS
DVKCTSVVLLSVLQQLRVESSSKLWAQCVQLHNDILLAKDTTEAFEKMVSLLSVLLSMQG
AVDINKLCEEMLDNRATLQAIASEFSSLPSYAAYATAQEAYEQAVANGDSEVVLKKLKKS
LNVAKSEFDRDAAMQRKLEKMADQAMTQMYKQARSEDKRAKVTSAMQTMLFTMLRKLDND
ALNNIINNARDGCVPLNIIPLTTAAKLMVVVPDYGTYKNTCDGNTFTYASALWEIQQVVD
ADSKIVQLSEINMDNSPNLAWPLIVTALRANSAVKLQNNELSPVALRQMSCAAGTTQTAC
TDDNALAYYNNSKGGRFVLALLSDHQDLKWARFPKSDGTGTIYTELEPPCRFVTDTPKGP
KVKYLYFIKGLNNLNRGMVLGSLAATVRLQAGNATEVPANSTVLSFCAFAVDPAKAYKDY
LASGGQPITNCVKMLCTHTGTGQAITVTPEANMDQESFGGASCCLYCRCHIDHPNPKGFC
DLKGKYVQIPTTCANDPVGFTLRNTVCTVCGMWKGYGCSCDQLREPMMQSADASTFLNGF
AV
Download sequence
Identical sequences U5WI49

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]