SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for A0A142IJR5 from Uniprot 2018_03 genome

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  A0A142IJR5
Domain Number 1 Region: 2781-3084
Classification Level Classification E-value
Superfamily Trypsin-like serine proteases 3.95e-120
Family Viral cysteine protease of trypsin fold 0.000000603
Further Details:      
 
Domain Number 2 Region: 3506-3664
Classification Level Classification E-value
Superfamily Coronavirus NSP8-like 5.36e-59
Family Coronavirus NSP8-like 0.0000194
Further Details:      
 
Domain Number 3 Region: 3797-3917
Classification Level Classification E-value
Superfamily Coronavirus NSP10-like 1.31e-50
Family Coronavirus NSP10-like 0.0000079
Further Details:      
 
Domain Number 4 Region: 3677-3785
Classification Level Classification E-value
Superfamily Replicase NSP9 1.06e-40
Family Replicase NSP9 0.00021
Further Details:      
 
Domain Number 5 Region: 3382-3464
Classification Level Classification E-value
Superfamily Coronavirus NSP7-like 3.66e-30
Family Coronavirus NSP7-like 0.00031
Further Details:      
 
Domain Number 6 Region: 1007-1178
Classification Level Classification E-value
Superfamily Macro domain-like 2.35e-29
Family Macro domain 0.00034
Further Details:      
 
Domain Number 7 Region: 674-744
Classification Level Classification E-value
Superfamily NSP3A-like 0.0000602
Family NSP3A-like 0.0082
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) A0A142IJR5
Sequence length 3953
Comment (tr|A0A142IJR5|A0A142IJR5_9GAMC) 1a polyprotein {ECO:0000313|EMBL:AMR60478.1} KW=Complete proteome OX=11120 OS=Infectious bronchitis virus. GN=1a OC=Nidovirales; Coronaviridae; Coronavirinae; Gammacoronavirus.
Sequence
MASSLKQGVSSQNQRSVILVAKDIPDQLRDALFFYTSHNPKDYADAFAFRQKFDRSLGAG
KQFKFETVCGPFFLKGVDKITPGVPAKALKATSKLADLEDIFGVSPFARKYRELLKTACH
WSMTVETLDARAQTLEEIFGPSEILWLQVAAKVQVSAMAMRRLVGEVTAKVMDSLGSNLS
ALFQVVKETMARIFQKALAIFESVSELPQRIAALKMAFAKCARSITVVVVGHALTIREFA
GTCLASINGAVAKFFEELPTGFMGAKVFSTLAFFKEAAVKIVENIPNAPRGTKGFEVVGN
AKGTQVVVRGMRNDLTLLDQKAEVPVEQEGWSAILEGHLCYVFKSGDRFYAAPLSGNFAL
HDVHCCERVVCLADGVTPEINDGLVLAAIYSSFSVSELVAALKKGEPFKFLGHKFVYAKE
AAVSFTLAKAATIGDVLKLFQSARVKTKDVWSALTEQSFEFWKLAYNRVRNLEEVLKTHF
CKAQMSVVILAVVLGEGIWHLCSQVIYKLSGLFAKVVDFCEKHWKGFCTQLRKARLVVTE
NLCVLKGIAQHCFQLLLESIHSMYRSFKKCALGRISGDLFFWKGGVHKIIHDGDEIWFDA
VDTIEVEDLGTVQEKPIDFEVCEDVILPENQPGHMVQIQDDGKNYMFFRFKKDENIYYTP
MSQLGAINVICKAGGKTVTFGETTVKEIPSPDVVPIKVSIECCGEPWNTIFKKAYKEPIE
VETDLTVEQLLTVIYDKMCEDLKLFPEAPEPPPFENVALVDKNGKALDCIKSCHLIYRDY
ESDDDIEEDDAEECDTDPADAEECDTASECEQEDEDTKVLSLLQDPASNKYPLPLDDDYS
VYNGCIVHKDALDVVNLPSGEETFVVNNCFEGAVKPLPQMVVDVLGDWGEAVDAQEQLCE
QESESVKEPVEKPTGIGGSAVEEAVVVEQEVVPVVEESQEVVVFTPADLKVVKETSEDVD
EFVLVADVPTEEVVPQEKEEPQVEQEPIQVVKSQREKKAKKFKVRQTTCEKPKFLEYTTC
VGDLTVVIAKALDEFKEFCVVNAANEYMSHGGGVAKAIADFCGPDFVEYCEDYVKKHGPQ
QRLVTPSYVKGIQCVNNVVGPRHGDKDLEEKLVAAYKNVLVDGVVNYVVPVLSSGIFGVD
FKTSIDAMRKAFEGLNIRVLLFSLSQEHIEYFDATCKQKKIYLTEDGVKYRSVVVKPGDS
LGQFGQVFAKNKTVFTADDVEDEEVLFIPTTDKAVLEYYGLDAQKYVIYLQTLAQKWEVQ
CRDNFILLKWRDGNCWISSAIVLLQAAKIRFRGFLAEAWAKLLGGDPTDFVAWCYASCNA
KVGEFSDCNWLLANLAEYFDADYTNALLKKRVSCNCGVKSYELRGLEACIQPVRAPNLLH
FKTQYSNCPTCGANSMDEVVEASLPYLLLVATDGPAAVDCDENAVGNVVFIGSTNSGHCY
TQVVGKAFDNLAQDRKFGKRSPYITAMYMRFSLKSQNSLSVAKKSKSKSEVVKEDVSNLA
TSSKFSFDDLTDFEQWYDSNIYESLKVQETPDNMDEYVSFTTKEDSKLPLTLKVRGIKSV
VDFKSKDGFTYKLIPDTDENSKAPVYYPTLDSVSLKAIWVDGTANFVVGHPNYNSRALRI
PTFWECAETFVKIGEKVDGVTMGLWRAEHLNRPNLERIFNVVKKTMVGTSVVTTQCGKLI
SKAATFVADKVGDGVVRNVFDRIKECFGSTREHFERRVSPQFLKTLFFFVFCFLKASVKG
LMASYKSVLCKVLSTALLILWIVYTSDPVISTGIRVLDFLFEGSFCSPYADYGKESFDVL
RYCGSDFTCRVCLHGKDSLHLYKHAYSVEQFYKDAVNGISFTWNWLYMLFLLLFVKPVAG
FVIICYCIRYLVLSTTVLQTGVGFLDWFIQTVFANFNFMGAGFYFWLFYKVYIQVHHIMY
CKDITCEVCKRVARSNRHEVSVVVNGRKQLVHVYTNSGYSFCKRHNWYCRNCDKCGHQNT
FMSPEVAGELSEKLKRHVKPTAFAYHVVDDACLVDDFVNLKYNAATPGKEGVHSAVKCFS
VSDFLKKAVFLKDAQKCEQISNDSFIVCNTHSAHALEEAKNAAIYYAQCLCKPILILDQA
LYEQLVVEPVSKSVVDKVCSILSNIISVDSAALDYKAGTLRDALLSVTKDEEAVDMAIFC
HNNDVEYTSDGFTNVVPSYGIDTDKLTPRDRGFLINADASIANLKVKNSPPVVWKYSDLI
KLSDSCLKYLISATVKSGGRFFITRSGAKQVISCYTQKLLVEKKAGGIVSSTISSFKICC
KWLLVFYLIFTVCCLGYYQWEMSREFAHPMYDVNSTMHVEGFKVIDKGVLRDIVPEDTCF
SNKYVNFDSFWGKSYVNSRDCPIVTAIIDGSGTVAAGVPGFVQWVMDGVMFIHMTQTERK
PWYIPTWFNREIVGYTHDSIITEGEFYTSIALFASRCLYLTSSNTPQLYCFNGDNDAPGA
LPFASILPHRVYFQPNGVRLIVPQQIMHTPYVVKLVSDSYCRGSVCEVTKPGYCVSMNSQ
WVLFNDEYTIKPGVFCGSTVRELLFSMVSTFFTGVNPNIYMQLATMFLTLVAVVLVFAMV
IKFQGVFKAYAITVFTIMLVWLINAFILCVHSYNGVLAIILLVLYCYASLVTSRNTAIIM
HCWLVFTFGLIVPNWIACVYLGFVLYMYTPLLLWCYGTTKNCRKLYEGNDFVGNYDLAAK
STFVIRGPEFVKLTNEIGDKFEHYLSAYARLKYYSGTGSEQDYLQACRAWLAYALDQFRS
SGVEVVYTPPRYSIGVSRLQAGFKKLVSPSSAVEKCIVSVSYRGSNLNGLWLGDSIYCPR
HVLGKFSGDQWGDVLNLANNHEFEVVTGNGVTLSVVSRRLKGAVLILQTAIVNADTPKYK
FLKANCGDSFTIACSYGGIVIGLYPVTMRSNGTIRASFLAGACGSVGFNIEKGVVNFYYM
HHLELPNALHTGTDLMGEFYGGYIDEEVAQKVQPDKLVTNNILAWLYAAIISVRESSFST
PKWLESTTVSIEDYNKWAVDNGFTSFVSCTAITKLSAITGVDVCKLLRTIMVKSTQWGSD
PVLGQYNFEDEMTPESVFNQVGGVRLQSSVVKKAASWFWSRCVLACFLFVLCSIVLFTAL
PYRYYLYGAAVLFAAVLFISFTVKHVMAYMDTFLLPTLITVIIGVCAEVPFIYNTLISQI
VIFFSQWYDPVVFDTVVPWMFLPLVLYTAFKCIQGCYSVNSFNTSLLVLYQFMKLGFVIY
TSSNTLTAYSEGNWELFFELVHTTVLANVSSNSLIGLIVFKFAKWMLYYCNASYLNNYVL
MAVIINGIGWMFTCYFGFYWWINKVFGLTLGKYSFKVSVDQYRYMCLHKINPPKTVWEVF
STNILIQGIGGDRVLPIATVQSKLSDVKCTTVVLMQLLTKLNVEANSKMHAYLVELHNKI
LASDDVNECMDNLLGMLVTLFCVDSTIDLSEYCDDILKRSTVLQSVTQEFSHIPSYAEYE
RAKDLYEKVLAESKNGSVTQQELAAYRKAANIAKSIFDRDLAVQKKLDSMAERAMTTMYK
EARVTDRRAKLVSSLHALLFSMLKKIDSEKLNVLFDQASSGVVPLATVPIVCSNKLTLVV
PDPETWVKCVEGMHVTYSTVVWNIDNVIDADGTELQPISTGDGLAYCISGDNIAWPLKVN
LTRNVHNKVDAVLQNNELMPHGVKTKACVAGVDQAHCSVESKCYYTNISGNSVVAAITSS
NPNLKVASFLNEAGNQIYVDLDPPCKFGMKVGDKVEVVYLYFIKNTRSIVRGMVLGAISN
VVVLQSKGYETEEVDAVGILSLCSFAVDPADTYCKYVAAGNQPLGNCVKMLTVHNGSGFA
ITSKPSPTPDQDSYGGASVCLYCRAHIAHPGSAGNLDGRCQFKGSFVQIPTTEKDPVGFC
LRNKVCTVCQCWIGHGCQCDSLRQSKPSVQSGAGAPGFDKNYLNGYGVAVRLG
Download sequence
Identical sequences A0A142IJR5

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]