SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for F4MIZ5 from Uniprot 2018_03 genome

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  F4MIZ5
Domain Number 1 Region: 2782-3085
Classification Level Classification E-value
Superfamily Trypsin-like serine proteases 4e-124
Family Viral cysteine protease of trypsin fold 0.000000529
Further Details:      
 
Domain Number 2 Region: 3506-3664
Classification Level Classification E-value
Superfamily Coronavirus NSP8-like 7.06e-60
Family Coronavirus NSP8-like 0.0000221
Further Details:      
 
Domain Number 3 Region: 3797-3917
Classification Level Classification E-value
Superfamily Coronavirus NSP10-like 3.4e-50
Family Coronavirus NSP10-like 0.00000743
Further Details:      
 
Domain Number 4 Region: 3677-3785
Classification Level Classification E-value
Superfamily Replicase NSP9 5.62e-40
Family Replicase NSP9 0.00023
Further Details:      
 
Domain Number 5 Region: 3382-3464
Classification Level Classification E-value
Superfamily Coronavirus NSP7-like 7.06e-31
Family Coronavirus NSP7-like 0.00033
Further Details:      
 
Domain Number 6 Region: 1022-1176
Classification Level Classification E-value
Superfamily Macro domain-like 8.75e-29
Family Macro domain 0.00072
Further Details:      
 
Weak hits

Sequence:  F4MIZ5
Domain Number - Region: 673-743
Classification Level Classification E-value
Superfamily NSP3A-like 0.000445
Family NSP3A-like 0.0082
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) F4MIZ5
Sequence length 3953
Comment (tr|F4MIZ5|F4MIZ5_9GAMC) Replicase polyprotein 1a {ECO:0000313|EMBL:ADA83575.1} KW=Complete proteome OX=11120 OS=Infectious bronchitis virus. GN= OC=Nidovirales; Coronaviridae; Coronavirinae; Gammacoronavirus.
Sequence
MASSLKQGVSPKLRDVILVSKDIPEQLCDALFFYTSHNPKDYADAFAVRQKFDRNLQTGK
QFKFETVCGLFLLKGVDKITPGVPAKVLKATSKLADLEDIFGVSPFARKYRELLKTACQW
SLTVETLDARAQTLDEIFDPTEILWLQVAAKIQVSAMAMRRLVGEVTAKVMDALGSNMSA
LFQIFKQQIVRIFQKALAIFENVSELPQRIAALKMAFAKCAKSITVVVMERTLVVREFAG
TCLASINGAVAKFFEELPNGFMGAKIFTTLAFFREAAVKIVDNIPNAPRGTKGFEVVGNA
KGTQVVVRGMRNDLTLLDQKAEIPVESEGWSAILGGHLCYVFKSGDRFYAAPLSGNFALH
DVHCCERVVCLSDGVTPEINDGLILAAIYSSFSVAELVAAIKRGEPFKFLGHKFVYAKDA
AVSFTLAKAATIADVLKLFQSARVKVEDVWSSLTEKSFEFWRLAYGKVRNLEEFVKTCFC
KAQMAIVILATVLGEGIWHLVSQVIYKVGGLFTKVVDFCEKYWKGFCAQLKRAKLIVTET
LCVLKGVAQHCFQLLLDAIQFMYKSFKKCALGRIHGDLLFWKGGVHKIIQEGDEIWFDAI
DSIDVEDLGVVQEKLIDFDVCDNVTLPENQPGHMVQIEDDGKNYMFFRFKKDENIYYTPM
SQLGAINVVCKAGGKTVTFGETTVQEIPPPDVVFIKVSIECCGEPWNTIFKKAYKEPIEV
ETDLTVEQLLSVVYEKMCDDLKLFPEAPEPPPFENVTLVDKNGKDLDCIKSCHLIYRDYE
SDDDIEEEDAEECDTDSGDAEECDTNSECEEEDEDTKVLALIQDPASNKYPLPLDDDYSV
YNGCIVHKDALDVVNLPSGEETFVVNNCFEGAVKALPQKVIDVLGDWGEAVDAQEQLCQQ
ESTRVISEKSVEGFTGSCDAMAEQAIVEEQEIVPVVEQSQDVVVFTPADLEVVKETAEEV
DEFILISAVPKEEVVSQEKEEPQVEQEPTLVVKAQREKKAKKFKVKPATCEKPKFLEYKT
CVGDLAVVIAKALDEFKEFCIVNAANEHMSHGGGVAKAIADFCGPDFVEYCADYVKKHGP
QQKLVTPSFVKGIQCVNNVVGPRHGDSNLREKLVAAYKSVLVGGVVNYVVPVLSSGIFGV
DFKISIDAMREAFKGCAIRVLLFSLSQEHIDYFDATCKQKTIYLTEDGVKYRSVVLKPGD
SLGQFGQVFARNKVVFSADDVEDKEILFIPTTDKTILEYYGLDAQKYVTYLQTLAQKWDV
QYRDNFVILEWRDGNCWISSAIVLLQAAKIRFKGFLAEAWAKLLGGDPTDFVAWCYASCN
AKVGDFSDANWLLANLAEHFDADYTNALLKKCVSCNCGVKSYELRGLEACIQPVRAPNLL
HFKTQYSNCPTCGASSTDEVIEASLPYLLLFATDGPATVDCDENAVGTVVFIGSTNSGHC
YTQADGKAFDNLAKDRKFGRKSPYITAMYTRFSLRSENPLLVVEHSKGKAKVVKEDVSNL
ATSSKASFDDLTDFEQWYDSNIYESLKVQETPDNLDEYVSFTTKEDSKLPLTLKVRGIKS
VVDFRSKDGFTYKLTPDTDENSKTPVYYPVLDSISLRAIWVEGSANFVVGHPNYYSKSLR
IPTFWENAESFVKMGYKIDGVTMGLWRAEHLNKPNLERIFNIAKKAIVGSSVVTTQCGKI
LVKAATYVADKVGDGVVRNITDRIKGLCGFTRGHFEKKMSLQFLKTLVFFFFYFLKASAK
SLVSSYKIVLCKVVFATLLIVWFIYTSNPVVFTGIRVLDFLFEGSLCGPYNDYGKDSFDV
LRYCAGDFTCRVCLHDRDSLHLYKHAYSVEQIYKDAASGINFNWNWLYLVFLILFVKPVA
GFVIICYCVKYLVLSSTVLQTGVGFLDWFVKTVFTHFNFMGAGFYFWLFYKIYVQVHHIL
YCKDVTCEVCKRVARSNRQEVSVVVGGRKQIVHVYTNSGYNFCKRHNWYCRNCDDYGHQN
TFMSPEVAGELSEKLKRHVKPTAYAYHVVYEACVVDDFVNLKYKAAIPGKDNASSAVKCF
SVTDFLKKAVFLKEALKCEQISNDGFIVCNTQSAHALEEAKNAAVYYAQYLCKPILILDQ
ALYEQLIVEPVSKSVIDKVCSILSNIISVDTAALNYKAGTLRDALLSITKDEEAVDMAIF
CHNHEVEYTGDGFTNVIPSYGMDTDKLTPRDRGFLINADASIANLRVKNAPPVVWKFSDL
IKLSDSCLKYLISATVKSGGRFFITKSGAKQVISCHTQKLLVEKKAGGVINNTFKWFMSC
FKWLFVFYILFTACCLGYYYMEMNKSFVHPMYDVNSTLHVEGFKVIDKGVIREIVSEDNC
FSNKFVNFDAFWGKSYENNKNCPIVTVVIDGDGTVAVGVPGFVSWVMDGVMFVHMTQTDR
RPWYIPTWFNREIVGYTQDSIITEGSFYTSIALFSARCLYLTASNTPQLYCFYGDNDAPG
ALPFGSIIPHRVYFQPNGVRLIVPQQILHTPYIVKFVSDSYCRGSVCEYTKPGYCVSLDS
QWVLFNDEYISKPGVFCGSTVRELMFNMVSTFFTGVNPNIYIQLATMFLILVVIVLIFAM
VIKFQGVFKAYATIVFTIMLVWVINAFVLCVHSYNSVLAVILLVLYCYASLVTSRNTAII
MHCWLVFTFGLIVPTWLACCYLGFILYMYTPLVFWCYGTTKNTRKLYDGNEFVGNYDLAA
KSTFVIRGTEFVKLTNEIGDKFEAYLSAYARLKYYSGTGSEQDYLQACRAWLAYALDQYR
NSGVEVVYTPPRYSIGVSRLHAGFKKLVSPSSAVEKCIVSVSYRGNNLNGLWLGDSIYCP
RHVLGKFSGDQWGDVLNLANNHEFEVVTQNGVTLNVVSRRLKGAVLILQTAVANAETPKY
KFVKANCGDSFTIACSYGGTVIGLYPVTMRSNGTIRASFLAGACGSVGFNIEKGVVNFFY
MHHLELPNALHTGTDLMGEFYGGYVDEEVAQRVPPDNLVTNNIVAWLYAAIISVKESSFS
QPKWLESTTVSIEDYNRWASDNGFTPFSTSTAITKLSAITGVDVCKLLRTIMVKSAQWGS
DPILGQYNFEDELTPESVFNQVGGVRLQSSFVRKATSWFWSRCVLACFLFVLCAIVLFTA
VPLKFYVHAAVILLMAVLFISFTVKHVMAYMDTFLLPTLITVIIGVCAEVPFIYNTLISQ
VVIFLSQWYDPVVFDTMVPWMLLPLVLYTAFKCVQGCYMNSFNTSLLMLYQFMKLGFVIY
TSSNTLTAYTEGNWELFFELVHTIVLANVSSNSLIGLIVFKCAKWILYYCNATYFNNYVL
MAVMVNGIGWLCTCYFGLYWWVNKVFGLTLGKYNFKVSVDQYRYMCLHKVNPPKTVWEVF
TTNILIQGIGGDRVLPIATVQSKLSDVKCTTVVLMQLLTKLNVEANSKMHAYLVELHNKI
LASDDVGECMDNLLGMLITLFCIDSTIDLGEYCDDILKRSTVLQSVTQEFSHIPSYAEYE
RAKSIYEKVLADSKNGGVTQQELAAYRKAANIAKSVFDRDLAVQKKLDSMAERAMTTMYK
EARVTDRRAKLVSSLHALLFSMLKKIDSEKLNVLFDQANSGVVPLATVPIVCSNKLTLVI
PDPETWVKCVEGVHVTYSTVVWNIDCVTDADGTELHPTSTGSGLTYCISGDNIAWPLKVN
LTRNGHNKVDVALQNNELMPHGVKTKACVAGVDQAHCSVESKCYYTSISGSSVVAAITSS
NPNLKVASFLNEAGNQIYVDLDPPCKFGMKVGDKVEVVYLYFIKNTRSIVRGMVLGAISN
VVVLQSKGHETEEVDAVGILSLCSFAVDPADTYCKYVAAGNQPLGNCVKMLTVHNGSGFA
ITSKPSPTPDQDSYGGASVCLYCRAHIAHPGGAGNLDGRCQFKGSFVQIPTTEKDPVGFC
LRNKVCTVCQCWIGYGCQCDSLRQPKPSVQSVAVASGFDKNYLNGYGVAVRLG
Download sequence
Identical sequences F4MIZ5

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]