SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for NP_001321273.1.80155 from NCBI 2017_08 genome

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  NP_001321273.1.80155
Domain Number 1 Region: 1691-2004
Classification Level Classification E-value
Superfamily Cysteine proteinases 1.57e-105
Family Calpain large subunit, catalytic domain (domain II) 0.00000427
Further Details:      
 
Domain Number 2 Region: 2003-2149
Classification Level Classification E-value
Superfamily Calpain large subunit, middle domain (domain III) 8.63e-30
Family Calpain large subunit, middle domain (domain III) 0.00091
Further Details:      
 
Domain Number 3 Region: 1425-1610
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 0.000000000000114
Family Laminin G-like module 0.036
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) NP_001321273.1.80155
Sequence length 2151
Comment calpain-type cysteine protease family [Arabidopsis thaliana]; AA=GCF_000001735.3; RF=reference genome; TAX=3702; STAX=3702; NAME=Arabidopsis thaliana; ecotype=Columbia; AL=Chromosome; RT=Major
Sequence
MEGDERGVLLACVISGTLFTVFGSGSFWILWAVNWRPWRLYSWIFARKWPKVLQGPQLDI
LCGVLSLFAWIVVVSPIAILIGWGSWLIVILDRHIIGLAIIMAGTALLLAFYSIMLWWRT
QWQSSRAVALLLLLGVALLCAYELCAVYVTAGAHASQQYSPSGFFFGVSAIALAINMLFI
CRMVFNGNGLDVDEYVRRAYKFAYSDCIEVGPVACLPEPPDPNELYPRQTSRASHLGLLY
LGSLVVLLAYSVLYGLTARESRWLGGITSAAVIVLDWNIGACLYGFKLLQNRVLALFVAG
ISRLFLICFGIHYWYLGHCISYIFVASVLSGAAVSRHLSITDPSAARRDALQSTVIRLRE
GFRRKEQNSSSSSSDGCGSSIKRSSSIDAGHTGCTNEANRTAESCTADNLTRTGSSQEGI
NSDKSEESGRPSLGLRSSSCRSVVQEPEAGTSYFMDKVSDQNNTLVVCSSSGLDSQGYES
STSNSANQQLLDMNLALAFQDQLNNPRIASILKKKAKEGDLELTNLLQDKGLDPNFAVML
KEKNLDPTILALLQRSSLDADRDHRDNTDITIIDSNSVDNTLPNQISLSEELRLRGLEKW
LKLSRLLLHHVAGTPERAWGLFSLVFILETIIVAIFRPKTITIINSSHQQFEFGFSVLLL
SPVVCSIMAFLRSLQVEEMALTSKSRKYGFVAWLLSTSVGLSLSFLSKSSVLLGISLTVP
LMAACLSIAVPIWMHNGYQFWVPQLSCGDQARDLRSPRIKGFILWICVVLFAGSVISLGA
IISAKPLDDLKYKLFSARENNVTSPYTSSVYLGWAMSSGIALVVTAILPIVSWFATYRFS
HSSAVCLMIFSVVLVAFCGTSYLEVVKSRDDQLPTKGDFLAALLPLACIPALLSLCCGMV
KWKDDCWILSRGVYVFFSIGLLLLFGAIAAVIAVKPWTIGVSFLLVLFLMVVTIGVIHLW
ASNNFYLTRKQTSFVCFLALLLGLAAFLLGWHQDKAFAGASVGYFTFLSLLAGRALAVLL
SPPIVVYSPRVLPVYVYDAHADCGKNVSAAFLVLYGIALATEGWGVVASLIIYPPFAGAA
VSAITLVVAFGFAVSRPCLTLEMMEVAVRFLSKDTIVQAISRSATKTRNALSGTYSAPQR
SASSAALLVGDPSAMRDKAGNFVLPRDDVMKLRDRLRNEERVAGSIFYKMQCRKGFRHEP
PTNVDYRRDMCAHARVLALEEAIDTEWVYMWDKFGGYLLLLLGLTAKAERVQDEVRLRLF
LDSIGFSDLSARKISKWKPEDRRQFEIIQESYLREKEMEEESLMQRREEEGRGKERRKAL
LEKEERKWKEIEASLIPSIPNAGSREAAAMAAAIRAVGGDSVLEDSFARERVSGIARRIR
TAQLERRAQQTGISGAVCVLDDEPMISGKHCGQMDSSVCQSQKISFSVTAMIQSDSGPVC
LFGTEFQKKVCWEILVAGSEQGIEAGQVGLRLITKGERQTTVAREWYIGATSITDGRWHT
VTITIDADAGEATCYIDGGFDGYQNGLPLSIGSAIWEQGAEVWLGVRPPIDVDAFGRSDS
DGVESKMHIMDVFLWGKCLSEEEAASLHAAIGMADLDMIDLSDDNWQWTDSPPRVDGWDS
DPADVDLYDRDDVDWDGQYSSGRKRRSGRDFVMSVDSFARRHRKPRMETQEDINQRMRSV
ELAVKEALSARGDKQFTDQEFPPNDRSLFVDTQNPPSKLQVVSEWMRPDSIVKENGSDSR
PCLFSGDANPSDVCQGRLGDCWFLSAVAVLTEVSRISEVIITPEYNEEGIYTVRFCIQGE
WVPVVIDDWIPCESPGKPAFATSRKLNELWVSMVEKAYAKLHGSYEALEGGLVQDALVDL
TGGAGEEIDLRSAQAQIDLASGRLWSQLLRFKQEGFLLGAGSPSGSDVHVSSSGIVQGHA
YSVLQVREVDGHRLVQIRNPWANEVEWNGPWSDSSPEWTDRMKHKLKHVPQSKEGIFWMS
WQDFQIHFRSIYVCRVYPREMRYSVNGQWRGYSAGGCQDYSSWHQNPQFRLRATGSDASL
PIHVFITLTQGVGFSRTTPGFRNYQSSHDSQLFYIGLRILKTRGRRAAYNIFLHESVGGT
DYVNSREISCEMVLDPDPKGYTIVPTTIHPGEEAPFVLSVFTKASIVLEAL
Download sequence
Identical sequences Q8RVL2
AT1G55350.1 AT1G55350.2 AT1G55350.3 AT1G55350.4 3702.AT1G55350.4-P NP_001319240.1.80155 NP_001321273.1.80155 NP_175932.2.80155 NP_850966.1.80155 NP_850967.1.80155

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]