SUPERFAMILY 1.75 HMM library and genome assignments server

Superfamily is undergoing a server migration - you are now browsing on the new server. Please contact us if you experience any problems.

Domain assignment for gi|427729043|ref|YP_007075280.1| from Nostoc sp. PCC 7524

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  gi|427729043|ref|YP_007075280.1|
Domain Number 1 Region: 4899-5199
Classification Level Classification E-value
Superfamily Subtilisin-like 3.67e-53
Family Subtilases 0.0000882
Further Details:      
 
Domain Number 2 Region: 5274-5383
Classification Level Classification E-value
Superfamily CalX-like 4.71e-24
Family CalX-beta domain 0.0011
Further Details:      
 
Domain Number 3 Region: 4655-4747
Classification Level Classification E-value
Superfamily Hypothetical protein PA1324 7.59e-24
Family Hypothetical protein PA1324 0.01
Further Details:      
 
Domain Number 4 Region: 4752-4852
Classification Level Classification E-value
Superfamily Hypothetical protein PA1324 3.01e-19
Family Hypothetical protein PA1324 0.011
Further Details:      
 
Domain Number 5 Region: 398-714
Classification Level Classification E-value
Superfamily Pectin lyase-like 3.33e-16
Family Virulence factor P.69 pertactin 0.064
Further Details:      
 
Domain Number 6 Region: 5388-5440,5486-5579
Classification Level Classification E-value
Superfamily beta-Roll 0.00000000000000196
Family Serralysin-like metalloprotease, C-terminal domain 0.0011
Further Details:      
 
Weak hits

Sequence:  gi|427729043|ref|YP_007075280.1|
Domain Number - Region: 81-148,285-370
Classification Level Classification E-value
Superfamily Kelch motif 0.000119
Family Kelch motif 0.018
Further Details:      
 
Domain Number - Region: 3536-3596
Classification Level Classification E-value
Superfamily Cna protein B-type domain 0.000149
Family Cna protein B-type domain 0.008
Further Details:      
 
Domain Number - Region: 4572-4639
Classification Level Classification E-value
Superfamily Fibronectin type III 0.00805
Family Fibronectin type III 0.003
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) gi|427729043|ref|YP_007075280.1|
Sequence length 5626
Comment CARDB domain-containing protein,putative collagen-binding protein,Calx-beta domain-containing protein,subtilase family protease [Nostoc sp. PCC 7524]
Sequence
MHTVSHETWQNLTPVPFTYGVSAGGGLATDGVYIYAADFSADSDDDYIDLDGDLIDDPEE
RLDALRLDALVITNGSVRFGRYNPDTDSWESLPTLNLTGVDGDAFSSGNLVNPLFVVGSK
LYYYQFRSGPNIRALYSYDLTSGVEGTWQSVWEKTSADNPLIDANAGIVGLDVDGQPVIL
HHTGGGDYNFARTDDIADGGTHTLLTPSWNFSGAHFPRGGDWEYDTSSDRLYHLSGDQLV
MWSHNDTNYPDGSFLTSTPDGTNPIAQETIVITSLADNLGWNTGSTQAYPGTSLWGNSVT
IANDSLYLIRGETSTDGWPFNEGRGIINNGNFARILPNGWLETLPNTPFNIGKGSSAVYL
NGYLYVTQGDTLTANDPDNVSPLNQEGIRSPGKGFARFAIAASPNVGTTYTVTNTDDSGE
GSLRWAIEQANTNVGTDTILFNIPGIEPQTINVSSQLPEITEAVFLDATSQPTYQGSPVI
VLNGSAAGADAVGLNITAGNSTIRGLAIYGFSGWGMRLTGGGNNTIQGNIIGDGGGGIYI
TSALNLIGGATVTESNAIANNNNIGISISALTATGNQILNNLIQNNDAGVQISDGASNNL
ITENVISNNTNDGIAIVAGESDATNNAIFGNNIYDNGGLGIDLGNDGVTTNDEGDSDGGA
NALQNYPVIISATLEGENTLIAGELNSLPSSTYRIEIFSNSSIDISGNGEGENLLTSLVV
TTDESGNASFSVTLPVMLPEEYYITATATDALGNTSEFSAPIQIINNNLPDLIVGEVTAA
DTAALGETISVSWTITNQGLGSAINSWYDEIYISDDQIFDDNDTYVEYNWSGEYIPLEPG
SSYTLNQDIYIPEYTTGGLRYLLFVTDVYDDQEEADETNNVVAQAINITAPNLVITSVTA
PTTAITGQSIDLTWVVTNQGAAPTSIDNSWYDYIYFSRNDILGDDDDVYITNIWSSDYAS
LPLNPAENYTVEQSVFLPSEALGSGYLLFATDRWEYQGESHETDNVFAQAIDIAAPDLIV
SAATAPPSGVVGEIISVSWTVRNQGTVTASRDWQDRIYISDDETLDGSDTFITSQSITSQ
TPLAADASYSITQDITLPNTTLGNRYLLFVADGNNAQGETDETNNVQAVAIELSAPDLVV
STATAPASGVVGEIISVSWTVRNQGAVTAFRDWQDRIYISDDETFDGSDTFIVSQSITSQ
TPLAADGSYNITRTITLPNTTLGNRYLLFVADGNNAQSETDETNNVRAVAIELGAPDLVV
TAATAPASGVVGEIISVSWTVTNQGTVTASRDWQDRIYISDDETFDGSDIFITSRSITSQ
TPLAADASYSITQDITLPNTTLGNRYLLFVADGNNAQGETDETNNVQAVAIELSAPDLVV
STATAPASGVVGEIISVSWTVRNQGAVTAFRDWQDRIYISDDETFDGSDTFIVSQSITSQ
TPLAADGSYNITRTITLPNTTLGNRYLLFVADGNNAQSETDETNNVRAVAIELGAPDLVV
TAATAPASLTLGGTTDLSWTVSNIGNSPAPSDWFDRVYLSNDTTFDSSDTLLTSQSAAAS
TPLAVGDSYTLSATNVTVPLTDTGDRYLLFVADATSNQGETNENNNVFSLPVTIAGALPG
ARSATVSGDVFLGGNYIELGLSRWGSFGTNATKPSNFYGTNARSQIGMSADFDGFLNGQD
LRFDYFLPGSPEERWVIGYQAGGSTFTASNAARTGSTQISNSVTNTSTSDQLSAVSTGSY
NNTLGITQQIGFRVDDKYFRNIVTLTNISAQTLNSVRYMRSVDPDNTVDLGGSFTTDNIV
QATIAEDGRAIVEARTTSDSDPVFQRTGSRAPIFFYSNDPRAVASTFGFTNTNPYAPLAY
DTPATRGQTIRDDQAITLTFDVGTLLPGEAATFVYYTSLDDRNFEDVIEELERPDLTVTS
VTPGANPAEFGSSLDISWVIANNGLRETTGGWTDRIYLSSDATLSGDDLLLSSLPTGETT
LFPSGNISRTTSINLPFDDNSTAGSYYILVQADALGNQLESVETNNVRAAAIELTLPPLP
DLVISDATAPASLILGQTGSISWTVTNQGVGVASEDWSDRVYLSVDDILDDSDILLLTET
IASQTPLDAASSYTISRNFTPGSTLTPGAYNLLIVTDSDRIQRESDETNNLRFVPLSINA
PDLVVSSITAPVESVSGQPLEISWTVTNQGQAATGGTWTDYVYLVNADTGAFVRNVGNFS
FTGTLAAGASINRTQSYNVPLELAGNFRVVVRTDNNNNIPEGTPNEANNTTIGDRPITIR
LAPIPNLQISSVTTPTTAFSSQQTVVSWQVTNTGNGATSAPIWYDAVYLSLDTTFDDTDV
FLGEAANSSYLNSGESYTNSLSVTLPRGIDSNYYFLVKTDSRNNVNELSNEGDNFGVGGP
TRINLTPPPDLQVTNVNAPAMTFSGQPMNLSWTVTNEGPGRTLETAWYDRIFMSEDEILD
SGDRSLAEIFRNGALNSGESYTASTTVNLPIGVSGNYYFFVRTDSRNQVYENIFENNNVN
YDTTATVITLTPPPDLEVDFVTIPNTARAGSNITINYGVTNYGASETPNFSWQDTFYLSL
DNQFDPTTDIRLGNVTRFGALNPDQGYERTVNFTLSNTLAGNYYLFVTTDSSDQVFELDN
ANNTRRSNNQVQITASPADLVVNTTTVPTTGEAGKSLRVQWSVSNQGTGDTIVTSWTDRI
IASTDGVLGNADDVVLANFNRTGILNPGNSYSREEFVNLPFALEGNYQLFVVTDAGNTVY
EASNENNNASVAIPVTISRQTPDLQVTQINVPATVSSGTPLTISWTVQNLGTGRTNSNFW
YDDVYLSVDPNISASDIKLGSFYRSGALEPTVEYTATANFNLPIDLNGSYYVIVRTDRNN
LVTEGALENNNDQASDSTVAVSLSPVPDLVVDAVDAPEQAIAGQPFSLTWTVTNNGAVTT
GTWYDAFYLSRDQVFDRNSDIYLGFQNRTALGAGESYTTTQAFNLPRGLAGLFYVFAVTD
GGNAIYERNGENNNTNYDGFSMEVILPPPSDLVVTDIVVPTNAVPGQPITINYTVQNQGT
DAALGSWYDAVYISADNQWDIGDALVGQVLRSGTIASGNSYSGTITANLPGVTPGNYYAI
VRSDIRNQVPEINEANNTGVSTNQVSIDAEQLIIDVADTATLRQGQSIYYRFNATAGQTI
RLRLDSQNNQSFNELYVRQGAMPSRGEFDVTGIEPFTADPEILLPITQDGTYYVLAYGAQ
ATSPAEYTIVAEDVPLSILDVSTNRIGNSGSATLSILGARFTEDTIFQLISPDGDVIAAG
QVYLANSTQAFATFDLFNQDVGLYDVRAIQGFDASARLEDVVTVEAGTGFRLSSSLNGQQ
EVRPNRNYLFNVDYGNAGDSDTKAPLLIVQSATNTPLGLELNNLGAGAPLHLLGVSFDGP
LNILRPGDTNSIPVYFNSATNEVNFSISTYSVENSTSIDLNSFEASIRPAGLTDAQWNSF
IADIAPRVQTYGQYANLLNDLSEQLSGTGQPIYDVRELFARAYSENANFFATATLAGELR
NASNNSAIANTEIAAYRLIDGGMELAGITTTNDQGQFQFSYLTNGQYEIIPTAPYVFDNN
RDSIPDSVRPTFTITDTSVVSAGTLYADIPTAAPPIIQESTPSLARDSAGNLHLLWTREG
QLWHAVNNGSGFEAQPLSQAFGSDVKLLINPNLINGTSEGLIATWSSGEGNESEIYYAIG
QATAGGGYQWTAPIQLTNNSLYDGAFDLEITDNGTPLFIWQRQDFSIEDDADLYYGGITV
ENPEFLDPLSNVTILTTVANALQYSDAELEALGIHRVRYAQDLGRITIPSWVPFISGTYE
AQFRADLVGQIDCTLILGANGQLRFKVGDNGELTGEVGGQARWTTNDDCEYEFQNARIRG
VVAGGLNIPLTDYRLGPFAGIKIGPRIELAGGANLIWDAGSDFPSWPSRVEGSFRGSLGA
NVEGKVSIPILGEVKVTGRVLGNFNFKADANGLGPADPFFSVSVLLRAEATVFGRKRFID
WTGTWPGQDLGGQSVDGFMPSDLFGFSEQTFYDETLIFSISTDVDTGTLNDYSDDNASAL
TANLYEEGTPVLTKDNNGILYAGWTNQDGVVISQLNAATGQWLAPSVIAGSAGLANSDLV
LAFDGDGDGLAVWSSQDVSGLNENNSQEEIRNSFIVGGDLVFSVYDATTTTWSSASPLFT
LTGADRSLALGKDTDGNLVASWLHDHNDRQSTLYATIWDADTNVWSAPATIASGFFSGQP
SVSALGAQPIIMWTQDDEAGINVVQANQSLKYSLFDGVAWSTPLDFTYSINEALAQNAIS
IFSDSSDFSELFGSSIPLPSPPEECCDCEEGDPECDDDDDDNNDDDNYDPPVRRPSDPND
ILGPEGFGDERWINARNPLAYTIRFENEPTASAPAQEVIITQYLDADLDWRTFRIDDFGW
GDLRFDLPGDRSFYSNRIDLRATQGFYVDVSASIDVLTGLATWRIATVDPTTGEAPLDAQ
TGFLPINDENGLGEGFVSYSVRTKRDITTGSVIDAEARIVFDTEAPIDTPPIFNTIDVGV
PTSSVTSLPEVSDSPEFLVTWSGSDDPNGSALAGYTIYVSVNAGAFTPWLTNTTLTEATY
IGELGNTYSFYSVAADHAGNVEAPPTTADALIRVAGGLASLGDLVWVDSNANGLRDADEP
GLAGVSINLYDGSNTLVATTTTDTNGIYSFTDLNPGDYVVEFVPGTGYLFSPQNQGSNSA
LDSDADPTSGRSLTVTLNPRDNNLTVDAGVYQFATIQGQKFHDLDGDGVKDPTEPGLSGW
TIYLDANRNGQLDTSEISTVTDANGNYIFTDLKPGTYNVAEVMQPGWQQTFPGTSNSNAA
ISTSASDAELYTPSASVTTTATSASPTASVSSLINLSNFRADPRFTDIDGSGFAVVTIDT
GIDLNHPFFGPDLDGNGIADRIVYQHDFGDRDNNASDLNGHGSHVASIIGSSDNTYTGVA
PGVNLIALKVFRDNGSGYFSDLEKSLQWVVANAQTHNIASVNLSLGDGRNWSTAGSYYGI
GDELAALAAMNIIVTAAAGNSFYEFNSWQGVAYPAADPNTIAVGAVWSDNLGGIWNFSNG
AQDYTTAADRIASFSQRDADLLDVFAPGTRIIGANANGGTSTLTGTSQAAPYIAGVAALA
QQLAVQQLGRQITVAEFRELLATTGVMINDGDNEDDNVTNTGINFARVNVLALAEKILTL
SPEAPTSGTTGSGTTGPGSTPTLGAQSLTHTINLTSGQVVTGIDFGNQRLAEMATLTFSA
AAYSVNEDSTSVTSITITRTSSNGTLTATLTLADGTATAPADYDNTPIIVEFADGEASKI
VTIPIIDDAVVEGNETLSISLGASSSYELGDITTATVTIVDNDIASVTITESDGSTNVTE
GGATDSYTVVLTSQPTADVSITINGDSQVSTDVNTLTFTAANWNVAQTVTVTAVDDEAFE
GNHSSTLTMTAASSDAKYDGIAIASVNVNITDNDTIINGTSGSDTLIGTNGNNIITGFQG
ADILTGGEGKDQFVYTSIRDAGDTITDFVPGTDTIVLTQLFQSLRLSHLNYETATAQGYL
RFGTNSAGGTVLIDPDGFIGRAATTTLLTVQGVSQNDLASVNNFIF
Download sequence
Identical sequences K9QQJ2
gi|427729043|ref|YP_007075280.1|

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]