SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for gi|16329757|ref|NP_440485.1| from Synechocystis sp. PCC 6803

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  gi|16329757|ref|NP_440485.1|
Domain Number 1 Region: 1308-1484
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 8.04e-20
Family Clostridium neurotoxins, the second last domain 0.015
Further Details:      
 
Domain Number 2 Region: 1993-2303
Classification Level Classification E-value
Superfamily Integrin alpha N-terminal domain 6.67e-19
Family Integrin alpha N-terminal domain 0.0032
Further Details:      
 
Domain Number 3 Region: 2300-2635
Classification Level Classification E-value
Superfamily Integrin alpha N-terminal domain 9.16e-17
Family Integrin alpha N-terminal domain 0.0063
Further Details:      
 
Domain Number 4 Region: 3421-3514
Classification Level Classification E-value
Superfamily CalX-like 3.01e-16
Family CalX-beta domain 0.0016
Further Details:      
 
Domain Number 5 Region: 1704-1938
Classification Level Classification E-value
Superfamily Integrin alpha N-terminal domain 0.0000000000000575
Family Integrin alpha N-terminal domain 0.0011
Further Details:      
 
Domain Number 6 Region: 2615-2709,2862-2884,2917-3191
Classification Level Classification E-value
Superfamily Integrin alpha N-terminal domain 0.000000000183
Family Integrin alpha N-terminal domain 0.0034
Further Details:      
 
Domain Number 7 Region: 3172-3330
Classification Level Classification E-value
Superfamily beta-Roll 0.000000000366
Family Serralysin-like metalloprotease, C-terminal domain 0.0055
Further Details:      
 
Weak hits

Sequence:  gi|16329757|ref|NP_440485.1|
Domain Number - Region: 2356-2416,2701-2887
Classification Level Classification E-value
Superfamily Sialidases 0.000186
Family Sialidases (neuraminidases) 0.045
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) gi|16329757|ref|NP_440485.1|
Sequence length 3972
Comment integrin subunit alpha [Synechocystis sp. PCC 6803]
Sequence
MSITLSDRLSAVVQDSAGTSHIVWLEGTNIWHAVYDPVSQTWKNAQAIVNTAGQNIRSLN
LVADSGLIVESGTSNNTLIPGVAVIYQEGLENDSNFFYTAARYDNVGKLQWLEAPQALTA
DQVGDLEPRAVVDNGVITVVGQKVDLVKAQNQAIREDADPYYQTFQIYSNQFSTIPSTPV
TPIAAYAPQITKDGVVQGNYIQRDQNTVAALPSATYQPLTLAETQSSSFQGWGANWTVSQ
TFDTNLSNLKLFKGMAGVPEALLTPIFKKFNITGTLLGSSGNNPNLSFFGGGISTEGILL
NASMAVQFNEDDQDTNNTSSGTAGSSLNPAENTTQLSRKPEISLSAIAATLYTFDKSLGE
DNLSYPLTVETGSLGLTVGFTFPIPIEGTPILFNFLGSAGLGMQWQLSPKDPQTYVSPVG
AVLSPTGGDSDAVLAVLFNVPPLGAITAIGLAVISDITEVVELIFTDVDPDGNEAIELES
LLFSIPIATGIDGGIKIPYVFEALAGIQASLVGTVGGTRSADNGAFQYDRDLSLGFPMNI
TIKFLGFLGGTVGIYPYWTWGDDPSSSASNSDFATLNQASADSPTAIVRGPLLEIQGLQT
NGTPQATDFQVNVKDPLGQTTTVPVFGVITQGNTTILRLESAIPQTNLNDQPFSNIKVTY
QNNAPIPVVNQSANTFTYNYNPISGTNSNYQVTGQQVVIAFNATLNPNIIPDSSLFQVTD
TNGNSISISSVIVTSRSVILTLSGSSSTYSVSYSGSSSSGNVLTTEDNVAISDFKIASNT
PPSTAGQVNRTYSNVPAGTTSAITTNPLLGSTVAQDYAEDSPPALAVTDSGVLLAWSSDT
PPITPISALLSGTTIILTFADTLVNLINTNKATSNPFTVLINNQSVEIQDSSSTGNYVAI
SLNSSTVIDADDTISVSYSIDSTLENSLFNLYLSDATDFALWVPDFTTPVTQAASSTNAP
TLLEAVAISNVITLTFDQILSSSNLPPGSQFKVTVNNSDNFITVINEVTIENNSVTLPLS
EPIGQGDIVTVSYSSGAGTLVNSNNIPVAPFTTSDILTTFANPGTVIKTLLATPSTSSVS
IAYPSSIIGTSGLNYDPAVYFNQATNQVFAAWVNVDSSQIQNQLIPGQYYSNLEIITQAL
QSSTIYFSVASLKDLGDLGPNGQIIPWSVAAPIPHQQTGQNTNVTLGLGPDGSIMAAWLN
TTLDNNGTPTTTIYYSTLNQNDSSPVWSAPVPILPDINPDAFTPLTISTINQQPAIFWTE
SSPASYRQLVLNAAPSVYLRLGERTGDVARNSGQFQAAANGTYSGTFTLGEMGALETTSN
TGDPNPAVLFQGGGVTLDQPVPLTNQAFAIEFWFKLPNLPRESINLVSALGLFGLGLEVD
PNDPNAPPKLTFGLGNDDQSQIQASETFATDVWYYVVGTYDGSSQNLSLYLNGVLAGTLD
DVEFDPFPTAAALALAGASTANNPVYLDEVAFYPKALTASNVNAADLTNANFQNLTGSQI
LEIIAGTNQIGNHYAAQYNQPVPPGPNTYYSVSDGSSWQLPSQINPTPAIVPTQLAGANI
PVFDLVSATPAQASTAIAPNGIADAVYQISLSNPNLTISGIKVTSGNQAWAIGTDDNGTA
LTGNQLGLLLGDTLLNSLNPDTQSFSYPIQGYTENLFLFIDPGSSNTTLGPVEVTVYFEN
NDNTPTPFSVAPYQINMATMGPDQTVTGTATVTEANDSSLALIDSGFIINSDNPAIGYVL
ASAFNSDGTLAYVAVGNRGYSDSQGNVLNNGTVQILFSCSDILSGSGSLSTTILNGNPDG
VLITNIQDAGDNQRNLSLSLATGDIDGDSIPDLVIGAPNVGNFAGAVYVIYGSYLSNQKG
QIIDVTNLSTKPNTMGFVVNGNEAEDLAGFSVVVGNFDGDSYGDIVYGAPYAKDSNGNRV
GQVYLVAGFAQGSAPDSISPTVIYSGKSFDIPNPQPNPPSQTLTVGEGAGFALGVSRRLS
NGPSTFTGSTTTDDLFIGAPNYQIQVVNQWTNQSNLPGQNSQNQQNQQITSDLFPSSSFV
SAGAVYVYNSNQGLSLKTNPWAIYTGPTIPNAQGTATSYFAGTVIGSGLNTDFTDYDGDG
RQDLVIAGPGANTNTGQAFVIGGSTATPSVTTTQSGITTTTTQALNTVSNLVINGGLSGG
KAGTVITSVGDLNHDGYQDLLITAPQGANATGQSYVLFGTLNLSEFGTVFDLNVTANDNK
TTFLLNGDQPFQATGTAAVGVGDINNDGVDDLMLTAPVGQQLYAVYGHPWLADDGSIKLA
DVSSNNGFVIDGYEYSLPSSPFGYVLSFDLVYGNDFELVSAWLQLISPTGQVIWRSNPGE
GVTVTPEAYAIMQSDGNFVIYGQPGGNSGDVIWSSNTAGNSGAVLKLASDGGLYIVDSEG
NIVPNGTLNPGNTSLITNAVTLDENQEINISLDTFLSNGENGGSLFGNGLNIVMVGDVNG
DGFADVISGGSPAGGVLIFGNSTKDLLDAALGTDDLIISVENAQVKEFVALGDFDGDGLA
DFGVIDDQGNFFLVLGSPELGSQGSLVIDSTLPNLSNFNQAWGVGDFNGNGYDDFVLQGP
NSTIAVYGNANGTLTDSSPLTFGNNFPLPSSFTGIDLNGNGIKEIVAGQPNLNPVPNIGG
FGGGLQYFTYEAGNAVLQPTVNPPNASVTEASGLSSWGQISFPNQYAQAGVPSFATLDGW
LYQAFYGINERISTKDSYIYIQRSRDGVSWENLTQVVPLDSNGTPIDLKNLPPSITAYNG
TLYLGFTADNGQVWVAEGVNTNANSGILINAVPINQASNNGPTLVAFNDELYVFFVKDAS
DNDILYSSSSNPGSSSGWDGTSTVLTFSDVNQATNFPLSATVVPGLDGDTLAVAFRSNNS
PATWVGLLNSSDVTNWQGSAELTQVDANSQVSLTVVDGTYYLFFTSSTEASASYATSTDG
LNWGDITLIPWDDGNLGGVASILFNQSFILSLNQSNNESLLFAFSNSLFEPNQASRWGEQ
VRDIGDFNGDGIADLAVLAPGYRNLLQFPILDYPAINNLGGVFIYYGEESGISVNDPPDV
VLAAPDLPQETIFELLEITPTGDVNGDGFDDLLISAPLTPVIAGQFPDVNGDQGVSWVVF
GGTHWGTEYTANSPFGLGNLANNQTNNSQNFNPYGFVTTGLPRSQAGISISGGADVNGDG
FSDFALGAPGNFDNLSYVLFGSDFTNQVNQLGTIGDDVMLGSPTGEIFVAGQGDDQIYTN
GGVDTVYAGPGNDFVTVTDTNFRRLDGGSGNNILKFTGYTNQDWDLTTLSPGLRLKNFNI
LDVRDYGANILTLNALTITQLSANNEVRVLLDANDKINLDSSFSFSEKVYLDNQNYYLYT
SNASAATVLVNVPSNQVTFTATTSNSPSLNLIPTAQPNAPTTTDVTILAATNSDPNQPTR
LFVSNPKVSEAAGEAQFVIERTGDLNKYVLVSYITQDMSGKAGDRYLPIAGQLIFNPGEN
QKNITVKIPTDSVYTADRQFSLLVSLLNDGLEAGDWGDAFALAGDANGAQIRRWNYLAGN
WDNSVMGGLIDFSTTVNSDQAEIHLSVEGLGEFNDFFGYDPLSQTYQSIMFNGATGARLT
NSDSPNAIGGVELKLLDGDRGDADGIVNGLVATNGYAGRTIPGLISNNNRVFWAPTNADG
QVQLRLINSPSQNYEIGWVMVDSADGAIDGLLPDDPGYEEAALARKQAVFSDQANASAQA
LTRSLARQSFTDIEAFARTESQFFGSFSNSNLEANRYYMLYSQQGEEIAFSIDAPLMVET
DSRGYHQLDFNGITTEIASKTLVVPGILNQTVTTNVSISRAGAYENLIVLYKVDSLTGGI
DTDGDRQINLNPGDVGYVQAALTRAQNPATGLSLNAPDEFFSTTQKTISLSGNNIYGMAI
IPNSSIEEVLSKNPTNDPNLGPVALFSFEQANPGGVSQMSRLGSNLFGFEDMVGGGDLDY
NDIILQFSFLTS
Download sequence
Identical sequences L8AFW6 P73139
1148.slr1028 gi|383490553|ref|YP_005408229.1| gi|16329757|ref|NP_440485.1| WP_010871794.1.11876 WP_010871794.1.1889 WP_010871794.1.18904 WP_010871794.1.33690 WP_010871794.1.35395 WP_010871794.1.47586 WP_010871794.1.99424 gi|383321499|ref|YP_005382352.1| gi|16329757|ref|NP_440485.1| gi|383324669|ref|YP_005385522.1|

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]