SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for gi|220930600|ref|YP_002507509.1| from Clostridium cellulolyticum H10

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  gi|220930600|ref|YP_002507509.1|
Domain Number 1 Region: 894-1030
Classification Level Classification E-value
Superfamily Ricin B-like lectins 1.88e-39
Family Ricin B-like 0.0051
Further Details:      
 
Domain Number 2 Region: 31-362
Classification Level Classification E-value
Superfamily Arabinanase/levansucrase/invertase 4.71e-38
Family alpha-L-arabinanase-like 0.028
Further Details:      
 
Domain Number 3 Region: 625-749
Classification Level Classification E-value
Superfamily Galactose-binding domain-like 2.34e-25
Family Family 6 carbohydrate binding module, CBM6 0.0078
Further Details:      
 
Domain Number 4 Region: 495-622
Classification Level Classification E-value
Superfamily Galactose-binding domain-like 1.91e-24
Family Family 6 carbohydrate binding module, CBM6 0.048
Further Details:      
 
Domain Number 5 Region: 754-881
Classification Level Classification E-value
Superfamily Galactose-binding domain-like 1.06e-23
Family Family 6 carbohydrate binding module, CBM6 0.015
Further Details:      
 
Domain Number 6 Region: 367-491
Classification Level Classification E-value
Superfamily Galactose-binding domain-like 0.0000000000000861
Family Family 6 carbohydrate binding module, CBM6 0.038
Further Details:      
 
Domain Number 7 Region: 1417-1510
Classification Level Classification E-value
Superfamily E set domains 0.00000000462
Family Cellulosomal scaffoldin protein CipC, module x2.1 0.018
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) gi|220930600|ref|YP_002507509.1|
Sequence length 2073
Comment hypothetical protein Ccel_3240 [Clostridium cellulolyticum H10]
Sequence
MRKKSLVMLSLAIVLSLLLTSISYADVTSSYQNPLMRGADPTIARAADGFYYSCFAVDND
IYLKKADTILGVGTAKSRLAWDKPADFGYVWGPYIYRLDGKWYIYFTSAPENSFGYGHPS
SYVLENTSPDPFEGTWELKGVSANADEDGQVTDKPGLLNTQGYGLACGVVTMGGKTYFTY
TKYFYYPDPNDPTKEKFDECPTIVEMENPWTLKGTEGTLARPVYDWEKQGDSINEGAAVV
ERNGKVYFAYSASSFMNDNYCVGVSTADAQSDLLQESNWTKNPEPALAKSPENSSFGPGS
PLFVKSEDGTEDWLIYHGGPVGGQTGSNRWVRAQRINWNDDGSINLGIPSNPGTVLDRPS
GEEKSETYEAEDASFAGVTRTILSDSSKASGSGVMKYDNSSNGYVEFTVDANMPGSYSLN
FRYNNSTGSNITMSLGVNQNSSRELSFEPNGSNSTNYDLLRVHNVQLNAGHNKIRLSSSE
ANGLVLDAMIIKKSVLYEAENAALSGGALVSTEHSGYSGTGFVGGMYTEGTSAEFTVDAP
YAGNYSVNLRYCNGFSNIDKTLSMYVNGVKVKQIDLFSFGDWSKWSERYDNIELRAGSNT
ITYKYDSGDGGNVNLDYITVTEATTRHYEAENAVLTGNAQKSTDHTGYTGTGFVGGFWTE
GSVEFSVNVETAALYDVKLKYALGFPEDRTMSIYLNGSKVKQVTLPSTGGWDTWSEYLET
LSLNKGNNTISIKRDSDDSGDINIDSIHLDRRISWKYQAEDATLLGGAHPVDDHLWYEGT
GFAGGFETLYESIQFNVNVPNTATYTTTLRYSGAQENDITMSLYVNGTKIKQVSLPPTAD
WDSWGEATETVNLKAGKNVIEYKRDDGDTGRFNIDSMTIDKYSVGDTDLKNRGIVSGTVY
TIKAKHSGKALDVSGYSSEPGALVDQWTYVGGNNQRWEIIDLGTGYYKIKSAHSGLVLDI
VEGSPDMDLCQTTSSPAETQQWILEKVGGYYKIVNKSNGMVLDVSEESYDNGKKVHLYSY
VGKANQLWKIDVASANELFIPVTGITDVPVTATAGTDLTLFANVLPYNATNKSIEWRVKD
AGETGATISGNTLSTVGAGTVLVTAVITDGMPDDNAYTQDYIIQINAVNHAPTAKANIPV
AAVAADDSVSFIASDIAEDEDGDTLTIAEIKTSPDSAMASAILDNGTVTVSGVAPGSTDA
TLVVSDGRGGTVEINVPIEVAAAPDVKIDISQSPVKTIFNALTFGLFFNDSIDVSLSSGK
TEIDHFEYQIVGAESPFDSNGTWTTGNSFSVEPDFMGRVYARAVFTDGAVTETYIKALVV
DKTKPEIGAVYDKDNASIAVTVSDISAGIDTITYQVGSGEVKSVNLTPTAEKDITFEYSF
TISSLPEGQYDVVINAIDNSDNAADTKTLNIVNNGVASAEISPVSESFDINVPADVSTTI
TWNDASSVTDVVYGGNPVTSDSYEVSGDTLTLKSSYLASLGLVNGDTSEFIINFDRGNPV
TFVVSIIDSSDPTPVNRSISVQNDGNGTASANVTSATEGTEVTLTATPNEGYRFKEWQVI
YPTGLAITGNTFTMPNGAVSVKAIFEQNPVVNYTLSVNGSYSNESGAGSYVEGSTVTINA
GSRSNYTFRGWSSGDGVTFANANSTTTTFTMPAKNVTVTASWTYNGSSPYNGTIPSQPAT
KDYTADIKVSGASGNSTLPITVDAKTGTARLDTASRNELVANGGISVITVPSIPDVDTYS
VGIPVPYLSTADWQGSLRVSTNRGSITVPSNMLTGIAEAAGTKAEITIGKGDKATLTEAA
KTAIGDRPLIQLTMYIDGKQMEWNNPNVPVMVTIPYTPSAEELANPERIIIWYIDDNGKA
VAVPNGHYDPASGTVAFDTTHFSSYAIGYSDVSFNDVTENAWYNKAVSFIAARDITAGTG
NGYFSPNAKLTRGEFIVMLMKAYEIAPDGKQSSNFEDAGNKYYTGYLAAAKRLGISEGIG
KNMFAPEKQITRQEMFTMLYNALKTMDKLPQGDTGKSLSSFDDEEMVASWARDAMTLLVE
TGIVGGNAGQLTPASTTTRSEMAQVLYNLIYKN
Download sequence
Identical sequences B8I0X3
WP_015926587.1.84728 394503.Ccel_3240 gi|220930600|ref|YP_002507509.1|

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]