SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for 203119.Cthe_0246 from STRING v9.0.5

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  203119.Cthe_0246
Domain Number 1 Region: 109-232
Classification Level Classification E-value
Superfamily Galactose-binding domain-like 4.25e-27
Family Family 6 carbohydrate binding module, CBM6 0.011
Further Details:      
 
Domain Number 2 Region: 350-641,734-785
Classification Level Classification E-value
Superfamily Integrin alpha N-terminal domain 1.44e-17
Family Integrin alpha N-terminal domain 0.0069
Further Details:      
 
Domain Number 3 Region: 29-99
Classification Level Classification E-value
Superfamily Type I dockerin domain 3.04e-17
Family Type I dockerin domain 0.00073
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) 203119.Cthe_0246
Sequence length 820
Comment (Clostridium thermocellum ATCC 27405)
Sequence
MKKTLVFLTALSLIFTLFISYSLSAGPASTKYGDLNADGKINSTDYNLGKRLILRTISEL
PISNGSVAFDLNGDSKVDSTDLTALKRYLLGVIDKFPVGTDIPSQTQKTRYQAEDAMLYK
AFEETIHAGYDGRSYVNYDNEPGGYIEWNVNVSSSGTYKLIFRYANGSNNNRPMEIRVNS
NLVAGSLDFYPTSAWTVWNDQSIVVTLNAGNNVIRATGIASDGGPNVDYLEVIPTNEPPA
PTPSPTPTVGPTPAGARQMERLDRGLVAVKVNNGVFLSWRMFGTDPSNIAFNLYRNGTKI
NSTPITGATNYVDTGGTTSSTYTVRAVINGQEQEASKPVSVWAQNYLQIPIQPPSSAYEA
NDCSAADLDGDGEYEIVLKWEPNNAKDNSQSGYTDNVYLDAYKLNGTRLWRIDLGRNIRA
GAHYTQFMVYDLDGDGKAEVACKTADGTRDGKGNVIGNPNADYRNSSGYILSGPEYLTVF
DGQTGAAITTVDYDPPRGNVSSWGDNYGNRVDRFLACIAYLDGQRPSLVMCRGYYTRSVL
VAWDFRNGRLTKRWVFDGNNYSGYNGQGNHNLSVADVDGDGRDEIIYGACTIDDNGKGLY
TSGLGHGDALHVGDLNPNRPGLEIWSCFESSGGAALRDARTGEVLFRWHRSSDTGRACAA
DITASSPGAELWAAGSPLFSCTGQNIGTAPSQINFAIWWDGDELRELLDGITISKYGVGT
LFTATGCASNNGTKSTPCLQADLLGDWREEVIFRTSDNRYLRIYTTTATTNRRIYTLMHD
PVYRLGIAWQNVAYNQPPHTSFFIGAGMAEPPKPNIYLVP
Download sequence
Identical sequences A3DC06
gi|125972768|ref|YP_001036678.1| WP_011837785.1.31213 Cth-1006 203119.Cthe_0246

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]