SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for gi|385777439|ref|YP_005686604.1| from Clostridium thermocellum DSM 1313

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  gi|385777439|ref|YP_005686604.1|
Domain Number 1 Region: 48-199
Classification Level Classification E-value
Superfamily Carbohydrate-binding domain 1.67e-27
Family Cellulose-binding domain family III 0.0027
Further Details:      
 
Domain Number 2 Region: 611-715
Classification Level Classification E-value
Superfamily vWA-like 0.0000945
Family Integrin A (or I) domain 0.07
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) gi|385777439|ref|YP_005686604.1|
Sequence length 1001
Comment type 3a cellulose-binding domain-containing protein [Clostridium thermocellum DSM 1313]
Sequence
MKRKFYLRTFALLIVLCVFVNLSLIAGFNRLFVENVYADEGVEFPVSSNKVYVTLKNIKT
GVPSDTIALKIGIINLNKAININLDDIKLRYYFTNDGCSPIQVNIKLFGTETESFNPGLV
KTSVVTGLSYPGADSYVEIGFTGSVELNCDRKPIYIELDIKENSPYRNFDQSNDFSNNNY
YTPFLPEEFFASGRVPVFMFDPKKSDYVLLTGVLPSETSKIPNPTPTPVVSPSPSPIVPE
GEKILATASGNIIIPKPSPSPVGLFEPADIGFPHEGSIDVGFSPRKKEALILLDSSYESN
DVDDGLTGIFKYCLFSSGDSLYQGDNITIEGDVFTRNTMNVTTPRTKITGKVEYLFRDLN
SYDGPLGGKEIKMEPADAHRYDRYLVPDENDDYSALFSMIQTKVTNLPEKDDAKFLITEN
TVENYISPEVRANPNWRKNAVQFYYDDNDKSVSIEYRSKEAGGFSRKYDNSLPQYLIKQD
KGDAFVLKSNMFFDGNLIISVNGIRQELIDGATSAFIYAYGDIILQGNGATFDDVYLITK
YGNIYIETNNCNVNGIAFAPNGKIVINGQSNNLQGSFVASKIQCEPGNSVFKGPTDDQLE
DIEDALKSTEGFDTIRNSIALLPYIFDEYTRAGIITYSDYANINDSPINDSWKFFDAATE
REEFLNYTLTLSVDEDSKRSNLGDGLRKALDVFNKYSDPEADKYIYIFTSLDPNAYTRSH
LPYGLFETDPAFNTDAAYIYDETVNREGNQYVREIMKLIEEYNNNHVNGKIKLILVDLTN
YIKEFNIKNGAKESEIEVDDLTNLAADLGIDIYDSDEKAYYCPSLEDIQSLSIINELAYR
SNSMPPKLAVENLKISSAQFELSLPSYIKPVELFFKRASNTKESIVNLSGLAASGGKYNI
TYTFSGDELATLTRISDGLKYDLESNGLYMTLIVNSSDDWDDGDNPLTVKGTVDIAGPKI
TYKLFDDKNNDGVRSAGEAEFEVVVPFDNIKFNVEYKKDIN
Download sequence
Identical sequences gi|385777439|ref|YP_005686604.1| WP_003516421.1.19387 WP_003516421.1.20586 WP_003516421.1.55520 WP_003516421.1.6636 WP_003516421.1.6965

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]