SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for WP_003512736.1.31213 from NCBI 2017_08 genome

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  WP_003512736.1.31213
Domain Number 1 Region: 306-813
Classification Level Classification E-value
Superfamily Six-hairpin glycosidases 1.49e-155
Family Cellulases catalytic domain 0.00000000000000115
Further Details:      
 
Domain Number 2 Region: 1003-1146
Classification Level Classification E-value
Superfamily Carbohydrate-binding domain 4.87e-39
Family Cellulose-binding domain family III 0.0003
Further Details:      
 
Domain Number 3 Region: 42-198
Classification Level Classification E-value
Superfamily Galactose-binding domain-like 8.33e-37
Family CBM4/9 0.0024
Further Details:      
 
Domain Number 4 Region: 210-304
Classification Level Classification E-value
Superfamily E set domains 6.3e-27
Family E-set domains of sugar-utilizing enzymes 0.00000168
Further Details:      
 
Domain Number 5 Region: 1158-1223
Classification Level Classification E-value
Superfamily Type I dockerin domain 1.24e-19
Family Type I dockerin domain 0.00011
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) WP_003512736.1.31213
Sequence length 1224
Comment cellulose 1,4-beta-cellobiosidase [Ruminiclostridium thermocellum]; AA=GCF_000015865.1; RF=representative genome; TAX=203119; STAX=1515; NAME=Ruminiclostridium thermocellum ATCC 27405; strain=ATCC 27405; AL=Complete Genome; RT=Major
Sequence
MKFRRSICTAVLLAVLLTLLVPTSVFALEDNSSTLPPYKNDLLYERTFDEGLCYPWHTCE
DSGGKCSFDVVDVPGQPGNKAFAVTVLDKGQNRWSVQMRHRGLTLEQGHTYRVRLKIWAD
ASCKVYIKIGQMGEPYAEYWNNKWSPYTLTAGKVLEIDETFVMDKPTDDTCEFTFHLGGE
LAATPPYTVYLDDVSLYDPEYTKPVEYILPQPDVRVNQVGYLPEGKKVATVVCNSTQPVK
WQLKNAAGVVVLEGYTEPKGLDKDSQDYVHWLDFSDFATEGIGYYFELPTVNSPTNYSHP
FDIRKDIYTQMKYDALAFFYHKRSGIPIEMPYAGGEQWTRPAGHIGIEPNKGDTNVPTWP
QDDEYAGIPQKNYTKDVTGGWYDAGDHGKYVVNGGIAVWTLMNMYERAKIRGLDNWGPYR
DGGMNIPEQNNGYPDILDEARWEIEFFKKMQVTEKEDPSIAGMVHHKIHDFRWTALGMLP
HEDPQPRYLRPVSTAATLNFAATLAQSARLWKDYDPTFAADCLEKAEIAWQAALKHPDIY
AEYTPGSGGPGGGPYNDDYVGDEFYWAACELYVTTGKDEYKNYLMNSPHYLEMPAKMGEN
GGANGEDNGLWGCFTWGTTQGLGTITLALVENGLPATDIQKARNNIAKAADRWLENIEEQ
GYRLPIKQAEDERGGYPWGSNSFILNQMIVMGYAYDFTGDSKYLDGMFDGISYLLGRNAM
DQSYVTGYGERPLQNPHDRFWTPQTSKRFPAPPPGIISGGPNSRFEDPTINAAVKKDTPP
QKCFIDHTDSWSTNEITVNWNAPFAWVTAYLDEQYTDSETDKVTIDSPVAGERFEAGKDI
NISATVKSKTPVSKVEFYNGDTLISSDTTAPYTAKITGAAVGAYNLKAVAVLSDGRRIES
PVTPVLVKVIVKPTVKLTAPKSNVVAYGNEFLKITATASDSDGKISRVDFLVDGEVIGSD
REAPYEYEWKAVEGNHEISVIAYDDDDAASTPDSVKIFVKQARDVKVQYLCENTQTSTQE
IKGKFNIVNTGNRDYSLKDIVLRYYFTKEHNSQLQFICYYTPIGSGNLIPSFGGSGDEHY
LQLEFKDVKLPAGGQTGEIQFVIRYADNSFHDQSNDYSFDPTIKAFQDYGKVTLYKNGEL
VWGTPPGGTEPEEPEEPAIVYGDCNDDGKVNSTDVAVMKRYLKKENVNINLDNADVNADG
KVNSTDFSILKRYVMKNIEELPYR
Download sequence
Identical sequences A3DCH2
203119.Cthe_0413 WP_003512736.1.31213 WP_003512736.1.60145 gi|125972934|ref|YP_001036844.1|

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]