SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for gi|125974310|ref|YP_001038220.1| from Clostridium thermocellum ATCC 27405

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  gi|125974310|ref|YP_001038220.1|
Domain Number 1 Region: 2022-2113
Classification Level Classification E-value
Superfamily Carboxypeptidase regulatory domain-like 0.0000000000051
Family Pre-dockerin domain 0.00035
Further Details:      
 
Domain Number 2 Region: 2121-2173
Classification Level Classification E-value
Superfamily Type I dockerin domain 0.00000000131
Family Type I dockerin domain 0.00073
Further Details:      
 
Domain Number 3 Region: 1571-1770
Classification Level Classification E-value
Superfamily Fucose-specific lectin 0.00000837
Family Fucose-specific lectin 0.025
Further Details:      
 
Domain Number 4 Region: 941-1025
Classification Level Classification E-value
Superfamily Invasin/intimin cell-adhesion fragments 0.0000565
Family Invasin/intimin cell-adhesion fragments 0.015
Further Details:      
 
Weak hits

Sequence:  gi|125974310|ref|YP_001038220.1|
Domain Number - Region: 1051-1119
Classification Level Classification E-value
Superfamily Invasin/intimin cell-adhesion fragments 0.00487
Family Invasin/intimin cell-adhesion fragments 0.016
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) gi|125974310|ref|YP_001038220.1|
Sequence length 2177
Comment dockerin type I cellulosome protein [Clostridium thermocellum ATCC 27405]
Sequence
MKKKSFISAALIVFLFLSFCLETPNIGAVSAQNAFEDPFVHLINSHLENCETSHEEICDS
TGNEGCEVSHEEICDCTGNESCEAGHEEISDSDEHKACKAGHEACSCKCTSDSDKDNTIS
NLDMNDIEPVGNALFVEKNEENLSNIVTYASLSSVVLAASCSHQFNGSYTVTKEPTCTTT
GTKVGKCTKCGAIVSTVTIPALGHSYGSWTVTKAATCTTDGSQKRTCSRCKNVETQTIKA
TGHTFNGSYTVTKEATCTTTGTKVGKCTKCGATVSTVTIPALGHSYGSWTVTKAATCTTD
GSQKRTCSRCKNVETQTIKATGHTFNGSYTVTKEATCTTAGTKVGKCTKCGTTVSTVEIP
ALGHSYGSWTVTKAATCTTAGTEKRTCTRSGCTASETRSISATGHTFNGSYTVTKEATCT
TAGTKVGKCTKCGTTVSTVEIPALGHSYGSWTVTKAATCTTAGTEKRTCTRSGCTASETR
SISATGHTFNGSYTTIKEPTCTTTGTKVGKCTKCGEVVSSVEIKELGHDFGSWKTIKQAT
CTEKGLREGTCSRCSVRKTEEISPTGHQFNGSFTTVKEPTCTEEGLKVGKCTKCGEVVAT
APIPALGGDHQFNGSFTTVKEPTCTEKGLKEGRCTRCGATVTTAPIPELGHDFGSWKTIK
EATCTEKGLREGTCSRCSVRKTEEISPTGHQFNGSFTTVKKPTCTEEGVREGRCTKCGEV
VATAPIPALGGDHQFNGSFTTVKEPTCTEKGLKEGRCTRCGATVTTTPIPELGHDFGSWK
TIKEATCTEKGLREGTCSRCSVRKTEEISPTGHQFNGSFTTVKEPTCTEEGLKVGKCTKC
GEEVATAPIPALGGAHQFNGSFTTVKKPTCTDPGLAEGRCSRCKTVVATKEIPPLGGSHQ
FNGSYKIIKEATCTEEGLKEGRCTKCGTVISTSVIPPQHKFSKITITPDRITLGGSNATE
SPIYVKVVCSKCNTTVDVTSKAKFSSSNSNVASVVNGYVKSGTQFGTATITADYDGMKAV
CSVQVKPAGGEKLRALCITPKEDTIAEFNKWGSQVKVMAVYDDYEVDITDYVLFTSGDRN
IAYVDEYYGNKYIKSGTKKGTTLITASYEGKKDTCTVKVDMAYEVEEMPFKLGKETNILV
PEDLPVIGGTEVEFSFDHIPGMVKYGEKDFRIAIGIEDKESLDKKWDNFVKYFEDAKNSK
ASAKELRNRMKKLGSKKGSFSIKDDWEPEVEAYGYIEGVFINGIPVATRGSFAVIVEAEY
RGQKQYFIGPVPVYFEIAGGLEMELISDILRVDFETGRIMLNSELKVTPRFELGAGVGLV
KVLTVGGSGEAELEFLIITGSEDYLKVTLTGSLKLKVSSYFFSAEKEIAKGTWVLYESKP
RLRMASPNINAQFDLYNADEYKMMPRDYIERPSEWLGNRRLMRSMATGFTNKELKVLGTN
IYPDAQPQLVNLEDKQVLVWIADNPDRTSANRTMLVYSVYDKNSGIWSEPVAVDDDGTAD
FYPQLAVDGNDLYVVWQNSNKTFAEDVTLEEVVASSEIAVSKFDEVTGTFGAAVRLTEND
VVDMLPQIIVSDGNAYIVWFTNNKNDVFGVDGENSIYYCELKDNEWSSPELLSEGLNAIV
SISAGFIEGSFAVAYALDGDDMLETIDDMEIYIVKPGDKDIRITDNDTMDSAPVFSSFNG
EGALYWYNEGNILYITQIGAEPNRVFSESKPGLKDNFKVVEGSNGETAIIWTNTAKGSST
IFTAIYDEDRAAWSDVVKLSDVTGQVQSPDGVFDDEGNFSIAFSRLYLLEDGNEQADLCI
IKVVPSYNLSIDSVNFDHSKVIPGTQLAIDVEVTNNGEIGVEELVVDILDGDEIINSEAV
QISLKPGESKTATVLMNLPDTIAKKAYSIRVSTVEGEEYNTDDNVKQFTIGYTDISLQLE
IYSEGDIEYVTANIINLSHVPTGATLKVTKGSEDGEVIDTKVIDSTDDIVKYEYQFDKKI
LCADKETEILYFTVVADEEEIYTSDNTRTLVLSVNNDSTDKTTVSGYISVDFDYPPESES
KIKSGFNVKVAGTELSTKTDEKGYFEISGIPGDMREFTLEISKRNYLKRNVTVNGTGKLV
VSTEDNPLILWAGDVERKGVQDNAINMVDVMEISKVFGTRAGDEEYVAELDLNMDGAINL
FDIAIVIRHFNALPSRY
Download sequence
Identical sequences A3DGE8
gi|125974310|ref|YP_001038220.1| WP_011838264.1.31213 203119.Cthe_1806

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]