SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for gi|125972773|ref|YP_001036683.1| from Clostridium thermocellum ATCC 27405

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  gi|125972773|ref|YP_001036683.1|
Domain Number 1 Region: 271-413
Classification Level Classification E-value
Superfamily Cysteine proteinases 6.87e-18
Family Transglutaminase core 0.036
Further Details:      
 
Weak hits

Sequence:  gi|125972773|ref|YP_001036683.1|
Domain Number - Region: 439-506
Classification Level Classification E-value
Superfamily Invasin/intimin cell-adhesion fragments 0.00408
Family Invasin/intimin cell-adhesion fragments 0.011
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) gi|125972773|ref|YP_001036683.1|
Sequence length 851
Comment transglutaminase-like protein [Clostridium thermocellum ATCC 27405]
Sequence
MERLSVNDRKNLINLAVFLLVALLSTLLIMAAISMLDLRGNGSPGGENNKRASQNDGSGK
NRNKQDRNNNNKNKKGSGQREDDNGSNWKERAQRRRGERRSFDRGDYAYGQGPESMIPGD
GWEYDIAPDQMPNLDISGNEGLPDMSFGMGGNTFVLKEGKNSHIPAFEVLGVPNYPFVRV
MAMDNYRRSHWSMINEAPELMFLFGEKVDRKFSENTVKIKPVEPSKGYIPVLSGNFEMKY
EFSLLEYKQSGVFYSTGVIENFYEMKYEDPPTEAELINAKTDDDYYYDIYVPEVVERIVD
EVIENCETDYEAIKFVEKFLLENYTYDNTVLNNYGSGDAVVSFLTGKDRVGNHLDFVSAY
AIILRAAGIPCRLALGYKLLPGVKYQVVYADQVYIYPEIKFEDYGWVPMDVFPYDVFYRP
PKETITQITFADGTTKRGETVTVRGTVTDSSGNPIDNMTVLVYLKQYKSEPCISYAKAYV
TNGNFEAVFDIKGDISAGKYHIIADVLENDVYRTSSSDPELKILADTFIDLEERSDIIGN
KLNFAGRIVDFFTYEGIEGLEVHVSFEGMDLVETVVSEEDGKLYKEIEIEVPEDYPYYKN
FFFAGRYLLFYGIEFKGTEIYTPYFTRRGVYMWKIYWINVTVAVVLLLGVVLLCVAIVLR
KKGAFRRDGGKFPVLAAEGPGMIVAADAGAESVERNHGKVYIEFPQIGEGLPDVWGIKEN
LAVVFHDDEGNRGEIGAVFHKKGEYRIKISGKNDEYGARNIRIVDYREEIIAIGKNFLKE
MSAKISGITDFMTLREIHDIIKPNIASERHWVLEDAFMVFEKAVYSDEDIVRSDYEKFYV
FARELGKNSSI
Download sequence
Identical sequences A3DC11
gi|125972773|ref|YP_001036683.1| 203119.Cthe_0251

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]