SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for WP_020457718.1.31213 from NCBI 2017_08 genome

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  WP_020457718.1.31213
Domain Number 1 Region: 44-382
Classification Level Classification E-value
Superfamily (Trans)glycosidases 2.27e-50
Family beta-glycanases 0.0061
Further Details:      
 
Domain Number 2 Region: 475-614
Classification Level Classification E-value
Superfamily AbfB domain 2.35e-46
Family AbfB domain 0.00016
Further Details:      
 
Domain Number 3 Region: 658-892
Classification Level Classification E-value
Superfamily Arabinanase/levansucrase/invertase 2.41e-21
Family alpha-L-arabinanase-like 0.081
Further Details:      
 
Domain Number 4 Region: 914-982
Classification Level Classification E-value
Superfamily Type I dockerin domain 8.93e-18
Family Type I dockerin domain 0.00043
Further Details:      
 
Domain Number 5 Region: 378-468
Classification Level Classification E-value
Superfamily Glycosyl hydrolase domain 0.000000000101
Family Composite domain of glycosyl hydrolase families 5, 30, 39 and 51 0.021
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) WP_020457718.1.31213
Sequence length 982
Comment alpha-L-arabinofuranosidase [Ruminiclostridium thermocellum]; AA=GCF_000015865.1; RF=representative genome; TAX=203119; STAX=1515; NAME=Ruminiclostridium thermocellum ATCC 27405; strain=ATCC 27405; AL=Complete Genome; RT=Major
Sequence
MNNLKKYTLVAVFVFLTAVCFQHPGITSAATTITIDPDATYQTIEGWGASICWWGNQIGR
WSPDNRNRLIEKIVSPTDGLGYNIFRYNIGGGDNPGHNHMRDYADIQGYQNADRSWNWNA
DAAQRAVLTRLIERGRYYGSEIILEAFSNSPPYWMTKSGCASGTSDGSNNLRDDCYDDFA
DYLTEVVKHFRDAWGITFRTLEPMNEPNSDWWKAGGRQEGCSFSYANQQRIIKEVGEKLK
AKGLTGTTVSAADEASIDTALEGLQSYDATTLSYMSQLNVHSYFGSKRAQLRDLAKSKGL
RIWQSESGPLSFNGDMADSCIMLSKRIVTDLKELQCVAWLDWQIIDGGNWGSIYVDDASQ
TFTLTEKFYMHANYSRFIRPGYTIIGANNEKTIAAISPDKKKLVIVATNDNKSSSANYTF
NLTRFSGVNSTVEVYRTSPSLSLAKSIITASNKIVSDTLPPYSINTYVITLDGGVESVPA
VGLQSYNYPNRYVRHADFDARIDENVTPLEDSQWRLVPGLANSSEGYVSIQSVNYPGYYL
RHWDYDFRLDKNDGTTIFAEDATFKLVPGLADPSCVSFQSYNYPDRYIRHYGYLLKLERI
STDLDRQDATFLIISDDSPGPITDSGYIMAYFKQAPGEYGLNLCYSTDGLHWRNINDGKP
VLYAQMGTKGIRDPYIFRKQDGKFGIVATDMLGTNWGDTSQYIHYWESEDLINFTERLIK
VHNKSNMHAWAPEVFYDENRKQYGIYWAGNTDYNRIYVNYTTDFDTVSDCQVFFDPGYDV
IDAHIVSDKGMYYLFFKDERASGKAIKVARSSSLTPNSFTVFTPNFITSPNTEGPFVFKD
NNSDSWYMYVDIYSNNGIFECWKTNDLNALSWTKVTGISVPPGVRHGSVVKVNRWELETA
ISRKVVTPPAPTPPPVLKGDVNADGVINSSDIMVLKRFLLRTITLTEEMLLNADTNGDGA
VNSSDFTLLKRYILRSIDSFPV
Download sequence
Identical sequences A3DHB4
CmR19 NYSGXRC-11096b 203119.Cthe_2139 gi|125974626|ref|YP_001038536.1| WP_020457718.1.31213

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]