SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for WP_001032458.1.60961 from NCBI 2017_08 genome

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  WP_001032458.1.60961
Domain Number 1 Region: 1490-1622
Classification Level Classification E-value
Superfamily Galactose-binding domain-like 3.26e-18
Family Galactose-binding domain 0.013
Further Details:      
 
Weak hits

Sequence:  WP_001032458.1.60961
Domain Number - Region: 623-628,657-768,796-870
Classification Level Classification E-value
Superfamily (Trans)glycosidases 0.00155
Family Amylase, catalytic domain 0.034
Further Details:      
 
Domain Number - Region: 976-999
Classification Level Classification E-value
Superfamily WW domain 0.00308
Family WW domain 0.0056
Further Details:      
 
Domain Number - Region: 1293-1360
Classification Level Classification E-value
Superfamily Galactose-binding domain-like 0.0623
Family CBM4/9 0.054
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) WP_001032458.1.60961
Sequence length 1767
Comment endo-alpha-N-acetylgalactosaminidase [Streptococcus pneumoniae]; AA=GCF_000180475.1; RF=na; TAX=869310; STAX=1313; NAME=Streptococcus pneumoniae SPN1041; strain=SPN1041; AL=Contig; RT=Major
Sequence
MNKGLFEKRCKYSIRKFSLGVASVMIGAAFFGTSPVLADSVQSGSTANLPADLATALATA
KENDGRDFEAPKVGEDQGSPEVTDGPKTEEELLALEKEKPAEEKPKEDKPAAAKPETPKT
VTPEWKTVEKKEQKGTVTIREEKGVRYNQLSSTAQNDNAGKPALFEKKGLTVDANGNATV
DLTFKDDSEKGKSRFGVFLKFKDTNNNVFVGYDKDGWFWEYKSPTTSTWYRGSRVAAPET
GSTNRLSITLKSDGQLNASNNDVNLFDTVTLPAAVNDHLKNEKKILLKAGSYDDERTVVS
VKTDNQEGVKTEDTPAEKETGPEVDDSKVTYDTIQSKVLKAVIDQAFPRVKEYTLNGHTL
PGQVQQFNQVFINNHRITPEVTYKKINETTAEYLMKLRDDAHLINAEMTVRLQVVDNQLH
FDVTKIVNHNQVTPGQKIDDERKLLSSISFLGNALVSVSSDQTGAKFDGATMSNNTHVSG
DDHIDVTNPMKDLAKGYMYGFVSTDKLAAGVWSNSQNSYGGGSNDWTRLTAYKETVGNAN
YVGIHSSEWQWEKAYKGIVFPEYTKELPSAKVVITEDANADKNVDWQDGAIAYRSIMNNP
QGWEKVKDITAYRIAMNFGSQAQNPFLMTLDGIKKINLHTDGLGQGVLLKGYGSEGHDSG
HLNYADIGKRIGGVEDFKTLIEKAKKYGAHLGIHVNASETYPESKYFNEKILRKNPDGSY
SYGWNWLDQGINIDAAYDLAHGRLARWEDLKKKLGDGLDFIYVDVWGNGQSGDNGAWATH
VLAKEINKQGWRFAIEWGHGGEYDSTFHHWAADLTYGGYTNKGINSAITRFIRNHQKDAW
VGDYRSYGGAANYPLLGGYSMKDFEGWQGRSDYNGYVTNLFAHDVMTKYFQHFTVSKWEN
GTPVTMTDNGSTYKWTPEMRVELVDADNNKVVVTRKSNDVNSPQYRERTVTLNGRVIQDG
SAYLTPWNWDANGKKLSTEKEKMYYFNTQAGTTTWTLPSDWAKSKVYLYKLTDQGKTEEQ
ELTVKDGKITLDLLANQPYVLYRSKQTNPEMSWSEGMHIYDQGFNSGTLKHWTISGDASK
AEIVKSQGANDMLRIQGNKEKVSLTQKLTGLKPNTKYAVYVGVDNRSNAKAIITVNTGEK
EVTTYTNKSLALNYVKAYAHNTRRDNATVDDTSYFQNMYAFFTTGSDVSNVTLTLSREAG
DEATYFDEIRTFENNSSMYGDKHDTGQGTFKQDFENVAQGIFPFVVGGVEGVEDNRTHLS
EKHDPYTQRGWNGKKVDDVIEGNWSLKTNGLVSRRNLVYQTIPQNFRFEAGKTYRVTFEY
EAGSDNTYAFVVGKGEFQSGRRGTQASNLEMHELPNTWTDSKKAKKATFLVTGAETGDTW
VGIYSTGNASNTRGDSGGNANFRGYNDFMMDNLQIEEITLTGNMLTENALKNYLPTVAMT
NYTKESMDALKEAVFNLSQADDDISVEEARAEIAKIETLKNALVQKKTALVADDFASLTA
PAQAQEGLANAFDGNLSSLWHTSWNGGDVGKPATMVLKEPTEITGLRYVPRGSGSNGNLR
DVKLVVTDESGKEHTFTATDWPDNNKPKDIDFGKTIKAKKIVLTGTKTYGDGGDKYQSAA
ELIFTRPQVAETPLDLSGYEAALTKAQKLTDKDNQEEVASVQASMKYATDNHLLTERMVE
YFADYLNQLKDSATKPDAPTVEKPEFKLSSLASEQGKTPDYKQEIARPETPEQILPATGE
SQSDTALFLAGVSLALSALFVVKTKKD
Download sequence
Identical sequences M5N7K4
WP_001032458.1.10169 WP_001032458.1.101806 WP_001032458.1.13705 WP_001032458.1.14218 WP_001032458.1.24627 WP_001032458.1.29269 WP_001032458.1.395 WP_001032458.1.58034 WP_001032458.1.60961 WP_001032458.1.65607 WP_001032458.1.69671 WP_001032458.1.80914 WP_001032458.1.89652 WP_001032458.1.99687 gi|225856106|ref|YP_002737617.1| 488223.SPP_0406

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]