SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSAPLP00000009898 from Anas platyrhynchos 76_1.0

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSAPLP00000009898
Domain Number 1 Region: 251-325
Classification Level Classification E-value
Superfamily Starch-binding domain-like 0.0000000000196
Family Rhamnogalacturonase B, RhgB, middle domain 0.024
Further Details:      
 
Domain Number 2 Region: 891-977
Classification Level Classification E-value
Superfamily Carboxypeptidase regulatory domain-like 0.00000051
Family Carboxypeptidase regulatory domain 0.031
Further Details:      
 
Domain Number 3 Region: 49-123
Classification Level Classification E-value
Superfamily Starch-binding domain-like 0.00000053
Family Rhamnogalacturonase B, RhgB, middle domain 0.028
Further Details:      
 
Domain Number 4 Region: 809-881
Classification Level Classification E-value
Superfamily Starch-binding domain-like 0.0000255
Family Rhamnogalacturonase B, RhgB, middle domain 0.04
Further Details:      
 
Weak hits

Sequence:  ENSAPLP00000009898
Domain Number - Region: 738-796
Classification Level Classification E-value
Superfamily Cna protein B-type domain 0.00353
Family Cna protein B-type domain 0.01
Further Details:      
 
Domain Number - Region: 137-219
Classification Level Classification E-value
Superfamily Starch-binding domain-like 0.00451
Family Rhamnogalacturonase B, RhgB, middle domain 0.024
Further Details:      
 
Domain Number - Region: 415-485
Classification Level Classification E-value
Superfamily Hypothetical protein PA1324 0.0105
Family Hypothetical protein PA1324 0.012
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSAPLP00000009898   Gene: ENSAPLG00000010079   Transcript: ENSAPLT00000010599
Sequence length 1149
Comment pep:novel scaffold:BGI_duck_1.0:KB743005.1:1015161:1040505:1 gene:ENSAPLG00000010079 transcript:ENSAPLT00000010599 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MIPLYDKGDFILKIEPPLGWSFEPTSVDIHVDGINDICTKGGDINFVFTGFSVNGKVLSK
GQALGPAGVQVVLRNAGSDVNIQATVTQPGGKFAFFKVLPGEYEIFASHPTWMLKEANTV
VRVTSSNAYAASPLIVAGYNVSGSVRSDGEPMKGVMFLLFSSSVTKEDVVGCNISPVDGF
QSRDESLTYLCNVVSKEDGSFSFLSLPSGKYTVIPFYRGERITFDVAPSRLDFFVEHDSL
QIEPIFHVMGFSVTGRVLNGPEGEGVADATVTLNNQIKVKTKADGSFRLENITTGTYTIH
ARKEHLFFDTITVKIAPNTPQLANIIATGFSVCGRISVTRLPDTVKQMNKYKVTMMPLDK
DKGSLVTTETDPHGAFCFKAKSGIYNIQVIIPEAETRAGLALKPKVFPVTVTDRPVMDVT
FSQFLASVSGKISCLDACGDLMVTLQSVSRQGEKRNLQLSGNTDSVAFTFENVPPGKYKI
SIVHEDWCWKNKSLELEVMEEDVSGVEFRQTGYMLRCSLSHAITLEFYQDGNGPENVGVY
NLSKGVNRFCLSKPGVYEVTPRSCHQFEHEYYTYDTSSPSILTLTAVRHHVLGSIVTDKL
MDVTITIKSSIDSEPDLVLGPLKSVQELRREQQLAEIEARRQEREKKGQEEEGTKPPVQE
MVEELQGPFLYEFSYWARSGEKITVTPSSKELLFYPPYVETVVSGDPHCNDQESCPGKLI
EIHGKAGLFMEGRIHPELEGVEIVIGEKGATSPLITVFTDDKGAYSVGPLHSDLEYTVTA
QKEGFVLTAVEGTVGDFKAFALAGVTFEIKSEDDQALAGVLLSLSGGMFRSNLLTQDNGM
LTFSNLSPGQYYFKPMMKEFRFEPSSQMIEVQEGQNLKIRITGYRTAYSCYGTVSSLNGE
PEQGVSVEAVGQEKCSIYGEDTITDEEGKFRLRGLLPGCVYHVQLKAEGNDHIERALPQH
RAIEVGNSDIGDINIIAFRQINQFDLSGNVITSSEYLSTLCVKLYKSENLDNPIHTVNLG
QSLFFHFPPLLRDGENYVVLLDSTLTKSQYDYTLPQVSFTAIGYHKHITLVFSPTRKLPE
QDIAQGSYIALPLTLLLLLAGYNHDKLIPLLLQLTTRLQGVRALGQTGSDTGGQEDAKRQ
TKKQKTRRT
Download sequence
Identical sequences U3IRM5
ENSAPLP00000009898

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]