SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSPTRP00000035848 from Pan troglodytes 76_2.1.4

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSPTRP00000035848
Domain Number 1 Region: 42-228
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 1.11e-36
Family Laminin G-like module 0.0053
Further Details:      
 
Domain Number 2 Region: 227-407
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 1.39e-35
Family Laminin G-like module 0.003
Further Details:      
 
Domain Number 3 Region: 672-848
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 3.17e-34
Family Laminin G-like module 0.0017
Further Details:      
 
Domain Number 4 Region: 858-1062
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 2.88e-29
Family Laminin G-like module 0.019
Further Details:      
 
Domain Number 5 Region: 466-524
Classification Level Classification E-value
Superfamily Fibrinogen C-terminal domain-like 0.0000000000205
Family Fibrinogen C-terminal domain-like 0.0036
Further Details:      
 
Domain Number 6 Region: 438-474
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000754
Family EGF-type module 0.015
Further Details:      
 
Weak hits

Sequence:  ENSPTRP00000035848
Domain Number - Region: 2-52
Classification Level Classification E-value
Superfamily Galactose-binding domain-like 0.0187
Family Discoidin domain (FA58C, coagulation factor 5/8 C-terminal domain) 0.0051
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSPTRP00000035848   Gene: ENSPTRG00000020961   Transcript: ENSPTRT00000038778
Sequence length 1177
Comment pep:known_by_projection chromosome:CHIMP2.1.4:9:39275098:39516477:-1 gene:ENSPTRG00000020961 transcript:ENSPTRT00000038778 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MFSDGGRNWKQYRREESIWGFPGNTNADSVVHYRLQPPFEARFLRFLPLAWNPRGRIGMR
IEVYGCAYKSEVVYFDGQSALLYTLDKKPLKPIRDVISLKFKAMQSNGILLHREGQHGNH
ITLELIKGKLVFFLNSGNAKLPSTIAPVTLTLGSLLDDQHWHSVLIELLDTQVNFTVDKH
THHFQAKGDSSYLDLNFEISFGGIPTPGRSRAFTRKSFHGCLENLYYNGVDVTELAKKHK
PQILMMGNVSFSCPQPQTVPVTFLSSRSYLALPGNSGEDKVSVTFQFRTWNRAGHLLFGE
LRRGSGSFVLFLKDGKLKLSLFQPGQSPRNVTAGAGLNDGQWHSVSFSAKWSHMNVVVDD
DTAVQPLVAVLIDSGDTYYFGGCLDNSSGSGCKSPLGGFQGCLRLITIGDKAVDPILVQQ
GALGSFRDLQIDSCGITDRCLPSYCEHGGECSQSWDTFSCDCLGTGYTGETCHSSLYEQS
CEAHKHRGNPSGLYYIDADGSGPLGPFLVYCNMTADAAWTVVRHGGPDAVTLRGAPSGHP
RSAVSFAYAADAGQLRASVNLAERCEQRLALRCGTARRPDSRDGTPLSWWVGRTNETHTY
WGGSLPDAQKCTCGLEGNCIDSQYYCNCDAGRNEWTSDTIVLSQKEHLPVTQIVMTDAGR
PRSEAAYTLGPLLCHGDKSFWNSASFNTETSYLHFLAFHGELTADVCFFFKTTVSSGVFM
ENLGITDFIRIELRAPTEVTFSFDVGNGPCEVTVQSPTPFNDNQWHHVRAERNVKGASLQ
VDQLPQKMQPAPADGHVRLQLNSQLFIGGTATRQRGFLGCIRSLQLNGVALDLEERATVT
PGVEPGCAGHCSTYGHLCRNGGRCREKRRGVTCDCAFSAYDGPFCSNEISAYFATGSSMT
YHFQEHYTLSENSSSLVSSLHRDVTLTREMITLSFRTTRTPSLLLYVSSFYEEYLSVILA
NNGSLQIRYKLDRHQNPDAFTFDFKNMADGQLHQVKINREEAVVMVEVHQSAKKQVILSS
GTEFNAVKSLILGKVLEAAGADPDTRRAATSGFTGCLSAVRFGRAAPLKAALRPSGTSRV
TVRGHVAPMARCAAGAASGSPARELAPRLAGGAGRSGPADEGEPLVNADRRDSAVIGGVI
AVVIFILLCITAIAIRIYQQRKLRKENESKVSKKEEC
Download sequence
Identical sequences ENSPTRP00000035848 ENSPTRP00000035848

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]