SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSPTRP00000035847 from Pan troglodytes 76_2.1.4

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSPTRP00000035847
Domain Number 1 Region: 38-179
Classification Level Classification E-value
Superfamily Galactose-binding domain-like 1.06e-45
Family Discoidin domain (FA58C, coagulation factor 5/8 C-terminal domain) 0.00028
Further Details:      
 
Domain Number 2 Region: 153-339
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 1.02e-35
Family Laminin G-like module 0.0068
Further Details:      
 
Domain Number 3 Region: 783-959
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 3.17e-35
Family Laminin G-like module 0.0018
Further Details:      
 
Domain Number 4 Region: 338-518
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 4.75e-35
Family Laminin G-like module 0.003
Further Details:      
 
Domain Number 5 Region: 969-1173
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 6.19e-30
Family Laminin G-like module 0.017
Further Details:      
 
Domain Number 6 Region: 577-635
Classification Level Classification E-value
Superfamily Fibrinogen C-terminal domain-like 0.0000000000314
Family Fibrinogen C-terminal domain-like 0.0041
Further Details:      
 
Domain Number 7 Region: 549-585
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000088
Family EGF-type module 0.015
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSPTRP00000035847   Gene: ENSPTRG00000033706   Transcript: ENSPTRT00000038777
Sequence length 1288
Comment pep:known_by_projection scaffold:CHIMP2.1.4:GL391077.1:194523:429704:-1 gene:ENSPTRG00000033706 transcript:ENSPTRT00000038777 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MASVAWAVLKVLLLLPTQTWSPVGAGNPPDCDAPLASALPRSSFSSSSELSSSHGPGFSR
LNRRDGAGGWTPLVSNKYQWLQIDLGERMEVTAVATQGGYGSSDWVTSYLLMFSDGGRNW
KQYRREESIWGFPGNTNADSVVHYRLQPPFEARFLRFLPLAWNPRGRIGMRIEVYGCAYK
SEVVYFDGQSALLYTLDKKPLKPIRDVISLKFKAMQSNGILLHREGQHGNHITLELIKGK
LVFFLNSGNAKLPSTIAPVTLTLGSLLDDQHWHSVLIELLDTQVNFTVDKHTHHFQAKGD
SSYLDLNFEISFGGISTPGRSRAFTRKSFHGCLENLYYNGVDVTELAKKHKPQILMMGNV
SFSCPQPQTVPVTFLSSRSYLALPGNSGEDKVSVTFQFRTWNRAGHLLFGELRRGSGSFV
LFLKDGKLKLSLFQPGQSPRNVTAGAGLNDGQWHSVSFSAKWSHMNVVVDDDTAVQPLVA
VLIDSGDTYYFGGCLGNSSGSGCKSPLGGFQGCLRLITIGDKAVDPILVQQGALGSFRDL
QIDSCGITDRCLPSYCEHGGECSQSWDTFSCDCLGTGYTGETCHSSLYEQSCEAHKHRGN
PSGLYSIDADGSGPLGPFLVYCNMTADSAWTVVRHGGPDAVTLRGAPSGHPRSAVSFAYA
AGAGQLRAAVNLAERCEQRLALRCGTARRPDSRDGTPLSWWVGRTNETHTYWGGSLPDAQ
KCTCGLEGNCIDSQYYCNCDAGRNEWTSDTIVLSQKEHLPVTQIVMTDAGQPHSEADYTL
GPLLCRGDKSFWNSASFNTETSYLHFPAFHGELTADVCFFFKTTVSSGVFMENLGITDFI
RIELRAPTEVTFSFDVGNGPCEVTVQSPTPFNDNQWHHVRAERNVKGASLQVDQLPQKMQ
PAPADGHVRLQLNSQLFIGGTATRQRGFLGCIRSLQLNGVALDLEERATVTPGVEPGCAG
HCSTYGHLCRNGGRCREKRRGVTCDCAFSAYDGPFCSNEISAYFATGSSMTYHFQEHYTL
SENSSSLVSSLHRDVTLTREMITLSFRTTRTPSLLLYVSSFYEEYLSVILANNGSLQIRY
KLDRHQNPDAFTFDFKNMADGQLHQVKINREEAVVMVEVNQSAKKQVILSSGTEFNAVKS
LILGKVLEAAGADPDTRRAATSGFTGCLSAVRFGRAAPLKAALRPSGPSRVTVRGHVAPM
ARCAAGAASGSPARELAPRLAGGAGRSGPADEGEPLVNADRRDSAVIGGVIAVVIFILLC
ITAIAIRIYQQRKLRKENESKVSKKEEC
Download sequence
Identical sequences 9598.ENSPTRP00000035848 ENSPTRP00000035847 ENSPTRP00000035847

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]