SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSPTRP00000021263 from Pan troglodytes 76_2.1.4

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSPTRP00000021263
Domain Number 1 Region: 38-176
Classification Level Classification E-value
Superfamily Galactose-binding domain-like 1.06e-40
Family Discoidin domain (FA58C, coagulation factor 5/8 C-terminal domain) 0.00065
Further Details:      
 
Domain Number 2 Region: 150-336
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 8.58e-39
Family Laminin G-like module 0.0064
Further Details:      
 
Domain Number 3 Region: 351-519
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 1.47e-35
Family Laminin G-like module 0.0055
Further Details:      
 
Domain Number 4 Region: 894-1098
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 3.34e-29
Family Laminin G-like module 0.0061
Further Details:      
 
Domain Number 5 Region: 569-591,794-884
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 0.0000000000095
Family Laminin G-like module 0.028
Further Details:      
 
Domain Number 6 Region: 577-634
Classification Level Classification E-value
Superfamily Fibrinogen C-terminal domain-like 0.0000000123
Family Fibrinogen C-terminal domain-like 0.0053
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSPTRP00000021263   Gene: ENSPTRG00000012421   Transcript: ENSPTRT00000023056
Sequence length 1233
Comment pep:known_by_projection chromosome:CHIMP2.1.4:2B:124601369:125477089:1 gene:ENSPTRG00000012421 transcript:ENSPTRT00000023056 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MDSLPRLTSVLTLLFSGLWHLGLTATNYNCDDPLASLLSPMAFSSSSDLTGTHSPAQLNW
RVGTGGWSPADSNAQQWLQMDLGNRVEITAVATQGRYGSSDWVTSYSLMFSDTGRNWKQY
KQEDSIWTFAGNMNADSVVHHKLLHSVRARFVRFVPLEWNPSGKIGMRVEVYGCSYKSDV
ADFDGRSSLLYRFNQKLMSTLKDVISLKFKSMQGDGVLFHGEGQRGDHITLELQKGRLAL
HLNLDDSKARLSSSLPSATLGSLLDDQHWHSVLIERVGKQVNFTVDKHTQHFRTKGETDA
LDIDYELSFGGIPVPGKPGTFLKKNFHGCIENLYYNGVNIIDLAKRRKHQIYTGNVTFSC
SEPQIVPITFVNSSGSYLLLPGTPQIDGLSVSFQFRTWNKDGLLLSTELSEGSGTLLLSL
EGGILRLVIQKMTERVAEILTGSNLNDGLWHSVSINARRNRITLTLDDEAAPPAPDSTWV
QIYSGNSYYFGGCPDNLTDSQCLNPIKAFQGCMRLIFIDNQPKDLISVQQGSLGNFSDLH
IDLCSIKDRCLPNYCEHGGSCSQSWTTFYCNCSDTSYTGATCHNSIYEQSCEVYRHQGNT
AGFFYIDSDGSGPLGPLQVYCNITEDKIWTSVQHNNTELTRVRGANPEKPYAMALDYGGS
MEQLEAMIDGSEHCEQEVAYHCRRSRLLNTPDGTPFTWWIGRSNERHPYWGGSPPGVQQC
ECGLDESCLDIQHFCNCDADKDEWTNDTGFLSFKDHLPVTQIVITDTDRSNSEAAWRIGP
LRCYGDRRFWNAVSFYTEASYLHFPTFHAEFSADISFFFKTTALSGVFLENLGIKDFIRL
EISWGTSSRQKGFLGCIRSLHLNGQKMDLEERAKVTSGVRPGCPGHCSSYGSICHNGGKC
VEKHNGYLCDCTNSPYEGPFCKKEVSAVFEAGTSITYMFQEPYPVTKNISLSSSAIYTDS
APSKEKIALSFVTAQAPSLLLFINSSSQDFVAVLLCKNGSLQVRYHLNKEETHVFTIDAD
NFANRRMHHLKINREGRELTIQMDQQLRLSYNFSLEVEFRVIRSLTLGKVTENLGLDSEV
AKANAMGFAGCMSSVQYNHIAPLKAALRHATVAPVTVHGTLTESSCGFMVDSDVNAVTTV
HSSSDPFGKTDEREPLTNAVRSDSAVIGGVIAVVIFIIFCIIGIMTRFLYQHKQSHRTSQ
MKEKEYPENLDSSFRNEIDLQNTVSECKREYFI
Download sequence
Identical sequences ENSPTRP00000021263 ENSPTRP00000021263

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]