SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSNLEP00000018906 from Nomascus leucogenys 76_1.0

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSNLEP00000018906
Domain Number 1 Region: 38-176
Classification Level Classification E-value
Superfamily Galactose-binding domain-like 1.06e-40
Family Discoidin domain (FA58C, coagulation factor 5/8 C-terminal domain) 0.00065
Further Details:      
 
Domain Number 2 Region: 150-336
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 1.98e-38
Family Laminin G-like module 0.0068
Further Details:      
 
Domain Number 3 Region: 351-519
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 1.3e-35
Family Laminin G-like module 0.0055
Further Details:      
 
Domain Number 4 Region: 726-738,774-936
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 6.23e-35
Family Laminin G-like module 0.0021
Further Details:      
 
Domain Number 5 Region: 967-1171
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 2.64e-30
Family Laminin G-like module 0.0063
Further Details:      
 
Domain Number 6 Region: 578-634
Classification Level Classification E-value
Superfamily Fibrinogen C-terminal domain-like 0.0000000212
Family Fibrinogen C-terminal domain-like 0.0053
Further Details:      
 
Domain Number 7 Region: 548-584
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000331
Family EGF-type module 0.017
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSNLEP00000018906   Gene: ENSNLEG00000015567   Transcript: ENSNLET00000019858
Sequence length 1306
Comment pep:known_by_projection supercontig:Nleu1.0:GL397388.1:1244551:2152292:1 gene:ENSNLEG00000015567 transcript:ENSNLET00000019858 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MDSVPRLTSVLTLLFSGLWHLGLTATNYNCDDPLASLLSPMAFSSSSDLTGTHSPAQLNW
RVGAGGWSPADSNAQQWLQMDLGNRVEITAVATQGRYGSSDWVTSYSLMFSDTGRNWKQY
KQEDSIWTFAGNMNADSVVHHKLLHSVRARFVRFVPLEWNPSGKIGMRVEVYGCSYKSDV
ADFDGRSSLLYRFNQKLMSTLKDVISLKFKSMQGDGVLFHGEGQRGDHITLELQKGRLAL
HLNLDDSKARFSSSLPSAILGSLLDDQHWHSVLIERVGKQVNFTVDKHTQHFRTKGETDA
LDIDYELSFGGIPVPGKPGTFLKKNFHGCIENLYYNGVNIIDLAKRRKHQIYTGNVTFSC
SEPQIVPITFVNSSGSYLLLPGTPQIDGLSVSFQFRTWNKDGLLLSTELSEGSGTLLLSL
EGGILRLVIQKMTERVAEILTGSNLNDGLWHSVSINARRNRITLTLDNEAAPPAPDSTWV
QIYSGNSYYFGGCPDNLTDSQCLNPIKAFQGCMRLIFIDNQPKDLISVQQGSLGNFSDLH
IDLCSIKDRCLPNYCEHGGSCSQSWTTFYCNCSDTSYAGATCHNSIYEQSCEVYRHQGNT
AGFFYIDSDGSGPLGPLQVYCNITEDKIWTSVQHNNTELTRVRGANPEKPYAMALDYGGS
MEQLEAMIDGSEHCEQEVAYHCRRSRLLNTPDGTPFTWWIGRSNERHPYWGGSPPGVQQC
ECGLDESCLDIQHFCNCDADKDEWANDTGFLSFKDHLPVTQIVITDTDRSNSEAAWRIGP
LRCYGDRRLWNAVSFYTEASYLHFPTFHAEFSADISFFFKTTALSGVFLENLGIKDFIRL
EISSPSEITFAIDVGNGPMELVVQSPSLLNDNQWHYVRAERNLKETSLQVDNLPRSTRET
SEEGHFRLQLNSQLFVGGTSSRQKGFLGCIRSLHLNGQKLDLEERAKVTSGVRPGCPGHC
SSYGSICHNGGKCVEKHNGYLCDCTNSPYEGPFCKKEVSAVFEAGTSVTYMFQEPYPVTK
NISLSSSAIYTDSAPSKENIAFSFVTAQAPSLLLFINSSSQDFLAVLLCKNGSLQVRYHL
NKEETHVFTIDADNFANRRMHHLKINREGRELTIQMDQQLRLSYNFSPEVEFRVIRSLTL
GKVTENLGLDSEVAKANVMGFAGCMSSVQYNHIAPLKAALRHATVAPVTVHGTLMESSCG
FMVDSDVNAVTTVHSSSDPFGKIDEREPLTNAVRSDSAVIGGVIAVVIFIIFCIIGIMTR
FLYQHKQSHRTSQMKEKEYPENLDSSFRNDIDLQNTVSECKREYFI
Download sequence
Identical sequences G1S078
ENSNLEP00000018906 ENSNLEP00000018906 XP_003276159.1.23891

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]