SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSNLEP00000018784 from Nomascus leucogenys 76_1.0

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSNLEP00000018784
Domain Number 1 Region: 234-437
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 9.04e-30
Family Laminin G-like module 0.042
Further Details:      
 
Domain Number 2 Region: 682-829
Classification Level Classification E-value
Superfamily Metalloproteases ("zincins"), catalytic domain 0.0000000000232
Family Reprolysin-like 0.067
Further Details:      
 
Domain Number 3 Region: 1524-1598
Classification Level Classification E-value
Superfamily Complement control module/SCR domain 0.000000000139
Family Complement control module/SCR domain 0.0019
Further Details:      
 
Domain Number 4 Region: 1590-1647
Classification Level Classification E-value
Superfamily Complement control module/SCR domain 0.00000653
Family Complement control module/SCR domain 0.0065
Further Details:      
 
Domain Number 5 Region: 541-631,720-771
Classification Level Classification E-value
Superfamily Metalloproteases ("zincins"), catalytic domain 0.0000115
Family Reprolysin-like 0.067
Further Details:      
 
Domain Number 6 Region: 1469-1530
Classification Level Classification E-value
Superfamily Complement control module/SCR domain 0.0000181
Family Complement control module/SCR domain 0.0066
Further Details:      
 
Domain Number 7 Region: 837-879,1063-1107
Classification Level Classification E-value
Superfamily Fibronectin type III 0.0000609
Family Fibronectin type III 0.0048
Further Details:      
 
Weak hits

Sequence:  ENSNLEP00000018784
Domain Number - Region: 1729-1758
Classification Level Classification E-value
Superfamily Notch domain 0.00314
Family Notch domain 0.0067
Further Details:      
 
Domain Number - Region: 1417-1467
Classification Level Classification E-value
Superfamily Complement control module/SCR domain 0.00393
Family Complement control module/SCR domain 0.008
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSNLEP00000018784   Gene: ENSNLEG00000015448   Transcript: ENSNLET00000019726
Sequence length 1791
Comment pep:known_by_projection supercontig:Nleu1.0:GL397275.1:27034507:27420532:1 gene:ENSNLEG00000015448 transcript:ENSNLET00000019726 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MMCLKILRISLAILAGWALCSANCELGWTRKKSLVEREHLNQVLLDGERCWLGAKVRRPR
ASPQHHLFGVYPSRAGNYLRPYPVGEQEIHHTGRSKPDTEGNAVSLVPRDLTENPAGLRG
AVEQPAAPWVGDSPIGQSELLGDDDAYLGNQRSKESLGEAGIQKGSAMAATTTTAIFTTL
NEPKPESQRRGWSKSRQRRQVWKRLAEDVQGDSDISSHLQPWTKHSLKHRVRKSPPEESN
QNGGEGSYREAETFNSQVGLPILYFSGRRERLLLRPEVLAEIPREAFTVEAWVKPEGGQN
NPAIIAGVFDNCSHTVSDKGWALGIRSGKDKGKRDARFFFSLCTDRMKRATILISHSRYQ
PGTWTHVAATYDGRRMALYVDGTQVASSLDQSGPLNSPFMASCRSLLLGGDSSEDGHYFR
GHLGTLVFWSTALPQSHFQHSSQHSSGEEEATDLVLTASFEPVNTEWVPFRDEKYPRLEV
LKGFEPEPEILSPLRPPLCGQTVCDNVELISQYNGYWPLRGEKVIRYQVVNICDDEGLNP
IVSEEQIRLQHEALNEAFSRYNISWQLSVHQVRNSTLRHRVVLVNCEPSKIGNDHCDPEC
EHPLTGYDGGDCRLQGRCYSWNRRDGLCHVECNNMLNDFDDGDCCDPQVADVRKTCFDPD
SPKRAYMSVKELKEALQLNSTHFLNIYFASSVREDLAGAATWPWDKDAVTHLGGIVLSPA
YYGMPGHTDTMIHEVGHVLGLYHVFKGVSERESCNDPCKETVPSMETGDLCADTAPTPKS
ELCREPEPTSDTCGFTRFPGAPFTNYMSYTDDNCTDNFTPNQVARMHCYLDLVYQQWTES
RKPTPIPIPPMVIGQTNKSLTIHWLPPISGVVYDRASGSMCGACTEDGTFRQYVHTASSR
RVCDSSGYWTPEEAVGPPDVDQPCEPSLQAWSPEVHLYHMNMTVPCPTEGCSLELLFQHP
VQADTLTLWVTSFFMESSQVLFDTEILLENKESVHLGPLDTFCDIPLTIKLHVDGKVSGV
KVYTFDERIEIDAALLTSQPHSPLCSGCRPVRYQVLRDPPFASGLPVVVTHSHRKFMDVE
VTPGQMYQYQVLAEAGGELGEASPPLNHIHGAPYCGDGKVSESLGEECDDGDLVSGDGCS
KVCELEEGFNCVGEPSLCYMYEGDGICEPFERKTSIVDCGIYTPKGYLDQWATRAYSSHE
DKKKCPVSSVTGEPHSLICTSYHPDLPNHRTLTGWFPCVASENETQDDRSEQPEGSLKKE
DEVWLKVCFNRPGVARAIFIFLTTDGLVPGEHRQPTVTLYLTDVRGSNHSLGTYGLSCQH
NPLIINVTHHQNVLFHHTTSVLLNFSSPRVGISAVALRTFSHIGISTPSNCISEDEGQNQ
QEQSCFHRPCGKQDSCPSLLLDHADVVNCTSIGPGLMKCAITCQRGFALQASSGQYIRPM
QKEILLTCSSGHWDQNVSCLPVDCGVPDPSLVNYANFSCSEGTNFLKRCSISCVPPAKLQ
GLSPWLTCLEDGLWSLPEVYCKLECDAPPVILNANLLLPHCLQDNHDVGTICKYECKPGY
YVAESAEGKVRNKLLKIQCLEGGIWEQGSCIPVVCEPPPPVFEGMYECTNGFNLDSQCVL
NCNQEREKLPILCTKEGLWTQEFKLCENLQGECPPPPSELNSVEYKCEQGYGIGAVCSPL
CVIPPSDPVMLPENITADTLEHWMEPVKVQNKWNTGRRQWHPDPVIVQSIQSCEPFQADG
WCDTINNRAYCHYDGGDCCSSTLSSKKVIPFAADCDLDECTCRDPKAEENQ
Download sequence
Identical sequences ENSNLEP00000018784 ENSNLEP00000018784

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]