SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSNLEP00000016203 from Nomascus leucogenys 76_1.0

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSNLEP00000016203
Domain Number 1 Region: 3133-3291
Classification Level Classification E-value
Superfamily C-type lectin-like 2.97e-41
Family C-type lectin domain 0.00000564
Further Details:      
 
Domain Number 2 Region: 143-259
Classification Level Classification E-value
Superfamily C-type lectin-like 8.48e-41
Family Link domain 0.002
Further Details:      
 
Domain Number 3 Region: 252-353
Classification Level Classification E-value
Superfamily C-type lectin-like 1.03e-29
Family Link domain 0.003
Further Details:      
 
Domain Number 4 Region: 3292-3350
Classification Level Classification E-value
Superfamily Complement control module/SCR domain 0.000000000125
Family Complement control module/SCR domain 0.0022
Further Details:      
 
Domain Number 5 Region: 28-147
Classification Level Classification E-value
Superfamily Immunoglobulin 0.000000000244
Family V set domains (antibody variable domain-like) 0.012
Further Details:      
 
Domain Number 6 Region: 3089-3129
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000000000286
Family EGF-type module 0.0093
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSNLEP00000016203   Gene: ENSNLEG00000013300   Transcript: ENSNLET00000017031
Sequence length 3394
Comment pep:known_by_projection supercontig:Nleu1.0:GL397283.1:15997282:16111119:1 gene:ENSNLEG00000013300 transcript:ENSNLET00000017031 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MFINIKSILWMCSTFIVTHALHKVKVGKSPPVRGSLSGKVSLPCHFSTMPTLPPSYNTSE
FLRIKWSKIEVDKNGKDLKETTVLVAQNGNIKVGQDYKGRVSVPTHPEAVGDASLAVVKL
LASDAGLYRCDVMYGIEDTQDTVSLAVDGVVFHYRAATSRYTLNFEAAQKACLDVGAVIA
TPEQLFAAYEDGFEQCDAGWLSDQTVRYPIRAPRVGCYGDKMGKAGVRTYGFRSPQETYD
VYCYVDHLDGDVFHLTVPSKFTFEEAAKECENQDARLATVGELQAAWRNGFDQCDYGWLS
DASVRHPVTVARAQCGGGLLGVRTLYRFENQTGFPPPDSRFDAYCFKPKEATTIDLSILA
ETASPSLSKEPQMVSDRTTPVIPLVDELPVIPTEFPPVGNIVNFEQKATVQPQAVTDSLA
TKLPTPTGSTKKPWDMDDYSPSASGPLGKLDISEIKEEVLQSTTGVSHYATDSWDGIVED
TQTQESVTQIEQIEVGPLVTSMEILKHIPSKEFPVTETPLVTARMTLESKTEKKTVSTVS
ELVTTGHYGFTLGEEDDEDRTLTVGSDESTLIFDQMPEVITVSKTSEDTIHTQLEDLESV
SASTTVSPLIMPDKNGSSMDDWEERQTSGRITEDFLGKYLSTTPFPPQHHTEIELFPYSG
DKILVEGVSTVIYPSLQTEMTHRRERTETLIPEMRTDTYTDEIQEEITKSSFMGKTEEEF
FSGMKLSTSPSEPIHVTESSVEMTKSFDFPTLTTKLSAEPTEVRDVEEDFTATPGTTKYD
ENITTVLLAHGTLSVEAATVSKWSWDEDNTTSKPLESTEPSASSKLPPALLTTVGMNGKD
KETPSFTEDGADEFTLIPDSIQKQLEEVTDEDIAAHGKFTIRFQPTTSIGIAEKSTLRDS
TTEEKVPPITSTEGQVYVTMEGSALGEVEDVDLSKPVSTVAQFAHTSEVEGLAFVSYSST
QEPTTYVDSSHTIPLSVIPKTDWGVLVPSVPSEDEVLGEPSQDILVIDQTHLEVTISPET
MRTTKITEGTTQEEFPWKEQTAEKPVPTLSSTAWTPKEAVTPLDEQEGDGSAYTVSEDEL
LTGSERVPVLETTPVGKIDHSVSYPPGAITEHKVKTDEVVTLTPRIGPKVSLSPGPEQKY
ETEGSSTTGFTSLSPFSTHVTQLMEETTTEKTSLEDVGLGSGLFEKPKATELIEFSTIKV
TVPSDITTAFSSVARLHTTSAFKPSSMITKKPPLIDREPGEETTSDMVIIGESTSHVPPT
TVEDIVAKETETDIDREYFTTSSPPATQPTRPPTVEDKEAFGPQALSTPQPPARTKFHPD
INVYIIEVRENKTGRMSDLSVIGHPIDSESKEDEPCSEETDPVHDLMAEILPDFPDIIEI
DLYHSEENEEEEEECANATDVTTTPSVQYINGKHLVTTVPKDPEAAEARRGQFESVAPSQ
NFSDSSESDTHPFVVAEMELSTAVQPNESTETTESLEITWKPEIYPEASEHFSGGEPDVF
PTVPFHEEFEHGTAKKGAESVIERVTEVGHQAHEHTEPVSLFPEESSGEIAIDQESQKIA
FARATEVTFGEEVEKSTSVTYTPTIVPSSASAYVSEEEAVTLIGNPWPDNLLSTKESWVE
ATPRQVVELSGSSSIPITEASGEAEEDEDTMFTMVTDLSQRNTTDTLITLDTSRIITESF
FEVPATTIYSVSEQPSAKVMPTKFVSETDTSEWISSTSVEEKKRKEEEGTTGTASTVEVY
SPTQRLDQLILPSELESPNVATSSDSGTRKSFMSLTTPTQSEREMTDSTPVFTETNTLEN
FWAQTTEHSSIHQPGAQEGLTTLPGSPASVFMEQGSGEAAADPETTTVSSFSLNLEHEIQ
AKKEAAGSLSPHVETTFSTEPTGLVLSTVMNREVAENISQTSREVLISERLGEPNYGAEI
RGFSTGFPLEEDFSGDFREYSTVSHPIAKEETVMMEGSGDAAFRDTQISPSTVPTSVHIS
LISDSEGPSSTMVSTSAFPWEEFTSSAEGSGEQLVTVSSSVVPVLPSAVGKFSGTASYII
DEGLGEVGTINEIDRRSTILPTAEVEGTKAPIEKEEVKVSGTISTNFPQTMEPAKLWSRQ
EVNPVRQEIESETTSEEQIQEEKSFESPQNSPATEQTIFDSQTFTETELKTTDYSVLTTK
KTYSDDKEMKEEGTSLVNVSTPDPDANGLESYTTLPEATEKSHFFLATALVTESIPAEHV
VTDSPIKEEESTKPFPKGMRPTIQESGTELLFSGLGSGEEVLPTLPTESVNFTEVEQINN
TLYPHTSQVESTSSDKTEDSNRMENVAKEVGPLVSQTDIFEGSGSVTSTTLIEILSDTGA
EGPTVAPLPFSTDIGHPQNQTLRWAEEIQTSRPQTITEQDSNKNSSTAEITETTTASTDF
LARAYGFEMAKEFVTSAPKPSDLYYEPSGEGSGEVDIVGSFHTSATTQATRQESSTTFVS
DGSLEKHPEVPSAKAVTADGFPTVSVLLPLHSEQNKSSPDPTSTLSNTVSYERSTDGSFQ
DHFREFEDSTLKPNRKKPTENIIIDLDKEDKDLILTITESTILEILPELTSDKNTIIDID
HTKPVYEDILGMQTDIDPEVPSEPHDSNDENNDDSTQVQETYEASVNLSLTEETFEGSGD
VLASYTQATHDESMTYEDRSQLDHMGFNFTTGIPAPSTETELDILLPMATSLPIPRKSAT
VIPEIEGIKAEAKALDDMFESSTLSDGQAIADQSEIIPTLGQFERTQEEYEDKKHAGPSF
QPEFFSGAEEALVDHTPYLSIATTHLMDQSLTEVPKVMEGSSPPYYTDTTLAVSTSAKLS
SQTPSSPLTIYSGSEASGHTEIPQPSALPEIDVGSSVMSPQDSFKESHVIEATFKPSSEE
YLHITEPTSLSPDTKLEPSEDDAKPELLEETESSPTELIAVEGTEILQDFQNKTDGQVSG
EAIKMFPTIKTPEAGTVITTADEIKLEGATQWPHSTSASATYGVEAGVVPWLSPQTSERP
TLSSSPEINPETQAALIRGQDSTTAASEQQVAARILDSNNQATVSPVEFNTEVATPPFSL
LETSNETDFLIGINEESVEGTAIYLPGPDRCKMNPCLNGGTCYPTETSYVCTCVPGYSGD
QCELDFDECHSNPCRNGATCVDGFNTFRCLCLPSYVGALCEQDTETCDYGWHKFQGQCYK
YFAHRRTWDAAERECRLQGAHLTSILSHEEQMFVNRVGHDYQWIGLNDKMFEHDFRWTDG
STLQYENWRPNQPDSFFSAGEDCVVIIWHENGQWNDVPCNYHLTYTCKKGTVACGQPPVV
ENAKTFGKMKPRYEINSLIRYHCKDGFIQRHLPTIRCLGNGRWAIPKITCMNPSAYQRTY
SMKYFKNSSSAKDNSINTSKHDHRWSRRWQESRR
Download sequence
Identical sequences G1RSI0
ENSNLEP00000016203 XP_003261608.1.23891 ENSNLEP00000016203

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]