SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSNLEP00000021021 from Nomascus leucogenys 76_1.0

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSNLEP00000021021
Domain Number 1 Region: 191-370
Classification Level Classification E-value
Superfamily Galactose-binding domain-like 1.7e-35
Family APC10-like 0.0021
Further Details:      
 
Domain Number 2 Region: 1808-1868
Classification Level Classification E-value
Superfamily RING/U-box 0.00000000006
Family ZZ domain 0.0082
Further Details:      
 
Domain Number 3 Region: 1766-1817
Classification Level Classification E-value
Superfamily RING/U-box 0.000000000753
Family ZZ domain 0.02
Further Details:      
 
Domain Number 4 Region: 82-148
Classification Level Classification E-value
Superfamily EF-hand 0.00000187
Family Polcalcin 0.061
Further Details:      
 
Weak hits

Sequence:  ENSNLEP00000021021
Domain Number - Region: 1091-1181
Classification Level Classification E-value
Superfamily Spermadhesin, CUB domain 0.00236
Family Spermadhesin, CUB domain 0.0089
Further Details:      
 
Domain Number - Region: 2695-2784
Classification Level Classification E-value
Superfamily Spermadhesin, CUB domain 0.0249
Family Spermadhesin, CUB domain 0.011
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSNLEP00000021021   Gene: ENSNLEG00000017302   Transcript: ENSNLET00000022077
Sequence length 2948
Comment pep:known_by_projection supercontig:Nleu1.0:GL397423.1:2117265:2247290:-1 gene:ENSNLEG00000017302 transcript:ENSNLET00000022077 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
XAAAAGGEGWVPHQDWAAVSGTTPGPGLAAPALPPAAALLEPARLREAAAALLPTPPCES
LVSRHRGALFRWLEERLGRGEESVTLEQFRELLEARGAGCSSEQFEEAFAQFDAEGDGTV
DAENMLEALKNSSGANLQGELSHIIRQLQACSLVPGFTDIFSESKEGLDTHSSMILRFLH
RNRLSSAVMPYPMLEHCNNMCTMRSSVLRESLDQLVQKEKESPGDLTRSPEMDKLKSVAK
CYAYIETSSNSADIDKMTNGETSSYWQSDGSACSHWIRLKMKPDVVLRHLSIAVAATDQS
YMPQQVTVAVGRNASDLQEVRDVHIPSNVTGYVTLLENANVSQLYVQINIKRCLSDGCDT
RIHGLRAVGFQRVKKSGVSVSDASAIWYWSLLTSLVTASMETNPAFVQTVLHNTQKALRH
MPPLSLSPGSTDFSTFLSPNVLEEVDSFLIRITSCCSTPEVELTLLAFALARGSVAKVMS
SLCTITDHLDTQYDASSLILSMASVRQNLLLKYGKPLQLTLQACDVKGKEDKSGPENLLV
EPWTRDGFLTETGKTRASTIFSTGTESAFQVTQIRIMVRRGGIGAQCGLVFAYNSSSDKF
CAEEHFKRFEKYDKWKLQELRQFVKSRIGCSSDDLGEDDPIGWFELEEEWDEADVKLQQC
RVAKYLMVKFLCTRQESAERLGVQGLSISGYLRPARTEAEQSITCAHCRKDTEESVCGAT
LLLRTLQFIQQLAHDLVQQKESGLKYKSFLDFAGLDLQIFWNFYSKLKQNPREECISAQT
LLLQLLQSCFSVLQGDVLAASEEEKTPIQSPKGVEAAKELYTHLCDVVDKVDGDSVPMEI
LKQEVRNTLLNGAAIFFPNRQTRRNHLFTMMNVTEQEHKQSLQLTFRSLCTYFSDKDPGG
LLLLPEKNDLAKMNISEVLAVMDTLLSVAARECELLMLSGAPGEVGSVLFSLFWSVQGSL
LSWCYLQLKSTDSGAKDLAVDLIEKYVGQFLASMRVILESLLSQYNGKTIVERLCNSVFS
MAARQLVIFLLDFCTLDIPHCVLLREFSILTELLKKLCSGPEGGLRKLDVETWQQEQPVV
LHTWTKESAHNYENNCHEVSVFVSPGATYFEVEFDDRCETEKRYDYLEFTDARGRKTRYD
TKVGTDKWPKKVTFKAGPRLQFLFHSDSSHNEWGYKFTVTAYGLPDVAVSWGLDLQLLVS
RLMGRLASQCMALKSVHQLGSNMVVPQAKMALVLSSPLWKPVFRHQVCPELELEASWPTH
PHRNSKEVKNIPDDPCRHFLLDFAQSEPAQNFCGPYSELFKGFIQACRKQAPKTDIVAGS
TIDQAVNATFAALVYRTPDLYEKLQKYVNSGGRIALSEEFAQVYSLADGIRIWMLEMKQK
SLMSLGNEAEEKHSSEATEANPESLAKECIEKSLLLLKFLPTGISSKESCEKLETADETS
HLQPLDKRQRTSSVVEEHFQASVSPTEAAPPATGDWSPGLDTQPKLPSSSGLPAADVSPA
TAEEPLSPSTPTRRPPFTRGRLRLLSFRSMEEARLVPTVKEKYPVLKDVMDFIKDQSLSH
RSVVKVLSLRKAQAQSILEVLKITQYCAESLGQPHCFHPPFILFLLELLTCQKDFTNYFG
HLEGCGADLHKEIRDTYYQLVLFLVKAVKGFSSLNDRSLLPALSCVQTALLHLLDMGWEP
SDLAFFVDIQLPDLLMKMSQENISVHDSVISQWSEEDELADAKQNSEWMDECQDGMFEAW
YEKIAQEDPEKQRKMHMFIARYCDLLNVDISCDGCDEIAPWHRYRCLQCSDMDLCKTCFL
GGVKPEGHGDNHEMVNMEFTCDHCQGLIIGRRMNCNVCDDFDLCYGCYAAKKYSYGHLPT
HSITAHPMVTIRISDRQRLIQPYIHNYSWLLFAALALYSAHLASAEDVDGEKLDPQTRSS
ATTLRSQCMQLVGDCLMKAHQGKGLKALALLGVLPDGDASPEDQALPVTVPTRASEEQLE
KKAVQGAELSEAGNGKRAVHEEVRPVDFKQRNKADKGVSLTKDPSCQTQISDSPADASPP
TGLPDAEDSEVSSQKPIEEKAVTPSPEQVFAECSQKRILGLLAAMLPPLKSGPTVPLIDL
EHVLPLMFQVVISNAGHLNETYHLTLGLLGQLIIRLLPAEVDAAVIKVLSAKHNLFAAGD
SSIVPDGWKTTHLLFSLGAVCLDSRVGLDWACSMAEILRSLNSAPLWRDVIATFTDHCIK
QLPFQLKHTNIFTLLVLVGFPQVLCVGTRCVYMDNANEPHNVIILKHFTEKNRAVIVDVK
TRKRKTVKDYQLVQKGGGQECGDSQAQLSQYSQHFAFIASHLLQSSMDSHCPEAVEATWV
LSLALKGLYKTLKAHGFEETHATFLQTDLLKLLVKKCSKGTGFSKTWLLRDLEILSIMLY
SSKKEINALAEHGDLELDERGDREEEVERPVSSPGDPEQKKLDPLEGLDEPTRICFLVWM
ASLWPGPCFSLNMKMLLSAVLRIHFLKTQSVFDGDELTTDERIRSLAQRWQPSKSLRLEE
QSAKAVDTDMIILPCLSRPVRCDQATAESNPVTQKLISSTESELQQSYAKQRRSKSAALL
HKELNCKSKRAVRDYLFRVNEATAVLYARHVLASLLAEWPSHVPVSEDILELSGPAHMTY
ILDMFMQLEEKHEWEKILQKVLQGCREDMLGTMALAACQFMEEPGMEVQVRESKHPYNNN
TNFEDKVHIPGAIYLSIKFDSQCNTEEGCDELAMSSSSDFQQDRHSFSGSQQKWKDFELP
GDTLYYRFTSDMSNTEWGYKFTVTAGHLGRFQTGFEILKQMLSEERVVPHLPLAKIWEWL
VGVACRQTGHQRLKAIHLLLRIVRCCGHSDLCDLALLKPLWQLFTHMEYGLFEDAGVPGG
LGELHCASGAVSAPALSLQELGVLQDYLLALTTDDHLLRCAAQALQNIAAISLINYPNKA
TRLWNVEC
Download sequence
Identical sequences ENSNLEP00000021021 ENSNLEP00000021021

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]