SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSNLEP00000011285 from Nomascus leucogenys 76_1.0

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSNLEP00000011285
Domain Number 1 Region: 2671-2959
Classification Level Classification E-value
Superfamily BEACH domain 2.49e-126
Family BEACH domain 0.0000000151
Further Details:      
 
Domain Number 2 Region: 3030-3229,3392-3424
Classification Level Classification E-value
Superfamily WD40 repeat-like 4.58e-31
Family WD40-repeat 0.0012
Further Details:      
 
Domain Number 3 Region: 2517-2639
Classification Level Classification E-value
Superfamily PH domain-like 1.05e-22
Family PreBEACH PH-like domain 0.017
Further Details:      
 
Domain Number 4 Region: 3438-3498
Classification Level Classification E-value
Superfamily FYVE/PHD zinc finger 0.0000000000576
Family FYVE, a phosphatidylinositol-3-phosphate binding domain 0.0031
Further Details:      
 
Domain Number 5 Region: 1091-1286
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 0.0000000673
Family Trypanosoma sialidase, C-terminal domain 0.072
Further Details:      
 
Domain Number 6 Region: 470-514,580-658,852-912
Classification Level Classification E-value
Superfamily ARM repeat 0.00000197
Family HspBP1 domain 0.069
Further Details:      
 
Weak hits

Sequence:  ENSNLEP00000011285
Domain Number - Region: 1410-1461,1513-1579,1614-1704
Classification Level Classification E-value
Superfamily ARM repeat 0.000141
Family HspBP1 domain 0.073
Further Details:      
 
Domain Number - Region: 2345-2404
Classification Level Classification E-value
Superfamily Cyclin-like 0.00241
Family Retinoblastoma tumor suppressor domains 0.014
Further Details:      
 
Domain Number - Region: 117-391
Classification Level Classification E-value
Superfamily ARM repeat 0.0317
Family Mo25 protein 0.091
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSNLEP00000011285   Gene: ENSNLEG00000009171   Transcript: ENSNLET00000011841
Sequence length 3507
Comment pep:known_by_projection supercontig:Nleu1.0:GL397302.1:16761622:17032939:-1 gene:ENSNLEG00000009171 transcript:ENSNLET00000011841 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MNMVKRIMGRPRQEECSPQDNALGLMHLRRLFTELCHPPRHMTQKEQEEKLYMMLPVFNR
VFGNAPPNTMTEKFSDLLQFTTQVSRLMVTEIRRRASNKSTEAASRAIVQFLEINQSEEA
SRGWMLLTTINLLASSGQKTVDCMTTMSVPSTLVKCLYLFFDLPHVPEAVGGAQNELPLA
ERRGLLQKVFVQILVKLCSFVSPAEELAQKDDLQLLFSAITSWCPPYNLPWRKSAGEVLM
TISRHGLSVNVVKYIHEKECLSTCVQNMQQSDDLSPLEIVEMFAGLSCFLKDSSDVSQTL
LDDFRIWQGYNFLCDLLLRLEQAKEAESKDALKDLVNLITSLTTYGVSELKPAGITTGAP
FLLPGFAVPQPAGKGHSVRNIQAFAVLQNAFLKAKTSFLAQIILDAITNIYMADNANYFI
LESQHTLSQFAEKISKLPEVQNKYFEMLEFVVFSLNYIPCKELISVSILLKSSSSYHCSI
IAMKTLLKFTRHDYIFKDVFREVGLLEVMVNLLHKYAALLKDPTQALNEQGDSRNNSSVE
DQKHLALLVMETLTVLLQGSNTNAGIFREFGGARCAHNIVKYPQCRQHALMTIQQLVLSP
NGDDDMGTLLGLMHSAPPTELQLKTDILRALLSVLRESHRSRTVFRKVGGFVYITSLLVA
MERSLSCPPKNGWEKVNQNQVFELLHTVFCTLTAAMRYEPANSHFFKTEIQYEKLADAVR
FLGCFSDLRKISAMNVFPSNTQPFQRLLEEDVVSIESVSPTLRHCSKLFIYLYKVATDSF
DSRAEQIPPCLTSESSLPSPWGTPALSRKRHAYHSVSTPPVYPPKNVADLKLHVTTSSLQ
SSDAVIIHPGAMLAMLDLLASVGSVTQPEHALDLQLAVANILQSLVHTERNQQVMCEAGL
HARLLQRCSAALADEDHSLHPPLQRMFERLASQALEPMVLREFLRLASPLNCGAWDKKLL
KQYRVHKPSSLSYEPEMRSSMITSLEGLGTDNVFSLHEDNHYRISKSLVKSAEGSTVPLT
RVKCLVSMTTPHDIRLHGSSVTPAFVEFDTSLEGFGCLFLPSLAPHNAPTNNTVTTGLID
GAVVSGIGSGERFFPPPSGLSYSSWFCIEHFSSPPNNHPVRLLTVVRRANSSEQHYVCLT
IVLSAKDRSLIVSTKEELLQNYVDDFSEESSFYEILPCCARFRCGELIIEGQWHHLVLVM
SKGMLKNSTAALYIDGQLVNTVKLHYVHSTPGGSGSANPPVVSTVYAYIGTPPAQRQIAS
LVWRLGPTHFLEEVLPSSNVTTIYELGPNYVGSFQAVCMPCKDAKSEGVVPSPVSLVPEE
KVSFGLYALSVSSLTVARIRKVYNKLDSKAIAKQLGISSHENATPVKLIHNSAGHLNGSA
RTIGASLIGYLGVRTFVPKPVATTLQYIGGAAAILGLVAMASDVEGLYAAVKALVCVVKS
NPLASKEMERIKGYQLLAMLLKKKRSLLNSHILHLTFSLVGTVDSGHETSIIPNSTAFQD
LLCDFEVWLHAPYELHLSLFEHFIELLTESSEASKNAKLMREFQLIPKLLLTLRDMSLSQ
PTIAAISNVLSFLLQGFPNSNDLLRFGQFISSTLPTFAVCEKFVVMEINNEEKLDTGTEE
EFGGLVSANLILLRNRLLDILLKLIYTSKEKTSINLQACEELVKTLGFDWIMMFMEEHLH
STTVTAAMRILVVLLSNQSILIKFKEGLSGGGWLEQTDSVLTNKIGTVLGFNVGRSAGGR
STVREINRDACHFPGFPVLQSFLPKHTNVPALYFLLMALFLQQPVSELPENLQVSVPVIS
CRSKQGCQFDLDSIWTFIFGVPASSGTVVSSIHNVCTEAVFLLLGMLRSMLNSPWQSEEE
GSWLREYPVTLMQFFRYLYHNVPDLASMWMSPDFLCALAATVFPFNIRPYSEMVTDLDDE
VGSPAEEFKAFATDTGMNRSQSEYCNVGTKTYLTNHPAKKFVFDFMRVLIIDNLCLTPAS
KQTPLIDLLLEASPERSTRTQQKEFQTYILDSVMDHLLAADVLLGEDASLPITSGGSYQV
LVNNVFYFTQRVVDKLWQGMFNKESKLLIDFIIQLIAQSKRRSQGLSLDAVYHCLNRTIL
YQFSRAHKTVPQQVALLDSLRVLTVNRNLILGPGNHDQEFISCLAHCLINLHVGSNVDGF
GLEAEARMTTWHIMIPSDIEPDGSYSQDISEGRQLLIKAVNRVWTELIHSKKQVLEELFK
VTLPVNERGHVDIATARPLIEEAALKCWQNHLAHEKKCISRGEALAPTTQSKLSRVSSGF
GLSKLTGSRRNRKESGLNKHSLSTQEISQWMFTHIAVVRDLVDTQYKEYQERQQNALKYV
TEEWCQIECELLRERGLWGPPIGSHLDKWMLEMTEGPCRMRKKMVRNDMFYNHYPYVPET
EQETNVAKPARYRRAVSYDSKEYYMRLASGNPAIVQDAIVESSEGEAAQQEPEHGEDTIA
KVKGLVKPPLKRSRSAPDGGDEENQEQLQDQIAEGSSIEEEEKTDNATLLRLLEEGEKIQ
HMYRCARVQGLDTSEGLLLFGKEHFYVIDGFTMTATREIRDIETLPPNMHEPIIPRGARQ
GPSQLKRTCSIFAYEDIKEVHKRRYLLQPIAVEVFSGDGRNYLLAFQKGIRNKVYQRFLA
VVPSLTDSSESVSGQRPNTSVEQGSGLLSTLVGEKSVTQRWERGEISNFQYLMHLNTLAG
RSYNDLMQYPVFPWILADYDSEEVDLTNPKTFRNLAKPMGAQTDERLAQYKKRYKDWEDP
NGETPAYHYGTHYSSAMIVASYLVRMEPFTQIFLRLQGGHFDLADRMFHSVREAWYSASK
HNMADVKELIPEFFYLPEFLFNSNNFDLGCKQNGTKLGDVILPPWAKGDPREFIRVHREA
LECDYVSAHLHEWIDLIFGYKQQGPAAVEAVNVFHHLFYEGQVDIYNINDPLKETATIGF
INNFGQIPKQLFKKPHPPKRVRSRLNGDNAGISVLPGSTSDKIFFHHLDNLRPSLTPVKE
LKEPVGQIVCTDKGILAVEQNKVLIPPTWNKTFAWGYADLSCRLGTYESDKAVTVYECLS
EWGQILCAICPNPKLVITGGTSTVVCVWEMGTSKEKAKTVTLKQALLGHTDTVTCATASL
AYHIIVSGSRDRTCIIWDLNKLSFLTQLRGHRAPVSALCINELTGDIVSCAGTYIHVWSI
NGNPIVSVNTFTGRSQQIICCCMSEMNEWDTQNVIVTGHSDGVVRFWRMEFLQVPETPAP
EPAEVLEIQEDCPEAQIGQEAQDEDSSDSEADEQSVSQDPKDTPSQPSSTSHRPRAASCR
ATAAWCTDSGSDDSRRWSDQLSLDEKDGFIFVNYSEGQTRAHLQGPLSHPHPNPIEVRNY
SRLKPGYRWERQLVFRSKLTMHTAFDRKDNAHPAEVTALGVSKDHSRILVGDSRGRVFSW
SVSDQPGRSAADHWVKMKADSCSGCSVRFSLTERRHLQDCGQLFCQKCSRFQSEIKRLKI
SSPVRVCQNCYYNLQHERGSEDGPRNC
Download sequence
Identical sequences ENSNLEP00000011285

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]