SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSNLEP00000011281 from Nomascus leucogenys 76_1.0

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSNLEP00000011281
Domain Number 1 Region: 2688-2976
Classification Level Classification E-value
Superfamily BEACH domain 2.62e-126
Family BEACH domain 0.0000000151
Further Details:      
 
Domain Number 2 Region: 3047-3246,3409-3441
Classification Level Classification E-value
Superfamily WD40 repeat-like 3.4e-31
Family WD40-repeat 0.0012
Further Details:      
 
Domain Number 3 Region: 2534-2656
Classification Level Classification E-value
Superfamily PH domain-like 1.11e-22
Family PreBEACH PH-like domain 0.017
Further Details:      
 
Domain Number 4 Region: 3455-3515
Classification Level Classification E-value
Superfamily FYVE/PHD zinc finger 0.0000000000576
Family FYVE, a phosphatidylinositol-3-phosphate binding domain 0.0031
Further Details:      
 
Domain Number 5 Region: 1091-1286
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 0.0000000673
Family Trypanosoma sialidase, C-terminal domain 0.072
Further Details:      
 
Domain Number 6 Region: 470-514,580-658,852-910
Classification Level Classification E-value
Superfamily ARM repeat 0.00000238
Family HspBP1 domain 0.088
Further Details:      
 
Weak hits

Sequence:  ENSNLEP00000011281
Domain Number - Region: 1410-1461,1513-1579,1614-1704
Classification Level Classification E-value
Superfamily ARM repeat 0.000141
Family HspBP1 domain 0.073
Further Details:      
 
Domain Number - Region: 2345-2405
Classification Level Classification E-value
Superfamily Cyclin-like 0.0018
Family Retinoblastoma tumor suppressor domains 0.014
Further Details:      
 
Domain Number - Region: 122-390
Classification Level Classification E-value
Superfamily ARM repeat 0.0374
Family Mo25 protein 0.091
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSNLEP00000011281   Gene: ENSNLEG00000009171   Transcript: ENSNLET00000011836
Sequence length 3524
Comment pep:known_by_projection supercontig:Nleu1.0:GL397302.1:16761622:17032939:-1 gene:ENSNLEG00000009171 transcript:ENSNLET00000011836 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MNMVKRIMGRPRQEECSPQDNALGLMHLRRLFTELCHPPRHMTQKEQEEKLYMMLPVFNR
VFGNAPPNTMTEKFSDLLQFTTQVSRLMVTEIRRRASNKSTEAASRAIVQFLEINQSEEA
SRGWMLLTTINLLASSGQKTVDCMTTMSVPSTLVKCLYLFFDLPHVPEAVGGAQNELPLA
ERRGLLQKVFVQILVKLCSFVSPAEELAQKDDLQLLFSAITSWCPPYNLPWRKSAGEVLM
TISRHGLSVNVVKYIHEKECLSTCVQNMQQSDDLSPLEIVEMFAGLSCFLKDSSDVSQTL
LDDFRIWQGYNFLCDLLLRLEQAKEAESKDALKDLVNLITSLTTYGVSELKPAGITTGAP
FLLPGFAVPQPAGKGHSVRNIQAFAVLQNAFLKAKTSFLAQIILDAITNIYMADNANYFI
LESQHTLSQFAEKISKLPEVQNKYFEMLEFVVFSLNYIPCKELISVSILLKSSSSYHCSI
IAMKTLLKFTRHDYIFKDVFREVGLLEVMVNLLHKYAALLKDPTQALNEQGDSRNNSSVE
DQKHLALLVMETLTVLLQGSNTNAGIFREFGGARCAHNIVKYPQCRQHALMTIQQLVLSP
NGDDDMGTLLGLMHSAPPTELQLKTDILRALLSVLRESHRSRTVFRKVGGFVYITSLLVA
MERSLSCPPKNGWEKVNQNQVFELLHTVFCTLTAAMRYEPANSHFFKTEIQYEKLADAVR
FLGCFSDLRKISAMNVFPSNTQPFQRLLEEDVVSIESVSPTLRHCSKLFIYLYKVATDSF
DSRAEQIPPCLTSESSLPSPWGTPALSRKRHAYHSVSTPPVYPPKNVADLKLHVTTSSLQ
SSDAVIIHPGAMLAMLDLLASVGSVTQPEHALDLQLAVANILQSLVHTERNQQVMCEAGL
HARLLQRCSAALADEDHSLHPPLQRMFERLASQALEPMVLREFLRLASPLNCGAWDKKLL
KQYRVHKPSSLSYEPEMRSSMITSLEGLGTDNVFSLHEDNHYRISKSLVKSAEGSTVPLT
RVKCLVSMTTPHDIRLHGSSVTPAFVEFDTSLEGFGCLFLPSLAPHNAPTNNTVTTGLID
GAVVSGIGSGERFFPPPSGLSYSSWFCIEHFSSPPNNHPVRLLTVVRRANSSEQHYVCLT
IVLSAKDRSLIVSTKEELLQNYVDDFSEESSFYEILPCCARFRCGELIIEGQWHHLVLVM
SKGMLKNSTAALYIDGQLVNTVKLHYVHSTPGGSGSANPPVVSTVYAYIGTPPAQRQIAS
LVWRLGPTHFLEEVLPSSNVTTIYELGPNYVGSFQAVCMPCKDAKSEGVVPSPVSLVPEE
KVSFGLYALSVSSLTVARIRKVYNKLDSKAIAKQLGISSHENATPVKLIHNSAGHLNGSA
RTIGASLIGYLGVRTFVPKPVATTLQYIGGAAAILGLVAMASDVEGLYAAVKALVCVVKS
NPLASKEMERIKGYQLLAMLLKKKRSLLNSHILHLTFSLVGTVDSGHETSIIPNSTAFQD
LLCDFEVWLHAPYELHLSLFEHFIELLTESSEASKNAKLMREFQLIPKLLLTLRDMSLSQ
PTIAAISNVLSFLLQGFPNSNDLLRFGQFISSTLPTFAVCEKFVVMEINNEEKLDTGTEE
EFGGLVSANLILLRNRLLDILLKLIYTSKEKTSINLQACEELVKTLGFDWIMMFMEEHLH
STTVTAAMRILVVLLSNQSILIKFKEGLSGGGWLEQTDSVLTNKIGTVLGFNVGRSAGGR
STVREINRDACHFPGFPVLQSFLPKHTNVPALYFLLMALFLQQPVSELPENLQVSVPVIS
CRSKQGCQFDLDSIWTFIFGVPASSGTVVSSIHNVCTEAVFLLLGMLRSMLNSPWQSEEE
GSWLREYPVTLMQFFRYLYHNVPDLASMWMSPDFLCALAATVFPFNIRPYSEMVTDLDDE
VGSPAEEFKAFATDTGMNRSQSEYCNVGTKTYLTNHPAKKFVFDFMRVLIIDNLCLTPAS
KQTPLIDLLLEASPERSTRTQQKEFQTYILDSVMDHLLAADVLLGEDASLPITSGGSYQV
LVNNVFYFTQRVVDKLWQGMFNKESKLLIDFIIQLIAQSKRRSQGLSLDAVYHCLNRTIL
YQFSRAHKTVPQQVALLDSLRVLTVNRNLILGPGNHDQEFISCLAHCLINLHVGSNVDGF
GLEAEARMTTWHIMIPSDIEPDGSYSQDISEGRQLLIKAVNRVWTELIHSKKQVLEELFK
VTLPVNERGHVDIATARPLIEEAALKCWQNHLAHEKKCISRGEALAPTTQSKLSRVSSGF
GLSKLTGSRRNRKESGLNKHSLSTQEISQWMFTHIAVVRDLVDTQYKEYQERQQNALKYV
TEEWCQIECELLRERGLWGPPIGSHLDKWMLEMTEGPCRMRKKMVRNDMFYNHYPYVPET
EQETNVASEIPSKQPETPDDIPQKKPARYRRAVSYDSKEYYMRLASGNPAIVQDAIVESS
EGEAAQQEPEHGEDTIAKVKGLVKPPLKRSRSAPDGGDEENQEQLQDQIAEGSSIEEEEK
TDNATLLRLLEEGEKIQHMYRCARVQGLDTSEGLLLFGKEHFYVIDGFTMTATREIRDIE
TLPPNMHEPIIPRGARQGPSQLKRTCSIFAYEDIKEVHKRRYLLQPIAVEVFSGDGRNYL
LAFQKGIRNKVYQRFLAVVPSLTDSSESVSGQRPNTSVEQGSGLLSTLVGEKSVTQRWER
GEISNFQYLMHLNTLAGRSYNDLMQYPVFPWILADYDSEEVDLTNPKTFRNLAKPMGAQT
DERLAQYKKRYKDWEDPNGETPAYHYGTHYSSAMIVASYLVRMEPFTQIFLRLQGGHFDL
ADRMFHSVREAWYSASKHNMADVKELIPEFFYLPEFLFNSNNFDLGCKQNGTKLGDVILP
PWAKGDPREFIRVHREALECDYVSAHLHEWIDLIFGYKQQGPAAVEAVNVFHHLFYEGQV
DIYNINDPLKETATIGFINNFGQIPKQLFKKPHPPKRVRSRLNGDNAGISVLPGSTSDKI
FFHHLDNLRPSLTPVKELKEPVGQIVCTDKGILAVEQNKVLIPPTWNKTFAWGYADLSCR
LGTYESDKAVTVYECLSEWGQILCAICPNPKLVITGGTSTVVCVWEMGTSKEKAKTVTLK
QALLGHTDTVTCATASLAYHIIVSGSRDRTCIIWDLNKLSFLTQLRGHRAPVSALCINEL
TGDIVSCAGTYIHVWSINGNPIVSVNTFTGRSQQIICCCMSEMNEWDTQNVIVTGHSDGV
VRFWRMEFLQVPETPAPEPAEVLEIQEDCPEAQIGQEAQDEDSSDSEADEQSVSQDPKDT
PSQPSSTSHRPRAASCRATAAWCTDSGSDDSRRWSDQLSLDEKDGFIFVNYSEGQTRAHL
QGPLSHPHPNPIEVRNYSRLKPGYRWERQLVFRSKLTMHTAFDRKDNAHPAEVTALGVSK
DHSRILVGDSRGRVFSWSVSDQPGRSAADHWVKMKADSCSGCSVRFSLTERRHLQDCGQL
FCQKCSRFQSEIKRLKISSPVRVCQNCYYNLQHERGSEDGPRNC
Download sequence
Identical sequences ENSNLEP00000011281 ENSNLEP00000011281

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]