SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSDORP00000010446 from Dipodomys ordii 76_1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSDORP00000010446
Domain Number 1 Region: 2688-2814
Classification Level Classification E-value
Superfamily BEACH domain 6.41e-50
Family BEACH domain 0.00000628
Further Details:      
 
Domain Number 2 Region: 3083-3262,3395-3440
Classification Level Classification E-value
Superfamily WD40 repeat-like 5.01e-30
Family WD40-repeat 0.0013
Further Details:      
 
Domain Number 3 Region: 2895-2967
Classification Level Classification E-value
Superfamily BEACH domain 2.62e-26
Family BEACH domain 0.00016
Further Details:      
 
Domain Number 4 Region: 2533-2655
Classification Level Classification E-value
Superfamily PH domain-like 1.11e-22
Family PreBEACH PH-like domain 0.017
Further Details:      
 
Domain Number 5 Region: 3448-3517
Classification Level Classification E-value
Superfamily FYVE/PHD zinc finger 7.67e-20
Family FYVE, a phosphatidylinositol-3-phosphate binding domain 0.0015
Further Details:      
 
Domain Number 6 Region: 1090-1286
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 0.000000158
Family Trypanosoma sialidase, C-terminal domain 0.085
Further Details:      
 
Weak hits

Sequence:  ENSDORP00000010446
Domain Number - Region: 470-658
Classification Level Classification E-value
Superfamily ARM repeat 0.000576
Family Armadillo repeat 0.036
Further Details:      
 
Domain Number - Region: 2343-2402
Classification Level Classification E-value
Superfamily Cyclin-like 0.00214
Family Retinoblastoma tumor suppressor domains 0.014
Further Details:      
 
Domain Number - Region: 117-411
Classification Level Classification E-value
Superfamily ARM repeat 0.0269
Family Armadillo repeat 0.03
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSDORP00000010446   Gene: ENSDORG00000011115   Transcript: ENSDORT00000011117
Sequence length 3525
Comment pep:known_by_projection genescaffold:dipOrd1:GeneScaffold_4650:6547:170566:-1 gene:ENSDORG00000011115 transcript:ENSDORT00000011117 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MNMVKRIMGRPRQEECSPQDNALGLMHLRRLFTELCHPPRHMTQKEQEEKLYMMLPVFNR
VFGNAPPNTMTEKFSDLLQFTTQVSRLMVTEIRRRASNKSTEAASRAIVQFLEINQSEEA
SRGWMLLTTINLLASSGQKTVDCMTTMSVPSTLVKCLYLFFDLPHVPEAGGGAQNELPLA
ERRGLLQKVFVQILVKLCSFVSPAEELAQKDDLQLLFSAITSWCPPYNLPWRKSAGEVLM
TISRHGLSVNVVKYIHEKECLSTCVQNMQQSDDLSPLEIVEMFAGLSCFLKDSSDVSQTL
LDDFRIWQGYNFLCDLLLRLEQAKEAESKDALKDLVHLITSLTTYGVSELKPAGITTGAP
FLLPGFAVPQPAGKGHSVRNIQAFAVLQNAFLKAKTNFLAQIILDAITNIYMADNANYFI
LESQHTLSQFAEKISKLPEVQNKYFEMLEFVVFSLNYIPCKELISVSILLKSSSSYHCSI
VAMKTLLKFTRHDYVFKDVFREVGLLEVMVTLLHKYAALLKDPAQALNEQGDSRNNSSVE
DQKHLALLVMETLTVLLQGSNTNAGIFREFGGARCAHNIVKYPQCRQHALMTIQQLVLSP
NGDDDMGTLLGLMHSAPPTELQLKTDILRALLSVLRESHRSRTVFRKVGGFVYITSLLVA
MERSLSSPPKNGWEKVNQNQVFELLHTVFCTLTAAMRYEPANSHFFKTEIQYEKLADAVR
FLGCFSDLRKISVMNVFPSNTQPFQRLLEEDVTSVDSVSPTLRHCSKLFIYLYKVATDSF
DSRAEQIPPCLTSESSLPSPWGTPALSRKRHAYHSVSTPPVYPAKNVADLKLHGTASPLQ
NSDAVIIHPGAMLAMLDLLASVGSVTQPEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEFLRLASPLNCGAWDKKLL
KQYRVHKPSSLSYEPEMRSSMITSLEGLGSDNVFSFHEDNHYRISKSLVKSAEGSTVPLT
RVKCLVSMTTPHDIRLHGSSVTPAFVEFDTSLEGFGCLFLPSLAPHNAPTNNAVTTGLTD
GAVVSGLGSGERFFPPPSGLSYSSWFCIEHFSSPPNNHPVRLLTVVRRANSSEQHFVCLA
VVLSAKDRSLIVSTKEELLQNYVDDFSEESSFYEILPCCARFRCGELIVEGQWHHLVLVM
SKGMLKNSTAALYLDGQLVSTVKLHYVHSTPGGSGSVNPPVVSTVYAYIGTPPAQRQIAS
LVWRLGPTHFLEEVLPPSSVTSIYELGPNYVGSFQAVCMPCKDAKSEGIIPSPVSLVPEE
KVSFGLYALSVSSLTVARIRKVYNKLDSKAIAKQXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXVRTFVPKPVATTLQYIGGAAAILGLVAMASDVEGLYAAVKALVCVVKS
NPLASKEMERIKGYQLLAMLLKKKRSLLNSHILHLTFSLVGTVDSGHETSIIPNSTAFQD
LLCDFEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXFGQFISSTLPTFAVCEKFVVMEINNEEKLDTTEEE
FGSLVSANLILLRNRLLDILLKLVYTSKEKTSINLXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXSNQSILIKFKEGLSGGGWLEQTDSVTNKIGTVLGFNVGRSAGGRWT
VREINRDACHFPGFPVLQSFLPKHTNVPALYFLLMALFLQQPVSELPENLQVSVPVISSR
CKQGCQFDLDSIWTFIFGVPASSGTVVSSIHNVCTEAAFLLLGMLRSMLNSXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXATVFPFNIRPYSEMVTDLDDEVG
SPAEEFKAFAADTGMNRSQSEYCNVGTKTYLTNHPAKKFVFDFMRVLIIDNLCLTPASKQ
TPLIDLLLEASPERSTRTQQKEFQTYILDSVMDHLLAADVLLGEDASLPITSGGSYQVLV
NNVFYFTQRVVDKLWQGMFNKESKLLIDFIIQLIAQSKRRSQGLSLDAVYHCLNRTILYQ
FSRAHKTVPQQVALLDSLRVLTVNRNLILGPGNHDQEFISCLAHCLINLHVGSNVDGFGL
EAEARMTTWHIMIPSDIEPDGGYSQDISEGRQLLXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHEKKCVSRGEALVPTTQSKLSRVSSGFGL
SKLTGSRRNRKESGLNKHSLSTQEISQWMFTHIAVVRDLVDTQYKEYQERQQNALKYVTE
EWCQIECELLRERGLWGPPIGSHLDKWMLEMTEGPCRMRKKMVRNDMFYNHYPYVPETEQ
ETNVASEIPSKQPETPDDTIPQKKPARYRRAISYDSKEYYMRLASGNPAIVQDTIVESSE
GEAAQQEPEHGEDTIAKVKGLVKPPLKRSRSAPDGGDEENQDQLQDQIAESSSIEEEEKT
DNATLLRLLEEGEKIQHMYRCARVQGLDTSEGLLLFGKEHFYVIDGFTMTATREIRDIET
LPPNMHEPIIPRGARQGPSQLKRTCSIFAYEDIKEVHKRRYLLQPIAVEVFSGDGRNYLL
AFQKGIRNKVYQRFLAVVPSLTDSSESVSGQRPNTSVEQGSGLLSTLVGEKSVTQRWERG
EISNFQYLMHLNTLAGRSYNDLMQYPVFPWILADYDSEEVDLTNPKTFRNLAKPMGAQTD
ERLAQYKKRYKDWEDPNGETPAYHYGTHYSSAMIVASYLVRMEPFTQIFLRLQXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXALECDYVSAHLHEWIDLIFGYKQQGPAAVEAVNVFHHLFYEGQVD
IYNINDPLKETATIGFINNFGQIPKQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXLKEPVGQIVCTDKGILAVEQNKVLIPPTWNKTFAWGYADLSCRL
GNYESDKAVTVYECLSEWGQILCAICPNPKLVITGGTSTVVCVWEMGTSKEKAKTLTLKQ
ALLGHTDTVNCATASLAYHIIVSGSQDRTCIIWDLNKLSFLTQLRGHRAPVSALCINELT
GDIVSCAGTYIHVWSINGNPIVSVNTFTGGSQHIVCCCVSEMNEWDTQNVIVTGHSDGVV
RFWRMEFLQVPETPAPEPVEVLEMQENCPEAQIXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXYRWERQLVFRSKLTMHTAFDRKDNAHPAEVTALGISKD
HSRILIGDSRGRVFSWSVSDQPGRSAADHWVKDEGGDSCSGCSVRFSLTERRHHCRNCGQ
LFCQKCSRFQSEIKRLKISSPVRVCQNCYYNLQHERSSEDGPRNC
Download sequence
Identical sequences ENSDORP00000010446 ENSDORP00000010446

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]