SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSCAFP00000013654 from Canis familiaris 76_3.1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSCAFP00000013654
Domain Number 1 Region: 2671-2959
Classification Level Classification E-value
Superfamily BEACH domain 2.49e-126
Family BEACH domain 0.0000000151
Further Details:      
 
Domain Number 2 Region: 3030-3229,3392-3424
Classification Level Classification E-value
Superfamily WD40 repeat-like 5.19e-32
Family WD40-repeat 0.001
Further Details:      
 
Domain Number 3 Region: 2517-2639
Classification Level Classification E-value
Superfamily PH domain-like 1.05e-22
Family PreBEACH PH-like domain 0.017
Further Details:      
 
Domain Number 4 Region: 3432-3500
Classification Level Classification E-value
Superfamily FYVE/PHD zinc finger 6.72e-20
Family FYVE, a phosphatidylinositol-3-phosphate binding domain 0.0015
Further Details:      
 
Domain Number 5 Region: 1088-1286
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 0.0000000686
Family Trypanosoma sialidase, C-terminal domain 0.085
Further Details:      
 
Domain Number 6 Region: 470-514,580-658,852-911
Classification Level Classification E-value
Superfamily ARM repeat 0.00000205
Family Armadillo repeat 0.072
Further Details:      
 
Weak hits

Sequence:  ENSCAFP00000013654
Domain Number - Region: 1410-1461,1513-1579,1614-1704
Classification Level Classification E-value
Superfamily ARM repeat 0.000132
Family HspBP1 domain 0.069
Further Details:      
 
Domain Number - Region: 2345-2404
Classification Level Classification E-value
Superfamily Cyclin-like 0.00204
Family Retinoblastoma tumor suppressor domains 0.01
Further Details:      
 
Domain Number - Region: 117-410
Classification Level Classification E-value
Superfamily ARM repeat 0.0691
Family Armadillo repeat 0.056
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSCAFP00000013654   Gene: ENSCAFG00000009247   Transcript: ENSCAFT00000014761
Sequence length 3509
Comment pep:known_by_projection chromosome:CanFam3.1:32:8434080:8610578:-1 gene:ENSCAFG00000009247 transcript:ENSCAFT00000014761 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MNMVKRIMGRPRQEECSPQDNALGLMHLRRLFTELCHPPRHMTQKEQEEKLYMMLPVFNR
VFGNAPPNTMTEKFSDLLQFTTQVSRLMVTEIRRRASNKSTEAASRAIVQFLEINQSEEA
SRGWMLLTTINLLASSGQKTVDCMTTMSVPSTLVKCLYLFFDLPHVPEAVGGAQNELPLA
ERRGLLQKVFVQILVKLCSFVSPAEELAQKDDLQLLFSAITSWCPPYNLPWRKSAGEVLM
TISRHGLSVNVVKYIHEKECLSTCVQNMQQSDDLSPLEIVEMFAGLSCFLKDSSDVSQTL
LDDFRIWQGYNFLCDLLLRLEQAKEAESKDALKDLVNLITSLTTYGVNELKPAGITTGAP
FLLPGFAVPQPAGKGHSVRNIQAFAVLQNAFLKAKTNFLAQIILDAITNIYMADNANYFI
LESQHTLSQFAEKISKLPEVQSKYFEMLEFVVFSLNYIPCKELISVSILLKSSSSYHCSI
IAMKTLLKFTRHDYIFKDVFREVGLLEVMVNLLHKYAALLKDPTQALNEQGDSRNNSSIE
DQKHLALLVMETLTVLLQGSNTNAGIFREFGGARCAHNIVKYPQCRQHALMIIQQLVLSP
NGEDDMGTLLGLMHSAPPTELQLKTDILRALLSVLRESHRSRTVFRKVGGFVYITSLLVA
MERSLSSPPKNGWEKVNQNHVFELLHTVFCTLTAAMRYEPANSHFFKTEIQYEKLADAVR
FLGCFSDLRKISPMNVFPSNTQPFQRLLEEDVISMDTVSPTLRHCSKLFIYLYKVATDSF
DSRAEQIPPCLTSESSLPSPWGTPALSRKRHAYHSVSTPPVYPPKNIVDLKLHVATTSLQ
SSDAVIIHPGAMLAMLDLLASVGSVTQPEHALDLQLAVANILQSLVHTERNQQVMCEAGL
HARLLQRCSAALADEDHSLHPPLQRMFERLASQALEPMVLREFLRLASPLNCGAWDKKLL
KQYRVHKPSSLSYEPEMRSSLITSMEGLGSDNVFSLHEDNHYRISKSLVKSAEGSTVPLT
RVKCLVSMTTPHDIRLHGSSVTPAFVEFDTSLEGFGCLFLPSLAPHNAPTNNAVTTGLTD
GAVVSGIGSGERFFPPPSGLSYSSWFCIEHFSSPPNNHPVRLLTVVRRANSSEQHYVCLA
IILSAKDRSLIVSTKEELLQNYVDDFSEESSFYEILPCCARFRCGELIVEGQWHHLVLVM
SKGMLKNSTAALYIDGQLVNTVKLHYVHSTPGGSGSANPPVVSTVYAYVGTPPAQRQIAS
LVWRLGPTHFLEEVLPPSNVTTIYELGPNYVGSFQAVCMPCKDAKCEGVVPSPVSLVPEE
KVSFGLYALSVSSLTVARIRKVYNKLDSKAIAKQLGISSHENATPVKLIHNSAGHLNGPA
RTVGATLIGYLGVRTFVPKPVATTLQYIGGAAAILGLVAMASDVEGLYAAVKALVCVVKS
NPLASKEMERIKGYQLLAMLLKKKRSLLNSHILHLTFSLVGTVDSGHETSIIPNSTAFQD
LLCDFEVWLHAPYELHLSLFEHFIELLTESSEASKNAKLMREFQLIPKLLLTLRDMSLSQ
PTIAAISNVLSFLLQGFPNSNDLLRFGQFISSTLPTFAVCEKFVVMEINNEEKLDTGTEE
EFGGLVSANLILLRNRLLDILLKLVYTSKEKTSINLQACEELVKTLGFDWIMMFMEEHLH
STTVTAAMRILVVLLSNQSILIKFKEGLSGGGWLEQTDSVLTNKIGTVLGFNVGRSAGGR
STVREINRDACHFPGFPVLQSFLPKHTNVPALYFLLMALFLQQPVSELPENLQVSVPVIS
SRCKQGCQFDLDSIWTFIFGVPASSGTVVSSIHNVCTEAAFLLLGMLRSMLNSPWQSEEE
GSWLREYPVTLMQFFRYLYHNVPDLASMWMSPDFLCALAATVFPFNIRPYSEMVTDLDDE
VGSPAEEFKAFAADTGMNRSQSEYCNVGTKTYLTNHPAKKFVFDFMRVLIIDNLCLTPAS
KQTPLIDLLLEASPERSTRTQQKEFQTYILDSVMDHLLAADVLLGEDASLPITSGGSYQV
LVNNVFYFTQRVVDKLWQGMFNKESKLLIDFIIQLIAQSKRRSQGLSLDAVYHCLNRTIL
YQFSRAHKTVPQQVALLDSLRVLTVNRNLILGPGNHDQEFISCLAHCLINLHVGSNVDGF
GLEAEARMTTWHIMIPSDIEPDGGYSQDISEGRQLLIKAVNRVWTELIHSKKQALEELFK
VTLPVNERGHVDIAIARPLIEEAGLKCWQNHLAHEKKCISRGEALVPTTQSKLSRVSSGF
GLSKLTGSRRNRKESGLNKHSLSTQEISQWMFTHIAVVRDLVDTQYKEYQERQQNALKYV
TEEWCQIEYELLRERGLWGPPIGSHLDKWMLEMTEGPCRMRKKMVRNDMFYNHYPYVPET
EQETNVAKPARYRRAISYDSKEYYMRLASGNPAIVQDAIVESSEGEAAQQEPEHGEDTIA
KVKGLVKPPLKRSRSAPDGGDEESQEQLQDQIAEGSSIEEEEKTDNATLLRLLEEGEKIQ
HMYRCARVQGLDTSEGLLLFGKEHFYVIDGFTMTATREIRDIETLPPNMHEPIIPRGARQ
GPSQLKRTCSIFAYEDIKEVHKRRYLLQPIAVEVFSGDGRNYLLAFQKGIRNKVYQRFLA
VVPSLTDSSESVSGQRPNTSVEQGSGLLSTLVGEKSVTQRWERGEISNFQYLMHLNTLAG
RSYNDLMQYPVFPWILADYDSEEVDLTNPKTFRNLAKPMGAQTDERLAQYKKRYKDWEDP
NGETPAYHYGTHYSSAMIVASYLVRMEPFTQIFLRLQGGHFDLADRMFHSVREAWYSASK
HNMADVKELIPEFFYLPEFLFNSNNFDLGCKQNGTKLGDVILPPWAKGDPREFIRVHREA
LECDYVSAHLHEWIDLIFGYKQQGPAAVEAVNVFHHLFYEGQVDIYNINDPLKETATIGF
INNFGQIPKQLFKKPHPPKRVRSRLNGDSMGASAPPGSSSDKIFFHHLDNLRPSLTPVKE
LKEPVGQIVCTDKGILAVEQNKVLIPPTWNKTFAWGYADLSCRLGTYESDKVVTIYECLS
EWGQILCAICPNPKLVITGGTSTVVCVWEMGTSKEKAKTLTLKQALLGHTDTVTCATASL
AYHIIVSGSRDRTCIIWDLNKLSFLTQLRGHRAPVSALCINELTGDIVSCAGTYIHVWSI
NGNPIVSVNTFTGRSQQIVCCCVSEMNEWDTQNVIVTGHSDGVVRFWRMEFLQVPETPAP
EPVEVLEMQEDCPEAQIGQEAQDEDSSDSEADEQSTSQDPKDVPSQPSSTSHRPRAASCR
AAAVWCTDSGSDDSRRWSDQLSLDEKDGFIFVNYSEGQTRAHLQGPLNHPHPSPIEVRNY
SRLKPGYRWERQLVFRSKLTMHTAFDRKDNAHPAEITALGVSKDHSRILVGDSRGRVFSW
SVSDQPGRSAADHWVKDEGGDSCSGCSVRFSLTERRHHCRNCGQLFCQKCSRFQSEIKRL
KISSPVRVCQNCYYNLQHERGSEDGPRNC
Download sequence
Identical sequences E2RLD7
ENSCAFP00000013654 XP_013965420.1.84170

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]