SUPERFAMILY 1.75 HMM library and genome assignments server

Superfamily is undergoing a server migration - you are now browsing on the new server. Please contact us if you experience any problems.

Domain assignment for F7C6C4 from Uniprot 2018_03 genome

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  F7C6C4
Domain Number 1 Region: 81-440,687-1090,1120-1143,1252-1296
Classification Level Classification E-value
Superfamily ARM repeat 6.34e-44
Family Armadillo repeat 0.075
Further Details:      
 
Weak hits

Sequence:  F7C6C4
Domain Number - Region: 2190-2304,2377-2397,2749-3111
Classification Level Classification E-value
Superfamily ARM repeat 0.000157
Family GUN4-associated domain 0.069
Further Details:      
 
Domain Number - Region: 53-63
Classification Level Classification E-value
Superfamily Formin homology 2 domain (FH2 domain) 0.000175
Family Formin homology 2 domain (FH2 domain) 0.13
Further Details:      
 
Domain Number - Region: 18-82
Classification Level Classification E-value
Superfamily beta-sandwich domain of Sec23/24 0.0392
Family beta-sandwich domain of Sec23/24 0.021
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) F7C6C4
Sequence length 3135
Comment (tr|F7C6C4|F7C6C4_MONDO) Huntingtin {ECO:0000313|Ensembl:ENSMODP00000004422} KW=Complete proteome; Reference proteome OX=13616 OS=Monodelphis domestica (Gray short-tailed opossum). GN=HTT OC=Mammalia; Metatheria; Didelphimorphia; Didelphidae; Monodelphis.
Sequence
MATLEKLMKAFESLKSFQQQQQQPPAPPPPPPPPPQPPQAQPLPQPQQSPQQPPPPPPPP
PPPGPSGVEEPAHRPKKELSTTKKDRVNHCLTICENIVAQSLRNSPEFQKLLGIAMELFL
LCSDDAESDVRMVADECLNKVIKALMESNLPRLQLELYKEIKKNGASRSLRAALWRFAEL
AHLVRPQKCRPYLVNLLPCLTRISKRTEESVQETLAAAIPKIMASFGNFANDNEIKVLLK
AFIANLKSSSPTIRRTAAGSAVSICQHSRRTQYFYTWLLNVLLGLLVPVEEEHSTLFILG
VLLTLRYLIPLLQQQVKDMSLKGSFGVTRKETEISPSTDQLIQVYELTLHYTQHQDHNVV
TGALELLQQLFKTPPVELLQALTTAGGFGQVNVTKEEFSRSRSGSIVELIAGGGSSCSPV
LSRKQKGKVLLGEEEGLEDDPESRSEVSSATFAASMKSEITGELASSSGVSTPVSTSSAA
DSTGHDIITEQPRSQHTLQSDPVDLTGCDLTSAATDGDEEDMLSRSSSQISAVPSDPAVD
MNDGTQASSPISDSSQTTTEGPDSAVTPSDSSEIVLDGAESQYSGMQIGQLQDEDDETSN
VLPDETPDSFRDSTIALQQPHLLKSTGHSRQPSDSSVDRFLSKDEAVELGDHESKPSRVK
GDIGHFTDTDLAPLVHCVRLLSASFLLTGEKGALVPDRDVRVSVKALAVSCVGAAVALHP
ESFFSKLYKTPLETMEHPEDQYVSDVLNYIDHGDPQIRGATAILCGTIIYSILNKSRFSV
ENWLTAVRNSTGNTFSLVDCIPLLQKSLKDESSVTCKLACTAVRHCVMSLCSSCFSELGL
QLIIDVLTLRNSSYWLVRTELLDTLAEIDFRLISFLEAKADHLHKGSHHYTGLLKLQDRV
LNNVVIYLLGDEDPRVRHVAASSLIRLVPKLFYNCDQGQTDPVVAVARDQSNVYLKLLMH
ETQPPSHFSVSTITRTYRGYNLLPSITDVTMENNLSRVIAAVSHALTTSTTRALTFGCCE
ALCLLSTAFPVCIWSLGWHCGVPLLSPSDESRKSCTVGMATTVLTLLSSAWFPLDLSAHQ
DAIILAGNLLAASAPKSLKNPWTTEDEANPGAMKQEEPWPALGDRVLVQMVEQLFSHLLK
AINICAHVLDDVTPGPAIKAALPSLTNPPSLSPIRRKGKEKEPGEQASVPLSPKKGNEAS
PASRSSDPSGPAITSKSSTLGSFYHLPSYLKLYDVLKATHANYKVTLDLQNSSEKFGGFL
RAALDVLSQILELATLQDIGKCVEEILGYLKSCFNREPTMATVCVQQLLKTLFGTNLASQ
YDGLSSKPSKSQGKAQRLGSSSLRPGLYHYCFMAPYTHFTQALADASLRNMVQADQEHDT
SGWFDVLQKVSTQLKTNLTSVTKHRADKNAIHNHIRLFEPLVIKALKQYTTTTSVQLQRQ
VLDLLAQLVQLRVNYCLLDSDQVFIGFVLKQFEYIEVGQFRESEAIIPNIFFFLVLLSYE
RYHSKQIIGIPKIIQLCDGIMASGRKAVTHAIPALQPIVHDLFVLRGTNKADAGKELETQ
KEVAVSMLLRLIQYHQVLEMFILVLQQCHKENEDKWKRLSRQIADIILPMLAKQQMHIDS
HEALGVLNTLFEILAPSSLRPVDMLLRSMFVTPNTMASVSTVQLWISGILAILRVLISQS
TEDIVLSRIQELSFSPHLISCQIIKKLRDGGSSLPTPEDQSEVKQAAKCLPEETFSRFLL
QLVGILLEDIVMKQLKVEMSEQQHTFYCQELGTLLMCLIHIFKSGMFRRITAAATRLFTG
DGSDGSFYTLESLSELVRSMIPTHPSLVLLWCQILLLVNYTNYNWWSEVHQTPKRHSLSS
TKLLSPQMSSDGEDSHLASTLGVCNREIVRRGALILFCDYVCQNLYDSEHLTWLIVNHIQ
DLISLSHEPPVQDFISAVHRNSAASGLFIQAIQSRCENLSSPTTLKKTLQCLEGIHLSQS
GAVLMLYVDKLLCTPFRVLARMVDTLACRRVEMLLAANLQNSMSQLPVEELNRIQEYLQK
SGLAQRHQRLYSLLDRFRHTVAPETASPSPVVTSHPLDGENHLSLEMINPDQDWYLSLVK
FQCCTKSDSALLEGAELVNRIPPGELTPFMLSKEFNLCLLAPCLSLGVREISSGQSSSLF
ETARSVTLDRVASLVQQLPSSHQVFQPLLPIETSAYWNQLSDLFGNAVIYQSVTTLACAL
AQYLVLLSKLPSHLQLPPEKESDILKFVVAALEALSWHLIHEQMPLSMDLQAVLDCCCLT
LQLPALWNMLSSIEYVTHVCSLIHCVRFILEAIAIQPGDQLLSPERRKNTPRGISEDEID
SNMQVPRYITAACEMIAEMVAALQTVLSLGHKRNNGIPAFLTPVLKNIIISLARLPLVNS
YTRVPPLVWKLGWSPKPGGEFGTTLPEIPVEFLQEKEIFKEFIYRINTLGWTSRTQFEET
WATLLGVLVTQPIVMDQEESQQEEDTERTQINVLAVQAITSLVLSAMTLPVAGNPAVSCL
EQQPRNKALKALDTRFGRKLSVIRGIVEQEIQAMVSKRDNIPTHHLYQAWDPVPSLSPAM
SGALISHEKLLLQINTEREMGNMSYNLGQVSIHSMWLGNNITPLREEEWDEDEEDDGDLP
APSSPPTSPINSRKHRAGVDIHSCSQFLLELYSQWILPSSSAKRTPVILISEVVRSLLAV
SDLFTERNQFEMMYLTLTELRRVHPSEDEILIQYLVPATCKAAAVLGMDKAVAEPVSRLL
ETTLRSTHLPSKIGALHGILYVLECDLLDETAKQLIPIISDYLLSNLRGIAHCVNLHSQQ
HVLVTCAAAFYLMENYPLDVGPEFSAAVIQMCGVMLSASEESTPAIIYHCVLRGLERLLL
SEQLSRLDGESLVKLSVDRVNVHSPHRAMAALGLMLTCMYTGKEKISPGRTSDPNPTAPD
SESVIVAMERVSVLFDRIRKGFPCEARVVARILPQFLDDFFPPQDVMNKVIGEFLSNQQP
YPQFMATVVYKVFQTLHTTGQSSMVRDWVMLSLSNFTQRTPVAMAMWSLSCFFVSASTSQ
WVSAILPHIISRMGKSEQVDINLFCLVAIDFYRHQIDEELDRRAFQSVFEVVASPGNPYH
RLLTCLQNVHKIAAC
Download sequence
Identical sequences F7C6C4
ENSMODP00000004422 XP_001364862.1.35504 ENSMODP00000004422

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]