SUPERFAMILY 1.75 HMM library and genome assignments server

Superfamily is undergoing a server migration - you are now browsing on the new server. Please contact us if you experience any problems.

Domain assignment for XP_001364862.1.35504 from NCBI 2017_08 genome

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  XP_001364862.1.35504
Domain Number 1 Region: 81-440,687-1090,1120-1143,1252-1296
Classification Level Classification E-value
Superfamily ARM repeat 6.34e-44
Family Armadillo repeat 0.075
Further Details:      
 
Weak hits

Sequence:  XP_001364862.1.35504
Domain Number - Region: 2190-2304,2377-2397,2749-3111
Classification Level Classification E-value
Superfamily ARM repeat 0.000157
Family GUN4-associated domain 0.069
Further Details:      
 
Domain Number - Region: 53-63
Classification Level Classification E-value
Superfamily Formin homology 2 domain (FH2 domain) 0.000175
Family Formin homology 2 domain (FH2 domain) 0.13
Further Details:      
 
Domain Number - Region: 18-82
Classification Level Classification E-value
Superfamily beta-sandwich domain of Sec23/24 0.0392
Family beta-sandwich domain of Sec23/24 0.021
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) XP_001364862.1.35504
Sequence length 3135
Comment PREDICTED: huntingtin isoform X3 [Monodelphis domestica]; AA=GCF_000002295.2; RF=representative genome; TAX=13616; STAX=13616; NAME=Monodelphis domestica; AL=Chromosome; RT=Major
Sequence
MATLEKLMKAFESLKSFQQQQQQPPAPPPPPPPPPQPPQAQPLPQPQQSPQQPPPPPPPP
PPPGPSGVEEPAHRPKKELSTTKKDRVNHCLTICENIVAQSLRNSPEFQKLLGIAMELFL
LCSDDAESDVRMVADECLNKVIKALMESNLPRLQLELYKEIKKNGASRSLRAALWRFAEL
AHLVRPQKCRPYLVNLLPCLTRISKRTEESVQETLAAAIPKIMASFGNFANDNEIKVLLK
AFIANLKSSSPTIRRTAAGSAVSICQHSRRTQYFYTWLLNVLLGLLVPVEEEHSTLFILG
VLLTLRYLIPLLQQQVKDMSLKGSFGVTRKETEISPSTDQLIQVYELTLHYTQHQDHNVV
TGALELLQQLFKTPPVELLQALTTAGGFGQVNVTKEEFSRSRSGSIVELIAGGGSSCSPV
LSRKQKGKVLLGEEEGLEDDPESRSEVSSATFAASMKSEITGELASSSGVSTPVSTSSAA
DSTGHDIITEQPRSQHTLQSDPVDLTGCDLTSAATDGDEEDMLSRSSSQISAVPSDPAVD
MNDGTQASSPISDSSQTTTEGPDSAVTPSDSSEIVLDGAESQYSGMQIGQLQDEDDETSN
VLPDETPDSFRDSTIALQQPHLLKSTGHSRQPSDSSVDRFLSKDEAVELGDHESKPSRVK
GDIGHFTDTDLAPLVHCVRLLSASFLLTGEKGALVPDRDVRVSVKALAVSCVGAAVALHP
ESFFSKLYKTPLETMEHPEDQYVSDVLNYIDHGDPQIRGATAILCGTIIYSILNKSRFSV
ENWLTAVRNSTGNTFSLVDCIPLLQKSLKDESSVTCKLACTAVRHCVMSLCSSCFSELGL
QLIIDVLTLRNSSYWLVRTELLDTLAEIDFRLISFLEAKADHLHKGSHHYTGLLKLQDRV
LNNVVIYLLGDEDPRVRHVAASSLIRLVPKLFYNCDQGQTDPVVAVARDQSNVYLKLLMH
ETQPPSHFSVSTITRTYRGYNLLPSITDVTMENNLSRVIAAVSHALTTSTTRALTFGCCE
ALCLLSTAFPVCIWSLGWHCGVPLLSPSDESRKSCTVGMATTVLTLLSSAWFPLDLSAHQ
DAIILAGNLLAASAPKSLKNPWTTEDEANPGAMKQEEPWPALGDRVLVQMVEQLFSHLLK
AINICAHVLDDVTPGPAIKAALPSLTNPPSLSPIRRKGKEKEPGEQASVPLSPKKGNEAS
PASRSSDPSGPAITSKSSTLGSFYHLPSYLKLYDVLKATHANYKVTLDLQNSSEKFGGFL
RAALDVLSQILELATLQDIGKCVEEILGYLKSCFNREPTMATVCVQQLLKTLFGTNLASQ
YDGLSSKPSKSQGKAQRLGSSSLRPGLYHYCFMAPYTHFTQALADASLRNMVQADQEHDT
SGWFDVLQKVSTQLKTNLTSVTKHRADKNAIHNHIRLFEPLVIKALKQYTTTTSVQLQRQ
VLDLLAQLVQLRVNYCLLDSDQVFIGFVLKQFEYIEVGQFRESEAIIPNIFFFLVLLSYE
RYHSKQIIGIPKIIQLCDGIMASGRKAVTHAIPALQPIVHDLFVLRGTNKADAGKELETQ
KEVAVSMLLRLIQYHQVLEMFILVLQQCHKENEDKWKRLSRQIADIILPMLAKQQMHIDS
HEALGVLNTLFEILAPSSLRPVDMLLRSMFVTPNTMASVSTVQLWISGILAILRVLISQS
TEDIVLSRIQELSFSPHLISCQIIKKLRDGGSSLPTPEDQSEVKQAAKCLPEETFSRFLL
QLVGILLEDIVMKQLKVEMSEQQHTFYCQELGTLLMCLIHIFKSGMFRRITAAATRLFTG
DGSDGSFYTLESLSELVRSMIPTHPSLVLLWCQILLLVNYTNYNWWSEVHQTPKRHSLSS
TKLLSPQMSSDGEDSHLASTLGVCNREIVRRGALILFCDYVCQNLYDSEHLTWLIVNHIQ
DLISLSHEPPVQDFISAVHRNSAASGLFIQAIQSRCENLSSPTTLKKTLQCLEGIHLSQS
GAVLMLYVDKLLCTPFRVLARMVDTLACRRVEMLLAANLQNSMSQLPVEELNRIQEYLQK
SGLAQRHQRLYSLLDRFRHTVAPETASPSPVVTSHPLDGENHLSLEMINPDQDWYLSLVK
FQCCTKSDSALLEGAELVNRIPPGELTPFMLSKEFNLCLLAPCLSLGVREISSGQSSSLF
ETARSVTLDRVASLVQQLPSSHQVFQPLLPIETSAYWNQLSDLFGNAVIYQSVTTLACAL
AQYLVLLSKLPSHLQLPPEKESDILKFVVAALEALSWHLIHEQMPLSMDLQAVLDCCCLT
LQLPALWNMLSSIEYVTHVCSLIHCVRFILEAIAIQPGDQLLSPERRKNTPRGISEDEID
SNMQVPRYITAACEMIAEMVAALQTVLSLGHKRNNGIPAFLTPVLKNIIISLARLPLVNS
YTRVPPLVWKLGWSPKPGGEFGTTLPEIPVEFLQEKEIFKEFIYRINTLGWTSRTQFEET
WATLLGVLVTQPIVMDQEESQQEEDTERTQINVLAVQAITSLVLSAMTLPVAGNPAVSCL
EQQPRNKALKALDTRFGRKLSVIRGIVEQEIQAMVSKRDNIPTHHLYQAWDPVPSLSPAM
SGALISHEKLLLQINTEREMGNMSYNLGQVSIHSMWLGNNITPLREEEWDEDEEDDGDLP
APSSPPTSPINSRKHRAGVDIHSCSQFLLELYSQWILPSSSAKRTPVILISEVVRSLLAV
SDLFTERNQFEMMYLTLTELRRVHPSEDEILIQYLVPATCKAAAVLGMDKAVAEPVSRLL
ETTLRSTHLPSKIGALHGILYVLECDLLDETAKQLIPIISDYLLSNLRGIAHCVNLHSQQ
HVLVTCAAAFYLMENYPLDVGPEFSAAVIQMCGVMLSASEESTPAIIYHCVLRGLERLLL
SEQLSRLDGESLVKLSVDRVNVHSPHRAMAALGLMLTCMYTGKEKISPGRTSDPNPTAPD
SESVIVAMERVSVLFDRIRKGFPCEARVVARILPQFLDDFFPPQDVMNKVIGEFLSNQQP
YPQFMATVVYKVFQTLHTTGQSSMVRDWVMLSLSNFTQRTPVAMAMWSLSCFFVSASTSQ
WVSAILPHIISRMGKSEQVDINLFCLVAIDFYRHQIDEELDRRAFQSVFEVVASPGNPYH
RLLTCLQNVHKIAAC
Download sequence
Identical sequences F7C6C4
ENSMODP00000004422 ENSMODP00000004422 XP_001364862.1.35504

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]