SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSPTRP00000049950 from Pan troglodytes 76_2.1.4

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSPTRP00000049950
Domain Number 1 Region: 1439-1553,1628-1685,1799-1889,1935-1993,2080-2501
Classification Level Classification E-value
Superfamily ARM repeat 1.08e-62
Family HEAT repeat 0.051
Further Details:      
 
Domain Number 2 Region: 77-453,485-552
Classification Level Classification E-value
Superfamily ARM repeat 3.63e-17
Family Armadillo repeat 0.065
Further Details:      
 
Domain Number 3 Region: 901-978,1073-1175,1258-1559,1608-1663
Classification Level Classification E-value
Superfamily ARM repeat 0.0000000000499
Family PHAT domain 0.049
Further Details:      
 
Weak hits

Sequence:  ENSPTRP00000049950
Domain Number - Region: 2435-2680
Classification Level Classification E-value
Superfamily ARM repeat 0.000771
Family MIF4G domain-like 0.044
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSPTRP00000049950   Gene: ENSPTRG00000005349   Transcript: ENSPTRT00000009835
Sequence length 2741
Comment pep:known_by_projection chromosome:CHIMP2.1.4:12:101581742:101690150:1 gene:ENSPTRG00000005349 transcript:ENSPTRT00000009835 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MKTKPVSHKTENTYRFLTFAERLGNVNIDIIHRIDRTASYEEEVETYFFEGLLKWRELNL
TEHFGKFYKEVIDKCQSFNQLVYHQNEIVQSLKTHLQVKNSFAYQPLLDLVVQLARDLQM
DFYPHFPEFFLTITSILETQDTELLEWAFTSLSYLYKYLWRLMVKDMSSIYSMYSTLLAH
KKLHIRNFAAESFTFLMRKVSDKNALFNLMFLDLDKHPEKVEGVGQLLFEMCKGVRNMFH
SCTGQAVKLILRKARPVIKTETQLPWMLIGETLTLDSTVSYISKEHFGTCFECLQESLLD
LHTKVTKTNCCQSSEQIKRLLETYLILVKHGSGAKITTPADVCKVLSQTLQVASLSTSCW
ETLLDVISALILGENVSLPETLIKETIEKIFESRFEKHLIFSFSEVMFAMKQFEQLFLPS
FLSYIVNCFLIDDAVVKDEALAILAKLILNKAAPPTAGSMAIEKYPLVFSPQMVGSYIKQ
KKTRSKGRNEQFPVLDHLLSIIKLPSNKDTTYLSQSWAALVVLPHIRPLEKEKVIPLVTG
FIEALFMTVDKGSFGKGNLFVLCQAVNTLLSLEESSELLQLVPVERVKNLVLTFPLEPSV
LLLTDLYYQRLALCGCKGPLSQEALMELFPKLQANISTGVSKIRLLTVRILNHFDVQLPE
SMEDDGLSERQSVFAILRQAELVPATVNDYREKLLHLRKLRHDVVQTAVPDGPLQEVPLR
YLLGMLYINFSALWDPVIELISSHAHEMENKQFWKVYYEHLEKAATHAEKELQNDMTDEK
SVGDESWEQTQEGDVGALYHEQLALKTDCQERLDHTNFRFLLWRALTKFPERVEPRSREL
SPLFLRFINNEYYPADLQVAPTQDLRRKGKGMVAEEIEEEPAAGDDEELEEEAVPQDESS
QKKKTRRAAAKQLIAHLQVFSKFSNPRALYLESKLYELYLQLLLHQDQMVQKITLDCIMT
YKHPHVLPYRENLQRLLEDRSFKEEIVHFSISEDNAVVKTAHRADLFPILMRILYGRMKN
KTGSKTQGKSASGTRMAIVLRFLAGTQPEEIQIFLDLLFEPVKHFKNGECHSAVIQAVED
LDLSKVLPLGRQHGILNSLEIVLKNISHLISAYLPKILQILLCMTATVSHILDQREKIQL
RFINPLKNLRRLGIKMVTDIFLDWESYQFRTEEIDAVFHGAVWPQISRLGSESQYSPTPL
LKLISIWSRNARYFPLLAKQKPGHPECDILTNVFAILSAKNLSDATASIVMDIVDDLLNL
PDFEPTETVLNLLVTGCVYPGIAENIGESITIGGRLILPHVPAILQYLSKTTISAEKVKK
KKNRAQVSKELGILSKISKFMKDKEQSSVLITLLLPFLHRGNIAEDTEVDILVTVQNLLK
HCVDPTSFLKPIAKLFSVIKNKLSRKLLCTVFETLSDFESGLKYITDVVKLNAFDQRHLD
DINFDVRFETFQTITSYIKETQIVDVNYLIPVMHNCFYNLELGDMSLSDNASMCLMSIIK
KLAALNVTEKDYREIIHRSLLEKLRKGLKSQTESIQQDYTTILSCLIQTFPNQLEFKDLV
QLTHYHDPEMDFFENMKHIQIHRRARALKKLAKQLMEGKVVLSSKSLQNYIMPYAMTPIF
DEKMLKHENITTAATEIIGAICKHLSWSAYMYYLKHFIHVLQTGQINQKLGVSLLVIVLE
AFHFDHKTLEEQMGKIENEENTIEAIELPEPEAMELERVDEEEKEYTCKSLSDSGQPGTP
DPADSGGTSAKESECITKPVSFLPQNKEEIERTIKNIQGTITGDILPRLHKCLASTTKRE
EEHKLVKSKVVNDEEVVRVPLAFAMVKLMQSLPQEVMEANLPSILLKVCALLKNRAQEIR
DIARSTLAKIIEDLGVHFLQYVLKELQTTLVRGYQVHVLTFTVHMLLQGLTNKLQVGDLD
SCLDIMIEIFNHELFGAVAEEKEVKQILSKVMEARRSKSYDSYEILGKFVGKDQVTKLIL
PLKEILQNTTSLKLARKVHETLRRITVGLIVNQEMTAESILLLSYGLISENLPLLTEKEK
NPVAPAPDPRLPPQSCLLLPPTPVRGGQKAVVSRKTNMHIFIESGLRLLHLSLKASKIKS
SGEHVLEMLDPFVSLLIDCLGSMDVKVITGALQCLIWVLRFPLPSIETKAEQLTKHLFLL
LKDYAKLGAARGQNFHLVVNCFKCVTILVKKVKSYQITEKQLQVLLAYAEEDIYDTSRQA
TAFGLLKAILSRKLLVPEIDDVMRKVSKLAVSAQSEPARVQCRQVFLKYILDYPLGDKLR
PNLEFMLAQLNYEHETGRESTLEMIAYLFDTFPQGLLHENCGMFFIPLCLMTINDDSATC
KKMASMTIKSLLGKISLEKKDWLFDMVTTWFGAKKRLNRQLAALICGLFVESEGVDFEKR
LGTVLPVIEKEIDPENFKDIMEETEEKAADRLLFSFLTLITKLIKECNIIQFTKPAETLS
KIWSHVHSHLRHPHNWVWLTAAQIFGLLFASCQPEELIQKWNTKKTKKHLPEPVAIKFLA
SDLDQKMKSISLASCHQLHSKFLDQSLGEQIVKNLLFAAKVLYLLELYCEDKQSKIKEDL
EEQEALEDGVACADEKAESDGEEKEEVKEELGRPATLLWLIQKLSRIAKLEAAYSPRNPL
KRTCIFKFLGAVAMDLGIDKVKPYLPMIIAPLFRELNSTYSEQDPLLNNLFVTNPDIAAK
KKMKKHKNKSEAKKRKIEFLRPGYKAKRQKSHSLKDLAMVE
Download sequence
Identical sequences ENSPTRP00000049950 9598.ENSPTRP00000049950 ENSPTRP00000049950

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]