SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSECAP00000018867 from Equus caballus 69_2

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSECAP00000018867
Domain Number 1 Region: 3670-3834,3887-4060
Classification Level Classification E-value
Superfamily Protein kinase-like (PK-like) 5.2e-84
Family Phoshoinositide 3-kinase (PI3K), catalytic domain 0.00038
Further Details:      
 
Domain Number 2 Region: 2126-2517
Classification Level Classification E-value
Superfamily ARM repeat 9.22e-21
Family PBS lyase HEAT-like repeat 0.084
Further Details:      
 
Domain Number 3 Region: 72-488,702-820
Classification Level Classification E-value
Superfamily ARM repeat 2.56e-17
Family Clathrin adaptor core protein 0.044
Further Details:      
 
Domain Number 4 Region: 698-805,850-871,903-1206,1344-1420
Classification Level Classification E-value
Superfamily ARM repeat 0.00000000000302
Family Armadillo repeat 0.069
Further Details:      
 
Weak hits

Sequence:  ENSECAP00000018867
Domain Number - Region: 2929-2985,3088-3119
Classification Level Classification E-value
Superfamily TPR-like 0.0238
Family Tetratricopeptide repeat (TPR) 0.033
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSECAP00000018867   Gene: ENSECAG00000020168   Transcript: ENSECAT00000022809
Sequence length 4137
Comment pep:known chromosome:EquCab2:9:35369306:35569831:1 gene:ENSECAG00000020168 transcript:ENSECAT00000022809 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MAHSGSGLQSFLMQLQESLSAADRCSAAMAGYHLIRGLGQECMLSTGPAVLALQTSLVFS
KDFGLLVFVRKSLSIDEFRDCREEILKFLYIFLEKIGQKITPYSLDIKNTCTSVYTKDKA
AKCKIPALDLLIKLLQTLRSSRLMDEFRVGELFTKFYGELALKTKIPDTVLEKIYELLGV
LGEVHPSEMISNSEQLFRAFLGELKSQMTSTVREPKLPVLAGCLKGLSSLMCNFTKSMEE
DPQTSREIFDFALKAIRPQIDLKRYAVPLAGLCLFTLHASQFSTCLLENYVSLFEVLSKW
CGHTNIELKKAAHSALESFLKQVSFMVAKDAERHKNKLQYFMEQFYGIIRNMDSNSKDLS
IAIRGYGLFAGPCKVINAKDVDFMYVELIQRCKQLFLTQTDTVDDHIYQMPSFLQSIASV
LLYLDTIPEVYTPVLEHLMVVQIDSFPQYSPKMQPVCCRAIVKVFLALAEKGPVLWNCIS
TVVHQGLIRICSKPVVFQKGAGSESEDYHTSEEARTGKWKMPTYKDYLDLFRYLLSCDQM
MDSLLADEAFLFVNSSLHSLNRLLYDEFVKSVLKIVEKLDLTLEKQNVGEQEDETEATGV
WVIPTSDPAANLHPAKPKDFSAFINLVEFCREILPEKHVEFFEPWVYSFAYELILQSTRL
PLISGFYKLLSVAVRNAKKMKYFEGVGPKSQKQSPEDLEKYSCFALFAKFSKEVSIKMKQ
YKDELLASCLTFILSLPHDIIELDVRAYVPALQMAFKLGLSYTPLAEVGLNALEEWSGYI
CKHVIQPYYKDILPSLDGYLKTSVLSDETKNSWQVSALSRAAQKGFNKVVLKHLTKTKSI
SSNEALSLEEVRIRVVRILGSLGGQINKNLVTAASSDEMMKKCVAWDREKRLRFAVPFME
MKPVIYLDLFLPRVTELALSASDRQTKVAACEFLHSMVMFMLGKATQMPEDGQGSPPMYQ
LYKRTFPVLLRLACDVDQVTRQLYEPLVMQLIHWFTNNKKFESQDTVALLETILDGIVDP
VDSTLRDFCGQCIQEFLKWSIKQTTPQQQEKSPVNTKSLFKRLYSFALHPNAFKRLGASL
AFNNIYREFREEESLVEQFVFEALVTYMESLALAHTDEKSLGTIQQCCDAIDHLSLIIEK
KHVSLNKAKKRRLPRGFPPATSLCLLDVVQWLLANCGRPQTECRHKSIELFYKFVTLLPG
NKSPFLWLKDIIKKEDISFLINTFEGGGRGCDRPSGILAQPTLFHLQGPFSLRAALQWMD
MLLAALECYNTFIEEKTLEAPKVLGTETQSSLWKAVAFFLESIAMHDIMAAEKYFGTGAT
GNRPSPQEGERYNYSKCTIVVRIMEFTTTLLSTSPEGWKLLEKDVCNTNLMKLLVKTLCE
PSSIGFNIGDVAVMNYLPSVCTNLMKALKKSPYKDILEMHLKEKITAQSIEELCAVDLYC
PDACVDRARLASVVSACKQLHRAGFLCVIIPSQSADQHHSIGTKLLSLVYKSIAPGDEQQ
CLPSLDPNCKRLASGLLELAFAFGGLCEHLVSLLLDTTVLSMPSRGGSQKNIVSFSHGEY
FYSLFSETINTELLKNLDLAVLELMKSSVDNPKMVSNVLNGMLDQSFRDRTSEKHQGLKL
ATIILQNWKKCDSWWAKDSAPESKMAVLTLLAKILQIDSSVCFNTNHCMFPEVFTTYVSL
LADSKLDLHLKATGQAIILLPFFTSLTGGSLEDLKVVLENLIVSNFPMKSEEFPPGTLQY
SNYVDCMKKFLDALELSESPMLLQLMTEILCREQQHVMEELFQSTFKKIARKSSCITQLG
LLESVYRMFRRDDLLSNITRQAFVDRSLLTLLWHCSLNALREFFSKIVVEAINVLKSRFI
KLNESAFDTQITKKMGYYKMLDVMYSRLPKDDVHSKESKINQVFHGSCITEGSELTKTLI
KLCYDAFTENMAGENQLLERRRLYHCAAYNCAISVVCCVFNELKFYQGFLFTEKPEKNLL
IFENLIDLKRCYTFPIEVEVPMERKKKYLEIRKEAREAASGDSDGPRYISSLSYLADSSL
SEEMSQFDFSTGVQSYSYSSQDPKSTTAHFRRQKHKESMIQDDILELEMDELNQHECMAT
MTALIKHMQRNQILPKEEEGSVPRNLPPWMKFLHDKLGNPSISLNIRLFLAKLVINTEEV
FRPYARYWLSPLLQLVVSGNNGGEGIHYMVVEIVVIILSWTGLATPIGVPKDEVLANRLL
HFLMKHVFHQKRAVFRHNLEIIKTLVECWKDCLSIPYRLIFEKFSSTDPNSKDNSVGIQL
LGIVMANNLPPYDPKCGIESIKYFQALVNNMSFVKYREVYAAAAEVLGLVLRYITERENI
LEESVCELVIKQLKQHQNTMEDKFIVCLNKAVKNFPPLADRFMNTVFFLLPKFHGVMKTL
CLEVVLCRAEEITDLYLQLKSKDFIQVMRHRDDERQKVCLDIIYKMMAKLKPVELRELLN
PVVEFISHPSPVCREQMYNILMWIHDNYRDPESQADDDSQEIFKLAKDVLIQGLIDENPG
LQLIIRNFWSHETRLPSNTLDRLLALNSLYSPKIEAHFLSLATDFLLEMTSVSPDYSNPM
FEHPLSECKFQEYTIDSDWRFRSTVLTPMFIETQASQSALQTRTQEGSLSARGVMTGQIR
ATQQQYDFTPTQNTDGRSSFNWLTGNSIDPLVDFTVSSSSDSLSSSLLFAHKRSEKSQRV
PLKSVGPDFGKKRLGLPGDEVDNKAKGGEDNRAEILRLRRRFLKDREKLSLIYARKGVAE
QKREKEIKSELKMKHDAQVILYRSYRQGDLPDIQIKYSSLITPLQAVAQRDPIIAKQLFG
SLFSGIIKEMDKYKTMSEKNNITQKLLQDFNNFLNTTVSFFPPFISCIQEISCQHADLLS
LNPASVSASCLASLQQPVGVRLLEEALLHLLPEEPPAKRVRGRPCLYPDFVRWMELAKLY
RSIGEYDILRGIFNSEIGTKQVTQNALLAEARNDYSEAVKQYNEALNKQEWVDGEPMEAE
KDFWELASLDCYNQLAEWKSLAYCSTVSVDSANPPDLNKMWNEPFYQETYLPYMIRSKLK
LLLQGEGDQSLLTFIDEAVSKELQKVLVELHYSQELSLLYILQDDVDRAKYYIENCIRIF
MQSYSSIDVLLERSRLTKLQSLQTLIEIQEFISFISKQGNLSSQIPLKRLLKTWTNRYPD
AKMDPMNIWDDIITNRCFFLSKIEEKLTLPPDDHSMNTDGDEDSSDRMKVQEQEEDIYSL
IKSCKFSMKMKMIESARKQKNFSLAMKLLKELHKESKTRDDWLVKWVQSYCRLSHSRSQT
QNRPEQILTVLKTVSLLDENTSSYLSKNIPVSRDHNILLGTTYRIIANALSSDPTCLAEI
GESKARRILELSGSSLENAEEVIAGLYQRVLHHLSEAVRIAEEEAQPFTRGQEPAVGVID
AYMTLVDFCDQQLRKEEESSSVTESVQLQMYPALVVDKMLKALRLHSNEARLKFPRLLQI
IEQYPEETLSLMTKEISSIPCWQFIGWISHMVALLDKEEAVAVHRTVEEIADNYPQAMVY
PFIISSESYSFKDTSTGYKNKEFVERIKIKLDQGGVIQDFINALEQLSHPEMLFKDWTDD
IKVELEKNPVNRKNIEKMYEKMYATLGDPQAPGLGAFRRRFIQAFGKEFDKHFGRGGSKL
PGMKPREFSDITNSLFSKMCEVSKPPGNLKECSPWMSDFKVEFLRSELEIPGQYDGKGKP
VPEYHARIAGFDERIKVMASMRKPKRIIIRGHDEREYPFLVKGGEDLRQDQRIEQLFEVM
NVILSQDATCSQRSMQLKTYQVIPMTSRLGLIEWIENTFTLKELLLSNMSQEEKAACTSD
PKAPPFEYRDWLTKMSGKCDVGAYMLMYKGASRTETVTSFRKRESKVPADLLKRAFVKMS
TSPEAFLTLRSHFARSHALICISHWILGIGDRHLNNFLVSMETGGVIGIDFGHAFGSATQ
FLPVPELMPFRLTRQFINLMLPMKETGVMYSIMVHALRAFRSQSNLLANTMDVFVKEPSF
DWKNFEQKMLKKGGSWIQEINVTEKNWYPRQKIHYAKRKLAGANPAVITCDELLLGHEKV
AAFGDYVAVARGSEDHNIRAQELESDLSEEAQVKCLIDQATDPNILGRTWIGWEPWM
Download sequence
Identical sequences F6RGR8
ENSECAP00000018867 ENSECAP00000018867 9796.ENSECAP00000018867

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]