SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSECAP00000007042 from Equus caballus 69_2

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSECAP00000007042
Domain Number 1 Region: 1048-1312
Classification Level Classification E-value
Superfamily P-loop containing nucleoside triphosphate hydrolases 1.72e-42
Family Extended AAA-ATPase domain 0.01
Further Details:      
 
Domain Number 2 Region: 1738-1993
Classification Level Classification E-value
Superfamily P-loop containing nucleoside triphosphate hydrolases 2.36e-38
Family Extended AAA-ATPase domain 0.08
Further Details:      
 
Domain Number 3 Region: 1361-1633
Classification Level Classification E-value
Superfamily P-loop containing nucleoside triphosphate hydrolases 7.11e-34
Family Shikimate kinase (AroK) 0.072
Further Details:      
 
Domain Number 4 Region: 660-720,816-978
Classification Level Classification E-value
Superfamily P-loop containing nucleoside triphosphate hydrolases 2.29e-33
Family Extended AAA-ATPase domain 0.081
Further Details:      
 
Domain Number 5 Region: 299-590
Classification Level Classification E-value
Superfamily P-loop containing nucleoside triphosphate hydrolases 3.78e-29
Family Extended AAA-ATPase domain 0.016
Further Details:      
 
Domain Number 6 Region: 2050-2295
Classification Level Classification E-value
Superfamily P-loop containing nucleoside triphosphate hydrolases 2.52e-17
Family Extended AAA-ATPase domain 0.059
Further Details:      
 
Domain Number 7 Region: 5386-5588
Classification Level Classification E-value
Superfamily vWA-like 0.000000000000133
Family Integrin A (or I) domain 0.051
Further Details:      
 
Weak hits

Sequence:  ENSECAP00000007042
Domain Number - Region: 3045-3112
Classification Level Classification E-value
Superfamily S-adenosylmethionine decarboxylase 0.002
Family Bacterial S-adenosylmethionine decarboxylase 0.0058
Further Details:      
 
Domain Number - Region: 39-207
Classification Level Classification E-value
Superfamily ARM repeat 0.0141
Family Armadillo repeat 0.098
Further Details:      
 
Domain Number - Region: 2368-2628,2848-2890
Classification Level Classification E-value
Superfamily ARM repeat 0.0214
Family Clathrin adaptor core protein 0.028
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSECAP00000007042   Gene: ENSECAG00000008291   Transcript: ENSECAT00000009278
Sequence length 5598
Comment pep:known chromosome:EquCab2:10:42033306:42191852:-1 gene:ENSECAG00000008291 transcript:ENSECAT00000009278 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MEHLLLEVAAAPLRLIAAKNEKSRSELGRFLAKQVWTPQDRQCILSTLAQLLLDKDCTLL
IGRQLRPLLLDLLERNAEAIKAGGQVNHDLHERLCVSMSKLIGNHPDVLPFALRYFKDSS
PVFQRLFLESSDANPVRYGRRRMKLRDLMEAAYKFLQQEQCVFRELWDWSVCVPLLRSHD
TLVRWYTANCLALVTCMNEEHKLSFLKKIFNSEELIHFRLRLLEEAQLQDLEKALVLANP
ETSLWRKEKDLQYLQGHLVSADLSSRVTAVCGVVLPGQPPTFGDQAGNRNSSREQELAFR
SYVLVESVCKNLQTLAMAVASQNAVLLEGPIGCGKTSLVEHLAAMTGRRKPPQLLKVQLG
DQTDSKMLLGMYRCTDVPGEFVWQPGTLTQAATKGHWILLEDIDYAPLDVVSVLIPLLEN
GELLIPGQGDCLKVASGFQFFATRRLLSCGGNWYRPLNSYATLLDKYWTKIHLDNMDKTE
LNEVLQNRYPSLLTATDHLLDIYLQLTGEKHRSQSDSSAASEQAPEEVSEARRENKRLSL
EGRELSLRDLLNWCNRIAHSFDSLSSSASLSIFQEALDCFTAMLSKHTSKLKMAEVIGSR
LNISKKKAEFFCQLYKPEIVINELDVQVGRVRLLRKQSDAIHIQRETLTFAATRPSSVLI
EQLAVCVSKGEPVLLVGETGTGKTSTVQYLAHITGHRLRVVNMNQQSDTADLLGGYKPVD
PKLIWLPLREAFEELFAQTFSKKQNFTFLGHIQTCYRQKRWHDLLRLMQHVHKSAVNKDG
KESEAGLLLKEKWEAFGLRLDHAQQQMKMTENALLFAFVEGTLAQAVKQGEWILLDEINL
AAPETLECLSGLLEGSSGSLVLLDRGDTEPLVRHPDFRLFACMNPATDVGKRNLPAGIRN
RFTELYVEELESKEDLQILIVDYLKGLSVNKSTVQGIINFYTAVRKDSGTKLVDGTGHKP
HYSLRTLCRALRFAASNPCSNIQRSLYEGFCLGFLTQLDRASHPIVQKLICQHIVPGNVK
SLLKQPIPEPKGGRLIQVEGYWISVGDKEPTIDETYVLTSSVKLNLRDIVRVVSAGTYPV
LIQGETSVGKTSLIRWLAAATGNDCVRINNHEHTDIQEYIGCYTSDSSGKLVFKEGVLID
AMRRGYWIILDELNLAPTDVLEALNRLLDDNRELLITETQEVVKAHPRFMLFATQNPPGL
YGGRKMLSRAFRNRFVELHFDELPSSELETILHKRCSLPPSYCSRLVKVMLDLQSYRRSS
SVFAGKQGFITLRDLFRWAERYRLAEQTEKEYDWLQHLANDGFMLLAGRVRKQEEVDVIR
EVLQKHFKKKLCPQSLFSKENVLKSLSKLSTQTSTLESKFNHIVWTEGMRRLAMLVGRAL
EFGEPVLLVGDTGCGKTTICQVFAALANQKLYSVNCHLHMETSDFLGGLRPVRQKPKDKE
EIDTSRLFEWHDGPLVLAMKEDGFFLLDEISLADDSVLERLNSVLEVEKSLVLAEKGSLE
DKENEVELLTAGKKFRILATMNPGGDFGKKELSPALRNRFTEIWCPQSTSREDLIQIISR
NLRPGLSLGRTDHKGADIAEVMLDFIDWLTHQEFGRRCVVSIRDILSWVNFMNTMEEEAA
LKRPETISTVTSFVHAACLVYIDGIGSGVTSSGFGTALLARKECLKFLIKKLSKIVRLTE
CQKSELKIYDRLKAKEFTGIDNLWGIYPFFIPRGPVLHRNNIADYALSAGTTAMNAQRLL
RATKLNKPILLEGSPGVGKTSLVGALAKASGNILVRINLSEQTDITDLFGADLPVEGGKG
GEFAWRDGPLLAALKAGHWVVLDELNLASQSVLEGLNACFDHRGEIYVPELGMSFQVQHE
KTKIFGCQNPFRQGGGRKGLPRSFLNRFTQVFVDPLTVIDMEFIASTLFPAIDKNVVKKM
VAFNNQIDHEVTVEKKWGQKGGPWEFNLRDLFRWCQLMLVDQSPGCYDPGQHVFLVYGER
MRTREDKEKVIAIFKDVFGSNSNPYMGTRLFHITPYNVQLGYSVLSRGSYVPPPSRRPLL
LLHQSLQSLESIMKCVQMSWMVILVGPASVGKTSLVQLLAHLTGHPLKIMAMNSAMDTTE
LLGGFEQVDLIRPWRQLLEKVEGTVRALLRDSLLISADDAEVVLRAWSHFLLTYKPRCLG
EGGKGVTMEIVNKLEAVLLLMQRLNNKINSYSKAEFAKLVEEFRSFGVKLMQTTSGRSHG
TFEWIDSMLVRALKSGDWLLMDNVNFCNPSVLDRLNALLEPGGVLTVSERGMIDGCTPTV
TPNPNFSYIFLSMDPLEGEIRAMRNRGLEICISGEGDGSIPDDLDLKVLLHSLGLVGDGV
CDTLLALHTQTQSAIVGSIASSVSTLIQTAILIVQYLQRGLSLDRAFYEACLETYVHSQH
SPANRKLVQALLEKCVSSLRAHETWGNSILAMGLWPDSLPSALFAAEDSHLSMVRSDGQI
LAYCLNRMSMKTSSWARVQPLTLPDLEKIIQSSNPESLKFSSVEVDTYWIDEPDVLAMAV
KLLIERATNQDWMLRVKWLYHLAKNIPQGLESIQIHLEASAASLRNFYSNSLSAGISNVI
KVLQPNITDEFVIPLDPRWNVQALDIIRNSMDFDPQTDQPEQLFPLLESIANKTIIYLDR
EKRIYTETNLVSVGGKKLRNSVLRMSFEFHKDPESYHSLPHEIVVNLAAFFELCDALILL
WVQSSQGMVSDASVHEILDSLWWRDRFWTVADTVKVDAPGLALLALHWHWVLKHLVRQIP
QLLKSHEDKYYKEIQTVSEHIQNCLGSPTGGFIGIKKLQKFLGRPFPFKHKLVVECFSQL
KVVNRALAIRERMPALGESGWREDVSRLEMVASKWTLKRSLLQAWGLVLRANILEDVNPD
ELKNLVNGLCLELKAKGISLGFLEKKHNEASSLSQPDFTSLIRLTRSVQLWPAMEYLAML
WQYKVTADFMTQACLRGSSKTQQPQIDEEISHLITFCLKHTSVAPQELRDLWAILHHQKV
STEEILSLWSELFNSTLMSFWSSTVTTNPEYWLTWSPLPDVQQREVPKSLLDSTLKGPGS
LSKAVFSKCCFEVLTSSCRASPWDVNGLPILSSSHVTLGEWVERAQQLRDVSSVLWTNMA
VPSVAEFRCMDSQLQGLVLCRHLAGLAELLPEPRRQEYMQNCEQLLLGDSQAFQHVGQTL
GDMAGQEALPKELLCLLLTSLRHLFGEGDGKRNLPEPARRGSLWVSLGLLQIQTWLPQAR
FDPAVKREYKLKYAKEELHRLQCEWKTRNLSSQLQTGRDLEDEVSINYSHPHVRLLRQRI
DQLENLICSLSKKQAFRPQLPAYESLVQEIHHYVSSIAKASAIQDLLTRLLEALHMDGPR
SSQVAQNLLKEEASWQQSHHQFRKRLAEEFALYPDSVAPLQASILQLQHGMRLVASEVHA
SFHNSVVSADRLGALATSLLAFPSVGPTFPTYYAHADALCSVKSEEVLRGLGKLILRRSG
GKELEGKGQSSCPTREQLLMNALLYLRSHVLCKGELDQRALQLFRHVCQEIINEWDEQER
IAQEKAEQESSLYRYRSRKSRTALSEEEEEEREFRKQFPLHEKDFADVLVEPTLEEKGTS
DGQGEAAASDPTLLSQSSMQAVMLIHQQLCLSFARSLWYQQTVPPHEAEHYLSLFLSCYQ
TGASLVTHFYPLMGVELNDQLLGSQLLACTVSHNTLFGEAASDLKVKPDGPYDFYQHPNI
PEARHCQPVLQGFSEAVSQLLQDWPEHPALEQLLVVMDRIRSFPLSSPISKFLNGLEILL
AKAQDWEENASRALSLRKHLDLVSQMIIRWRKLELNCWSMSLDNTMKRHTEKSTKHWFSI
YQMLEKHMQEQTEEQEDDKQMTLMLLVSTLQAFIEGSSLGEFHVRLQMLLVFHCHVLLMP
QVEGKDSLCSVLWNLYHYYKQFFDQVQAKIVELRSPLEKELKEFVKISKWNDVSFWSIKQ
SVEKTHRTLFKFMKKFEAVLSEPCRSSLVENDKEEQPDFLSQPTDAALSEASPIQSLNRA
LRETLLARPAAVQPTVPEQCQGAPFSLEGELLRRLPKLMRRMRKLCLTLMKESCLPHLVE
GLDQFTGEVISSVSELQSLKVEPSAEKEKQRSEAKHILMRKQRALADLFKHLAKTGLSYR
KGLAWARLKNPQEVLRLRPLDLRSALSVVSSTQEADSRLLREILSSWDGCQKYFYRSLAR
HARLSATLAAPAKEMGIGNVERCKGFSAHLMKMLIRQRHSLTTLTEQWILLRNLLSCVQE
IHGRLTGPLVYPVAFPPQDGVQQWTERLQHLAMQSQILLEQLSWLLQCCPSAGLAPDQGD
AHAQGQPFAPYPEGPEVSTGQLSGAVPDLIPSDLRYPSPVPGNQLPSGCRMRKQDQLWQQ
STARLTEMLKTIKTVKADIDKIRQQSCETLFHSWKDFEVCSSGLSCLSQVSAHLQGLESL
FILPGMEVEQTDQQMALVESLEYVRGEINKATDDFTTWKTHLLASGSQGGNQILDEGFVE
DFSEQMETAIRAILYAIQNLAERSSKKIDEDTDQTKPQEEDADFERLQSGHLTKLLEDDF
WADVSTLHVQKIISAVSELLERLKSYGEDGTASKHMFFSQSCCLLVRLLPMLSRYSDLVL
FFLTNALATHRSTAKLLSVLAQVFTELAQKGFCLPKEFMEDSAGEGATEFHDYEGGGIGE
GEGMKDVSDRIENEEQVQDTFQKGQEKDKEDPDSKSDIKGEDNAIEMSEDFDGKMHDGEL
EEQEEEDEKSDSEGGDLDKQMGDLNGEEADKLDERLWGDDDDEEDEEEEDGKTEETGPGM
DEEDCELVAKDDNLDAGKSNRDKKQQDKKEEKEEAEAADDGQGQDKINEQIDEREYDENE
VDPYHGNQETLPEPEALDLPDDLNLDSENKNSDEDTDHEEGEEENPLEIKEKPVDTEEGG
HEAEEINEETEPDQNEGQGQHEPEEGPSEDDSDEGEEEMDTGANDQDKDTAEHPEENSEE
EQQSLEDKDKEASEESAENGVSVDRGLQPQEKEEGENSDAEEQVPEATERKEHASCGQTG
LESVQSAQAVELAGAAPEKEQGNEEHGSGAADANQAEGHESNFIARLASQKQTRKNTQSF
KRKPGQADNERSMGDHNEHVHKRLRTVDTDSHTEQGPYQPQAQVEDADAFEHIKQGNDPY
DAQTYDVATKEQQQSAKDSSKDQEEEEIEDAFMDMEEQEELTAVDPEQLKPEEFKSGTMA
SPSFDEMEMETQTVKTEEGQDPRTDRSHKETENEKPERSRDSTIHTAPQFLVDTIFQPLL
KDVSELRQELERQLETWQPHESGHPEDERAAAEMWQSYLVLTAPLSQQLCEQLRLILEPT
QAAKLKGDYRTGKRLNMRKVIPYIASQFRKDKIWLRRTKPSKRHYQICLAIDDSSSMVDN
HTKQLAFESLAVIGNALTLLEVGQIAVCSFGESVKLLHPFHEQFGDYSGSQILRLCKFQQ
KKTKIAQFLESAANMFAAAQQLSHNITPETAQLLLVVSDGRGLFLEGKDRVLAAVQAARN
ANIFVIFVVLDNPSSRDSILDIKVPIFKGPGEMPEIRSYMEEFPFPFYIILRDVNALPET
LSDALRQWFELVTASDHP
Download sequence
Identical sequences F6XJW2
9796.ENSECAP00000007042 ENSECAP00000007042 ENSECAP00000007042

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]