SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSPTRP00000031469 from Pan troglodytes 76_2.1.4

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSPTRP00000031469
Domain Number 1 Region: 1048-1318
Classification Level Classification E-value
Superfamily P-loop containing nucleoside triphosphate hydrolases 6.18e-43
Family Extended AAA-ATPase domain 0.011
Further Details:      
 
Domain Number 2 Region: 1736-1993
Classification Level Classification E-value
Superfamily P-loop containing nucleoside triphosphate hydrolases 2.92e-39
Family Extended AAA-ATPase domain 0.08
Further Details:      
 
Domain Number 3 Region: 659-720,816-978
Classification Level Classification E-value
Superfamily P-loop containing nucleoside triphosphate hydrolases 1.67e-33
Family Extended AAA-ATPase domain 0.077
Further Details:      
 
Domain Number 4 Region: 1362-1614
Classification Level Classification E-value
Superfamily P-loop containing nucleoside triphosphate hydrolases 2.86e-30
Family Shikimate kinase (AroK) 0.072
Further Details:      
 
Domain Number 5 Region: 302-591
Classification Level Classification E-value
Superfamily P-loop containing nucleoside triphosphate hydrolases 4.45e-29
Family Extended AAA-ATPase domain 0.023
Further Details:      
 
Domain Number 6 Region: 2051-2107,2220-2311
Classification Level Classification E-value
Superfamily P-loop containing nucleoside triphosphate hydrolases 3.13e-19
Family Extended AAA-ATPase domain 0.049
Further Details:      
 
Domain Number 7 Region: 5381-5584
Classification Level Classification E-value
Superfamily vWA-like 0.0000000000000667
Family Integrin A (or I) domain 0.045
Further Details:      
 
Weak hits

Sequence:  ENSPTRP00000031469
Domain Number - Region: 39-207
Classification Level Classification E-value
Superfamily ARM repeat 0.00453
Family Armadillo repeat 0.066
Further Details:      
 
Domain Number - Region: 2476-2627
Classification Level Classification E-value
Superfamily ARM repeat 0.0173
Family Clathrin adaptor core protein 0.054
Further Details:      
 
Domain Number - Region: 3064-3109
Classification Level Classification E-value
Superfamily S-adenosylmethionine decarboxylase 0.0365
Family Bacterial S-adenosylmethionine decarboxylase 0.0065
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSPTRP00000031469   Gene: ENSPTRG00000018423   Transcript: ENSPTRT00000034049
Sequence length 5594
Comment pep:known_by_projection chromosome:CHIMP2.1.4:6:90277499:90456929:-1 gene:ENSPTRG00000018423 transcript:ENSPTRT00000034049 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MEHFLLEVAAAPLRLIAAKNEKSRSELGRFLAKQVWTPQDRQCVLSTLAQLLLDKDCTVL
VGRQLRPLLLDLLERNAEAIKAGGQINHDLHERLCVSMSKLIGNHPDVLPFALRYFKDTS
PVFQRLFLESSDANPVRYGRRRMKLRDLMEAAFKFLQQEQSVFRELWDWSVCVPLLRSHD
TLVRWYTANCLALVTCMNEEHKLSFLKKMFNSDELIHFRLRLLEEAQLQDLEKALVLANP
EVSLWHKQKELQYLQGHLVSSDLSPRVTAVCGVVLPGQLPAPGELGGNRSSSREQELALR
SYVLVESVCKNLQTLAMAVASQNAVLLEGPIGCGKTSLVEYLAAVTGRTKPPQLLKVQLG
DQTDSKMLLGMYRCTDVPGEFVWQPGTLTQAATMGHWILLEDIDYAPLDVVSVLIPLLEN
GELLIPGRGDCLKVAPGFQFFATRRLLSCGGNWYRPLNSHATLLDKYWTKIHLDNLDKRE
LNEVLQSRYPSLLAVVDHLLDIYIQLTGEKHHSWSDSSVGCEQAPEEVSEARRENKRPTL
EGRELSLRDLLNWCNRIAHSFDSSSLSASLNIFQEALDCFTAMLSEHTSKLKMAEVIGSK
LNISRKKAEFFCQLYKPEIVINELDLQVGRVRLLRKQSEAVHIQREKFTFAATRPSSVLI
EQLAVCVSKGEPVLLVGETGTGKTSTVQYLAHITGHRLRVVNMNQQSDTADLLGGYKPVD
HKLIWLPLREAFEELFAQTFSKKQNFTFLGHIQTCYRQKRWHDLLRLMQHVHKSAVNKDG
KDSETGLLIKEKWEAFGLRLNHAQQQMKMTENTLLFAFVEGTLAQAVKKGEWILLDEINL
AAPEILECLSGLLEGSSGSLVLLDRGDTEPLVRHPDFCLFACMNPATDVGKRNLPPGIRN
RFTELYVEELESKEDLQVLIIDYLKGLSVNKNTVQGIINFYTALRKESGTKLVDGTGHRP
HYSLRTLCRALRFAASNPCGNIQRSLYEGFCLGFLTQLDRASHPIVQKLICQHIVPGNVK
SLLKQPIPEPKGGRLIQVEGYWIAVGDKEPTIDETYILTSSVKLNLRDIVRVVSAGTYPV
LIQGETSVGKTSLIQWLAAATGNHCVRINNHEHTDIQEYIGCYTSDSSGKLVFKEGVLID
AMRKGYWIILDELNLAPTDVLEALNRLLDDNRELLVTETQEVVKAHPRFMLFATQNPPGL
YGGRKVLSRAFRNRFVELHFDELPSSELETILHKRCSLPPSYCSKLVKVMLDLQSYRRSS
SVFAGKQGFITLRDLFRWAERYRLAEQTEKEYDWLQHLANDGYMLLAGRVRKQEEVDVIQ
EVLEKHFKKKLCPQSLFSKENVLKLLGKLSTQISTLECNFGHIVWTEGMRRLAMLVGRAL
EFGEPVLLVGDTGCGKTTICQVFAALANQKLYSVSCHLHMETSDFLGGLRPVRQKPNDKE
EIDTSRLFEWHDGPLVQAMKEDGFFLLDEISLADDSVLERLNSVLEVEKSLVLAEKGSPE
DKDSEVELLTAGKRIILATMNPGGDGKKGLSPALRNRFTEIWCPQSTSREDLIQIINHNL
RPGLCLGRTDPKGSDIPEVMLDFIDWLTHQEFGRKCVVSIRDILSWVNFMNKMGEEAALK
RPEIISTVTSFVHAACLVYIDGIGSGVTSSGFGTALLARKECLKFLIKRLAKIVRLTEYQ
KNELKIYDRMKAKEFTGIDNLWGIHPFFIPRGPVLHRNNIADYALSAGTTAMNAQRLLRA
TKLKKPILLEGSPGVGKTSLVGALAKASGNTLVRINLSEQTDITDLFGADLPVEGGKGGE
FAWRDGPLLAALKAGHWVVLDELNLASQSVLEGLNACFDHRGEIYVPELGMSFQVQHEKT
KIFGCQNPFRQGGGRKGLPRSFLNRFTQVFVDPLTVIDMEFIASTLFPAIEKNIVKKMVA
FNNQIDHEVTVEKKWGQKGGPWEFNLRDLFRWCQLMLVDQSPGCYDPGQHVFLVYGERMR
TEEDKKKVIAVFKDVFGSNSNPYMGTRLFRITPYDVQLGYSVLSRGSCVPHPSRHPLLLL
HQSFQPLESIMKCVQMSWMVILVGPASVGKTSLVQLLAHLTGHTLKIMAMNSAMDTTELL
GGFEQVDLIRPWRRLLEKVEGTVRALLRDSLLISADDAEVVLRAWSHFLLTYKPKCLGEG
GKAITMEIVNKLEAVLLLMQRLNNKINSYCKAEFAKLVEEFRSFGVKLTQLASGHSHGTF
EWVDSMLVQALKSGDWLLMDNVNFCNPSVLDRLNALLEPGGVLTISERGMIDGSTPTITP
NPNFRLFLSMDPVHGDISRAMRNRGLEIYISGEGDASTPDNLDLKVLLHSLGLVGNSVCD
ILLALHTETRSTVVGSPTSSVSTLIQTAILIVQYLQRGLSLDRAFSEACWEVYVCSQHSP
ANRKLVQALLEKHVSSLRAHETWGDSILGMGLWPDSVPSALFATEDSHLSTVRRDGQILA
YCLNRMSMKTSSWTRSQPFTLQDLEKIMQSPSPENLKFNAVEVNTYWIDEPDVLVMAVKL
LIERATNQDWMLRVKWLYHLAKNIPQGLESIQIHLEASAASLRNFYSHSLSGAVSNVFKI
LQPNTTDEFVIPLDPRWNVQALDMIRNLMDFDPQTDQPDQLFALLESAANKTIIYLDREK
RVFTEANLVSVGSKKLRESVLRMSFEFHQDPESYHTLPHEIVVNLAAFFELCDALVLLWV
QSSQGMVSDASVNEILGSLRWRDRFWTVADTVKVDAPGLALLALHWHWVLKHLVHQIPRL
LMNYEDKYYKEVQTVSEHIQNCLGSQTGGFAGIKKLQKFLGRPFPFKDKLVVECFSQLKV
LNKVLAIREQMSALGESGWQEDINRLQVVASQWTLKKSLLQAWGLILRANILEDVSLDEL
KNFVHAQCLELKAKGLSLGFLEKKHDEASSLSHPDLTSVIHLTRSVQLWPAMEYLAMLWR
YKVTADFMAQACLRRCSKNQQPQINEEISHLISFCLYHTPVTPQELRDLWSLLHHQKVSP
EEITSLWSELFNSMFMSFWSSTVTTNPEYWLMWNPLPGMQQREAPKSVLDSTLKGPGNLN
RPIFSKCCFEVLTSSWRASPWDVSGLPILSSSHVTLGEWVERTQQLQDISSMLWTNMAIS
SVAEFRRTDSQLQGQVLFRHLAGLAELLPESRRQEYMQNCEQLLLGSSQAFQHVGQTLGD
MAGQEVLPKELLCQLLTSLHHFVGEGESKRSLPEPAQRGSLWVSLGLLQIQTWLPQARFD
PAVKREYKLNYAKEELHQLQCEWKTRNLSSQLQTGRDLEDEVVVSYSHPHVRLLRQRMDR
LDNLTCHLLKKQAFRPQLPAYESLVQEIHHYVTSIAKAPAVQDLLTRLLQALHMDGPRSA
QVAQSLLKEEASWQQSHHQFRKRLSEEYTFYPDAVSPLQASILQLQHGMRLVASELHTSL
HSSMVGADRLGTLATALLAFPSVGPTFPTYYAHADTLCSVKSEEVLRGLGKLILKRSGGK
ELEGKGQKACPTREQLLMNALLYLRSHVLCKGELDQRALQLFRHVCQEIISEWDEQERIA
QEKAEQESSLYRYRSRNSRTALSEEEEEEREFRKQFPLHEKDFADILVQPELEENKGTSD
GQEEEAGTNPALLSQNSMQAVMLIHQQLCLNFARSLWYQQTLPPHEAKHYLSLFLSCYQT
GASLVTHFYPLMGVELNDRLLGSQLLACTLSHNTLFGEAPSDLMVKPDGPYDFYQHPNVP
EARQCQPVLQGFSEAVSHLLQDWPEHPALEQLLVVMDRIRSFPLSSPISKFLNGLEILLA
KAQDWEENASRVLSLRKHLDLISQMIIRWRKLELNCWSMSLDNTMKRHTEKSTKHWFSIY
QMLEKHMQEQTEEQEDDKQMTLMLLVSTLQAFIEGSSLGEFHVRLQMLLVFHCHVLLMPQ
VEGKDSLCSVLWNLYHYYKQFFDRVQAKIVELRSPLEKELKEFVKISKWNDVSFWSIKQS
VEKTHRTLFKFMKKFEAVLSEPCRSSLVESDKEEQPDFLPRPTDGAASERSSIQNLNRAL
RETLLAQPAAGQATIPEWCQGAAPSGLEGELLRRLPKLRKRMRKMCLTFMKESPLPRLVE
GLDQFTGEVISSVSELQSLKVEPSAEKEKQRSEAKHILMQKQRALSDLFKHLAKIGLSYR
KGLAWARSKNPQEMLHLHPLDLQSALSIVSSTQEADSRLLTEISSSWDGCQKYFYRSLAR
HARLNAALATPAKEMGMGNVERCRGFSAHLMKMLVRQRRSLTTLSEQWIILRNLLSCVQE
IHRRLMGPQAYPVAFPPQDGVQQWTERLQHLAMQCQILLEQLSWLLQCCPSVGPAPGHGN
VQVLGQPPAPCLEGPELSKGQLCGVVLDLIPSNLSYPSPLPGSQLPSGCRMRKQDHLWQQ
STTRLTEMLKTIKTVKADVDKIRQQSCETLFHSWKDFEVCSSALSCLSQVSVHLQGLESL
FIIPGMEVEQRDSQMALVESLEYVRGEISKAMADFTTWKTHLLTSGSQGGNQTLDEGFVE
DFSEQMEIAIRAILCAIQNLEGRKNEKAEENTDQASPQEEYVGFERLQSGHLTKLLEDDF
WADVSTLHVQKIISAISELLERLKSYGEDGTAAKHLFFSQSCSLLVRLVPVLSSYSDLVL
FFLTMSLATHRSTAKLLSVLAQVFTELAQKGFCLPKEFMEDSAGEGATEFHDYEGGGIGE
GEGMKDVSDQIENEEQVEDTFQKGQEKDKEDPDSKSDIKGEDNAIEMSEDFDGKMHDGEL
EEQEEDDEKSDSEGGDLDKHMGDLNGEEADKLDERLWGDDDEEEDEEEEDNKTEETGPGM
DEEDSELVAKDDNLDSGNSNKDKSQQDKKEEKEEAEADDGGQGEDKINEQIDEREYDENE
VDPYHGNQEKVPEPEALDLPDDLNLDSEDKNGGEDTDNEEGEEENPLEIKEKPEEAGHEA
EERGETETDQNESQSPQEPEEGPSEDDEAEGEEEMDTGADDQDGDAAQHPEEHSEEQQQS
VEEKDKEADEEGGENGPADQGFQPQEEEEREDSDTEEQVPEALERKEHASCGQTGVENMQ
NTQAMELAGAAPEKEQGKEEHGSGAADANQAEGHESNFIAQLASQKHTRKNTQSFKRKPG
QADNERSMGDHNERVHKRLRTVDTDSHAEQGPAQQPQAQVEDADAFEHIKQGSDAYDAQT
YDVASKEQQQSAKDSGKDQEEEEIEDTLMDTEEQEEFKAADVEQLKPEEIKSGTTAPLGF
DEMEVEIQTVKTEEDQDPRTDKAHKETENEKPERSRESTIHTAHQFLMDTIFQPFLKDVN
ELRQELERQLEMWQPRESGNPEEEKVAAEMWQSYLILTAPLSQRLCEELRLILEPTQAAK
LKGDYRTGKRLNIRKVIPYIASQFRKDKIWLRRTKPSKRQYQICLAIDDSSSMVDNHTKQ
LAFESLAVIGNALTLLEVGQIAVCSFGESVKLLHPFHEQFSDYSGSQILRLCKFQQKKTK
IAQFLESVANMFAAAQQLSQNISSETAQLLLVVSDGRGLFLEGKERVLAAVQAARNANIF
VIFVVLDNPSSRDSILDIKVPIFKGPGEMPEIRSYMEEFPFPYYIILRDVNALPETLSDA
LRQWFELVTASDHP
Download sequence
Identical sequences ENSPTRP00000031469 ENSPTRP00000031469

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]