SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSGACP00000000226 from Gasterosteus aculeatus 76_1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGACP00000000226
Domain Number 1 Region: 1818-1941,1969-2006,2035-2116
Classification Level Classification E-value
Superfamily P-loop containing nucleoside triphosphate hydrolases 1.27e-31
Family Tandem AAA-ATPase domain 0.019
Further Details:      
 
Domain Number 2 Region: 1506-1547,1663-1867
Classification Level Classification E-value
Superfamily P-loop containing nucleoside triphosphate hydrolases 6.18e-23
Family Tandem AAA-ATPase domain 0.032
Further Details:      
 
Domain Number 3 Region: 2147-2366
Classification Level Classification E-value
Superfamily DNase I-like 9.03e-19
Family DNase I-like 0.06
Further Details:      
 
Domain Number 4 Region: 2-182
Classification Level Classification E-value
Superfamily Cysteine proteinases 0.0000000000785
Family M48USP-like 0.013
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGACP00000000226   Gene: ENSGACG00000000172   Transcript: ENSGACT00000000226
Sequence length 2368
Comment pep:novel scaffold:BROADS1:scaffold_230:6653:13853:1 gene:ENSGACG00000000172 transcript:ENSGACT00000000226 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
VRASHSQASIRYGRSRNRQCTCNSLTFLAFLHENESITRADLDHVLDKGNTMYRETRKQV
TNHIYLTTDELTDEVPARSATHHADMTHLSRYGTLGEPLPGAVDSFLDLESGLSCLLSDV
KYALLLMRLLCIAVFRTKSGRYGFFDPHSRTARGLPLPAGSRTPGTAVMVTFSRLSDMID
RLKKSHRMMDTQSSCNYELKPVEFYNVNTVNLNGVPADESNSPTATDQTSSPELTPHTPA
VNQPEVLITTSSHDAPHNESESTAQVINPVLSDPSDDILTAESSDIIFTSSVRQQEVMNS
NEPELSTQSTFLPSHDASLNEPNISSSVANTVASDLSHVLLQGISCKLTKCDKNKRRKIK
RRLMISEKTQQRKESKKRKEREKYASNKGFKTEKKSHAKSKYQNNPEFRQKKINNTRDNV
DFQQKRKEYISNRYRENADFRQKKKQIFVSRYRNNPDFRQKKKQIFVSRYRNNPDFRQKK
KQIFVSHYRNNPDFRQKKKQTFVSRYRNNPDFREKKKQHFTNRYQDNAEFRKEKKLSFTA
FYKNSIDFRQKKREKERSHIIQRYTHDQAFRLRHKQLMKQRMKDRYKNNPTYKSMRNMRC
AIKIKRKYRQINKPTEESDNSLIKEAISVFRSQIKSGPTYVCTVCHKAAFPNQVKPCKRL
NYVRNPDVVAACLTGKYVHVCDDQCRDEQQCNVPDERKEEWICHTCQQSSMGVMPTLAVA
NNLELADIPPELCDPTYWERHLIAKCIAFAKIVPLPKGRQRAIHGNVVCVPSEVQETVEA
LPRLRSESQVMRVKLKRRLCYRGHQLFQTVTWSKLVQALHKLKRIHPQYQDITIRDEAEL
CDPTLEIDMCEISALCETETELDGEQDVDMLSCDREQPEQQRDDEEQEGEMPNGGFALES
CLQPCDVSEEILCFSEGIYSVAPAESNSPVGFFKTPKLEAMAFPVQFPTGQNTLDEGRRM
KVTPSSYFKARLFCVDDRFARDTNYLFFAQFVTEIHLATSSMTIQLRKGKPMTRDGRKIT
SGMLQNKREVEKLVRNKDAVRFMQPLRGTPAYWEKTTRDLFAMIRQLGTPTFFCTFSAAE
MRWPEVIEAIKRQQGEEVNFEGLDWSAKCDILRSNPVTTMRMFDKRVEALFRDVLLSPAQ
PLGKVVDYFYRVEFQHRGSPHIHCLIWVEGAPVFEEDDDHTVSAFVSKYITAQLPDQLTQ
PELYKKVTEVQIHSKKHSRTCFRSPSSGCRFGFPKPPSRKTMISRPGEGVAPLQLQIAKS
KLQPLNLLLNEPETASLSLQQLLVKCNLTLQEYEGCLNVINKSSAVILKREPKDCWVNGY
NAHLLEAWDANIDVSYILNAYSCIMYLTSYITKKESGLSEYLKTVIENSTKDHVNECDEM
REIMQAYSKKREVSAQECVTRACGINMKKCSRGVIFVPTDDNALKMSRPMSYLENTTLES
VNVWMTSLTDKYKSRPETPEFEQMCLADFAATCRIVYGQQKKGKDVLPLLNDMGFVQKRK
NDKPAIIRFYRCSQEKYPEKFYGTLLKLYIPHRSDLELKRRHFPTYESFYKSGCVQLPGS
DHPEYVRHIVKRNKDKYEKNSEDIENAVEEFEQNRGVIDEWCNLAPESEVERLECIEELE
AREPDHENVQENVPEYTNQGNAATEARAIRELPTFDPTLLRQMYQNLNQKQACVFYAVRD
WCVKRVCGLDPEQFFFYVNGGAGTGKSHLIKCIYSEASKILCKLPSHSEEVDISNPTVLL
TAFTGTAAFNISGSTLHSLLKLPRSLKPPFQGLGNQLDEVRSELLNAEILIIDEVSMVSK
PLFAYVDARLKQIKGSTKPFGGMSVIAVGDFYQLPPVRQSKTLCVYEPCEIDLWQEHFQT
ITLTEIMRQKDDVAFAEMLNRIRVKGKSDELSQADRALLSQTITEPSLCPPDVLHIFATN
KQVDSHNSVTLALLHSNITNIDADDYKKDPRTGRMARQAQPYKGTRNELPDTLNVAEGAR
VMLTRNIDVSQGLVNGSFATLVRSGVAHVTMLGLKMDDQTAGRNYRNRAPGGPDDVVYIE
RAEDNLKQKGVVRRQFPVRLAFACTIHKVQGMTRTSAVVSLKHIFEPGMAYVAISRVTSL
SGLHMLDLDESKIYANPEITGALETMRQVNLDDVMPLLRIKETSSRHDTLTIVHHNTEGL
PSHIIDIRSHHELCLADVLCLTETHLQGSFVAQSLHLEGYNMFKRNRHLSYTNVPQIANR
GGGGVAVYVKSHIQVREKQYVHNVTDLEFVALKVEAPVRALIAAVYRPPDYSVRSFLSNL
GSLLDSLEIMDCQPIIVCGDFNENLLHDGKPILELFQSRGFAQVITNATTDKNTLLDLIF
ISQPQRCLHSGVMRTYYSYHNPVYCIMS
Download sequence
Identical sequences G3N4K6
ENSGACP00000000226 69293.ENSGACP00000000226 ENSGACP00000000226

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]