SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSGACP00000001104 from Gasterosteus aculeatus 76_1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGACP00000001104
Domain Number 1 Region: 1830-1953,1981-2018,2053-2134
Classification Level Classification E-value
Superfamily P-loop containing nucleoside triphosphate hydrolases 1.51e-32
Family Tandem AAA-ATPase domain 0.021
Further Details:      
 
Domain Number 2 Region: 1519-1559,1675-1879
Classification Level Classification E-value
Superfamily P-loop containing nucleoside triphosphate hydrolases 6.27e-23
Family Tandem AAA-ATPase domain 0.032
Further Details:      
 
Domain Number 3 Region: 2166-2386
Classification Level Classification E-value
Superfamily DNase I-like 9.55e-19
Family DNase I-like 0.06
Further Details:      
 
Domain Number 4 Region: 2-153
Classification Level Classification E-value
Superfamily Cysteine proteinases 0.000000000235
Family M48USP-like 0.013
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGACP00000001104   Gene: ENSGACG00000000857   Transcript: ENSGACT00000001104
Sequence length 2387
Comment pep:novel scaffold:BROADS1:scaffold_364:9741:17234:-1 gene:ENSGACG00000000857 transcript:ENSGACT00000001104 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
VRASHSQASIRYGRSRNRQCTCNSLTFLAFLHENESITRADLDHVLDKGNTMYRETRKQV
TNHIYLTTDELTDEVPARSATHHADMTQLSRYGTLGEPLPGAVDSFLDLESGLSCLLSDV
KYALLLMRLLCIAVFRTKSGRYGFFDPHSRTARGLPLPAGSRTPGTAVMVTFSRLSDMID
RLKKSHRMMDTQSSCNYELKPVEFYTESSDIIFTSSVRRQEVINSNETELSSQSTFLPSH
DASLNEPSISSSVENTVASDLSDVLLHGISCKLTKCDKNERRKMKRRLMISVKTQQRKEI
QKRKEREKYASNKAYKADKISHAKSQYQNNPEFRQKKINNTRDNVDLKQKRKEYISSRYT
ENADFRQKKKHIFVTRYRNNAEFRENKRKILVSRYRNNPDFREKQKEILVSRYRNNPDFR
EKQKEILVSRYRNNPDFREKQKEILVSRYRNNPDFREKQKEILVSRYRNNPDFREKQKEI
LVSRYRNNPDFREKQKEILVSRYRNNPDFREKQKEILVSRYRNNPEFREKQKQHFTDRYR
DNAEFRKEKKMYFTAFYKNSIDFRERKRSHIIQRYAHDQAFRLRHKQLMKQRMKDRYKNN
PTYKSMRNMRCAMKIKRKYRGINKPTEESDNSLIKEAISVFRSQIKSGPTYVCTVCHKAS
FPNQVKPCKRLNYVRNPDVVAACLTGKYVHVCDDECRDEQQCNVPDERKEEWICHTCHNH
LKDGVMPTLAVANNLELADIPPELCDLNILERHLIAKCIAFAKIVPLPKGRQRAIHGNVV
CVPSEVQETVEALPRLRSESQVMRVKLKRRLCYRGHQLFQTVTWSKLVQALHKLKRIHPQ
YQDITIRDEAELCDPTLEIDMCEKSALCETETELDGEQDVDMLSCDGEQPEQQRDDKEQE
GDMPNGGFALESCLQPCDVSEEILCFSEGIYSVAPAESNSPVGFFKTPKLEAMAFPVQFP
TGQNTLDEGRRMKVTPSSYFKARLFCVDDRFARDTNYLFFAQFVTEIHLATSSMTIQLRK
GKPMTRDGRKITSGMLQNKREVEKLVRNKDAVRFMQPLRGTPAYWEKTTRDLFAMIRQLG
TPTFFCTFSAAEMRWPEVIEAIKRQQGEEVNFEGLDWSAKCDILRSNPVTTMRMFDKRVE
ALFRDVLLSPAQPLGKVVDYFYRVEFQHRGSPHIHCLIWVEGAPVFEEDDDHTVSAFVSK
YITAQLPDQLTQPELYKKVTEVQIHSKKHSRTCFRSPSSGCRFGFPKPPSRKTIISRPGE
GVAPLQLQIAKSKLQPLNLLLNEPETASLSLQQLLAKCNLTLQEYEGCLNVINKSSAVIL
KREPKDCWVNGYNAHLLEAWDANIDVSYILNAYSCIMYLTSYITKKESGLSEYLKTVIEN
STKDHVNECDEMREIMQAYSKKREVSAQECVTRACGINMKKCSRGVIFVPTDDNALEMSR
PMSYLENTTLESVNVWMTSLTDKYKSRPETPEFEQMCLADFAATCRIVYGQQKKGKDVLP
LLNDMGFVQKRKNNKPAIIRFYRCSQEKYPEKFYGTLLKLYIPHRSDLELKRRHFPTYES
FYKSGCVQLPGSDHPEYVRHIVKRNKDKYEKNSEDIENAVEEFEQNRGVIDEWCNLAPES
EVERLECIEELEAREPDHENVQENVPEYTNQANAATEARAIREPPAFDPTVLRQMYQNLN
QKQACVFYAVRDWCVKRVCGLNPEQFFFYVNGGAGTGKSHLIKCIYSEASKILCKLPSHS
EEVDISNPTVLLTAFTGTAAFNISGSTLHSLLKLPRSLKPPFQGLGNQLDEVRSELLNAE
ILIIDEVSMVSKPLFAYVDARLKQIKGSTKPFGGMSVIAVGDFYQLPPVRQSKTLCVYEP
CEIDLWQEHFQTITLTEIMRQKDDVAFAEMLNRIRVKGKSDELSQADRALLSQTITEPSL
CPPDVLHIFATNKQVDSHNSVTLALLHSNITNIDADDYKKDPRTGRMARQAQPYKGNRNE
LPDTLNVAEGARVMLTRNIDVSQGLVNGSFATLVRVITAEQSGVAHVTMLGLKMDDQTAG
RNYRNRAPGGPDDVVYIERAEDNLKQKGVVRRQFPVRLAFACTIHKVQGMTRTSAVVSLK
HIFEPGMAYVAISRVTSLSGLHMLDLDESKIYANPEITGALETMRQVNLDDMMPLLRIKE
ASSRHDTLTIVHHNTEGLPSHIIDIRSHHELCLADVLCLTETHLQGSFVAESLHLEGYNM
FKRNRHLSYTNVPQIANRGGGGVAVYVKSHIQVREKQYVHNVTDLEFVALKVEAPVRALI
AAVYRPPDYSVRSFLSNLGSLLDSLEIMDCQPIIVCGDFNENLFSNTGKPILELFQSRGF
AQVITAATTDKNTLLDLIFISQPQRCLHSGVMRTYYSYHNPVYCVMS
Download sequence
Identical sequences G3N729
ENSGACP00000001104 ENSGACP00000001104 69293.ENSGACP00000001104

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]