SUPERFAMILY 1.75 HMM library and genome assignments server

Superfamily is undergoing a server migration - you are now browsing on the new server. Please contact us if you experience any problems.

Domain assignment for ENSONIP00000002021 from Oreochromis niloticus 76_1.0

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSONIP00000002021
Domain Number 1 Region: 1029-1292
Classification Level Classification E-value
Superfamily P-loop containing nucleoside triphosphate hydrolases 2.33e-42
Family Extended AAA-ATPase domain 0.014
Further Details:      
 
Domain Number 2 Region: 1713-1961
Classification Level Classification E-value
Superfamily P-loop containing nucleoside triphosphate hydrolases 1.3e-39
Family Extended AAA-ATPase domain 0.045
Further Details:      
 
Domain Number 3 Region: 1344-1593
Classification Level Classification E-value
Superfamily P-loop containing nucleoside triphosphate hydrolases 5.45e-38
Family Extended AAA-ATPase domain 0.044
Further Details:      
 
Domain Number 4 Region: 642-702,798-961
Classification Level Classification E-value
Superfamily P-loop containing nucleoside triphosphate hydrolases 1.73e-32
Family Shikimate kinase (AroK) 0.06
Further Details:      
 
Domain Number 5 Region: 279-494,523-574
Classification Level Classification E-value
Superfamily P-loop containing nucleoside triphosphate hydrolases 1.26e-26
Family Extended AAA-ATPase domain 0.024
Further Details:      
 
Domain Number 6 Region: 2028-2085,2199-2291
Classification Level Classification E-value
Superfamily P-loop containing nucleoside triphosphate hydrolases 4.17e-18
Family ABC transporter ATPase domain-like 0.067
Further Details:      
 
Domain Number 7 Region: 5293-5497
Classification Level Classification E-value
Superfamily vWA-like 0.0000000000000209
Family Integrin A (or I) domain 0.029
Further Details:      
 
Weak hits

Sequence:  ENSONIP00000002021
Domain Number - Region: 3308-3343,3397-3499,3568-3620,3692-3713,3783-3905,4665-4685
Classification Level Classification E-value
Superfamily ARM repeat 0.00144
Family Clathrin adaptor core protein 0.05
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSONIP00000002021   Gene: ENSONIG00000001610   Transcript: ENSONIT00000002020
Sequence length 5507
Comment pep:known_by_projection scaffold:Orenil1.0:GL831328.1:373267:424685:1 gene:ENSONIG00000001610 transcript:ENSONIT00000002020 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MENLVLSLSSLGTIARHVDKSHSQLSQYLSKQIWSQQDRQCILDCLAQLLLEKDYTLLIA
RHLRPLTLDLLERNAERVKAGGSINHDLHERLCVALSKLLSISPDAQTFGARYLDNAPPV
FQRLFFTSEESSTVQYGPRRMKLRDLMGATLRFLQSDCAKFRMLWDWSPCMSLLLTSDVM
VRWYTAHCLALVSHMTDNQKTIFLRKVLTSDEILHMKMKGLEETQQLEVEKALVLANQGY
VTWCQEKANKFTRGQVVSEDLSQNVVAVCGVVLPRIVPRQPEQINQKDLVLVDSTCRNLR
RLALAVASQKPVLLEGPIGCGKTALVEFMAAVTGHAKTTEILKVQLGDQTDSKVLLGMYR
CTDIPGKFIWQPGTLTQAVSSGQWILLEDIDHAPLDVISALLPLMENKKLMIPGREDCID
VAPGFQFFATRRMYYSGGSWHRPQNSHAVLLDKYWTKLQMGNMTREELKKVLISRYPRLS
VVSDHLLEIFCQLTGERHSELDTNSLAMPDNSQQHNSDDKSTQLRALSLRDLLKWCERIS
VNFDCTSSATAQHVFLEALDCFTAMLSHPESRLRMAEIIGSKLNISREKAQHFCQMYQPG
ILLTELEASVGRVTLIRKQTEAVQLSVDNHTFAATRPSAVLLEQLAVCVAKGEPVLLVGE
TGTGKTSTVQQLARITGHRLRVVNMNQQSDTADLLGGYKPVDIKQILLPLREAFEDLFSQ
TYSRKQNLTFLGHVQTCFRGKRWQDLLKLMDHVCKSALTKELQEKSDAALLQEQWEALAS
KLNQTQQQIRACETAMVFAFVEGTLAQAVKKGHWILLDEINLAAAETLECLSGLLEGNTR
SLVLLDRGDTVEPLVRHPDFRLFACMNPATDVGKRNLPLGLRNRFTELYVEELENESDLR
ILISDYLKCLNPHRSVISGIISFYLTVRKEASSRLVDGRGHRPHYSLRTLCRALKYVALN
PCNNVQRSLYEGFCLSFLTQLDRSSHPVVQKLVCQHILMGNTKCLKQSIPAPSGRPCVEM
EGYWVSQGEMEPGLDPSYILTPSVKLNLRDLARVVSAGTHPVLIQGETSVGKTSLIRWLA
AATGNQCVRINNHEHTDVQEYIGCYSSDDRGKLVFKEGVLIDAMRKGYWIILDELNLAPT
DVLEALNRLLDDNRELFVAETQEVISAHPRFMLFATQNPPGLYGGRKVLSRAFRNRFVEL
HFDELPSAELEIILHQRCSLPPSYCTKLVKVMQDLQSLRKGSSVFAGKHGFITLRDLFRW
ADRYRLEEQAEASRDWLQHLADDGYMLLAGRVRKPEEEAIILSILEKHFKRTVNPEYLFS
QKQVASQFSPFIDSIAGVPEEFRHVVWTQDMRRLAVLLGRALRFGESVLLVGNTGCGKTT
ICQLFAALAGQKFFSVNCHLHMETSDFLGGLRPVRHTHQNDGEDGRLFEWHDGPLVQAMK
EGGVFLMDEISLADDSVLERLNSVLETEKSLVLAEKGSGDRGDVELIKAAANFRLVATMN
PGGDFGKKELSPALRNRFTEIWCPQTNNRSDLVKIIQHNLRSGLSLDGYEPCGDVAELML
DFIEWLTQQDFGCRCILSVRDILSWVNFLNAVYEDEAEWDLRLDTVTAFIHAACLVYIDG
IGSGTTASCADGVLLARRLCLSFLQQRLSKMTKLDQDVMDALRVYDSNLPREPQWGEDFF
GIDPFYIALGPHTESRNLSNYAIAAGTTAVNAQRILRALKLQRPVLLEGSPGVGKTSLVA
ALAKASGNHLVRINLSEQTDVTDLFGTDLPVEGGKGGEFAWRDGPLLAALKAGHWVVLDE
LNLASQSVLEGLNACFDHRAEIYIPELAMSFQVQHDKTKIFGCQNPFTQGGGRKGLPKSF
LNRFTQVFVDQLTTKDMEFIGDLIFPSIDKEIVIKMVEFSNRLVQEVSKDRHWGQKGSPW
EFNLRDLFRWCQLMQADQAPGFFNPGQHVALVYADRMRTESDKAQVLSVFRKVFGEDFEP
YCGPRELHITPFNLQVGYSVLHRSGGANVAPDTPLSITHQCLRPLESLMKCVEMGWMTIL
VGPTASGKTSLVRLLSLLTGHRLRIMAMNSAMDTTELLGGFEQVDIMRPWQQVLESVDYI
VAMVIRRGLMSLDGGIQDTEFLLQTWGLFCHWLKQEGLQRTGGTINSEALNKLEVIILLL
QKLNTKLKVFTDMSKLQMDFTLLKERLAQLEDGWTNGGFEWLDGMLVQALQSGDWLLMDN
VNFCNASVLDRLNALLEPGGSLVINERGVIDGSTPKITPHPNFRLFLTMDAVHGELSRAM
RNRGVEIYIPGEHEAVCWDTLDLKTLLHTAGVTGDCVCDLLIEIHKSVKSAIWDSPASSV
ASLLHAATLLSCQLQRGVDLPSALQHACGEAYSFCQHNSANQKKAQQVIEQHLSILDTEG
WGCGLLCAGVWPDSFPSALLSTEDSCFSTVIRDGQVLLFCLNTLSLQGKRRSQPLSLSDL
QRVLQNSGVHEGLVFSGAVGNLEDGNALRLIPTAVRLLTERASHGDWIMRSSWLSHLGKS
HKYAPDAALVQVEAGNRALKAVFGSKLAAKGKSLAELLQPHSTDEYRILVDMRWNKQYLD
ILANKCNFEDEERYTEFLEALNAVANRIVLMMDREERSVVCTCAVNLSGPNSVLQIATAF
SKGTVDIGHLPHPVLMHLQTFFELWGSFILQAVQTGQPYISDHIMFEILQLLQWLDRFWD
VCSTLTPDAQGISLLSLHWQWVQKHLILRLPQLLLGDADATFEELQATSTAIQTVLSTPV
SVATALKGIQKTLGKTLPFKLESVAKCASQLRLLSKALDVSDLKLDNNMIHSKQELVRLQ
GAGLGLDIKESLLEAWGLVLLANNQLDPQKSGVMERLVSSVEQQECCMTSRGLLEVCSMS
EHDESVDGVPLSVEELQQLSRRAQLWPLMEELAVLNQLRLSTDLLHLALSPSDQEAEDCL
RRDLSRHLRYCLQSTPMNVRQLQPLWFLLSSDRVSTEELPFVWCELLSEALSSLWTCSIT
SNPEHWLKWDPLNPETQTDPKQHQGTDNMGPALLSKAVRSHCVLEVLSCSKTGGSREPSG
EAFLSLSNVRLGDWKGRIQQLQDLSAVLWDNMSIRALADFRGTDFRLQGAVLRRQLQSMA
DVLPPTLQEGYMESCESLVEDPDPLIAAQEIQTVFRDFLPPKLLPDGLVEACLTCLQQYV
QSKVQGSNSLARSGVFWVNMGLLQIKVWTPQTIFDPAVKRAYKLNYAQQELALLQEEWKG
RSLWSQLMTGAQLEESSTTDGFQHPRIRYLWIRMQQVKEQIAELSRKQACRPHEPQYGQL
YQELQHYLCSIGQTSAIQDLLSQLLNTLESSSSSKSKSAVQALLKEEAVWQNSQKRFCQR
LLEDYPLYPDVVGPVRTGILQLRYGMRLVASQVAALLTPVPGLPRLVSCLLAFPFLSPSL
PSYLVRADFLCSRICMDTLRSLVKLLPQQDTACVIPQSCTLLLSALLYVQCHTLSTGEFS
HKARSIFRHICQALVNEWDERERRRKEHEEMEASLYRSRSRLHGSGLTEDEQEERDFRRQ
FPQFNKDFADIVSEPSLEGAADTSLEGLEDKTHEDSFEMKALSPATLNTVIQIHQRLCLG
YAQSLWYKSTAPANHSKEHIKALLSSYQIASPVMSHFYHLIDSDLDQQLTGSELLLSTIL
QNTVQGSGGAEGLTITPDGPYDFYQHPNMSEASLCLPVLEQLSVAVKQRLEDWPGHPALV
QLNVVMERIKAFNLASPVAKFLNGLEILLSKAQDWENNASRSVSLRKELEPVTQLIIQWR
KLELNCWSRSLDNAMKRHTEKSTKHWFSVYQLLERYLEEQRTESCTAEEVEHLSLSSVSS
TLQAFLEGSTLGEFHTRLNMLLTFHCHLLLVPPQPGQEPLSSLLWNLYKYYNQFSQGIQT
KITQHRQTIEKDLKDFVKICKWNDVNFWSIKQSVEKTHRTLFKFIKQFEEALNGPSIPAL
VEHGSGASLDSVDANPQEMPIHRVHQLLKSTLPLKSRILQTDGEEEEEEFKASSLQRSLP
VLSRKMKKMCIQLLKKNPVPELAEDLDCFTGEVISNLRDLQGLTINTSADKDKQKAEIKH
ILQQKQRALSDLFKMLTEIGLSYRKGLIWNRTAKAEKALYMQPLEMTTALSAVKTQENAE
SMLFAELLTAWDGCQKYFYQSWARNTALQTALQHAAKELGLGNVERCRGFSSHLFKLLLK
QRRRLAKLTEQWVHLRRLTDSVQGIKAHLQSQSEEQGCTLPPQASLQDWVKRGQALTAQC
NTLFQQLAWLLHCCPEDLQKEEVSSCKQHTLRCPSPLAAQRQPPGCLMRRGDAAWCQLHQ
RVTTMLEQTQSLKVELDCAAQQISDGVLHTWNYFTECCSVFNRLGAIGAQMPIVEQVFTS
DVCVGTPDSQPTVIQSLQYVRGQVEATVTEFTTWRIHVLSLGHDDTDHLHTFSTEFSAEL
EANINTLLCSVQTLVKKRERVQQKGAKEDKEEPLEDLLKPGHLTRLLEEELEAEVEALRV
GDVSSGLERLLSHLKTHRDSSQPPHFQVNRACRMLVRLEPLLGIYSDLVCYYLAVSLGAH
RSTGKLLSVLASIFTELAQKGFCLPQELMAGDGEGAEQFHDYEGGGIGEGKGTKDVSDKI
ENEDQLEDTFKDGEEKTEQHEKEDIKSEDNAVEMSEDFDGQMCDADEKEPGDDEESDKED
DEELDKKMGDLGEGQTDTLDEKMWGDDDDEEEEEGSDKEEESGQGMDQGESELVAKDDNL
DAADPKKDHKNQNKDQQIDEEEKDMINEQGDEREFDENEVDPYHGQQDKRPEPEAMDLPE
ELDLDQGDKEGDDDDDGDEGEEENPFNIDDRGMEADQDEEEDGEKDGGENEQGAEELGKD
QNGEENDGEPETAEEKEGGEEAEKDDSADKDEEEKDEERERESGRDEDTKTSSNEKGHEP
KEEEEGGGEDEELPECAERKWHDTDGQTGEDNIQSDTAVELAGEASERDEAKEEHGSGAA
DASQSEGHQSKLTATIASHRQTQSQTQSLKRKPGQADSERSIGDYNKPVNKRLRTVERSK
ESLENNRQSDTQQVSDLYEHIKHGDPNYDTQTYDVASAEQESTAGPRGQDEDDKEEDLSM
ETEDQEVDLQAAEVQELKPEQLDSSKASQTGLDPGEMEVQRQGEEQEDERWKEKQQAEEE
ERAGRSTDSTIHIVPELLLDSAQEKAQKRDPEETRREMELQLEAWHKLAPGTQEEEAAAA
SMWHQYQTLTSALSQQLCEQLRLILEPTQAAKLKGDYRTGKRLNMRKVIPYIASQFRKDK
IWLRRTKPSKREYQICLAVDDSSSMVDNHSKQLAFESLAVIINALTLLEVGQVSVCSFGE
QVQLLHPFQHQFNDESGARILRLCQFQQKKTRIAQFLETSVNMFLAARQQIPGSMNSETA
QLLVIVSDGRGLFLEGKERVMAAVRAARSAGIFIIFMVLDNPNSRDSILDIKVPIFKGPG
ELPEIHSYMDEFPFPFYVILRDINALPATLSDALRQWFELVTVAEHP
Download sequence
Identical sequences I3IZI1
ENSONIP00000002021 ENSONIP00000002021

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]