SUPERFAMILY 1.75 HMM library and genome assignments server

Superfamily is undergoing a server migration - you are now browsing on the new server. Please contact us if you experience any problems.

Domain assignment for ENSGACP00000021680 from Gasterosteus aculeatus 76_1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGACP00000021680
Domain Number 1 Region: 3944-4310
Classification Level Classification E-value
Superfamily RCC1/BLIP-II 9.16e-92
Family Regulator of chromosome condensation RCC1 0.00086
Further Details:      
 
Domain Number 2 Region: 4438-4786
Classification Level Classification E-value
Superfamily Hect, E3 ligase catalytic domain 1.83e-85
Family Hect, E3 ligase catalytic domain 0.0000941
Further Details:      
 
Domain Number 3 Region: 356-578
Classification Level Classification E-value
Superfamily RCC1/BLIP-II 1.7e-52
Family Regulator of chromosome condensation RCC1 0.013
Further Details:      
 
Domain Number 4 Region: 559-734
Classification Level Classification E-value
Superfamily RCC1/BLIP-II 3.27e-46
Family Regulator of chromosome condensation RCC1 0.012
Further Details:      
 
Domain Number 5 Region: 3367-3463,3528-3661,3688-3760
Classification Level Classification E-value
Superfamily WD40 repeat-like 6.87e-30
Family WD40-repeat 0.01
Further Details:      
 
Domain Number 6 Region: 2015-2191
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 8.31e-29
Family SPRY domain 0.0046
Further Details:      
 
Weak hits

Sequence:  ENSGACP00000021680
Domain Number - Region: 2696-2754
Classification Level Classification E-value
Superfamily UBA-like 0.0474
Family UBA domain 0.017
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGACP00000021680   Gene: ENSGACG00000016410   Transcript: ENSGACT00000021721
Sequence length 4810
Comment pep:known_by_projection group:BROADS1:groupXIV:3743206:3774676:1 gene:ENSGACG00000016410 transcript:ENSGACT00000021721 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MVPCQTHVLLKWQEHFNSSWAAEDSVQTARRHGAAVLYNKLLLNKEVVTLAQPVQELVGP
RLPDFECESSASAEKEEYLSSLLHSQRWLAHRMLTQTSYTLGLHHRLVVLQRIYYALHRK
YHDKFRVQLPSQSTESGTECGQLELASEPCRPGGASKVKSGTDVLIEMGVRTGLSLLFSL
LQQNWRYAASVHPESVLCNDVLATASSVLTSLPPLSLANENKIPSVGLDCLSQVSDFLKK
TSVSSGTGGADPTGRRLALELLLGLAMQRGSLKFLLEWVEVALAASTLSSQNSAGVGFDV
IHQTLLQMKQYSGFRGDSVNTQVPKKDADGLCRLSQAALCLFEEICNLASYCLCSCGTNA
AAPGSESDTVVVYVWGSNSSHQLAEGTLEKILLPKLTQGFSDAQMIEAGQYCTFSVSADG
SVKACGKGSYGRLGLGDSNNQSMPKKLVLEPHRNMKKVSSSKGSDGHTLAITLEGEVFSW
GDGEYGKLGHGNSATQKYPKIIQGPLFGKVVVCVSAGYRHSAAVTNDGELYTWGEGDFGR
LGHSDSQSRNVPTLVKDISGVGQVACGSSHTVAVAQDGRTVWSFGGGDNGKLGHGDTNRV
YRPKVIEALHGFIIRKVCAGSQSSLALTSAGQVFAWGCGSCLGCGSSETTSLRPRFIEDL
SITKIIDISCGDSHCLALTHENEVYAWGNNTMGQCGQGHTSTPITKPKKVLGLEGVSIQQ
ITAGTSHSLAWTADPTDRQLVAWHRPFCVDLEESTFTYLRNFLESYCDGIGNDTPPAPFL
SKRDHHQFILLCMKLLSIHLSLAHAGGTGAMVLGAQGRPLRNLLFRLIDTNMPDSIQQAV
LNTLSIGASLLLPPLRERTELLLSLLPQGPQSLNVPSKGQKRLQLDMVLSSLQDQSHVSS
LLGYSHFGEGTSLGPPLTPVMSARLPSSSSSVESYNPLHLAEVLLRTLLLSIGVYTERAF
GELEKNSDKQPSSDRQGQTDPPCHFHQLLSGLHKHLLAHCYIHPNPEDDSSVTLLREHLY
RLLPCAAETLRRSTKLLKDSSLDKQIIKKLHVVLYNSVAGNLLCQVMYSLLLLPLGMVQP
LLSHLLALLEQLNDFNRLLPETGLLEEQELRMMAHFGIFLTDKGEENTCLGEQQPEEESK
WVWLLDLQRSVALAVGRCLGGMLQGPPPSLQEKTSDFWLSNVLLRNGLETDFEQLDSSMA
WLTEVVLLGSSDARLADLSLNEDTRTLLELALGSSRGTAGGLWRNMEEYAHRKEWESGGS
SGDCLLQTVCRCSLAALLKHTGLQNQACWQDRYEPCEMLLNVYETVDKIRSTLLAHKNYP
RITQVQTAVKQTTQTPQSPDVDRQEPGTSVEQVGKDQQQQSEGHCSWEVQVPPPPTANHN
QEEVAEEDTFGSSQVLESFMATREAITSTLPSSMESPVSPERELDPKQQREKQRESEESL
GFPAACHLVVSRCLYLLLGIRPACVEPSVRENSEAVNTGARLVSNRQLCLRVFVVHLCVV
VFFWKSTTEFLMKAADGQKQRLMGFLLKPKKEYSTGLHSSGRSVPGLPLCSLGIMKDAWD
RLRQCISPTSPMSHCGSNTLPILNQVFHFVCGSLIHVSSSSPTPPGAQTCTQADPKAIYF
AILQQQHRAELRLEAFCQMSSFLSRMEERSSGFIVSPQFPALLQSVQLQFLSGCFALGTQ
IIASGANYEMQHYMSGTHSASMDTQRELQSAAHTFYQQVVRVLKQRLLSEKERPGSYQHL
LLATMFALNFSYKPVDLVLVIKCGILEILSMLTNNSCALMNQSWFAASKSGPILLSGAVR
LACARLLQILTVAASSCEDLLPMDVSHALMEVLREQLQNILYTFQQQQTAERLTAEGADL
DFCSKSAKPIGALNSPSVRVVLGAESSKVVESQLADFLVFLRRVLSLRVMKRLPTFVQWI
DPIMAIVSHKCSSGSSCFQNLRTELLAFHVLEKVLPACSEPVQIQQIVEQLFQLLSVYMW
KEPLAEKSHEEMAEKEKLMSLQNPGSYDECIPIGDFSFDPHKTICCSLESGNIISHGSGG
KGYGLATTAITAGCFIWKFYITKENRGNEATCIGVSRWPVKDHNHHTTTNMWLYRAYSGN
LYHGGELVRTLPSFTQGDTVTCILDMEAHTISFAKNDKEPKLAFEGVVASELYPCVLFYS
SNPGEKVALRDLKMRGMPSNYLPGEPLCSPRTTVLLESTVQLLRRLHQCDQWTSHINQYI
HTHLELIGPLLKEDDFNLGVGPTHSETELNEGNFAGDDTVDFLGHRRSLSEAKLAALCTE
VWPVLALIGGVDSGLRAGGLCMHKPSGRRALLLGVLKEGSSLAKLQWEEADLSVSSMNSW
SPSDTPIVSLEPCGTSCCDVTSLGGLKPTVLLDLIYLMGLLEEQGWQGTYPASKRKYLEE
NTRESELVETDPCTSSSSTKEASKIVDHQELTCLPQPLQNSQTSKVDAFAHELRAVRVSY
LLIGALKSLTVIFSCDKLSNLLLVPKNDPSNSTISPLSSPNLDQAKASSNQWDESAELRS
VLQYVVQSMVKWAVRPCPIKQSVSLVDLERAQVMIYKGALSRLQEDKEHKARVLKAPPSL
ANQTEPLSHPRTTTWRPLCRQTCRQMALIISTPSFPSVSSITQPADHSVSVIPASKGQRH
MVLTRFPTLTGLVNAPPVLHPCFSLASSASCDAFTEQQTSFMEPGPCSTQARSSLGKNKR
RKKAVGPQYKIIIRLIEMGFSMRHIYWAMEAADVAGDLDSQTIEVLASWMLEHPLTEEHH
AAESARQEGAPDTPSPDGPETVQCPERPTTQINESLLPGLDWIERENFLDVHLTRNRPPP
ARRRRSGPSHRISFRRAANTIFSDLPSPIPLPELYQHHTSDSDWPEQFHPYAAEESELGY
MDDPYHEETYEDLLTPSFFSLERDTLQIVEVTQTQALTSEEHSQMVKCELCSTLTLQFNN
HVKRRHPGCGQSAARKGYDSTGAYVDVWFKGECGSNFPFYLLCSSCREKYLAANQSGASS
KYERIKGLTSDLIGQLDGTSDDDWEISHRDECDADKLTGLEDFGLLLRPLGLTEKKLVPD
PIAFSEPDPLGALVYSSADPTRAAASKVLLSGCIQKCEFSKKTSLSLGHQAVSLRDSHDR
LKALRRVTSTAQILLAYSMVMRALSQTAFSASAYSHSSGLESLGLADIRILVRLMTLAAG
GRAHTCVDRQSGVKGLSTERNNSTCLSFLTSAIGSVVSRSPAAYRQLVEICTQELLAAAT
GVNIGTISDPKQRSRLNSSQTAGHQGAQQRKDYIEQTPPTFRVTQNLVSLLTEKGVNCHG
PAHPSARVGPLELANALAACVLSARLTSKHRQWAAKQLLQALAGTGRDGPNRPQTYSDLA
GDLRKSPLKRLEGHYNKVTSCSWNSDQNLLATCSQDKVVQLWNISQNNVELHTTFSCITS
KMDRSSSERATRCHPMSPVYWSTGGNFLAAPDNKHITIMNIRGSHCHVEAQVSRVTALCW
AHTFSLSVLDQATSCQNRPAESLLVARLDGSLCWLQVTMQQIDLHVTSTELTRCHRTEAA
PQCVAWHSEDKPFAVGYPNGMILFATTEAYENEQPVVLSVFQDSVVSLKWDPTGHLLLCL
GRSEVVKILGRSEGNWVTLHSLFHSSIVNIAEWCPLAGRVPDPRLMVAAGCQNGSVHVWT
LPQGGTFVSLPNILNSSQRQDKSTDVKEKAKCVFVLHGHITAVKWLSFCSSGLALVSGGI
GGLLNIWSLQDGSVLQTATGLGSVVSTTWIPNLGVAACFGRSKDVLLICCTPDWISQNHV
LASCRMVLRSQNILGLNQAPCLAVFLERLPLLLQEQYSYERTHVAAGDQLVHSAFLQSLA
SLSVGLSLEKHLCRYPRPPHHTSPDAHSCPPEWSWLATYATTVRSAEAIASGTAFPESFN
LSEFQEVDVPEISKALDNSKWSFRMDEQLMSWATTRPEDWQLGGKSEVYLWGNGRYGQLA
GMGTNLMMPTLAPSLLQTQQVVCGQNCSFLVQSNGTVLAVGEGQYGRLGQGNSDDLYVPT
IISAFQGYVVTQLVTSCGSDGHSMALTETGEVFSWGDGDFGKLGHGNSERQRRPKQIEAL
QGEEVIQLSCGFRHSAVVTADGKLFTFGSGESGRLGQRSTSNKLLPERVAALEGYHVGQV
SCGLNHTLVLSLDGMVVWAFGDGDYGKLGTGSSTAKYYPQKVEQLCNRGIKKVSCGTQFS
VALACYGHVYTFGQERLIGLPDSMMRNKSRPQLVPSLEGLFIEDIAVGCEHVLALSSTGD
VYTWGCNSEGQLGLGHSNPVKEPTLVTTLQGKNIRQISAGRCHSSAWTTPSTSIKNSGGS
GSFQLGLPQLVPPQYNTLKDCSPDVLSMRLRVLYHFSDLMYKSWRLLNLDAKNPVFTSRY
SSGTTAIIRGDLRGLLSPKVNTLPLVRSIGRTMTQGKTYGPQITVKRISTRGRSSKPIFV
QIAKQVVGLNPLELRLPSRAWKVKLVGEGADDAGGVFDDTITEMCQELQSGVVDLLIHSP
NSFADVGCNTDRFLFNPAALLEDHMVQFRFLGILMAVAIRTKKPLDLHLAPWVWKQLCSM
PLGEPDLEEVDLLTYRTLQGILHLDNSGITEDNFHVMIPLDSFMAHSANGRLVPVVPGGQ
NISLTFGNRTEYVERALDYRLHEMDSQVAAVREGMSTIVPVPLLSLLTAQQLEQLVCGLP
EVSVEMLKKLVRYRDIAESHELIGWFWQSLEEFTNEERVLFLRFVSGRSRLPSNPADITQ
KFQIIKVDRPINGLPTAQTCFFLFRLPPYTSQAILAERLRYSIHNCPSIDMDNYMLTHNT
DPADSSDTED
Download sequence
Identical sequences G3PVP1
ENSGACP00000021680 69293.ENSGACP00000021680 ENSGACP00000021680

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]