SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSGGOP00000020436 from Gorilla gorilla 76_3.1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGGOP00000020436
Domain Number 1 Region: 1724-1861
Classification Level Classification E-value
Superfamily Actin depolymerizing proteins 1.29e-25
Family Gelsolin-like 0.0015
Further Details:      
 
Domain Number 2 Region: 2122-2185
Classification Level Classification E-value
Superfamily VHP, Villin headpiece domain 1.96e-20
Family VHP, Villin headpiece domain 0.00055
Further Details:      
 
Domain Number 3 Region: 1846-1961
Classification Level Classification E-value
Superfamily Actin depolymerizing proteins 4.44e-18
Family Gelsolin-like 0.0043
Further Details:      
 
Domain Number 4 Region: 1408-1520
Classification Level Classification E-value
Superfamily Actin depolymerizing proteins 1.11e-17
Family Gelsolin-like 0.0016
Further Details:      
 
Domain Number 5 Region: 1943-2093
Classification Level Classification E-value
Superfamily Actin depolymerizing proteins 0.0000000000000028
Family Gelsolin-like 0.0049
Further Details:      
 
Domain Number 6 Region: 1556-1598,1626-1668
Classification Level Classification E-value
Superfamily Actin depolymerizing proteins 0.000000000000257
Family Gelsolin-like 0.013
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGGOP00000020436   Gene: ENSGGOG00000026771   Transcript: ENSGGOT00000022314
Sequence length 2185
Comment pep:known_by_projection chromosome:gorGor3.1:10:32704883:32999446:-1 gene:ENSGGOG00000026771 transcript:ENSGGOT00000022314 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MKRKERIARRLEGIENDTQPILLQSCTGLVTHRLLEEDTPRYMRASDPASPHIGRSNEEE
ETSDSSLEKQTRSKYCTETSGVHGDSPYGSGTMDTHSLESKAERIARYKAERRRQLAEKY
GLTLDPEADSEYLSRYTKSRKEPDAVEKRGGKSDKQEESSRDASSLYPGTETMGLRTCAG
ESKDYALHGGDGASDPEVLLNIENQRRGQEAHDLSPAAESSSTFSFSGRDSSFTEVPRSP
KHAHSSSLQQAASRSPSFGDPQLPPEARPSTGKPKHEWFLQKDSEGDTPSLINWPSRVKV
REKLVKEESARNSPELASESVTQRRHQPAPVHYVSFQSEHSAFDRVPSKAAGSTRQPIRG
YVQPADTGHTAKLVMPETPENASECSWVASATQNVPKPPTLTVLQGDGRDSPVLHICESG
VRFTEALEQSKKTLLALEGDGLVRSPEDPSRNEDFGKPAVSTVTLEHQKELENVAQPPQA
QHQPTERTGRSEMVLYIQSEPVSQDAKPTGHNKEASKKRKVRTRSLSDFTGPPQLQALKY
KDPASRRELELPSSKTEGPYGEISMLDTKVSVAQLRSAFLASANACRRPELKSRVERSAE
GPGLPTGVERERGSRKPRRYFSPGESRKTSERFRTQPITSAERKESDRCTSHSETPTVDD
EEKVDERAKLSVAAKRLLFREMEKSFDEQNVPKRRSRNAAVEQRLRRLQDRSLTQPITTE
EVVIAATEPIPASCSGGTHPVMARLPSPTVARSAVQPARLQASAHQKALAKDQTNEGKEL
AEQGEPDSSTLSLAEKLALFNKLSQPVSKAISTRNRIDTRQRRMNARYQTQPVTLGEVEQ
VQSGKLIPFSPAVNTSVSTVASTVAPMYAGDLRTKPPLDHNASATDYKFSSSVENSDSPV
RSILKSQAWQPLVEGSENKGMLREYGETESKRALTGRDSGMEKYGSFEEAEASYPILNRA
REGDSHKESKYAVPRKGSLERANPPITHLGDEPKEFSMAKMNAQGNLDLRDRLPFEEKVE
VENVMKRKFSLRAAEFGEPTSEQTGTAAGKTIAQTAAPVSWKPQDSSEQPQEKLCKNPCA
MFAAGEIKTPTGEGLLDSPSKTMSIKERLALLKKSGEEDWRNRLSRRQEGGKAPASSLHT
QEAGRSLIKKRVTESRESQMTIEERKQLITVREEAWKTRGRGAANDSTQFTVAGRMVNEA
EAGSSTLNRSKCRFRSVQWAPVCSKVSLPLSRFTADLFGTLTSFACLLIAGRMHETVLTV
TGKSVKEVMKPDDDETFAKFYRNVDYNMLRSPVELDEDFDVIFDPYAPKMTKGFSSQGNM
TKPTRKSDYSKQKLKQLKSLEFLILAYTFLKTSVYFFFFKHIKAFACPVSSNSSFSEVTL
AGLASKENFSNVSLRSVNLTEQNSNNSAVPYKRLMLLQIKGRRHVQTRLVEPQASVLNSG
DCFLLLSPHCCFLWVGEFANVIEKAKASELATLIQTKRELGCRATYIQTIEEGINTHTHA
AKDFWKLLGGQTSYQSAGDPKEDELYEAAIIETNCIYRLMDDKLVPDDDYWGKIPKCSLL
QPKEVLVFNFGSEVYVWHGKEVTLAQRKIAFQLAKHLWNGTFDYENCDINPLDPGECNPL
IPRKGQGRPDWAIFGRLTEHNETILFKEKFLDWTELKRPNEKNPGELAQHKEDPRADVKA
YDVTRMVSMPQTTAGTILDGVNVGRGYGLVEGHDRRQFEITSVSVDVWHILEFDYSRLPK
QSIGQFHEGDAYVVKWKFMVSTAVGSRQKGEHSVRAAGKEKCVYFFWQGRHSTVSEKGTS
ALMTVELDEERGAQKVQVLQGKEPPCFLQCFQGGMVVHSGRREEEEENVQSEWRLYCVRG
EVPVEGNLLEVACHCSSLRSRTSMVVLNVNKALIYLWHGCKAQAHTKEVGRTAANKIKEQ
CPLEAGLHSSSKVTIHECDEGSEPLGFWDALGRRDRKAYDCMLQDPGSFNFAPRLFILSS
SSGDFAATEFVYPARAPSVVSSMPFLQEDLYSAPQPALFLDNHHEVYLWQGWWPIENKIT
GSARICRASDRKSTMETVLQYCKGKNLKKPAPKSYLIHAGLEPLTFTNMFPSWEHREDIA
EITEMDTEVSNQITLVEDVLAKLCKTIYPLADLLARPLPEGVDPLKLEIYLTDEDFEFAL
DMTRDEYNALPAWKQVNLKKAKGLF
Download sequence
Identical sequences ENSGGOP00000020436 ENSGGOP00000017756

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]