SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSGGOP00000017756 from Gorilla gorilla 76_3.1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGGOP00000017756
Domain Number 1 Region: 1635-1771
Classification Level Classification E-value
Superfamily Actin depolymerizing proteins 2.8e-26
Family Gelsolin-like 0.0013
Further Details:      
 
Domain Number 2 Region: 2032-2095
Classification Level Classification E-value
Superfamily VHP, Villin headpiece domain 1.96e-20
Family VHP, Villin headpiece domain 0.00055
Further Details:      
 
Domain Number 3 Region: 1756-1871
Classification Level Classification E-value
Superfamily Actin depolymerizing proteins 4.21e-18
Family Gelsolin-like 0.0043
Further Details:      
 
Domain Number 4 Region: 1319-1431
Classification Level Classification E-value
Superfamily Actin depolymerizing proteins 8.72e-18
Family Gelsolin-like 0.0016
Further Details:      
 
Domain Number 5 Region: 1853-2003
Classification Level Classification E-value
Superfamily Actin depolymerizing proteins 0.00000000000000257
Family Gelsolin-like 0.0049
Further Details:      
 
Domain Number 6 Region: 1467-1509,1537-1579
Classification Level Classification E-value
Superfamily Actin depolymerizing proteins 0.000000000000257
Family Gelsolin-like 0.013
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGGOP00000017756   Gene: ENSGGOG00000026771   Transcript: ENSGGOT00000031414
Sequence length 2095
Comment pep:known_by_projection chromosome:gorGor3.1:10:32704883:32803035:-1 gene:ENSGGOG00000026771 transcript:ENSGGOT00000031414 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MKRKERIARRLEGIENDTQPILLQSCTGLVTHRLLEEDTPRYMRASDPASPHIGRSNEEE
ETSDSSLEKQTRSKYCTETSGVHGDSPYGSGTMDTHSLESKAERIARYKAERRRQLAEKY
GLTLDPEADSEYLSRYTKSRKEPDAVEKRGGKSDKQEESSRDASSLYPGTETMGLRTCAG
ESKDYALHGGDGASDPEVLLNIENQRRGQELSATRQAHDLSPAAESSSTFSFSGRDSSFT
EVPRSPKHAHSSSLQQAASRSPSFGDPQLPPEARPSTGKPKHEWFLQKDSEGDTPSLINW
PSRVKVREKLVKEESARNSPELASESVTQRRHQPAPVHYVSFQSEHSAFDRVPSKAAGST
RQPIRGYVQPADTGHTAKLVMPETPENASECSWVASATQNVPKPPTLTVLQGDGRDSPVL
HICESKAEEEEGEGEGEEKEEGVRFTEALEQSKKTLLALEGDGLVRSPEDPSRNEDFGKP
AVSTVTLEHQKELENVAQPPQAQHQPTERTGRSEMVLYIQSEPVSQDAKPTGHNKEASKK
RKVRTRSLSDFTGPPQLQALKYKDPASRRELELPSSKTEGPYGEISMLDTKVSVAQLRSA
FLASANACRRPELKSRVERSAEGPGLPTGVERERGSRKPRRYFSPGESRKTSERFRTQPI
TSAERKESDRCTSHSETPTVDDEEKVDERAKLSVAAKRLLFREMEKSFDEQNVPKRRSRN
AAVEQRLRRLQDRSLTQPITTEEVVIAATEPIPASCSGGTHPVMARLPSPTVARSAVQPA
RLQASAHQKALAKDQTNEGKELAEQGEPDSSTLSLAEKLALFNKLSQPVSKAISTRNRID
TRQRRMNARYQTQPVTLGEVEQVQSGKLIPFSPAVNTSVSTVASTVAPMYAGDLRTKPPL
DHNASATDYKFSSSVENSDSPVRSILKSQAWQPLVEGSENKGMLREYGETESKRALTGRD
SGMEKYGSFEEAEASYPILNRAREGDSHKESKYAVPRKGSLERANPPITHLGDEPKEFSM
AKMNAQGNLDLRDRLPFEEKVEVENVMKRKFSLRAAEFGEPTSEQTGTAAGKTIAQTAAP
VSWKPQDSSEQPQEKLCKNPCAMFAAGEIKTPTGEGLLDSPSKTMSIKERLALLKKSGEE
DWRNRLSRRQEGGKAPASSLHTQEAGRSLIKKRVTESRESQMTIEERKQLITVREEAWKT
RGRGAANDSTQFTVAGRMVKKAGRMHETVLTVTGKSVKEVMKPDDDETFAKFYRNVDYNM
LRSPVELDEDFDVIFDPYAPSSNSSFSEVTLAGLASKENFSNVSLRSVNLTEQNSNNSAV
PYKRLMLLQIKGRRHVQTRLVEPQASVLNSGDCFLLLSPHCCFLWVGEFANVIEKAKASE
LATLIQTKRELGCRATYIQTIEEGINTHTHAAKDFWKLLGGQTSYQSAGDPKEDELYEAA
IIETNCIYRLMDDKLVPDDDYWGKIPKCSLLQPKEVLVFNFGSEVYVWHGKEVTLAQRKI
AFQLAKHLWNGTFDYENCDINPLDPGECNPLIPRKGQGRPDWAIFGRLTEHNETILFKEK
FLDWTELKRPNEKNPGELAQHKEDPRADVKAYDVTRMVSMPQTTAGTILDGVNVGRGYGL
VEGHDRRQFEITSVSVDVWHILEFDYSRLPKQSIGQFHEGDAYVVKWKFMVSTAVGSRQK
GEHSVRAAGKEKCVYFFWQGRHSTVSEKGTSALMTVELDEERGAQVQVLQGKEPPCFLQC
FQGGMVVHSGRREEEEENVQSEWRLYCVRGEVPVEGNLLEVACHCSSLRSRTSMVVLNVN
KALIYLWHGCKAQAHTKEVGRTAANKIKEQCPLEAGLHSSSKVTIHECDEGSEPLGFWDA
LGRRDRKAYDCMLQDPGSFNFAPRLFILSSSSGDFAATEFVYPARAPSVVSSMPFLQEDL
YSAPQPALFLDNHHEVYLWQGWWPIENKITGSARICRASDRKSTMETVLQYCKGKNLKKP
APKSYLIHAGLEPLTFTNMFPSWEHREDIAEITEMDTEVSNQITLVEDVLAKLCKTIYPL
ADLLARPLPEGVDPLKLEIYLTDEDFEFALDMTRDEYNALPAWKQVNLKKAKGLF
Download sequence
Identical sequences ENSGGOP00000017756

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]