SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSPTRP00000004060 from Pan troglodytes 69_2.1.4

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSPTRP00000004060
Domain Number 1 Region: 1753-1889
Classification Level Classification E-value
Superfamily Actin depolymerizing proteins 3.04e-26
Family Gelsolin-like 0.0013
Further Details:      
 
Domain Number 2 Region: 2151-2214
Classification Level Classification E-value
Superfamily VHP, Villin headpiece domain 1.96e-20
Family VHP, Villin headpiece domain 0.00055
Further Details:      
 
Domain Number 3 Region: 1874-1989
Classification Level Classification E-value
Superfamily Actin depolymerizing proteins 4.44e-18
Family Gelsolin-like 0.0043
Further Details:      
 
Domain Number 4 Region: 1437-1549
Classification Level Classification E-value
Superfamily Actin depolymerizing proteins 1.31e-17
Family Gelsolin-like 0.0014
Further Details:      
 
Domain Number 5 Region: 1971-2122
Classification Level Classification E-value
Superfamily Actin depolymerizing proteins 1.43e-17
Family Gelsolin-like 0.0045
Further Details:      
 
Domain Number 6 Region: 1585-1627,1655-1697
Classification Level Classification E-value
Superfamily Actin depolymerizing proteins 0.0000000000000467
Family Gelsolin-like 0.011
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSPTRP00000004060   Gene: ENSPTRG00000002392   Transcript: ENSPTRT00000004399
Sequence length 2214
Comment pep:known chromosome:CHIMP2.1.4:10:29898307:30179348:-1 gene:ENSPTRG00000002392 transcript:ENSPTRT00000004399 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MKRKERIARRLEGIENDTQPILLQSCTGLVTHRLLEEDTPRYMRASDPASPHIGRSNEEE
ETSDSSLEKQTRSKYCTETSGVHGDSPYGSGTMDTHSLESKAERIARYKAERRRQLAEKY
GLTLDPEADSEYLSRYTKSRKEPDAVEKRGGKSDKQEESSRDASSLYPGTETMGLRTCAG
ESKDYALHGGDGASDPEVLLNIENQRRGQELSATRQAHDLSPAAESSSTFSFSGRDSSFT
EVPRSPKHAHSSSLQQAASRSPSFGDPQLPPEARPSTGKPKHEWFLQKDSEGDTPSLINW
PSRVKVREKLVKEESARNSPELASESVTQRRHQPAPVHYVSFQSEHSAFDRVPSKAAGST
RQPIRGYVQPADTGHTAKLVTPETPENASECSWVASATQNVPKPPSLTVLEGDGRDSPVL
HICESKAEEEEGEGEGEEKEEDVHFTEALEQSKKTLLALEGDGLVRSPEDPSRNEDFGKP
DVSTVTLEHQKELENVAQPPQAPHQPTERTGRSEMVLYIQSEPVSQDAKPTGHNREASKK
RKVRTRSLSDFTGPPQLQALKYKDPASRRELELPSSKTEGPYGEISMLDTKVSVAQLRSA
FLASANACRRPELKSRVERSAEGPGLPTGVERERGSRKPRRYFSPGESRKTSERFRTQPI
TSAERKESDRCTSHSETPTVDDEEKVDERAKLSVAAKRLLFREMEKSFDEQNVPKRRSRN
AAVEQRLRRLQDRSLTQPITTEEVVIAATEPIPASCSGGTHPVMARLPSPTVARSAVQPA
RLQASAHQKALAKDQTNEGKELAEQGEPDSSTLSLAEKLALFNKLSQPVSKAISTRNRID
TRQRRMNARYQTQPVTLGEVEQVQSGKLIPFSPAVNTSVSTVASTVAPMYAGDLRTKPPL
DHNASATDYKFSSSIENSDSPVRSILKSQAWQPLVEGSENKGMLREYGETESKRALTGRD
SGMEKYGSFEEAEASYPILNRAREGDSHKESKYAVPRRGSLERANPPITHLGDEPKEFSM
AKMNAQGNLDLRDRLPFEEKVEVENVMKRKFSLRAAEFGEPTSEQTGTAAGKTIAQTTAP
VSWKPQDSSEQPQEKLCKNPCAMFAAGEIKTPTGEGLLDSPSKTMSIKERLALLKKSGEE
DWRNRLSRRQEGGKAPASSLHTQEAGRSLIKKRVTESRESQMTIEERKQLITVREEAWKS
RGRGAANDSTQFTVAGRMVKKGLASPTAITPVASPICSKTRGTTPVSKPLEDIEARPDMQ
LESDLKLDRLETFLRRLNNKVGGMHETVLTVTGKSVKEVMKPDDDETFAKFYRNVDYNMP
RSPVELDEDFDVIFDPYAPKLTSSVAEHKRAVRPKRRVQASKNPLKMLAAREDLLQEYTE
QRLNVAFMESKRMKVEKMSSNSNFSEVTLAGLASKENFSNVSLRSVNLTEQNSNNSAVPY
KRLMLLQIKGRRHVQTRLVEPRASALNSGDCFLLLSPHCCFLWVGEFANVIEKAKASELA
TLIQTKRELGCRATYIQTIEEGINTHTHAAKDFWKLLGGQTSYQSAGDPKEDELYEAAII
ETNCIYRLLDDKLVPDDDYWGKIPKCSLLQPKEVLVFDFGSEVYVWHGKEVTLAQRKIAF
QLAKHLWNGTFDYENCDINPLDPGECNPLIPRKGQGRPDWAIFGRLTEHNETILFKEKFL
DWTELKRPNEKNPGELAQHKEDPRADVKAYDVTRMVSMPQTTAGTILDGVNVGRGYGLVE
GHDRRQFEITSVSVDVWHILEFDYSRLPKQSIGQFHEGDAYVVKWKFMVSTAVGSRQKGE
HSVRAAGKEKCVYFFWQGRHSTVSEKGTSALMTVELDEERGAQVQVLQGKEPPCFLQCFQ
GGMVVHSGRREEEEENVQSEWRLYCVRGEVPVEGNLLEVACHCSSLRSRTSMVVLNVNKA
LIYLWHGCKAQAHTKEVGRTAANKIKEQCPLEAGLHSSSKVTIHECDEGSEPLGFWDALG
RRDRKAYDCMLQDPGSFNFAPRLFILSSSSGDFAATEFVYPARAPSVVSSMPFLQEDLYS
APQPALFLVDNHHEVYLWQGWWPIENKITGSARIRWASDRKSAMETVLQYCKGKNLKKPA
PKSYLIHAGLEPLTFTNMFPSWEHREDIAEITEMDTEVSNQITLVEDVLAKLCKTIYPLA
DLLARPLPEGVDPLKLEIYLTDEDFEFALDMTRDEYNALPAWKQVNLKKAKGLF
Download sequence
Identical sequences H2Q1S4
ENSPTRP00000004060 ENSPTRP00000004060 9598.ENSPTRP00000004060 XP_001136112.1.37143

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]