SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for XP_001136112.1.37143 from NCBI 2017_08 genome

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  XP_001136112.1.37143
Domain Number 1 Region: 1753-1889
Classification Level Classification E-value
Superfamily Actin depolymerizing proteins 3.04e-26
Family Gelsolin-like 0.0013
Further Details:      
 
Domain Number 2 Region: 2151-2214
Classification Level Classification E-value
Superfamily VHP, Villin headpiece domain 1.96e-20
Family VHP, Villin headpiece domain 0.00055
Further Details:      
 
Domain Number 3 Region: 1874-1989
Classification Level Classification E-value
Superfamily Actin depolymerizing proteins 4.44e-18
Family Gelsolin-like 0.0043
Further Details:      
 
Domain Number 4 Region: 1437-1549
Classification Level Classification E-value
Superfamily Actin depolymerizing proteins 1.31e-17
Family Gelsolin-like 0.0014
Further Details:      
 
Domain Number 5 Region: 1971-2122
Classification Level Classification E-value
Superfamily Actin depolymerizing proteins 1.43e-17
Family Gelsolin-like 0.0045
Further Details:      
 
Domain Number 6 Region: 1585-1627,1655-1697
Classification Level Classification E-value
Superfamily Actin depolymerizing proteins 0.0000000000000467
Family Gelsolin-like 0.011
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) XP_001136112.1.37143
Sequence length 2214
Comment PREDICTED: supervillin isoform X9 [Pan troglodytes]; AA=GCF_000001515.7; RF=representative genome; TAX=9598; STAX=9598; NAME=Pan troglodytes; AL=Chromosome; RT=Major
Sequence
MKRKERIARRLEGIENDTQPILLQSCTGLVTHRLLEEDTPRYMRASDPASPHIGRSNEEE
ETSDSSLEKQTRSKYCTETSGVHGDSPYGSGTMDTHSLESKAERIARYKAERRRQLAEKY
GLTLDPEADSEYLSRYTKSRKEPDAVEKRGGKSDKQEESSRDASSLYPGTETMGLRTCAG
ESKDYALHGGDGASDPEVLLNIENQRRGQELSATRQAHDLSPAAESSSTFSFSGRDSSFT
EVPRSPKHAHSSSLQQAASRSPSFGDPQLPPEARPSTGKPKHEWFLQKDSEGDTPSLINW
PSRVKVREKLVKEESARNSPELASESVTQRRHQPAPVHYVSFQSEHSAFDRVPSKAAGST
RQPIRGYVQPADTGHTAKLVTPETPENASECSWVASATQNVPKPPSLTVLEGDGRDSPVL
HICESKAEEEEGEGEGEEKEEDVHFTEALEQSKKTLLALEGDGLVRSPEDPSRNEDFGKP
DVSTVTLEHQKELENVAQPPQAPHQPTERTGRSEMVLYIQSEPVSQDAKPTGHNREASKK
RKVRTRSLSDFTGPPQLQALKYKDPASRRELELPSSKTEGPYGEISMLDTKVSVAQLRSA
FLASANACRRPELKSRVERSAEGPGLPTGVERERGSRKPRRYFSPGESRKTSERFRTQPI
TSAERKESDRCTSHSETPTVDDEEKVDERAKLSVAAKRLLFREMEKSFDEQNVPKRRSRN
AAVEQRLRRLQDRSLTQPITTEEVVIAATEPIPASCSGGTHPVMARLPSPTVARSAVQPA
RLQASAHQKALAKDQTNEGKELAEQGEPDSSTLSLAEKLALFNKLSQPVSKAISTRNRID
TRQRRMNARYQTQPVTLGEVEQVQSGKLIPFSPAVNTSVSTVASTVAPMYAGDLRTKPPL
DHNASATDYKFSSSIENSDSPVRSILKSQAWQPLVEGSENKGMLREYGETESKRALTGRD
SGMEKYGSFEEAEASYPILNRAREGDSHKESKYAVPRRGSLERANPPITHLGDEPKEFSM
AKMNAQGNLDLRDRLPFEEKVEVENVMKRKFSLRAAEFGEPTSEQTGTAAGKTIAQTTAP
VSWKPQDSSEQPQEKLCKNPCAMFAAGEIKTPTGEGLLDSPSKTMSIKERLALLKKSGEE
DWRNRLSRRQEGGKAPASSLHTQEAGRSLIKKRVTESRESQMTIEERKQLITVREEAWKS
RGRGAANDSTQFTVAGRMVKKGLASPTAITPVASPICSKTRGTTPVSKPLEDIEARPDMQ
LESDLKLDRLETFLRRLNNKVGGMHETVLTVTGKSVKEVMKPDDDETFAKFYRNVDYNMP
RSPVELDEDFDVIFDPYAPKLTSSVAEHKRAVRPKRRVQASKNPLKMLAAREDLLQEYTE
QRLNVAFMESKRMKVEKMSSNSNFSEVTLAGLASKENFSNVSLRSVNLTEQNSNNSAVPY
KRLMLLQIKGRRHVQTRLVEPRASALNSGDCFLLLSPHCCFLWVGEFANVIEKAKASELA
TLIQTKRELGCRATYIQTIEEGINTHTHAAKDFWKLLGGQTSYQSAGDPKEDELYEAAII
ETNCIYRLLDDKLVPDDDYWGKIPKCSLLQPKEVLVFDFGSEVYVWHGKEVTLAQRKIAF
QLAKHLWNGTFDYENCDINPLDPGECNPLIPRKGQGRPDWAIFGRLTEHNETILFKEKFL
DWTELKRPNEKNPGELAQHKEDPRADVKAYDVTRMVSMPQTTAGTILDGVNVGRGYGLVE
GHDRRQFEITSVSVDVWHILEFDYSRLPKQSIGQFHEGDAYVVKWKFMVSTAVGSRQKGE
HSVRAAGKEKCVYFFWQGRHSTVSEKGTSALMTVELDEERGAQVQVLQGKEPPCFLQCFQ
GGMVVHSGRREEEEENVQSEWRLYCVRGEVPVEGNLLEVACHCSSLRSRTSMVVLNVNKA
LIYLWHGCKAQAHTKEVGRTAANKIKEQCPLEAGLHSSSKVTIHECDEGSEPLGFWDALG
RRDRKAYDCMLQDPGSFNFAPRLFILSSSSGDFAATEFVYPARAPSVVSSMPFLQEDLYS
APQPALFLVDNHHEVYLWQGWWPIENKITGSARIRWASDRKSAMETVLQYCKGKNLKKPA
PKSYLIHAGLEPLTFTNMFPSWEHREDIAEITEMDTEVSNQITLVEDVLAKLCKTIYPLA
DLLARPLPEGVDPLKLEIYLTDEDFEFALDMTRDEYNALPAWKQVNLKKAKGLF
Download sequence
Identical sequences H2Q1S4
9598.ENSPTRP00000004060 XP_001136112.1.37143 ENSPTRP00000004060 ENSPTRP00000004060

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]