SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for H2Q1S4 from Uniprot 2018_03 genome

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  H2Q1S4
Domain Number 1 Region: 1753-1889
Classification Level Classification E-value
Superfamily Actin depolymerizing proteins 3.04e-26
Family Gelsolin-like 0.0013
Further Details:      
 
Domain Number 2 Region: 2151-2214
Classification Level Classification E-value
Superfamily VHP, Villin headpiece domain 1.96e-20
Family VHP, Villin headpiece domain 0.00055
Further Details:      
 
Domain Number 3 Region: 1874-1989
Classification Level Classification E-value
Superfamily Actin depolymerizing proteins 4.44e-18
Family Gelsolin-like 0.0043
Further Details:      
 
Domain Number 4 Region: 1437-1549
Classification Level Classification E-value
Superfamily Actin depolymerizing proteins 1.31e-17
Family Gelsolin-like 0.0014
Further Details:      
 
Domain Number 5 Region: 1971-2122
Classification Level Classification E-value
Superfamily Actin depolymerizing proteins 1.43e-17
Family Gelsolin-like 0.0045
Further Details:      
 
Domain Number 6 Region: 1585-1627,1655-1697
Classification Level Classification E-value
Superfamily Actin depolymerizing proteins 0.0000000000000467
Family Gelsolin-like 0.011
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) H2Q1S4
Sequence length 2214
Comment (tr|H2Q1S4|H2Q1S4_PANTR) Supervillin {ECO:0000313|Ensembl:ENSPTRP00000004060} KW=Complete proteome; Reference proteome OX=9598 OS=Pan troglodytes (Chimpanzee). GN=CK820_G0018843 OC=Catarrhini; Hominidae; Pan.
Sequence
MKRKERIARRLEGIENDTQPILLQSCTGLVTHRLLEEDTPRYMRASDPASPHIGRSNEEE
ETSDSSLEKQTRSKYCTETSGVHGDSPYGSGTMDTHSLESKAERIARYKAERRRQLAEKY
GLTLDPEADSEYLSRYTKSRKEPDAVEKRGGKSDKQEESSRDASSLYPGTETMGLRTCAG
ESKDYALHGGDGASDPEVLLNIENQRRGQELSATRQAHDLSPAAESSSTFSFSGRDSSFT
EVPRSPKHAHSSSLQQAASRSPSFGDPQLPPEARPSTGKPKHEWFLQKDSEGDTPSLINW
PSRVKVREKLVKEESARNSPELASESVTQRRHQPAPVHYVSFQSEHSAFDRVPSKAAGST
RQPIRGYVQPADTGHTAKLVTPETPENASECSWVASATQNVPKPPSLTVLEGDGRDSPVL
HICESKAEEEEGEGEGEEKEEDVHFTEALEQSKKTLLALEGDGLVRSPEDPSRNEDFGKP
DVSTVTLEHQKELENVAQPPQAPHQPTERTGRSEMVLYIQSEPVSQDAKPTGHNREASKK
RKVRTRSLSDFTGPPQLQALKYKDPASRRELELPSSKTEGPYGEISMLDTKVSVAQLRSA
FLASANACRRPELKSRVERSAEGPGLPTGVERERGSRKPRRYFSPGESRKTSERFRTQPI
TSAERKESDRCTSHSETPTVDDEEKVDERAKLSVAAKRLLFREMEKSFDEQNVPKRRSRN
AAVEQRLRRLQDRSLTQPITTEEVVIAATEPIPASCSGGTHPVMARLPSPTVARSAVQPA
RLQASAHQKALAKDQTNEGKELAEQGEPDSSTLSLAEKLALFNKLSQPVSKAISTRNRID
TRQRRMNARYQTQPVTLGEVEQVQSGKLIPFSPAVNTSVSTVASTVAPMYAGDLRTKPPL
DHNASATDYKFSSSIENSDSPVRSILKSQAWQPLVEGSENKGMLREYGETESKRALTGRD
SGMEKYGSFEEAEASYPILNRAREGDSHKESKYAVPRRGSLERANPPITHLGDEPKEFSM
AKMNAQGNLDLRDRLPFEEKVEVENVMKRKFSLRAAEFGEPTSEQTGTAAGKTIAQTTAP
VSWKPQDSSEQPQEKLCKNPCAMFAAGEIKTPTGEGLLDSPSKTMSIKERLALLKKSGEE
DWRNRLSRRQEGGKAPASSLHTQEAGRSLIKKRVTESRESQMTIEERKQLITVREEAWKS
RGRGAANDSTQFTVAGRMVKKGLASPTAITPVASPICSKTRGTTPVSKPLEDIEARPDMQ
LESDLKLDRLETFLRRLNNKVGGMHETVLTVTGKSVKEVMKPDDDETFAKFYRNVDYNMP
RSPVELDEDFDVIFDPYAPKLTSSVAEHKRAVRPKRRVQASKNPLKMLAAREDLLQEYTE
QRLNVAFMESKRMKVEKMSSNSNFSEVTLAGLASKENFSNVSLRSVNLTEQNSNNSAVPY
KRLMLLQIKGRRHVQTRLVEPRASALNSGDCFLLLSPHCCFLWVGEFANVIEKAKASELA
TLIQTKRELGCRATYIQTIEEGINTHTHAAKDFWKLLGGQTSYQSAGDPKEDELYEAAII
ETNCIYRLLDDKLVPDDDYWGKIPKCSLLQPKEVLVFDFGSEVYVWHGKEVTLAQRKIAF
QLAKHLWNGTFDYENCDINPLDPGECNPLIPRKGQGRPDWAIFGRLTEHNETILFKEKFL
DWTELKRPNEKNPGELAQHKEDPRADVKAYDVTRMVSMPQTTAGTILDGVNVGRGYGLVE
GHDRRQFEITSVSVDVWHILEFDYSRLPKQSIGQFHEGDAYVVKWKFMVSTAVGSRQKGE
HSVRAAGKEKCVYFFWQGRHSTVSEKGTSALMTVELDEERGAQVQVLQGKEPPCFLQCFQ
GGMVVHSGRREEEEENVQSEWRLYCVRGEVPVEGNLLEVACHCSSLRSRTSMVVLNVNKA
LIYLWHGCKAQAHTKEVGRTAANKIKEQCPLEAGLHSSSKVTIHECDEGSEPLGFWDALG
RRDRKAYDCMLQDPGSFNFAPRLFILSSSSGDFAATEFVYPARAPSVVSSMPFLQEDLYS
APQPALFLVDNHHEVYLWQGWWPIENKITGSARIRWASDRKSAMETVLQYCKGKNLKKPA
PKSYLIHAGLEPLTFTNMFPSWEHREDIAEITEMDTEVSNQITLVEDVLAKLCKTIYPLA
DLLARPLPEGVDPLKLEIYLTDEDFEFALDMTRDEYNALPAWKQVNLKKAKGLF
Download sequence
Identical sequences H2Q1S4
9598.ENSPTRP00000004060 ENSPTRP00000004060 XP_001136112.1.37143 ENSPTRP00000004060

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]