SUPERFAMILY 1.75 HMM library and genome assignments server

Superfamily is undergoing a server migration - you are now browsing on the new server. Please contact us if you experience any problems.

Domain assignment for ENSGACP00000021643 from Gasterosteus aculeatus 76_1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGACP00000021643
Domain Number 1 Region: 491-656
Classification Level Classification E-value
Superfamily A middle domain of Talin 1 1.29e-72
Family A middle domain of Talin 1 0.0000003
Further Details:      
 
Domain Number 2 Region: 759-895
Classification Level Classification E-value
Superfamily I/LWEQ domain 8.44e-52
Family I/LWEQ domain 0.00000202
Further Details:      
 
Domain Number 3 Region: 2300-2489
Classification Level Classification E-value
Superfamily I/LWEQ domain 1.02e-49
Family I/LWEQ domain 0.00014
Further Details:      
 
Domain Number 4 Region: 662-789
Classification Level Classification E-value
Superfamily I/LWEQ domain 5.23e-47
Family I/LWEQ domain 0.00000502
Further Details:      
 
Domain Number 5 Region: 1844-1978
Classification Level Classification E-value
Superfamily alpha-catenin/vinculin-like 1.26e-44
Family VBS domain 0.00000966
Further Details:      
 
Domain Number 6 Region: 1231-1366
Classification Level Classification E-value
Superfamily alpha-catenin/vinculin-like 1.33e-41
Family VBS domain 0.026
Further Details:      
 
Domain Number 7 Region: 197-311
Classification Level Classification E-value
Superfamily Second domain of FERM 1.7e-31
Family Second domain of FERM 0.00000145
Further Details:      
 
Domain Number 8 Region: 1079-1213
Classification Level Classification E-value
Superfamily alpha-catenin/vinculin-like 3.45e-28
Family VBS domain 0.01
Further Details:      
 
Domain Number 9 Region: 312-402
Classification Level Classification E-value
Superfamily PH domain-like 6.39e-23
Family Third domain of FERM 0.0000191
Further Details:      
 
Domain Number 10 Region: 1476-1561
Classification Level Classification E-value
Superfamily I/LWEQ domain 0.0000000000000104
Family I/LWEQ domain 0.0099
Further Details:      
 
Domain Number 11 Region: 1700-1820
Classification Level Classification E-value
Superfamily alpha-catenin/vinculin-like 0.0000000926
Family VBS domain 0.043
Further Details:      
 
Domain Number 12 Region: 2010-2137
Classification Level Classification E-value
Superfamily alpha-catenin/vinculin-like 0.0000022
Family VBS domain 0.026
Further Details:      
 
Domain Number 13 Region: 81-137
Classification Level Classification E-value
Superfamily Ubiquitin-like 0.00000797
Family Ubiquitin-related 0.066
Further Details:      
 
Domain Number 14 Region: 1588-1665
Classification Level Classification E-value
Superfamily alpha-catenin/vinculin-like 0.0000173
Family VBS domain 0.044
Further Details:      
 
Weak hits

Sequence:  ENSGACP00000021643
Domain Number - Region: 927-989
Classification Level Classification E-value
Superfamily I/LWEQ domain 0.0019
Family I/LWEQ domain 0.0072
Further Details:      
 
Domain Number - Region: 2111-2233
Classification Level Classification E-value
Superfamily I/LWEQ domain 0.0222
Family I/LWEQ domain 0.0067
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGACP00000021643   Gene: ENSGACG00000016368   Transcript: ENSGACT00000021684
Sequence length 2546
Comment pep:known_by_projection group:BROADS1:groupII:15069480:15102391:1 gene:ENSGACG00000016368 transcript:ENSGACT00000021684 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MVVLSLKICIRQCNVVKTMQFEPSTAVYDACRIIRERVPEAQTGQASDYGLFLSDEDPRK
GIWLESGRTLDYYMLRNGDILEYKKKQRPQKIKMLDGAVKTIMVDDSKTVGELLVTICSR
IGITNYEEYSLIQETVEEKKEDGMGTLKKDRTLLRDERKMEKLKAKLHTDDDLNWLDHSR
TFREQGVDENETLLLRRKFFYSDQNVDSRDPVQLNLLYVQARDDILNGSHPVSFDKACEF
GGIQAQIQFGPHIEHKHKQGFLDLKEFLPKEYIKQRGAEKKIFQEHKNCGEMTEIEAKVK
YVKLARSLRTYGVSFFLVKEKMKSKNKLVPRLLGITKESVLRVEEKTKDVVQEWPLTTVK
RWAASPKSFTLDFGEYQESYYSVQTTEGEQISQLIAGYIDIILKKKQSKDRFGLEGDEES
TMLEESVSPKKSTILQQQFNRVGRVEHGSVALPGVIRSGSIGTESLSMGTMPSAQQQITM
GQMHRGHMPPLSSAQQALMGTINTSMQAVQKAQIDLGEVDNLPPLGQDMASKVWIQNKMD
ESKHEIHSQVDAITAGTASVVNLTAGDPTDTDYTAVGCAITTISSNLTEMSKGVKLLAAL
MEDDVGGGNDLMRAARTLAGAVSDLLKAVEPASGEPRQTVLTAAGSIGQASGDLLRRIGE
SEADERFQDILMNLAKAVANAAAMLVLKAKNVAQVAEDTGLQNRVIAAATQCALSTSQLV
ACAKVPQHVVSPTISSPVCQEQLIEAGKLVDRSVESCVQACLSATEDGDLLKQVSAAASV
VSQALGDLLQHVRQYTSRGEPIGRYDQATDTIMTVTESIFTSMGDAGEMVRQARVLAQAT
SDLVNAMRSDAEAEVDVDNSKKLLAAAKLLADATARMVEAAKGAAAYPENEDQQQRLREA
AEGLRVATNAAAQNAIKKKLFNRLENAAKQAAAAATQTIAAAQNAAASNKNTAAHQQLVQ
SCKAVADHIPQLVQGVRGSQAKPEDLSAQLALIIASQNFLQPGSKMVTSAKSSVPTVTDQ
AAAMQLGQCAKNLATCLAELRTSVQKAHEACGPMEIDSALTAIQTLRSELQDAMLTAMNS
KLKPLPGESLEKCAQDLGSTSKSVGSSMAQLLTCAAQGNEHYTGIAARETYQALRTLAQA
ARGVAASTTDPKAAAAMLDSARDVMEGSALLVHEAKQALVSPGDAESQQRLAQVAKAVSH
SLNSCVNCLPGQKDVDMALKSIGETSKKLLIETIPPASKSFQEAQSELNQTAADLNQSAG
EVVHASRGSSSQLAVASGKFSEDFDEFLDAGIEMAGHTQKKDDQVQVIGNLKNISMASSK
LLLAAKSLSVDPAAANAKNLLAAAARAVTDSINQLITLCTQQAPGQNECDNALRELEAVR
GMLDNPNEPVSDLSYFECIESVMENSKVLGESLAGISQNCKTGDVPSFGDCVGSASKALC
GLTEAAGQASYLVGVSDPNSQAGHQGLVDPIQFAKANQAIQMACQNLVDPGSSPSQVLSA
ATIVAKHTSALCNACRLASSKTTNPVAKRHFVQSAKEVANSTANLVKTIKALDGDFSDEN
RKKCRVATAPLIEAVENLTAFASNPEFASVAAKISNEGFAAQEPILQSARSMLDSSTYLL
KTARSLVINPKDPPTWSVLAGHSRTVSDSIKSLITAIRDKAPGQRECDSSIDNINKCIRD
IEQASLAAVSQNLASRDDISLEALQEQLTSTVQEIGHLIDPVSTAARGEASQLGHKVTQL
ARYFEPLIMASVGVTSKLRDHQQQMTFLDQTKTLAESALQMLYAAKEGGGNPKASHTHDA
IAEAAQLMKEAVDDIMVTLNEAASEVGMVGGMVESIAEAMARLDEGTPPEPEGSFVDYQT
SMVKHSKAIAVSAQEMMTKSVTCPDELGGLASQVTVDYSQLAVQGRLAAHTAEPEEIGFQ
IKNRVQELGHGCIFLVQKAGAVQITPSDGFTKRELIECARAVTEKVSLVLSALQTGNKGT
QACITAASAVSGIIADLDTTIMFATAGTLNAENDESFADHRENILKTAKALVEDTKMLVS
GAASGQDRLAQAAQSSAKTITQLTDVVKLGAASIGADDPETQVVLINAIKDVAKALSELI
SATKCAAGKAADDPSMYQLKSAAKVMVTNVTSLLKTVKAVEDEATRGTRALEATIECIRQ
ELVVFQSRDAPNKTTTPEEFIRMTKGITMATAKAVAAGNSARQEDVIHTANLSRKAISDM
LTTCKQASYHPEVSEEVKSRALMFGSECTTGYIDLLEQVLFVLQRPTGDQKQQLAVCSKR
VAGAVTELIQTAEAMKGSEWVDPEDPTVIAETELLGAAASIEAAAKKLEQLKPRAKPKQA
DETLNFEEQILEAAKSIAAATSALVKSASAAQRELVAQGKVGLIPANAVDDGQWSQGLIS
AARMVAAATSNLCEAANASVQGQASEEKLISSAKQVAASTAQLLVACKVKADQDSEAMRR
LQIAGNAVKKASDNLVRAAQKAAFDKADEANVVVKTKFVGGIAQIIAAQEEMLRKERELE
EARKKLAQIRQQQYKFLPSELREDDN
Download sequence
Identical sequences G3PVK4
ENSGACP00000021643 69293.ENSGACP00000021643 ENSGACP00000021643

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]