SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSMUSP00000035272 from Mus musculus 63_37 (longest transcript per gene)

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSMUSP00000035272
Domain Number 1 Region: 491-656
Classification Level Classification E-value
Superfamily A middle domain of Talin 1 3.4e-70
Family A middle domain of Talin 1 0.000000312
Further Details:      
 
Domain Number 2 Region: 756-891
Classification Level Classification E-value
Superfamily I/LWEQ domain 6.67e-53
Family I/LWEQ domain 0.00000208
Further Details:      
 
Domain Number 3 Region: 661-785
Classification Level Classification E-value
Superfamily I/LWEQ domain 1.09e-51
Family I/LWEQ domain 0.0000047
Further Details:      
 
Domain Number 4 Region: 2296-2486
Classification Level Classification E-value
Superfamily I/LWEQ domain 9.81e-51
Family I/LWEQ domain 0.00012
Further Details:      
 
Domain Number 5 Region: 1840-1974
Classification Level Classification E-value
Superfamily alpha-catenin/vinculin-like 1.49e-46
Family VBS domain 0.00000766
Further Details:      
 
Domain Number 6 Region: 1227-1363
Classification Level Classification E-value
Superfamily alpha-catenin/vinculin-like 8.63e-42
Family VBS domain 0.031
Further Details:      
 
Domain Number 7 Region: 197-311
Classification Level Classification E-value
Superfamily Second domain of FERM 8.37e-32
Family Second domain of FERM 0.0000013
Further Details:      
 
Domain Number 8 Region: 1076-1209
Classification Level Classification E-value
Superfamily alpha-catenin/vinculin-like 8.47e-30
Family VBS domain 0.01
Further Details:      
 
Domain Number 9 Region: 312-402
Classification Level Classification E-value
Superfamily PH domain-like 8.88e-25
Family Third domain of FERM 0.0000172
Further Details:      
 
Domain Number 10 Region: 1471-1557
Classification Level Classification E-value
Superfamily I/LWEQ domain 0.0000000000000118
Family I/LWEQ domain 0.0092
Further Details:      
 
Domain Number 11 Region: 2006-2133
Classification Level Classification E-value
Superfamily alpha-catenin/vinculin-like 0.0000000251
Family VBS domain 0.013
Further Details:      
 
Domain Number 12 Region: 1698-1816
Classification Level Classification E-value
Superfamily alpha-catenin/vinculin-like 0.00000022
Family VBS domain 0.063
Further Details:      
 
Domain Number 13 Region: 81-137
Classification Level Classification E-value
Superfamily Ubiquitin-like 0.000000737
Family Ubiquitin-related 0.062
Further Details:      
 
Domain Number 14 Region: 946-1044
Classification Level Classification E-value
Superfamily I/LWEQ domain 0.0000863
Family I/LWEQ domain 0.0028
Further Details:      
 
Weak hits

Sequence:  ENSMUSP00000035272
Domain Number - Region: 1585-1661
Classification Level Classification E-value
Superfamily alpha-catenin/vinculin-like 0.000204
Family VBS domain 0.041
Further Details:      
 
Domain Number - Region: 2108-2229
Classification Level Classification E-value
Superfamily I/LWEQ domain 0.0955
Family I/LWEQ domain 0.0063
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSMUSP00000035272   Gene: ENSMUSG00000052698   Transcript: ENSMUST00000039662
Sequence length 2542
Comment pep:known chromosome:NCBIM37:9:67068945:67387199:-1 gene:ENSMUSG00000052698 transcript:ENSMUST00000039662
Sequence
MVALSLKICVRHCNVVKTMQFEPSTAVYDACRVIRERVPEAQTGQASDYGLFLSDEDPRK
GIWLEAGRTLDYYMLRNGDILEYKKKQRPQKIRMLDGSVKTVMVDDSKTVGELLVTICSR
IGITNYEEYSLIQETIEEKKEEGTGTLKKDRTLLRDERKMEKLKAKLHTDDDLNWLDHSR
TFREQGVDENETLLLRRKFFYSDQNVDSRDPVQLNLLYVQARDDILNGSHPVSFEKACEF
GGFQAQIQFGPHVEHKHKPGFLDLKEFLPKEYIKQRGAEKRIFQEHKNCGEMSEIEAKVK
YVKLARSLRTYGVSFFLVKEKMKGKNKLVPRLLGITKDSVMRVDEKTKEVLQEWPLTTVK
RWAASPKSFTLDFGEYQESYYSVQTTEGEQISQLIAGYIDIILKKKQSKDRFGLEGDEES
TMLEESVSPKKSTILQQQFNRTGKAEHGSVALPAVMRSGSSGPETFNVGSMPSPQQQVMV
GQMHRGHMPPLTSAQQALMGTINTSMHAVQQAQDDLSELDSLPPLGQDMASRVWVQNKVD
ESKHEIHSQVDAITAGTASVVNLTAGDPADTDYTAVGCAITTISSNLTEMSKGVKLLAAL
MDDDVGSGEDLLRAARTLAGAVSDLLKAVQPTSGEPRQTVLTAAGSIGQASGDLLRQIGE
NETDERFQDVLMSLAKAVANAAAMLVLKAKNVAQVAEDTVLQNRVIAAATQCALSTSQLV
ACAKVVSPTISSPVCQEQLIEAGKLVDRSVENCVRACQAATSDSELLKQVSAAASVVSQA
LHDLLQHVRQFASRGEPIGRYDQATDTIMCVTESIFSSMGDAGEMVRQARVLAQATSDLV
NAMRSDAEAEIDMENSKKLLAAAKLLADSTARMVEAAKGAAANPENEDQQQRLREAAEGL
RVATNAAAQNAIKKKIVNRLEVAAKQAAAAATQTIAASQNAAISNKNPSAQQQLVQSCKA
VADHIPQLVQGVRGSQAQAEDLSAQLALIISSQNFLQPGSKMVSSAKAAVPTVSDQAAAM
QLSQCAKNLATSLAELRTASQKAHEACGPMEIDSALNTVQTLKNELQDAKMAAAESQLKP
LPGETLEKCAQDLGSTSKGVGSSMAQLLTCAAQGNEHYTGVAARETAQALKTLAQAARGV
AASTNDPEAAHAMLDSARDVMEGSAMLIQEAKQALIAPGDTESQQRLAQVAKAVSHSLNN
CVNCLPGQKDVDVALKSIGEASKKLLVDSLPPSTKPFQEAQSELNQAAADLNQSAGEVVH
ATRGQSGELAAASGKFSDDFDEFLDAGIEMAGQAQTKEDQMQVIGNLKNISMASSKLLLA
AKSLSVDPGAPNAKNLLAAAARAVTESINQLIMLCTQQAPGQKECDNALRELETVKGMLE
NPNEPVSDLSYFDCIESVMENSKVLGESMAGISQNAKTGDLPAFGECVGIASKALCGLTE
AAAQAAYLVGISDPNSQAGHQGLVDPIQFARANQAIQMACQNLVDPGSSPSQVLSAATIV
AKHTSALCNACRIASSKTANPVAKRHFVQSAKEVANSTANLVKTIKALDGDFSEDNRNKC
RIATTPLIEAVENLTAFASNPEFASIPAQISSEGSQAQEPILVSAKTMLESSSYLIRTAR
SLAINPKDPPTWSVLAGHSHTVSDSIKSLITSIRDKAPGQRECDYSIDGINRCIRDIEQA
SLAAVSQSLATRDDISVEALQEQLTSVVQEIGHLIDPIATAARGEAAQLGHKVTQLASYF
EPLILAAVGVASKMLDHQQQMTVLDQTKTLAESALQMLYAAKEGGGNPKAQHTHDAITEA
AQLMKEAVDDIMVTLNEAASEVGLVGGMVDAIAEAMSKLDEGTPPEPKGTFVDYQTTVVK
YSKAIAVTAQEMMTKSVTNPEELGGLASQMTTDYGHLALQGQMAAATAEPEEIGFQIRTR
VQDLGHGCIFLVQKAGALQVCPTDSYTKRELIECARSVTEKVSLVLSALQAGNKGTQACI
TAATAVSGIIADLDTTIMFATAGTLNAENGETFADHRENILKTAKALVEDTKLLVSGAAS
TPDKLAQAAQSSAATITQLAEVVKLGAASLGSNDPETQVVLINAIKDVAKALSDLIGATK
GAASKPADDPSMYQLKGAAKVMVTNVTSLLKTVKAVEDEATRGTRALEATIEYIKQELTV
FQSKDIPEKTSSPEESIRMTKGITMATAKAVAAGNSCRQEDVIATANLSRKAVSDMLIAC
KQASFYPDVSEEVRTRALRYGTECTLGYLDLLEHVLVILQKPTPELKHQLAAFSKRVAGA
VTELIQAAEAMKGTEWVDPEDPTVIAETELLGAAASIEAAAKKLEQLKPRAKPKQADETL
DFEEQILEAAKSIAAATSALVKSASAAQRELVAQGKVGSIPANAADDGQWSQGLISAARM
VAAATSSLCEAANASVQGHASEEKLISSAKQVAASTAQLLVACKVKADQDSEAMKRLQAA
GNAVKRASDNLVRAAQKAAFGKADDDDVVVKTKFVGGIAQIIAAQEEMLKKERELEEARK
KLAQIRQQQYKFLPTELREDEG
Download sequence
Identical sequences E9PUM4
ENSMUSP00000035272 ENSMUSP00000035272 NP_001074711.2.92730 ENSMUSP00000035272 ENSMUSP00000039633

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]