SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSECAP00000019446 from Equus caballus 76_2

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSECAP00000019446
Domain Number 1 Region: 491-656
Classification Level Classification E-value
Superfamily A middle domain of Talin 1 1.44e-69
Family A middle domain of Talin 1 0.00000032
Further Details:      
 
Domain Number 2 Region: 756-891
Classification Level Classification E-value
Superfamily I/LWEQ domain 2.94e-53
Family I/LWEQ domain 0.00000202
Further Details:      
 
Domain Number 3 Region: 661-785
Classification Level Classification E-value
Superfamily I/LWEQ domain 5.1e-52
Family I/LWEQ domain 0.00000457
Further Details:      
 
Domain Number 4 Region: 2298-2488
Classification Level Classification E-value
Superfamily I/LWEQ domain 6.08e-51
Family I/LWEQ domain 0.00012
Further Details:      
 
Domain Number 5 Region: 1842-1976
Classification Level Classification E-value
Superfamily alpha-catenin/vinculin-like 5.57e-47
Family VBS domain 0.00000732
Further Details:      
 
Domain Number 6 Region: 1229-1365
Classification Level Classification E-value
Superfamily alpha-catenin/vinculin-like 1.8e-41
Family VBS domain 0.028
Further Details:      
 
Domain Number 7 Region: 197-311
Classification Level Classification E-value
Superfamily Second domain of FERM 8.37e-32
Family Second domain of FERM 0.0000013
Further Details:      
 
Domain Number 8 Region: 1078-1211
Classification Level Classification E-value
Superfamily alpha-catenin/vinculin-like 5.81e-29
Family VBS domain 0.012
Further Details:      
 
Domain Number 9 Region: 312-402
Classification Level Classification E-value
Superfamily PH domain-like 8.88e-25
Family Third domain of FERM 0.0000172
Further Details:      
 
Domain Number 10 Region: 1473-1559
Classification Level Classification E-value
Superfamily I/LWEQ domain 0.00000000000000955
Family I/LWEQ domain 0.0092
Further Details:      
 
Domain Number 11 Region: 2008-2135
Classification Level Classification E-value
Superfamily alpha-catenin/vinculin-like 0.0000000298
Family VBS domain 0.015
Further Details:      
 
Domain Number 12 Region: 1700-1818
Classification Level Classification E-value
Superfamily alpha-catenin/vinculin-like 0.000000173
Family VBS domain 0.063
Further Details:      
 
Domain Number 13 Region: 81-137
Classification Level Classification E-value
Superfamily Ubiquitin-like 0.000000737
Family Ubiquitin-related 0.062
Further Details:      
 
Domain Number 14 Region: 946-1047
Classification Level Classification E-value
Superfamily I/LWEQ domain 0.0000785
Family I/LWEQ domain 0.0025
Further Details:      
 
Weak hits

Sequence:  ENSECAP00000019446
Domain Number - Region: 1589-1663
Classification Level Classification E-value
Superfamily alpha-catenin/vinculin-like 0.000345
Family VBS domain 0.041
Further Details:      
 
Domain Number - Region: 2110-2231
Classification Level Classification E-value
Superfamily I/LWEQ domain 0.00968
Family I/LWEQ domain 0.0047
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSECAP00000019446   Gene: ENSECAG00000020488   Transcript: ENSECAT00000023476
Sequence length 2544
Comment pep:known_by_projection chromosome:EquCab2:1:129230635:129422024:-1 gene:ENSECAG00000020488 transcript:ENSECAT00000023476 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MVALSLKICVRHCNVVKTMQFEPSTAVYDACRVIRERVPEAQTGQASDYGLFLSDEDPRK
GIWLEAGRTLDYYMLRNGDILEYKKKQRPQKIRMLDGSVKTVMVDDSKTVGELLVTICSR
IGITNYEEYSLIQETIEEKKEEGTGTLKKDRTLLRDERKMEKLKAKLHTDDDLNWLDHSR
TFREQGVDENETLLLRRKFFYSDQNVDSRDPVQLNLLYVQARDDILNGSHPVSFEKACEF
GGFQAQIQFGPHVEHKHKPGFLDLKEFLPKEYIKQRGAEKRIFQEHKNCGEMSEIEAKVK
YVKLARSLRTYGVSFFLVKEKMKGKNKLVPRLLGITKDSVMRVDEKTKEVLQEWPLTTVK
RWAASPKSFTLDFGEYQESYYSVQTTEGEQISQLIAGYIDIILKKKQSKDRFGLEGDEES
TMLEESVSPKKSTILQQQFNRTGKVEHGSVALPAVMRSGSSGPETFNVGSMPSPQQQVMV
GQMHRGHMPPLTSAQQALMGTINTSMHAVQQAQDDLSELDSLPPLGQDMASRVWVQNKVD
ESKHEIHSQVDAITAGTASVVNLTAGDPADTDYTAVGCAITTISSNLTEMSKGVNLLAAL
MDDEVGSGEDLLRAARTLAGAVSDLLKAVQPTSGEPRQTVLTAAGSIGQASGDLLRQIGE
NETDERFQDVLMSLAKAVANAAAMLVLKAKNVAQVAEDTVLQNRVIAAATQCALSTSQLV
ACAKVVSPTISSPVCQEQLIEAGKLVDRSVENCVRACQAATDDSELLKQVSAAASVVSQA
LHDLLQHVRQFASRGEPIGRYDQATDTIMCVTESIFSSMGDAGEMVRQARVLAQATSDLV
NAMRSDAEAEIDMENSKKLLAAAKLLADSTARMVEAAKGAAANPENEDQQQRLREAAEGL
RVATNAAAQNAIKKKIVNRLEVAAKQAAAAATQTIAASQNAAVSNKNPAAQQQLVQSCKA
VADHIPQLVQGVRGSQAQTEDLSAQLALIISSQNFLQPGSKMVSSAKAAVPTVSDQAAAM
QLSQCAKNLATSLAELRTASQKAIAHEACGPMEIDSALSTVQTLKNELQDAKMAAVESQL
KPLPGETLEKCAQDLGSTSKAVGSSMAQLLTCAAQGNEHYTGVAARETAQALKTLAQAAR
GVAASTSDPAAAHAMLDSARDVMEGSAMLIQEAKQALIAPGDAESQQRLAQVAKAVSHSL
NNCVNCLPGQKDVDVALKSIGESSKKLLVDSLPPSTKPFQEAQSELNQAAADLNQSAGEV
VHATRGQTGELAAASGKFSDDFDEFLDAGIEMAGQAQTKEDQIQVIGNLKNISMASSKLL
LAAKSLSVDPGAPNAKNLLAAAARAVTESINQLITLCTQQAPGQKECDNALRELETVKGM
LDNPNEPVSDLSYFDCIESVMENSKVLGESMAGISQNAKTGDLPAFGECVGIASKALCGL
TEAAAQAAYLVGISDPNSQAGHQGLVDPIQFARANQAIQMACQNLVDPGSSPSQVLSAAT
IVAKHTSALCNACRIASSKTANPVAKRHFVQSAKEVANSTANLVKTIKALDGDFSEENRN
KCRIATAPLIEAVENLTAFASNPEFVSVPAQISSEGSQAQEPILVSAKTMLESSSYLIRT
ARSLAINPKDPPTWSVLAGHSHTVSDSIKSLITSIRDKAPGQRECDYSIDGINRCIRDIE
QASLAAVSQSLATRDDISVEALQEQLTSVVQEIGHLIDPIATAARGEAAQLGHKVTQLAS
YFEPLILAAVGVASKILDHQQQMTVLDQTKTLAESALQMLYAAKEGGGNPKAQHTHDAIT
EAAQLMKEAVDDIMVTLNEAASEVGLVGGMVDAIAEAMSKLDEGTPPEPKGTFVDYQTTV
VKYSKAIAVTAQEMMTKSVTNPEELGGLASQMTSDYGHLALQGQMAAATAEPEEIGFQIR
TRVQDLGHGCIFLVQKAGALQVCPTDSYTKRELIECARAVTEKVSLVLSALQAGNKGTQA
CITAATAVSGIIADLDTTIMFATAGTLNAENNETFADHRENILKTAKALVEDTKLLVSGA
ASTPDKLAQAAQSSAATITQLAEVVKLGAASLGSDDPETQVVLINAIKDVAKALSDLIGA
TKGAASKPADDPSMYQLKGAAKVMVTNVTSLLKTVKAVEDEATRGTRALEATIEYMKQEL
TVFQSKEIPEKTSSPEESIRMTKGITMATAKAVAAGNSCRQEDVIATANLSRKAVADMLT
ACKQASFHPDVSEEVRTRALRYGTECTLGYLDLLEHVLVILQKPTPELKHQLAAFSKRVA
GAVTELIQAAEAMKGTEWVDPEDPTVIAETELLGAAASIEAAAKKLEQLKPRAKPKQADE
TLDFEEQILEAAKSIAAATSALVKSASAAQRELVAQGKVGSIPANAADDGQWSQGLISAA
RMVAAATSSLCEAANASVQGHASEEKLISSAKQVAASTAQLLVACKVKADQDSEAMRRLQ
AAGNAVKRASDNLVRAAQKAAFGKADDDDVVVKTKFVGGIAQIIAAQEEMLKKERELEEA
RKKLAQIRQQQYKFLPTELREDEG
Download sequence
Identical sequences F6YFZ7
ENSECAP00000019446

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]