SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSGGOP00000011940 from Gorilla gorilla 76_3.1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGGOP00000011940
Domain Number 1 Region: 3777-4015
Classification Level Classification E-value
Superfamily Plakin repeat 8.11e-88
Family Plakin repeat 0.000000022
Further Details:      
 
Domain Number 2 Region: 173-396
Classification Level Classification E-value
Superfamily Calponin-homology domain, CH-domain 1.39e-86
Family Calponin-homology domain, CH-domain 0.000000605
Further Details:      
 
Domain Number 3 Region: 3198-3436
Classification Level Classification E-value
Superfamily Plakin repeat 1.57e-86
Family Plakin repeat 0.0000000533
Further Details:      
 
Domain Number 4 Region: 2537-2779
Classification Level Classification E-value
Superfamily Plakin repeat 2.22e-83
Family Plakin repeat 0.000000076
Further Details:      
 
Domain Number 5 Region: 2867-3106
Classification Level Classification E-value
Superfamily Plakin repeat 1.83e-77
Family Plakin repeat 0.000000502
Further Details:      
 
Domain Number 6 Region: 3535-3772
Classification Level Classification E-value
Superfamily Plakin repeat 1.31e-76
Family Plakin repeat 0.000000843
Further Details:      
 
Domain Number 7 Region: 4088-4129,4164-4358
Classification Level Classification E-value
Superfamily Plakin repeat 1.31e-69
Family Plakin repeat 0.000000325
Further Details:      
 
Domain Number 8 Region: 412-528
Classification Level Classification E-value
Superfamily Spectrin repeat 5.47e-22
Family Spectrin repeat 0.0042
Further Details:      
 
Domain Number 9 Region: 1289-1474
Classification Level Classification E-value
Superfamily Spectrin repeat 2.43e-20
Family Spectrin repeat 0.0039
Further Details:      
 
Domain Number 10 Region: 755-857
Classification Level Classification E-value
Superfamily Spectrin repeat 6.48e-17
Family Spectrin repeat 0.005
Further Details:      
 
Domain Number 11 Region: 4035-4083
Classification Level Classification E-value
Superfamily Plakin repeat 0.000000000222
Family Plakin repeat 0.0093
Further Details:      
 
Domain Number 12 Region: 661-754
Classification Level Classification E-value
Superfamily Spectrin repeat 0.00000000072
Family Spectrin repeat 0.0061
Further Details:      
 
Weak hits

Sequence:  ENSGGOP00000011940
Domain Number - Region: 1179-1288
Classification Level Classification E-value
Superfamily Spectrin repeat 0.000104
Family Spectrin repeat 0.015
Further Details:      
 
Domain Number - Region: 862-944
Classification Level Classification E-value
Superfamily Spectrin repeat 0.000272
Family Spectrin repeat 0.0098
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGGOP00000011940   Gene: ENSGGOG00000012217   Transcript: ENSGGOT00000012283
Sequence length 4440
Comment pep:known_by_projection chromosome:gorGor3.1:8:144105188:144144018:-1 gene:ENSGGOG00000012217 transcript:ENSGGOT00000012283 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MVAGMLMPRDQLRAIYEMLFREGVMVAKKDRRPRSLHPHVPGVTNLQVMRAMASLRARGL
VRETFAWCHFYWYLTNEGIAHLRQYLHLPPEIVPASLQRVRRPVAMVMPARRTPHVQAVQ
GPLGSPPKRGPLPAEEQRVYRRKELEEVSPETPVVPATTQRTLARPGPEPAPATDERDRV
QKKTFTKWVNKHLIKAQRHISDLYEDLRDGHNLISLLEVLSGDSLPREKGRMRFHKLQNV
QIALDYLRHRQVKLVNIRNDDIADGNPKLTLGLIWTIILHFQISDIQVSGQSEDMTAKEK
LLLWSQRMVEGYQGLRCDNFTSSWRDGRLFNAIIHRHKPMLIDMNKVYRQTNLENLDQAF
SVAERDLGVTRLLDPEDVDVPQPDEKSIITYVSSLYDAMPRVPDVQDGVRANELQLRWQE
YRELVLLLLQWMRHHTAAFEERRFPSSFEEIEILWSQFLKFKEMELPAKEADKNRSKGIY
QSLEGAVQAGQLKVPPGYHPLDVEKEWGKLHVAILEREKQLRSEFERLECLQRIVTKLQM
EAGLCEEQLNQADALLQSDVRLLAAGKVPQRAGEVERDLDKADSMIRLLFNDVQTLKDGR
HPQGEQMYRRVYRLHERLVAIRTEYNLRLKAGVAAPATQVTQVTLQSVQRRPELEDSTLR
YLQDLLAWVEENQHRVDGAEWGVDLPSVEAQLGSHRGLHQSIEEFRAKIERARSDEGQLS
PATRGAYRDCLGRLDLQYAKLLNSSKARLRSLESLHSFVAAATKELMWLNEKEEEEVGFD
WSDRNTNMTAKKESYSALMRELELKEKKIKELQNAGDRLLREDHPARPTVESFQAALQTQ
WSWMLQLCCCIEAHLKENAAYFQFFSDVREAEGQVQKLQEALRRKYSCDRSATVTRLEDL
LQDAQDEKEQLNEYKGHLSGLAKRAKAIVQLKPRHPAHPMRGRLPLLAVCDYKQVEVTVH
KGDECQLVGPAQPSHWKVLSSSGSEAAVPSVCFLVPPPNQEAQEAVTRLEAQHQALVTLW
HQLHVDMKSLLAWQSLRRDVQLIRSWSLATFRTLKPEEQRQALHSLELHYQAFLRDSQDA
GGFGPEDRLMAEREYGSCSHHYQQLLQSLEQGAQEESRCQRCISELKDIRLQLEACETRT
VHRLRLPLDKEPARECAQRITEQQKAQAEVEGLGKGVARLSAEAEKVLALPEPSPAAPTL
RSELELTLGKLEQVRSLSAIYLEKLKTISLVIRSTQGAEEVLRAHEEQLKEAQAVPATLP
ELEATKASLKKLRAQAEAQQPMFDALRDELRGAQEVGERLQQRHGERDVEVERWRERVAQ
LLERWQAVLAQTDVRQRELEQLGRQLRYYRESADPLGAWLQDARRRQEQIQAMPLADSQA
VREQLRQEQALLEEIERHGEKVEECQRFAKQYINAIKDYELQLVTYKAQLEPVASPAKKP
KVQSGSESVIQEYVDLRTRYSELTTLTSQYIKFISETLRRMEEEERLAEQQRAEERERLA
EVEAALEKQRQLAEAHAQAKAQAEREAKELQQRMQEEAVRREEAAVDAQQQKRSIQEELQ
QLRQSSEAEIQAKARQAEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAE
EAEAQKRQAQEEAERLRRQVQDESQRKRQAEAELASRVKAEAEAAREKQRALQALEELRL
QAEXAELAKVRAEMEVLLASKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALA
EEAKRQRQLAEEDAARQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLA
EDEAFQRRRLEEQAAQHKADIEERLAQLRKASDSELERQKGLVEDTLRQRRQVEEEILAL
KASFEKAAAGKAELELELGRIRSNAQDTLRSKEQAELEAARQRQLAAEEERRRREAEERV
QKSLAAEEEAARQRKAALEEVERLKAKVEEARRLRERAEQESARQLQLAQEAAQKRLQAE
EKAHAFVGVLDRLRSEAEAARRAAEEAEEARVQAEREAAQSRQQVEEAERLKQSAEEQAQ
AQAQAQAAAEKLRKEAEQEAARRAQAEQAALRQKQAADAEMEKHKKFAEQTLRQKAQVEQ
ELTTLRLQLEETDHQKNLLDEELQRLKAEATEAARQRSQVEEELFSVRVQMEELSKLKAR
IEAENRTLILRDKDNTQRFLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRA
LAEKMLKEKMQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQMAQQLAEETQGFQR
TLEAERQRQLEMSAEAERLKLRVAEMSRAQARAEEDAQRFRKQAEEIGEKLHRTELATQE
KVTLVQTLEIQRQQSDHDAERLREAIAELEREKEKLQQEAKLLQLKSEEMQTVQQEQLLQ
ETQALQQSFLSEKDSLLQRERFIEQEKAKLEQLFQDEVAKAQQLREMEQERQRLVASMEE
ARRRQHEAEEGVRREELLAEENQRLREQLQRLEEQHRAALAHSEEVTASQVAATKTLPNG
RDALDGPAAEAEPEHSFDGLRRKVPAQRLQEAGILSAEELQRLAQGHTTVDELARREDVR
HYLQGRSSIAGLLLKPTNEKLSVYAALQRQLLSPGTALILLEAQAASGFLLDPVRNRRLT
VNEAVKEGVVGPELHHKLLSAEPERAVTGYTDPYTGQQISLFQAMQKDLIVREHGIRLLE
AQIATGGVVDPVHSHRVPVDVAYRRGYFDEEMNRVLADPSDDTKGFFDPNTQENLTYLQL
LERCVEDPETGLRLLPLTDKAAKGGELVYTDSEARDVFEKATVSAPFGKFQGKTVTIWEI
INSEYFTAEQRRDLLRQFRTGRITVEKIIKIIITVVEEQEQKGRLCFEGLRSLVPAAELL
ESGVIDRELYQQLQRGERSVREVAEVDTVRRALRGANVIAGVWLEEAGQKLSIYNALKKD
LLPSDMAVALLEAQAGTGHIIDPATSARLTVDEAVRAGLVGPEFHEKLLSAEKAVTGYRD
PYTGQSVSLFQALKKGLIPREQGLRLLDAQLSTGGVVDPSKSHRVPLDVACARGCLDEET
SRALSAPRADAKAYSDPSTGEPVTYSELQQRCRPDQLTGLSLLPLSEKAARIRQEELYSE
LQARETFEKTPVEVPVGGFKGRTVTVWELISSEYFTAEQRQELLRQFRTGKVTVEKVIKI
LITIVEEVETLRQERLSFSGLRAPVPASELLASGVLSRAQFEQLKDGKTTVKDLSELGSV
RTLLQGSGCLAGIYLEDTKEKVSIYEAMRRGLLRASTAALLLEAQAATGFLVDPVRNQRL
YVHEAVKAGVVGPELHEQLLSAEKAVTGYRDPYSGSTISLFQAMQKGLVLRQHGIRLLEA
QIATGGIIDPVHSHRVPVDVAYQRGYFNEEMNRVLADPSDDTKGFFDPNTHENLTYRQLL
ERCVEDPETGLRLLPLKGAEKAEVVETTQVYTEEETRRAFEETQIDIPGGGSHGGSTMSL
WEVMQSDLIPEEQRAQLMADFQAGRVTKERMIIIIIEIIEKTEIIRQQGLASYDYVRRRL
TAEDLFEARIISLETYNLLREGSRSLREALEAESAWRYLYGTGSVAGVYLPGSRQTLSIY
QALKKGLLSAEVARLLLEAQAATGFLLDPVKGERLTVDEAVRKGLVGPELHDRLLSAERA
VTGYRDPYTEQTISLFQAMKKELIPTEEALRLLDAQLATGGIVDPRLGFHLPLEVAYQRG
YLNKDTHDQLSEPSEVRSYVDPSTDERLSYTQLLRRCRRDDGTGQLLLPLSDARKLTFRG
LRKQITVEELVRSQVMDEATALQLREGLTSIEEVTKNLQKFLEGTSCIAGVFVDATKERL
SVYQAMKKGIIRPGTAFELLEAQAATGYVIDPIKGLKLTVEEAVRMGIVGPEFKDKLLSA
ERAVTGYKDPYSGKLISLFQAMKKGLILKDHGIRLLEAQIATGGIIDPEESHRLPVEVAY
KRGLFDEEMNEILTDPSDDTKGFFDPNTEENLTYLQLMERCITDPQTGLCLLPLKEKKRE
RKTSSKSSVRKRRVVIVDPETGKEMSVYEAYRKGLIDHQTYLELSEQECEWEEITISSSD
GVVKSMIIDRRSGRQYDIDDAIAKNLIDRSALDQYRAGTLSITEFADMLSGNAGGFRSRS
SSVGSSSSYPISPAVSRTQLASWSDPTEETGPVAGILDTETLEKVSITEAMHRNLVDNIT
GQRLLEAQACTGGIIDPSTGERFPVTDAVNKGLVDKIMVDRINLAQKAFCGFEDPRTKTK
MSAAQALKKGWLYYEAGQRFLEVQYLTGGLIEPDTPGRVPLDEALQRGTVDARTAQKLRD
VGAYSKYLTCPKTKLKISYKDALDHSMVEEGTGLRLLEAAAQSTKGYYSPYSVSGSGSTA
GSRTGSRTGSRAGSRRGSFDATGSGFSMTFSSSSYSSSGYGRRYASGSSASLGGPESAVA
Download sequence
Identical sequences ENSGGOP00000011940 ENSGGOP00000025069

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]