SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSECAP00000020513 from Equus caballus 69_2

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSECAP00000020513
Domain Number 1 Region: 2770-2933
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 2.08e-37
Family Laminin G-like module 0.0000000783
Further Details:      
 
Domain Number 2 Region: 2939-3120
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 9.31e-35
Family Laminin G-like module 0.000000025
Further Details:      
 
Domain Number 3 Region: 2338-2525
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 9.71e-31
Family Laminin G-like module 0.0012
Further Details:      
 
Domain Number 4 Region: 2522-2713
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 1.06e-26
Family Laminin G-like module 0.015
Further Details:      
 
Domain Number 5 Region: 2153-2329
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 1.32e-23
Family Laminin G-like module 0.001
Further Details:      
 
Domain Number 6 Region: 410-538,723-738
Classification Level Classification E-value
Superfamily Growth factor receptor domain 0.00000000000863
Family Growth factor receptor domain 0.0063
Further Details:      
 
Domain Number 7 Region: 815-867
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000000000391
Family Laminin-type module 0.013
Further Details:      
 
Domain Number 8 Region: 865-918
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000000352
Family Laminin-type module 0.0025
Further Details:      
 
Domain Number 9 Region: 1420-1471
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000000391
Family Laminin-type module 0.014
Further Details:      
 
Domain Number 10 Region: 757-809
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000000053
Family Laminin-type module 0.0038
Further Details:      
 
Domain Number 11 Region: 1477-1529
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000000019
Family Laminin-type module 0.019
Further Details:      
 
Domain Number 12 Region: 918-964
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000000363
Family Laminin-type module 0.004
Further Details:      
 
Domain Number 13 Region: 287-346
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000000114
Family Laminin-type module 0.033
Further Details:      
 
Domain Number 14 Region: 1060-1108
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000000201
Family Laminin-type module 0.012
Further Details:      
 
Domain Number 15 Region: 1887-2115
Classification Level Classification E-value
Superfamily Methyl-accepting chemotaxis protein (MCP) signaling domain 0.00000188
Family Methyl-accepting chemotaxis protein (MCP) signaling domain 0.0083
Further Details:      
 
Domain Number 16 Region: 967-1016
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000446
Family Laminin-type module 0.011
Further Details:      
 
Domain Number 17 Region: 1014-1057
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000117
Family Laminin-type module 0.01
Further Details:      
 
Domain Number 18 Region: 1527-1566
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000335
Family Laminin-type module 0.012
Further Details:      
 
Domain Number 19 Region: 344-416
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000335
Family Laminin-type module 0.022
Further Details:      
 
Weak hits

Sequence:  ENSECAP00000020513
Domain Number - Region: 1378-1406
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000335
Family Laminin-type module 0.04
Further Details:      
 
Domain Number - Region: 90-169
Classification Level Classification E-value
Superfamily Galactose-binding domain-like 0.000453
Family APC10-like 0.053
Further Details:      
 
Domain Number - Region: 1638-1875
Classification Level Classification E-value
Superfamily Methyl-accepting chemotaxis protein (MCP) signaling domain 0.000667
Family Methyl-accepting chemotaxis protein (MCP) signaling domain 0.0091
Further Details:      
 
Domain Number - Region: 1119-1163
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000893
Family Laminin-type module 0.023
Further Details:      
 
Domain Number - Region: 552-585
Classification Level Classification E-value
Superfamily Kinase-associated protein B-like 0.0876
Family Kinase-associated protein B-like 0.0091
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSECAP00000020513   Gene: ENSECAG00000023844   Transcript: ENSECAT00000024681
Sequence length 3123
Comment pep:known chromosome:EquCab2:10:75632384:75807336:1 gene:ENSECAG00000023844 transcript:ENSECAT00000024681 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MPGAARLLLVLLLGGGLGGGRAQRPQQQRQPQAHQQRGLFPAVLNLASNAFITTNATCGE
KGPEMYCKLVEHVPGQPVRNPQCRICNQNSSNPFQRHPITNAIDGKNTWWQSPSIKNGIE
YHYVTITLDLQQVFQIAYVIVKAANSPRPGNWILERSLDDVEYKPWQYHAVTDTECLTLY
NIHPRTGPPSYAKDDEVICTSFYSKIHPLENGEIHISLINGRPSADDPSPELLEFTSARY
IRLRFQRIRTLNADLMMFAHKDPREIDPIVTRRYYYSVKDISVGGMCICYGHARACPLDP
VTNKSRCECEHNTCGDSCDQCCPGFHQKPWRAGTFLTKTECEACNCHGKAEECYYDENVA
RRNLSLNIHGKYVGGGVCINCTRNTAGINCETCVDGFFRPKGVSPNYPRPCQPCHCDPVG
SLNEVCVKDEKHARRGLAPGSCHCKPGFRGVSCDRCARGYFGYPDCKPCNCSGAGSTNED
PCFGPCTCKENVEGGDCSRCKSGFFNLQEDNHKGCDECFCSGVSNRCQSSYWTYGNIQDM
SGWYLTDISGHIRVAPRQGDLHPPQQISISSSEARRALPQSYYWSAPAPYLGNKLTAAGG
QLTFTISYDLEEEEEDSERVLQLMIILEGKDFRISTAQDEVYLQPSEEHIHALSLKEEFF
TIHGSSSPVSRKEFMTVLASLKRVLVQITYSLGVDAIFRLGSVALESAVPYPTDGSVAAA
VEVCQCPAGYTGSSCESCWPRHRRVNGTIFGGLCEPCQCFGHAESCDDITGECLNCKDHT
GGPYCDRCLPGFYGDPTKGTSEDCQPCACPLNTPSNNFSPTCHLDRSLGLICDKCPVGYT
GPRCERCAEGYFGQPSVPGGSCQPCQCNDNLDFSIPGSCDSLSGSCLICKPGTTGRYCEL
CADGYFGDAVGAKNCQPCGCNVNGSFSEICHTRTGQCECKPNVQGRRCDECKPKTFGVRS
GRGCVPCNCNSFGSKSFDCEENGQCWCQPGVTGKKCDHCAHGYFNFQEGGCTACDCSHLG
NNCDPKTGQCICPPNTIGEKCSKCAPNTWGHSITTGCKACNCSTVGSLDFQCNINTGQCK
CRPKFSGTRCTECNRGHWNFPHCTACACFLPGTDGSTCDSETRRCSCVDQTGQCTCKVNV
EGVHCDRCQPGKFGLDAKNPLGCSSCYCFRATTQCSEAKGLIRTWVTLKPEQTILPLVDE
ALQHTTTKGIAFQHPEIVADMDLVRQDLHLEPFYWKLPEQFEGKKLMAYGGKLKYAIYFE
AREETGFSTYNPQVIIRGGTPAHARIIIRHMAAPLIGQLTRHEIEMTEKEWKYYGDDPRI
SRTVTREDFLDVLYDIHYILIKATYGNIMRQSRISEISMEVAEQGRITPETPPARLIERC
DCPPGYSGLSCETCTPGFYRLRSEPGGRTPAPTLGTCVPCQCNGHSSLCDPETSICQNCQ
HHTAGDFCERCAVGYYGIVKGLPSDCQRCACPLISSSNNFSPSCVMEGLNDYRCSACPRG
YEGQYCERCAPGYTGSPSSPGGSCQECECDPHGSLPVPCDPVTGLCTCRPGATGRKCDGC
KHWHAREGAECVFCGDECTGLLLGDLARLEQMAMSINLTGPLPAPYKILYGLENTTQELK
HLLSPQRAPERLIQLAEGNLNTLVTEMNELLTRATKVTADAEQTGQDAERTNTRASSLTD
FIKELAQDAEAVNEKAVKLNETLGTQDKAFERNLQALQKEIDQMMTELRRKTLDTQKEVA
EDELVAAEGLLKKVQKLFGEPRGKNEEMEKDLREKLADYKNKVDDAWDLLREATDKIREA
NRLSAANQKNMTALEEKKEAIASGKRQTENTLKEGNDILDEANRLADEINSVIDYVEDIQ
TKLPPMSEELKDKIDDLSQEMKDRKLAEKVSQAESHAAQLNDSSAVLDGILDEAKNISFN
ATAAFKAYSNIKDYIDEAEKIAKEAKGLAHEATKLATGPQGSLKENAKGSLQKSFGTLNE
AKRLANDVKENDDRLNGLIARLENANERNGDLLRALNDTLGKLSAIPNDTAAKLQAVKDK
ARQANDTAKDVLAQIKDLHQNLDGLKKNYNQLADSVAKTNAVIKDPSKNSKIFADADATV
KNLEQEADRLIDKLKPIKELEDNLKKNISEIKELINQARKQANSIKVSVSSGGDCIRTYK
PEIKKGSYNNIIVNVKTAVADNLLFYLGSAKFIDFLAIEMRKGKVSFLWDVGSGVGRVEY
PDLTIDDSYWYRIEASRTGRNGTISVRALDGPKASIVPSTHHAVSPPGYTILDVDANAML
FVGGLTGKLKKADAVRVITFTGCMGETYFDSKPIGLWNFREKEGDCKGCTVSPQVEDSEG
TIQFDGEGYALVSRPIRWYPNISTVMFKFRTFSSSALLMYLATRDLKDFMSVELTDGHIK
VSYDLGSGMASVVSNQNHNDGKWKSFTLSRIQKQANISIVDIDTNQEENMATASSGNNFG
LDLKADDKIYFGGLPTLRNLRLFNRPEVNLKKYSGCLKDIEISRTPYNILSSPDYVGVTK
GCSLENVYTVSFPRPGFVELAPVPLDVGTEINLSFSTRNESGIILLGSGGTPAPPRRKRR
QTGQAYYAIFLNRGRLEVHLSTGTRTMRKIVVRPEPSLFHDGREHSVHVERTRGIFTVQV
DEDRRHMQNLTVEQAIEVKKLFVGGAPAEFQPSPLRNIPPFEGCVWNLVINSVPMDFARS
VSFKNADIGRCAHQKPHEDEDGAVPAEIVIQPEPVPTPASPTPTPVLAHGPCAAESEPAL
LIGSKQFGLSKNSHIAIAFDDTKVKNRLTIEFEVRTEAESGLLFYMARINHADFATVQLR
NGLPYFSYDLGSGDTNTMIPTKINDGQWHKIKITRIKQEGILYVDDASNRTVSPKKADIL
DVVGMLYVGGLPINYTTRRIGPVTYSIDGCIRNLQMAEAPADLEQPTSSFRVGTCFANAQ
KGTYFDGTGFAKAVGGFKVGLDLLVEFEFRTTRTTGVLLGISSQKMDGMGIEMIDEKLMF
HVDNGAGRFTAVYDAGIPGQLCDGQWHKVTANKIKHRIELTVDGNQVEAQSPNQASTSAD
TNDPVFVGGFPDGLNQFGLTTNIRFRGCIRSLRLTKGTGKPLEVNFAKALELRGVQPVSC
PAN
Download sequence
Identical sequences F6R9F5
9796.ENSECAP00000022822 ENSECAP00000020513 ENSECAP00000022822

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]