SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSGACP00000010657 from Gasterosteus aculeatus 69_1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGACP00000010657
Domain Number 1 Region: 2870-3032
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 2.85e-36
Family Laminin G-like module 0.00000746
Further Details:      
 
Domain Number 2 Region: 2705-2869
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 1.76e-32
Family Laminin G-like module 0.00000494
Further Details:      
 
Domain Number 3 Region: 2297-2479
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 8.49e-29
Family Laminin G-like module 0.0016
Further Details:      
 
Domain Number 4 Region: 2108-2285
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 2.51e-28
Family Laminin G-like module 0.0025
Further Details:      
 
Domain Number 5 Region: 2478-2670
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 1.45e-21
Family Laminin G-like module 0.0082
Further Details:      
 
Domain Number 6 Region: 718-770
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000000268
Family Laminin-type module 0.0063
Further Details:      
 
Domain Number 7 Region: 776-828
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000000268
Family Laminin-type module 0.019
Further Details:      
 
Domain Number 8 Region: 250-309
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000000753
Family Laminin-type module 0.05
Further Details:      
 
Domain Number 9 Region: 826-880
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000000131
Family Laminin-type module 0.0052
Further Details:      
 
Domain Number 10 Region: 1437-1488
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000000173
Family Laminin-type module 0.019
Further Details:      
 
Domain Number 11 Region: 1378-1430
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000000363
Family Laminin-type module 0.03
Further Details:      
 
Domain Number 12 Region: 928-977
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000067
Family Laminin-type module 0.0087
Further Details:      
 
Domain Number 13 Region: 307-364
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000195
Family Laminin-type module 0.034
Further Details:      
 
Domain Number 14 Region: 377-434
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000279
Family Laminin-type module 0.032
Further Details:      
 
Domain Number 15 Region: 879-922
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000307
Family Laminin-type module 0.012
Further Details:      
 
Domain Number 16 Region: 446-483
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000126
Family Laminin-type module 0.012
Further Details:      
 
Domain Number 17 Region: 1021-1069
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000013
Family Laminin-type module 0.015
Further Details:      
 
Domain Number 18 Region: 975-1018
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000502
Family Laminin-type module 0.013
Further Details:      
 
Weak hits

Sequence:  ENSGACP00000010657
Domain Number - Region: 54-137
Classification Level Classification E-value
Superfamily Galactose-binding domain-like 0.000172
Family APC10-like 0.071
Further Details:      
 
Domain Number - Region: 1486-1524
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000558
Family Laminin-type module 0.011
Further Details:      
 
Domain Number - Region: 1326-1358
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00184
Family EGF-type module 0.066
Further Details:      
 
Domain Number - Region: 1067-1124
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00307
Family Laminin-type module 0.021
Further Details:      
 
Domain Number - Region: 682-720
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00419
Family EGF-type module 0.063
Further Details:      
 
Domain Number - Region: 1742-1833
Classification Level Classification E-value
Superfamily Methyl-accepting chemotaxis protein (MCP) signaling domain 0.0178
Family Methyl-accepting chemotaxis protein (MCP) signaling domain 0.026
Further Details:      
 
Domain Number - Region: 1933-2101
Classification Level Classification E-value
Superfamily Bacterial hemolysins 0.034
Family HBL-like 0.057
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGACP00000010657   Gene: ENSGACG00000008026   Transcript: ENSGACT00000010678
Sequence length 3062
Comment pep:novel group:BROADS1:groupXVIII:7114170:7217010:-1 gene:ENSGACG00000008026 transcript:ENSGACT00000010678 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
GLFPAVLNLASMADISANATCGTLGPEMFCKLVEHVPGQPIRNPQCRICNQRSSNPFEQH
PIEYATDGTNRWWQSPSIKNGMEYHYVTITLDLKQVFQIAYIILKAANSPRPGNWVLERS
IDGVTFEPWQYYAITDTECLTRFSINPRTGPPSYTRDDEVICTSFYSKIHPLENGEIHTS
LINGRPSAADPSPTLLNFTSARYIRLVFQRIRTLNADLMTLTLHDPRDSDPIVTRRYYYS
IKDISVGGMCICYGHAKACPLNTATKKFSCECEHNTCGESCDRCCPGYNQQPWMAGTFLT
RHVCEKCNCHDKADECYFNQTVADLSLSLNIQGQNSGGGVCIGCRDNTTGINCETCVDGF
YRPAGVNAEDEDPCIPCSCDPLGSISQSCLPVSSKATPIQPAGSCPCKEGFGGLQCDRCA
VGYTGFPFCQRCNCSMEGSTNNDPCVTPCMCKENVEGDNCDRCKPGFYNLQGDNLRGCEK
CSCMGVSSRCSASSWTYQDETTLTGWHLLGETGGRVWSVHRQTPPYLSVRHSDVVDDLGS
AYYWNAPGLYLGNKLSAYGGTLVYTVSYATDQQEQTAIRVTSQPDLVIEGGGIKIIDRRF
GQPVYPSSPRTNRIDLLPENFLVSESVQPISRRDFLSVLANVTSVMVRASYSTEPSAAYR
LHAFSMQVANPSAGGERRASAVESCSCPPGYAGTSCEACIPGFRRINGNLYNGVCEACYC
HGHTSQCHELTGHCLDCAHHTTGPHCDSCLPGYYGNASRGSPADCQPCACPLHLPSNNFS
PTCHVGVEGELLCDQCQPGYTGPRCDRCSNGYYGQPNVPGGSCRPCDCHGNLDLSKPGSC
DPITGQCLRCRQGYGSASCDSCADGYYGDAIVAKNCQPCQCHTNGSESEVCDKETGRCQC
MKDVTGRQCDECIPKTHGLSTGGRCLPCNCNSFGSKSFDCDETGQCRCQPGVAGPKCDRC
SRGFFNFQEGGCTPCNCSHVGNNCDTNTGQCICPPNTIGGRCDYCSPNHWGHDIITGCKA
CGCSVIGSVTQQCNVNTGCCMCHNSFRGEKCNECQIGYRDFPQCTQCECNVSGSDSQTCD
PERAVCACADRTGKCNCKANVEGDNCDRCKPDTFGLSVRNPLGCSNCYCYGLTRSCTEAQ
GLIRMWLALKPEQTVLPLVDKSNTVETRRGVSFQHPEILARTELVTPTLAEPYYWKLPEQ
FTGSMITAYGGKLKYAVYYEARDETGPSSYDPQVIINGGPNRNILMTRHAPGLQIGQLTR
HEIDMTEHEWKYADGRSMTREDFMDILFYVDYILIKASHGNLMRHISEISLTVAEEGTPN
KDSEKAHQIEKCDCPIGYSGFSCEECAAGFYRLRAGSPTSASASRLPTAAGMGSCVQCQC
SGHSSTCDPETSICLHNCQGNTEGDRCERCSAGFYGVVRGFHDDCKPCACPLTNPENNFS
PTCVTEGYDDYQCTACPEGYEGKYCERCATGYHGNPRMPGGHCEECKCSSWGALPGPCDP
VTGQCRCTVGASGTSCDQCMERHVCGPAGIISCDDECSGLLISDMDRLHRIIADVTLTTP
LPPPYKVLYRFENMTEELKHMLSPPKALERLLQLADSNLGSLVVEMDQLHSRSTKVSADG
EQVEDDADRIHKRAEDLEHFISSIQMSLDLELKAADLNRTLSRRDGTPEKSLKEMKEEIQ
AMLAEMRERQLGGKKSIAEEEMGLAEQLYQKVKRLFGDPHQTTEDLKAEIKEKLSDHEGK
LKEAQDLLHSAQGKTRQAGKLAEQNQANLTALERKRSAVNGLRQEAQKILGEGERLLDEA
NQLSGDINEEKEDLEKMAKELNPLHDQLQDRVSHLSGGLGDGGLASRVHDAERHAEQLNE
SAAILDGILAEAKNLSFNATAAFKAYSNIKANVDAAEKEAKAAKQRASEALALVSDPEVK
EAARGALQKSHRLLNQAKQLQNDVKENTDSVAGLKGRVKAARDKAKDLLKAVNGTMATLN
AIPDGMTAHTQSVRARKSDSSQTHRNTLQGDFSISSEVLTQSYNIGEINQKLTFGFSVSL
FFSIPTAIIHAAGAKVKVLEDEADRLLEKLEPIKKLQDNLRRNISQIKELINQARKQANS
IKVSVSSGGDCLRSYRPDIRKGRYNTIILNVKTTTPDNLLFYLGSVQYVDFLALEMRQGK
VNFLWDVGSGVGRVEYPHHTLHDGNWHRIEASRNGLNGTISVYPLEGPMAGMMPTPASAN
SPTAFTILDVDQKAYLFVGGTNGAAKIADVVRTTTFSGCMGETFLDGKPIGLWNYREREG
DCKGCVVSPQRSDGEGTVQLDGEGYAAVGRPTRWNPNVSSVTFKFRTFSSESLLMYLATE
DMKDFMSLELSQGKVKVNFDLGSGVGSATSANRHNDGLWKSLTMSRNKKQVHKATVTVVD
IDSGVEEKIVSSSQGSATGLNLKENQKLYFGGLPTVGNYRSEVTLKRYAGCLREIEISRT
PYSLLRSSDYTGITKGCNVDKLYTVSFSKPGYMELAGLSLAVGTEISLSFSTLVDTGTIM
LAVGGASPISLQARREGHIPYLSVLLNKGSLEVLLFTGSHSPRRVTRRPEQGALNDGREH
SLRIERLAGRSFVVQVDEETRREAALPNDQPVSLQRLFLGGIPAKVEQTSNRANVPFQGC
IWNLMVNAVLSDFSLPVSFENAEIGQCPSLAPEGSKAQLSHKPKALTDSCASAAAPTVLD
NAYQFGLTRNSHMKFAFDDAKVREWLILEFELRTKEDSGLVLYMARINHADFVAIQIKDG
QVCLGYDLGSGNISGCVPFSINDGNWHTIRVSRNKQRGLLLVDGRYSKHMNSPKKADLLD
VVGMLYVGGFPENYTSKRIGPILYSINGCIRGLKMVGGAMNMAAPTSSHMIARCFVATET
GTYFDGTGYLKAVSSYRVGLDVSIAFEFRTSRTSGVLLAISNQGNDGLGLEIVGGKLLFH
VDNGAGRISAEHAPEGEGFCDGQWHSVTAEKLRHRAQLVVDGRQSQAESSNARSNTCDTN
DPIYVGGYPDGVRQAALSTRTSFKGCLRNLRITKASKTMEVQFNKALEIKGVQPLSCPAV
AA
Download sequence
Identical sequences G3NZ85
ENSGACP00000010657 69293.ENSGACP00000010657 ENSGACP00000010657

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]