SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSGACP00000021239 from Gasterosteus aculeatus 76_1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGACP00000021239
Domain Number 1 Region: 169-360
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 1.12e-25
Family Laminin G-like module 0.013
Further Details:      
 
Domain Number 2 Region: 385-586
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 7.28e-24
Family Laminin G-like module 0.0057
Further Details:      
 
Domain Number 3 Region: 773-814
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000000000271
Family EGF-type module 0.0064
Further Details:      
 
Domain Number 4 Region: 619-664
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000000126
Family EGF-type module 0.036
Further Details:      
 
Domain Number 5 Region: 655-696
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000000126
Family EGF-type module 0.01
Further Details:      
 
Domain Number 6 Region: 689-729
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000000429
Family EGF-type module 0.021
Further Details:      
 
Domain Number 7 Region: 1-143
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 0.000000515
Family Laminin G-like module 0.028
Further Details:      
 
Domain Number 8 Region: 364-403
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000000774
Family EGF-type module 0.025
Further Details:      
 
Domain Number 9 Region: 737-778
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000727
Family EGF-type module 0.023
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGACP00000021239   Gene: ENSGACG00000016101   Transcript: ENSGACT00000021280
Sequence length 852
Comment pep:known_by_projection group:BROADS1:groupIII:9387955:9393439:-1 gene:ENSGACG00000016101 transcript:ENSGACT00000021280 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
LSIELTDGHLSLRSLRGQGSSTLVQELPEHLSGNKWHTVEASLGGVVSLIRLLCTEGNCT
RDHSAKVQLLEPASALPEAGAVRQSLFIGAVGGSRASARAAGEAGYPSSFLGCFRDVFVD
SRLVWPAVSAEASDVQENITAGCSDKDKCEDGPCQNRGRCVSRGWRSYRCECHRPYEGTN
CEDEYITARFGNKDLESYAVFSLDDDPNDAVTISMFIRTRQASGILLILANSTSQYLRLW
QEEGRVKVQVNNFETLIGRGAVNDGHFHLVTVKLEGMAAVLIQSAQSRGSMPIRPIRTHP
GDLVFVGGLLDARASASFGGYFKGCVQDLRINSKPLQFYPIATPVESYSLERLVNVARGC
SSENACAVNPCLNRGVCYSMWDDFICNCPPNTAGQRCEEVLWCELSPCPAMAVCQPLSQG
FECLSNVTFSVESSVLHYQTNGQIKRGLRSVSLRFRTRQASATLIHAQRDSDYLTVSLLN
AHVVMELQSGAGKDLHKATVQSKGLIRDGEWHTLALSMENQSQHSRWILTVDGEEKERGV
STTAAGNLDFIRERADIFLGGLSVDTGVNLTGCLGPVEIGGLALPFYLNTELKLPRPQGE
KFARTNAGREPRHGCWGASVCAPNPCNNQGVCEDLFDLHRCTCSSEWTGPLCQHPTDSCF
SSPCVFGHCTNVPGAFECACEPGFSGERCEVEVDMCEYSRCSQGASCLRGFKSYACLCPQ
NLTGEYCELPQLPVSTCTGTRWDYSCFNGGNCSEADDSCFCPSGFTGQWCEKDVDECVSD
PCMNGGFCINYVNSFECVCDMNYSGVHCQIDVSDFYLYLFLGLWQNLFQLVSYLVIRLDD
DPEIDWGFYIND
Download sequence
Identical sequences G3PUF2
ENSGACP00000021239 69293.ENSGACP00000021239 ENSGACP00000021239

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]