SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSGACP00000013049 from Gasterosteus aculeatus 76_1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGACP00000013049
Domain Number 1 Region: 676-891
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 3.69e-104
Family Thrombospondin C-terminal domain 0.000000116
Further Details:      
 
Domain Number 2 Region: 559-673
Classification Level Classification E-value
Superfamily TSP type-3 repeat 1.12e-24
Family TSP type-3 repeat 0.0000244
Further Details:      
 
Domain Number 3 Region: 491-561
Classification Level Classification E-value
Superfamily TSP type-3 repeat 0.0000000000000183
Family TSP type-3 repeat 0.00035
Further Details:      
 
Domain Number 4 Region: 196-232
Classification Level Classification E-value
Superfamily Assembly domain of cartilage oligomeric matrix protein 0.000000000275
Family Assembly domain of cartilage oligomeric matrix protein 0.0017
Further Details:      
 
Domain Number 5 Region: 26-122
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 0.0000000264
Family Laminin G-like module 0.031
Further Details:      
 
Domain Number 6 Region: 422-502
Classification Level Classification E-value
Superfamily TSP type-3 repeat 0.000000445
Family TSP type-3 repeat 0.00023
Further Details:      
 
Domain Number 7 Region: 243-298
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000318
Family EGF-type module 0.021
Further Details:      
 
Weak hits

Sequence:  ENSGACP00000013049
Domain Number - Region: 333-372
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000246
Family EGF-type module 0.015
Further Details:      
 
Domain Number - Region: 380-422
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00671
Family EGF-type module 0.051
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGACP00000013049   Gene: ENSGACG00000009878   Transcript: ENSGACT00000013074
Sequence length 903
Comment pep:known_by_projection group:BROADS1:groupXIII:9656284:9662076:-1 gene:ENSGACG00000009878 transcript:ENSGACT00000013074 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
VYDLLVSPDCLPDLLQGSLKNKGRDEAFLLSSFRIHSKAPTSLYSVINPKDNSKYLEVSV
QAKLSKVTMRYQRTDGRFVTTSFNHGSLADGQDHHVMLHARGLQGGPPRLNIYVDCRLVH
SLDDLPATFGSLPPGPNRVALRTLQSSSKDELTDLKLVIEDTIDNVATLQDCSMDQRESL
QLLSIQGARTEHDQASMEELRSMFSEMKELLIKQIKETTFLRNTITECLACGRGPASQPL
TQCPPGTCFRQNMCVLSESGAFQCASCPEGFTGDGVHCDDVDECKFNPCYPGVRCVNTAP
GFRCEKCPLGYNGPEINGVGVSYAKSRKQVCDDIDECLGPPKDGSCTENSHCYNTIGSFR
CGECKAGFTGDQVGGCHGTRLCPNGQPSPCHTNGECVVERDGSISCVCGVGWAGNGYVCG
KDTDIDAYPDITLQCSDTNCKQDNCISVPNSGQEDADRDGLGDSCDDDADSDGIINIDDN
CWLVPNVDQKNSDKDLLGDACDNCRTVENPLQRDTDQDGLGDDCDDDMDGDGIKNALDNC
QRVPNRDQEDRDNDVVGDACDSCPDVPNPNQSDSDDDLVGDTCDDNIDSDGDGHQNTKDN
CPTFINSAQLDTDKDGMSSRACWNIDGSVLSRQVPNPDQKDTDDNNVGDACEGDFDKDSV
IDIIDHCPENAEVSLTDFRAYQTVVLDPEGDSQIDPNWVVLNQGMEIVQTMNSDPGLAVG
YKAFSGVDFEGTFHVNTVTDDDYTGFIFGYQDSSSFYVVMWKQTEQTYWQAAPFRAVAEP
GIQLKVVKSKTGPGEYMRNSLWHTGDTPNQVRLLWKDPRNVGWKDKVSYRWFLQHRPQVG
YIRARFFEGSKLVADTDVIIDTSMRGGRLGVFCFSQENIIWSNLKYRCNDTIPADYKDLS
AQN
Download sequence
Identical sequences G3P626
69293.ENSGACP00000013049 ENSGACP00000013049 ENSGACP00000013049

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]