SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSGACP00000020649 from Gasterosteus aculeatus 69_1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGACP00000020649
Domain Number 1 Region: 715-747
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000177
Family EGF-type module 0.009
Further Details:      
 
Domain Number 2 Region: 619-651
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000559
Family EGF-type module 0.019
Further Details:      
 
Weak hits

Sequence:  ENSGACP00000020649
Domain Number - Region: 651-683
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000402
Family EGF-type module 0.023
Further Details:      
 
Domain Number - Region: 445-491
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000615
Family EGF-type module 0.027
Further Details:      
 
Domain Number - Region: 778-812
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000629
Family EGF-type module 0.031
Further Details:      
 
Domain Number - Region: 479-514
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000829
Family EGF-type module 0.013
Further Details:      
 
Domain Number - Region: 683-714
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000967
Family EGF-type module 0.016
Further Details:      
 
Domain Number - Region: 747-776
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0011
Family EGF-type module 0.018
Further Details:      
 
Domain Number - Region: 587-617
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00121
Family EGF-type module 0.017
Further Details:      
 
Domain Number - Region: 356-444
Classification Level Classification E-value
Superfamily Cadherin-like 0.00785
Family Cadherin 0.058
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGACP00000020649   Gene: ENSGACG00000015657   Transcript: ENSGACT00000020688
Sequence length 844
Comment pep:novel group:BROADS1:groupI:27978021:27986072:-1 gene:ENSGACG00000015657 transcript:ENSGACT00000020688 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
SAGRPRRAASFEFQPVLAARGLSRADLESFAYFFPEDHPAEARPEVRPQWPTPSGMTSAK
ALQVCRAALADSTVGAVCRGLLGRRLEEAVDLCMLDLQLKDDLGWEEALLPHLENECERR
LLENRTHRALELAAPLGKTGEVVRALRCPNFCNSNGECTEGGCQCYPTHSFYDCSLAISQ
PAELTDLENGGLCDIRAFSCRSVRVFGLGFVDTPDLSCHANRLKQVNGVWVEGEKQRTKA
NFLSSRALDCAVPSLSNAAVNTEDFLMVDKPYARWEIQVTNDGSQHSQAKVLTIYDGVCQ
VCQASRSGLCKLKERTCNIDGMCFAEGVSSPSSPCLLCDPETSKFSWSVNQVNEPPAFHR
PHGDLRTFAGENFVFQLAASDPEGSALLFQLEEGPKGAVLSPAGLLIWRVPSLHETEGPQ
TFRFTLSDECNAQSTFAVEIQVVNCGCQHGGTCVTDVSFPAGSGKHLCVCPEGRWGDLCN
ERADPCRSAPCEAGTCTDTGRGFSCDCPAGLQGRSPGVKVNPSTTNTGTLKNTSGVSDTA
AGSGRPASGTASSVCRHVCGRNMECAAPNTCRCKTGYSGSDCHTAICQPDCANGGVCVAP
GVCRCPTGFHGETCQEALCRSPCENGGSCVGPQTCSCPYGFVGPRCETMVCSRHCHNGGR
CASPDECACPSGWSGPSCETALCSPVCLNGGSCARPDVCECPRGFYGVRCQSGVCSPPCK
NGGVCLRSSACSCLQGYTGRRCQISVCEPPCVNGGRCVGPDVCDCPSGWRGKSCDKPSCL
QKCANGGECVGAGACHCAPGWHGMLCQIPVCEQTCPPGGRCVRPNVCACRSGSLCSRRVR
TAKH
Download sequence
Identical sequences G3PSR3
ENSGACP00000020649 69293.ENSGACP00000020649 ENSGACP00000020649

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]