SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSGACP00000002355 from Gasterosteus aculeatus 76_1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGACP00000002355
Domain Number 1 Region: 1118-1340
Classification Level Classification E-value
Superfamily Kelch motif 3.92e-34
Family Kelch motif 0.0038
Further Details:      
 
Domain Number 2 Region: 1035-1105
Classification Level Classification E-value
Superfamily Kelch motif 0.000000772
Family Kelch motif 0.035
Further Details:      
 
Domain Number 3 Region: 607-642
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000117
Family EGF-type module 0.014
Further Details:      
 
Domain Number 4 Region: 742-791
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000195
Family Laminin-type module 0.005
Further Details:      
 
Domain Number 5 Region: 831-935
Classification Level Classification E-value
Superfamily Spermadhesin, CUB domain 0.00000329
Family Spermadhesin, CUB domain 0.0049
Further Details:      
 
Domain Number 6 Region: 1731-1781
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000117
Family Laminin-type module 0.015
Further Details:      
 
Domain Number 7 Region: 1652-1699
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000542
Family EGF-like domain of nidogen-1 0.081
Further Details:      
 
Weak hits

Sequence:  ENSGACP00000002355
Domain Number - Region: 1824-1860
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00148
Family Laminin-type module 0.029
Further Details:      
 
Domain Number - Region: 714-744
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0123
Family Laminin-type module 0.034
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGACP00000002355   Gene: ENSGACG00000001806   Transcript: ENSGACT00000002362
Sequence length 2329
Comment pep:known_by_projection scaffold:BROADS1:scaffold_122:271493:283730:1 gene:ENSGACG00000001806 transcript:ENSGACT00000002362 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
LSLAGGNVHIHYQEEKCYDEEIFFYHLGCHQWVSAGERWSLRRCPYFRLDSRASSDTGRY
SHVAAAMEGRVLLVAGGYSGVARGDLVAYKVPLFVSSDQGDRVSDGDAVCAEALDESMCL
KNPECSWDQQCSSSLPRPAFAAGAGFSVRCGVMRLSGGHMCSLCADPLPLGLAERSACRV
DQISGAYGWWGERPRFLTSLHSCRTENYVPGLHLLTFQHPRNDSQPDKVSILRSTTISLS
PTTEMDVALQFRGFVHPLWGAPPPAPAPTETVSMWARIQGLHFEAQMASGPNSRQLEAVG
RWAAPQERASMLLARTDGSRLFSNLTRGNHYLVQAEGYLNNSGSGQTSEMALIWNRTLPG
KSEISFLFLEPYRSGSCSAYTSCLACLSDQSCGWCPSLSRCLLRESPEPCPEGGKAEGQR
HLLLAPQHCTLCEEYRDCSACTQDPFCEWQINSSKKGDYQCSRRGRLDGSIRDPAGCPKI
CNQRKTCGECLSNSSQCAWCESAQACFYFAAYLTKYPYGECRDWYDSVHSVPQCKQCSAL
STCTDCLRTFQCGWCGDYNNPTIGKCLRGDWAGVDDPSAYNCSVAVAEARAANISGNAVW
SYPTCPDVEECRLGLHNCHPFATCINTPTSYECHCERGYTGDGTLHCNQTCYNECREGQC
SGSPRFECECSLGWTSDPATLVLSGVGCDVDCGLPLSFSLGFRAPGILPSLSSDWTVGPH
CEHCRPGSFGSALAGGGGCVQCECNGHGDPARGYCHNQTGQCYCTHNTQGAHCEACLAGY
YGDPRNNGTCYRQCQGRSVLLSSSSSSSAAVPLSSSLGWRSGTEGRGGLSHCLWVLSVTE
KLAPCPPRQPCPPVALTLHPDSHTHCKSSYVYVFDGLPRFLSNGVVHSDHNLIGAFCGTT
RTQPITVEATSGVISVYFEANVSSTKPQGFNASFWVRRCLQSADEGGEGSPACPGGALCQ
GGLCQCPKGYGGPHCDRPICPQDCGIAEGRGACNTSLGVCVCSPGWAGSDCSVQRDASSL
VWETLLDTQLTANQAHRFLHRMGHSLVSGPQGTLWMYGGLSLSEGILGNVYRYAVSERRW
TQMLTSSVEEGSTPGPRYHHAAALLTSRESGSGGHGASHDFMLVTGGVTQNGVAMDTWSL
NLSSLVWREHKSSLVPPLAGHTLTVRRDSSVLLVGGYSPDNGFNHHLLEFSPRTGNWTTV
SHTGTPPTGLYGHSAVYHEQTDAIYVFGGYRFHVETVEPSGELYSLYYPNLTWSLLVPSQ
GNKPVSRFFHAAAMIKDTMVIVGGRTEAEDYSNSVSLYQINCNTWIQPVSVVGDPVNRSV
SLAVTSWGDRLYLSGGFNGVTLGRLLTLSVPADPCALLPTPEACNVTTGSCVWCRGSCAS
SDAAERMACFTGQSSCSPTPRQPDQCRRLKTCSECLARHPKTFSAPSQPALQCKWCTNCP
EGACISSSVSCTSEHDCRINQREIFLSSNCTETSCEASDCPKCTASGKCMWTRQFKRTGE
TRRILSVNPTYDWTCFSYALLNVSPMQVESSPPMPCPPPCHSLQNCSLCLGSRGSDGGWQ
HCVWSMALQQCMSPSFVPLRCEAGQCGRLLSGGDSCSPQCPQLTQCSQCIARPQCGWCGT
QVGNGAGRCLQGGLDGVSEGVCPLVNSSWSFLHCPPEDECANGHHHCNSTQDCHDQPQGY
HCTCKQGYILSSVSGQCEPVCAQGCVNGTCVSPGVCQCHFGFVGENCSAQCSCNKHSNCA
GVSKPDVCLECHNNTVGRHCEKCKPLFVGSAKGGGTCRSCREFCRGNSAVCLSGDYIKKA
LENPGQYHLEPSSIPGLMSEGPTEENAVCVNCQNNSVGDKCESCLSGFFLLQGKCEKCQC
NGHADTCNEHDGTGCPCQNNTETSSCLSSPQSDRKDCYRQQCAKCKDSFNGTPVNGRQCY
RQFNVDTECCFDPTSQANCFHDPAIRNLPRGRTVFFSAQPKFTNVDIRVTIDVTFGEVEV
YVSNSHDIFIVDVDRHSGVHAIRIEEEAVTQKESPPSPIKVWANASSGLGGPVLSHTPLQ
LQGKVPGAEREITWERAEGLITYITVWKPQTVLIVRGVRDRVVITFPHEVHSLKSSRFYI
ALRGVGTDQRQGESQGLLFLRQDQAHIDLFVFFSVFFSCFFLFLSVCVLLWKVKQFLDFR
REQRRHIQEMTKMASRPFAKLTIYFEPEEPQLIYLPSPPPPPPPHPGWRGHRWPAPAPAL
LEHPPLPQRHFWPPGVPPSPPLMGFSYSTFKVGPITLEPTDDGMAGVATVLFQLPGGVLA
PNRACLGSALVTLRQNLQEYCGHGNGGGGHPGAGRKGLLGHQHLTTMAM
Download sequence
Identical sequences G3NAL0
ENSGACP00000002355 69293.ENSGACP00000002355 ENSGACP00000002355

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]