SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSGACP00000017593 from Gasterosteus aculeatus 69_1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGACP00000017593
Domain Number 1 Region: 3135-3314
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 1.68e-29
Family Laminin G-like module 0.0026
Further Details:      
 
Domain Number 2 Region: 2969-3118
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 3.08e-21
Family Laminin G-like module 0.0073
Further Details:      
 
Domain Number 3 Region: 2742-2875
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 5.13e-17
Family Laminin G-like module 0.0064
Further Details:      
 
Domain Number 4 Region: 2579-2739
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 0.0000000000000489
Family Laminin G-like module 0.012
Further Details:      
 
Domain Number 5 Region: 2361-2565
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 0.000000000457
Family Trypanosoma sialidase, C-terminal domain 0.092
Further Details:      
 
Domain Number 6 Region: 1325-1376
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000000248
Family Laminin-type module 0.01
Further Details:      
 
Domain Number 7 Region: 1235-1283
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000000335
Family Laminin-type module 0.0074
Further Details:      
 
Domain Number 8 Region: 254-316
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000000213
Family Laminin-type module 0.03
Further Details:      
 
Domain Number 9 Region: 1724-1762
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000000865
Family Laminin-type module 0.038
Further Details:      
 
Domain Number 10 Region: 314-370
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000001
Family Laminin-type module 0.015
Further Details:      
 
Domain Number 11 Region: 641-680
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000109
Family Laminin-type module 0.015
Further Details:      
 
Domain Number 12 Region: 494-532
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000176
Family Laminin-type module 0.012
Further Details:      
 
Domain Number 13 Region: 1657-1706
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000018
Family Laminin-type module 0.003
Further Details:      
 
Domain Number 14 Region: 1374-1422
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000019
Family Laminin-type module 0.012
Further Details:      
 
Domain Number 15 Region: 449-496
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000642
Family Laminin-type module 0.033
Further Details:      
 
Domain Number 16 Region: 588-638
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000753
Family Laminin-type module 0.01
Further Details:      
 
Domain Number 17 Region: 1806-2095
Classification Level Classification E-value
Superfamily Tropomyosin 0.0000353
Family Tropomyosin 0.00051
Further Details:      
 
Domain Number 18 Region: 384-427
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000474
Family Laminin-type module 0.021
Further Details:      
 
Weak hits

Sequence:  ENSGACP00000017593
Domain Number - Region: 1614-1659
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000229
Family Laminin-type module 0.03
Further Details:      
 
Domain Number - Region: 536-590
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000279
Family Laminin-type module 0.02
Further Details:      
 
Domain Number - Region: 1287-1327
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0113
Family Laminin-type module 0.044
Further Details:      
 
Domain Number - Region: 52-139
Classification Level Classification E-value
Superfamily Galactose-binding domain-like 0.0914
Family N-terminal domain of xrcc1 0.073
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGACP00000017593   Gene: ENSGACG00000013300   Transcript: ENSGACT00000017627
Sequence length 3319
Comment pep:novel group:BROADS1:groupIII:1111559:1155419:-1 gene:ENSGACG00000013300 transcript:ENSGACT00000017627 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
GFSLNPPYFNLADVSSISATATCGQDEAGTPRRELYCKLVGGPNNGPPTQNIQGQFCDYC
NSLDPQKAHPVTHAVDGTERWWQSPPLSRGAGYNEVNVTLDLGQLFHVAYVLLKFANSPR
PDLWVLERSVDNGRTFNPWQYFAHSKRECIETFGKQPNARIVHDDDQLCTTEYSRIVPLE
NGEIVVSLITGRPGSKNFTYSPVLQDFTKATNIRLRFLRTSTLLGHLISKAQRDPTVTRR
YYYSIKDISIGGRCVCHGHAQVCARGRYQDNPNRLQCECQHNTCGESCDRCCPGFHQQPW
RVAAPDTPNECYPCQCFSHASDCYYDPEVEKRGASLDTFGRYDGGGVCINCQHNTAGVNC
ERCLEGFYRPYGIPPESPAGCIPCRCDERTTAGCEMESGRCICKTQFAGENCDRCADGFL
YYPQCIRYPVYPTTTQSPAGPFVEDCAACVCDYRGTADRVCDAAGRCLCRQGVEGERCDR
CPSGRHSFPDCRVCQCSQGGSYGSVCNPVSGQCLCLPGLVGQQCDRCASGLSFPGCSGPI
NVCNPGGTETSLQGACRCVATTEGTLCDSCKPLYWKLATDNSGGCVECRCELKGTLSAVG
ECEQKGGQCHCKPNACGHACETCKDGYFLLQKKDYFGCQGCQCDLGGAIDTACDEMSGQC
RCQKNVLGLKCTDPAPSYYFPTLHQLRFEVEDGTTPNARPVRFGYNPQEFPDFSWRGYAV
MSPAQSEVRVTVNVERKDERQHLFRVVLRFTNPTSAGVSGSIAATNNRGAAGSDQSTEVI
FPRSPSPSFLTVPGEGFAEPFTLTPGTWVVHIRAEGVLLDYLVLLPRDYYEAPLLQEKIS
RPCTYLPTANNCLLYKHVDMDRFSSALASQGKLTSHGGRRRRLARVRRLTPDHPQMAALY
GRQSQLQVGLRVPRPGPYALVLEYPSEVDAVQNVNIVIRDPSGDQIPARANIYSCALSFL
CRSVAVDGSDRVAAFQLSHKTEILLQTSTASLLLYKVYAVPAEEFSIDYVDPKVLCVSTD
GRFTEDSRYCVLRQFDKPTSAEILDAARDGQLSPAPAVSRQREEDEDWSDGILLKFPQTE
ISFTPEVPLPGTYVVVVHYHQREHTSFPVEVLVDAERQWKGWMNASFCPAVSGCREVVVA
DGRIAFHFDHSSGQRPSVSLIVPHENTLILDYVLLVPDSSYTPDLLKEKLLDKSADFIQQ
CRGDGFYIDRRTSPQFCRDSARSLVAAYNGGALPCNCDESGSTETTCEPVGGQCPCRRHV
IGRRCTKCTTGYYGFPHCRPCECGRRLCDEVTGRCICPPQTVKPSCDVCQGQTFSYHPLA
GCEGCACSPSGVETNAALDCDLVTGQCSCKPRVGGRRCDRCAAGYYRFPDCVPCNCNRGG
VTSDICDPDTGRCLCKRNVAGVKCDACREGSFFFDRSNRHGCTGCFCFGATDRCQSSSKR
RGKFVEMKVWRLERADQEEVPSVLNTASETVVADVQELPPTVQTLYWVAPSPYLGDRVSS
YGGFLTYQSKSFGIPSEGMTLMDRRPDVVLTGQDMTLIHLAPQIPHPDKLYQGRVQLLEG
NWRHAVTNRPVSREELMMVLARLVGLRIRALYFTQSQRLSLGEVGLEGLSNTGTGGPGNA
VEDCSCPPQNTGDSCEKCAPGYYRDRSGPFLGRCVPCECNGLSDECEERTGRCLGCQYNA
AGDRCERCKEGYYGNAAQRTCRVCPCPFRSPSNSFAIGCKEVFGDFECVCRAGYTGDKCE
SCAPGYYGDPSTRGGSCRPCKCNGNGNYCDHRTGVCKNTLEPEDTNTEGPCQECDNCAQT
LLNDLEKLDDELRRIKTQLDNASASATSQDSLKKLEKAVADTKMLVNRYSSAINAQKSRA
NQLEEDVSNLTDDISTLREKADKSAAQADKAVADVANTQKRAKVLDSEIEKMLEKIQALL
DQLKDAGTGGDVLPNENLASLLEEAERLVKEMKDRNFTPQKTAAEQERDKAEKLLDFIKA
NVSKQYDQNEAAAEKLRGLLKNYEAKVKDLEKALKEAGDLLKKANTQNGLSAQALEDLKK
RIKGLEMERDTVKDQMTMAEKELQKAEDLAEMLSDSKTEYEQLAAQMDGAKTDLTKKVNE
ITKAASKKDLVEAAEEHAKNLAKLAKELENAVANASGQTEVRNAKDAIDAYRNITDAINA
AEAAAKEAKSASDSALNNLKKEQLTVRAKDLKDTSEDLLKDAQEAETDLQVKDSADDIND
VKNRLNDAGKKKTALEKDLNYAQNQLDNINTDDISATINEAKRKAALANNSATVTMDRLN
AINGELKNIKATPVDSNLNNVLNDVDETVKNLLKTIPSLDAKLSEMENLTSEFPPMSNIS
ENIKKIKDLIEQARDAANRIGIPMMFKGNSHVELRAPKNLEDLKAYTGLSLSLQRPQGRG
DGRRRRRQANNGDMFVLYLGNRDSSKNYIGMVLRNEVLFGVYKLNGVEYEMETGVITKSV
SVPAKFDQVDLRRIYQDAKMTLAKFSNSKLSFAPITAERQGVENKNLLDLSPSDVVFYVG
GYPDSFTPPASLKYPKYEGCIEFSSVNDKVISLYNFQKAVGINPEPPCKRYVPPTDSEFY
QGTGYGKVLIDGTIPALIINMFISSRSSNGLLLFIQSEDNYITVTIEKGIVFIRSNLLET
PATNNLETFPTSDYEQLNIIFLRSNDIIVRISNTDLAKANVAYNFGEFKEWYIGGAPRDV
RERYNITMQPFKGCVKNLKQNSAVISVSEPVGISKGCPKDSLVVRKADFSLGSSLSGDLT
GFSLANDVAVSLGFKSTQNEGIILQDKQTANGIQLALESGYVTLTFNEQTWKSSKQYQDG
QWHYLTATRRNGRVDLLIDDEDAGQMQSGSSSVPNTGGSMILGKNNFKGCVSNLYTRRPA
QLYQAEDLNNFKASGDVLLGVCTADTVAQLMLDRGSMKFNNVIPVEITQFIMIQTDDVGK
QHINETKPACASPATIQKAYRMGGPVSSLSYSLPLQLSFARPHFSLDVRTRASEGLLFFA
ATRGGRSHLALYISKGRIRLSVAKEKEIFNREKYNDGKWHSVIFSLEKKKFRLVVDGIRA
QDGQLTNAEWTSMQHFVSPVYLGSAPESLHRELKSKALPRQSVSGCIRNFGKDGAPMANP
TTNYGAGPCFEGQTQRGAYFAGNGSYVILNDSFILGSNFELLFNIRPRSPTGLLLHVGNS
IWNPRGPAMGHYLTVYMLRGEVVAQANNGQGEFKVSVKPKASLCDETFHKISVIKRKNVV
QLHVDTMDHYKIGPPSPAITLTKDSLYVGGIPEVSMQQKLPVTSSFVGCIQDMRINGDSV
SFDRPPGVFGPVNLKECPG
Download sequence
Identical sequences G3PJ14
ENSGACP00000017593 ENSGACP00000017593 69293.ENSGACP00000017593

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]