SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSGACP00000001375 from Gasterosteus aculeatus 76_1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGACP00000001375
Domain Number 1 Region: 2060-2216
Classification Level Classification E-value
Superfamily Galactose-binding domain-like 1.32e-28
Family Discoidin domain (FA58C, coagulation factor 5/8 C-terminal domain) 0.00097
Further Details:      
 
Domain Number 2 Region: 2508-2565
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.0000000000000127
Family TSP-1 type 1 repeat 0.00025
Further Details:      
 
Domain Number 3 Region: 4173-4230
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.0000000000000157
Family TSP-1 type 1 repeat 0.00039
Further Details:      
 
Domain Number 4 Region: 2797-2854
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.0000000000000249
Family TSP-1 type 1 repeat 0.00024
Further Details:      
 
Domain Number 5 Region: 3928-3980
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.00000000000017
Family TSP-1 type 1 repeat 0.00044
Further Details:      
 
Domain Number 6 Region: 3153-3210
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.00000000000131
Family TSP-1 type 1 repeat 0.00052
Further Details:      
 
Domain Number 7 Region: 3733-3784
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.00000000000327
Family TSP-1 type 1 repeat 0.00071
Further Details:      
 
Domain Number 8 Region: 3210-3267
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.00000000000772
Family TSP-1 type 1 repeat 0.00047
Further Details:      
 
Domain Number 9 Region: 2954-3012
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.0000000000112
Family TSP-1 type 1 repeat 0.00078
Further Details:      
 
Domain Number 10 Region: 2675-2724
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.0000000000353
Family TSP-1 type 1 repeat 0.0012
Further Details:      
 
Domain Number 11 Region: 3867-3926
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.000000000131
Family TSP-1 type 1 repeat 0.002
Further Details:      
 
Domain Number 12 Region: 3608-3652
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.000000000209
Family TSP-1 type 1 repeat 0.00055
Further Details:      
 
Domain Number 13 Region: 2732-2797
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.000000000785
Family TSP-1 type 1 repeat 0.0015
Further Details:      
 
Domain Number 14 Region: 2455-2504
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.00000000126
Family TSP-1 type 1 repeat 0.0027
Further Details:      
 
Domain Number 15 Region: 2415-2454
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.00000000249
Family LDL receptor-like module 0.0013
Further Details:      
 
Domain Number 16 Region: 1551-1591
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.00000000262
Family LDL receptor-like module 0.00079
Further Details:      
 
Domain Number 17 Region: 1487-1526
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.00000000288
Family LDL receptor-like module 0.0016
Further Details:      
 
Domain Number 18 Region: 1369-1406
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.00000000353
Family LDL receptor-like module 0.00096
Further Details:      
 
Domain Number 19 Region: 2361-2399
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.00000000419
Family LDL receptor-like module 0.0016
Further Details:      
 
Domain Number 20 Region: 1411-1448
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.00000000668
Family LDL receptor-like module 0.0014
Further Details:      
 
Domain Number 21 Region: 4080-4134
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.00000000916
Family TSP-1 type 1 repeat 0.0018
Further Details:      
 
Domain Number 22 Region: 1269-1332
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.0000000111
Family ATI-like 0.023
Further Details:      
 
Domain Number 23 Region: 3435-3484
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.0000000157
Family TSP-1 type 1 repeat 0.0008
Further Details:      
 
Domain Number 24 Region: 1802-1860
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.000000018
Family BSTI 0.056
Further Details:      
 
Domain Number 25 Region: 2910-2974
Classification Level Classification E-value
Superfamily FnI-like domain 0.0000000199
Family VWC domain 0.05
Further Details:      
 
Domain Number 26 Region: 3365-3411
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.0000000301
Family TSP-1 type 1 repeat 0.00076
Further Details:      
 
Domain Number 27 Region: 4684-4729
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.000000034
Family TSP-1 type 1 repeat 0.0011
Further Details:      
 
Domain Number 28 Region: 808-874
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.0000000719
Family ATI-like 0.022
Further Details:      
 
Domain Number 29 Region: 4736-4793
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.000000229
Family BSTI 0.057
Further Details:      
 
Domain Number 30 Region: 1447-1483
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.000000275
Family LDL receptor-like module 0.0015
Further Details:      
 
Domain Number 31 Region: 3018-3058
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.000000275
Family TSP-1 type 1 repeat 0.0026
Further Details:      
 
Domain Number 32 Region: 4533-4580
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.000000353
Family TSP-1 type 1 repeat 0.0026
Further Details:      
 
Domain Number 33 Region: 443-509
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.000000392
Family ATI-like 0.025
Further Details:      
 
Domain Number 34 Region: 4236-4289
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.00000068
Family TSP-1 type 1 repeat 0.0016
Further Details:      
 
Domain Number 35 Region: 1901-1958
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.000000772
Family TSP-1 type 1 repeat 0.002
Further Details:      
 
Domain Number 36 Region: 3987-4043
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.000000867
Family BSTI 0.082
Further Details:      
 
Domain Number 37 Region: 1691-1740
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.00000106
Family TSP-1 type 1 repeat 0.0024
Further Details:      
 
Domain Number 38 Region: 1740-1800
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.00000157
Family TSP-1 type 1 repeat 0.00087
Further Details:      
 
Domain Number 39 Region: 864-930
Classification Level Classification E-value
Superfamily FnI-like domain 0.0000022
Family VWC domain 0.028
Further Details:      
 
Domain Number 40 Region: 3065-3116
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.00000294
Family ATI-like 0.048
Further Details:      
 
Domain Number 41 Region: 4595-4646
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.00000343
Family BSTI 0.047
Further Details:      
 
Domain Number 42 Region: 2854-2920
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.00000507
Family BSTI 0.057
Further Details:      
 
Domain Number 43 Region: 966-1033
Classification Level Classification E-value
Superfamily FnI-like domain 0.00000586
Family Fibronectin type I module 0.075
Further Details:      
 
Domain Number 44 Region: 2627-2691
Classification Level Classification E-value
Superfamily FnI-like domain 0.00000617
Family VWC domain 0.023
Further Details:      
 
Domain Number 45 Region: 1647-1676
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.00000641
Family LDL receptor-like module 0.0032
Further Details:      
 
Domain Number 46 Region: 4847-4908
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.00000736
Family ATI-like 0.02
Further Details:      
 
Domain Number 47 Region: 2592-2635
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.0000111
Family ATI-like 0.036
Further Details:      
 
Domain Number 48 Region: 3276-3328
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.0000116
Family BSTI 0.045
Further Details:      
 
Domain Number 49 Region: 4343-4399
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.0000155
Family ATI-like 0.015
Further Details:      
 
Domain Number 50 Region: 1593-1627
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.0000223
Family LDL receptor-like module 0.0031
Further Details:      
 
Domain Number 51 Region: 2223-2248
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.0000249
Family LDL receptor-like module 0.0025
Further Details:      
 
Domain Number 52 Region: 3806-3853
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.0000275
Family TSP-1 type 1 repeat 0.0034
Further Details:      
 
Weak hits

Sequence:  ENSGACP00000001375
Domain Number - Region: 3493-3548
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.000128
Family BSTI 0.048
Further Details:      
 
Domain Number - Region: 4291-4341
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.000209
Family TSP-1 type 1 repeat 0.0032
Further Details:      
 
Domain Number - Region: 1956-2022
Classification Level Classification E-value
Superfamily FnI-like domain 0.000764
Family Fibronectin type I module 0.081
Further Details:      
 
Domain Number - Region: 3574-3624
Classification Level Classification E-value
Superfamily FnI-like domain 0.00199
Family VWC domain 0.031
Further Details:      
 
Domain Number - Region: 4900-4959
Classification Level Classification E-value
Superfamily FnI-like domain 0.00314
Family VWC domain 0.029
Further Details:      
 
Domain Number - Region: 507-548
Classification Level Classification E-value
Superfamily FnI-like domain 0.00889
Family Fibronectin type I module 0.051
Further Details:      
 
Domain Number - Region: 4452-4523
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.0157
Family TSP-1 type 1 repeat 0.01
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGACP00000001375   Gene: ENSGACG00000001055   Transcript: ENSGACT00000001376
Sequence length 5063
Comment pep:known_by_projection scaffold:BROADS1:scaffold_37:2346135:2424188:1 gene:ENSGACG00000001055 transcript:ENSGACT00000001376 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
VHRLLFVAGWWKCSRHWCEHAMEERVERVLSPRLQLEVSCAEVYQYNTQAWRLDLLWCNN
ISNLMCLSTSRPPETENLMVNKTVRTCCEGWGGPRCSEDLSSVGVRGRCFSTWSCEDFPG
VHNSSLMPMEQCCSTLWGLSWKNASDQTCLSCSYTLLPDSQSSPLVRGGLLGSVRAPQSS
ATCMSWGGAHYRTFDRKHFHFQGSCTYLLASSTDGTWAVYITTVCDGRGDCSKALRMMFG
LDLVSIQHKNLTLNSLSVPNGKALFENGVSVHWLGDFVFVESGLGVRVKFDLTNTVYLTV
TGEHLAATRGLCGVYDNSADADDFTTMGGAVSQFAASFGNSWKVPDQQNEGCSDAAELGH
SCDVSGDEVLQGRAESVCRRLLEDPFTHCHQRLEPAAYMDTCLYLYCSLPPKDRDSAVCD
TLASYARECAQLHVIIVWRTNTLCGRVCPRGQVFSDCVSSCSPSCASPQPPGPAAAMGQC
REECVGGCECPRDLYLHQGLCLTRDKCPCFHRRRTYQSGERIQQRCNTCLCRAGQWQCTG
EKCAAQCSLIGALQVTTFDKKRYSLQGGDCSYTAVEDFIDRKLVVSVRCGLCTAGGGGGG
GGLGCLREMSVTALRTTVTITDTGTVMLNGLREPLPVITGDLVVRRVSSSFLLIQAFGAQ
LLWHLDGPLALITLQPGFANKVRGLCGTLTWNQHDDFLTPQGDVENSVSLFASKFTTEHC
APPRGALPDPCTTFTQRRQYAETVCSVIHSPVFQACHDMVEREPFFRLCLSEVCSCTPQR
SCHCTILTAYGRQCAQEGVTVHWRNQTFCAVQCSGGQVYQECGGTCGGSCFDSRSCDAAG
GGTGLRLCVPGCQCPLGLVQDHQGQCVPINMCPCVEGDKTYQPGARIQNNCNTCVCERGL
FNCTQERCEEVTRCPGHLVYSPRSCLPTCSSLDQQQGSGVAHESCREALSACVCPRGTVL
LDDRCVLPEDCPCHHNGRLYFSNETITKDCNTCVCKERRWHCSQSACTGVCVATGDPHYV
TFDGRCYSFLGDCQYVLAEETNGLFSVMAENVPCGSTGVTCTKSVTLSLGNTVIHLLRGK
AVAVNGMPVSLPKSYSGSGLTLERVGLFVSLSSQLGVTLLWDGGMRVYVRLAAHLRGRVR
GLCGNFDGDTENDFTTRQGIAESTAELFGNSWKVSPSCPDVADQDLRDPCAINTHRVPWA
RKRCAVLTQELFSQCHPEVPFQQYYDWCVYDACGCDSGGDCECLCTAIAAYAEECNHRGV
YIRWRSQELCPMQCDNGLVYDPCGPACSHSCPSVQPSPHSQCGALSCVEGCFCPAGTVRH
GDSCVVIIQCPCEWEGSMFPPGTVITQHCQNCSCEVGVWQCRGVACPPAPPPCLDSEYTC
AGGRCIPIQWVCDNEEDCGDGSDEAPDCSVVCDEGEFLCSGGRCILYLHRCDGHDDCGDL
SDERGCVCAAGEFQCPGDQCVPVDTVCDGHRDCPSGTDEAICPSRVTCAPDQFACSDGTC
VATTKVCDGTLDCGGGEDENRTNCYITPSPSPVSWTTPTSEKSVFLLHTRVSPACRSYEL
SCATGGQCVPQAWRCDGETDCMDGSDEQQCTAPCGPVLVPCLSGDQCVDYQQLCDGILHC
RDASDESIDNCDKTVSPELPGSRNTTLICPEFTCTDGSCVPFNMVCNGVVDCPNSSQSPL
GGPTDEQGCRTWSSWGLWTPCSTSCGTGSRSRQRTCPAGDPLSHCKGQESQRQQCFNTTC
PVDGLWLPWVSWANCSSGCGGVQVRHRGCIPPLYGGRDCFQLPGPSNLPTEIKPCPDDGC
ANTTCPTGLVRHSCAPCPVSCAHISSGTSCDAKAPCYSGCWCPEGQVMSHTQQCVSPEEC
VCEAAGVRYWPGQQIKIDCEICVCERGRTQRCQPNPDCSAAVHCGWSSWTEWGECLGPCG
VQSVQWSFRSPNNPIKHGDGRACRGIYRKARRCQTNPCNECEYQGRSHAVGDRWRSDHCQ
VCHCLPNSTVQCSLYCPHAVSGCPQGQSLVHVEGDRCCYCQGKIRAVVPLICTSHLMLRA
LSLFCPTLSYLTPPIYPGGDCWYPLGVQTLPDSSFSASSQQPGHPPQGWSPEPDEYKDLP
QRSPEAQTSNTQSPYLQMDLLKAYNITGVLTQGGGAFGTFVSSFYLQFSRDGRQWYTYKE
LITDARPRAKVFEVRMAGRIPVTRWLGRMVSARYLRIIPVEFRHTFYLRVEILGCRAGEE
LQSEECVSDTKLCDGRADCKDYSDEINCGFPGLQNQTTGGSRFSWYRYLQAPQALQTIRS
QGGLPRMATPGVTGQTSPQKTMTSSTGQPGLHPTTTRHSGKPGLQTTPGKTPLLVLPQHL
IQNRHILALRDATTPYDGGRPRVLCVEGQFACRTFGCINLVQVCDGGRDCLDGSDEEHCE
WTSNTKKDVKSTTPHVPSPCSPKQFSCDSGECVHLDRRCDLQKDCVDGSDERDCVDCIMS
PWTAWSACSVSCGLGSLFRQRDILREALPGGSCGGAQFDSRACFPRACPVDGHWSQWTEW
SLCDTQCGGGVRRRNRTCSAPPPKNGGRDCEGMTLQSQSCNSKPCTEESETQTGCVNGMV
LVTERDCTAGGVQPCPPTCSHLSMTSNCTAACIPGCRCADGLYLQEGRCVNGSQCVCLWD
GQTLQPGETVSKDQCAICVCMDGQVTCDTSLCVATCQWSAWSSWSPCDVTCGLGLQQRYR
SPLNPAGAIRIQPCAGDSSEARRCSGSCFPVLPDGVWGKWTSWSECSKTCFNHVDDVGIR
LRFRSCNHTLTAFNGTVEDSACDGDGEDQEPCNTVHCPVNGGWSAWSSWSQCSSECDSGV
QTRERLCSSPTPQHGGSNCPGPHIQTGDCNSHPCSGVCPEGMVHMTSAECEARGAACPRV
CLDMTAGEVQCVTACYNGCYCSPGLYLLKGRCVPLGRCPCYHQGELYPAGAALPVDACNN
CTCTNGEMLCGAAPCVVDCGWSSWTQWSTCSRTCDVGVRRRYRSGTNPPPASGGRPCKGD
RVGIDSCSIKPCIGVREPWVAWSECSVTCGGGYRTRTRGPIRIHGTAQQFSACNLQPCDP
RDGGVCPPGQQWKQCVRGAVSCADLTMELSRNCTPGCQCPHGTIQQEGACVRESDCLCHF
DGEQYKPGDVVPTDCNNCTCEAGRLVNCSQVTCNVDGQWSEWTPWGQCSTSCGPGLQSRY
RFCSSPQRSGRGLPCLGPHREDQVCITVLCDRDAGWGPWTNWTECTKSCGGGVRSRRREC
NSPSPEGKGNYCEGLGTAFTACNTDHCPVAPCSRVPGTVFSSCGPSCPRSCDDLAHCEWQ
CEPGCYCTGGKVLSDHGTACVEKEECPCMDLSTGHRLQPGETTESLDGCNNCTCQGGRLN
CTRHPCPVSGGWCEWSEWTPCSRTCGAESVSRYRSCGCPEPKSGGALCVGEQETHNGVGA
QIQRQPCPVITFCPVHGSWSPWSAWSVCDACAGSSTRTRKCNSPPARFGGLPCLGESRQI
RGCHDNITVCSGCAGGQEERPCGKPCPRSCSDLHGDTECVDSPACKHTCGCPGDMVLQDG
VCVVREECRCKYHNSSASDSSNASWVWPGGFDWQFANPGDSIISDCRNCSCKAGVLQCKS
VPGCYSVGPWLPWSPWSECSVSCGGGQQSRSRLCSSPPCSGPSRQSKTCNTQVCLGKPGS
LHDFYRSQECPCLVDKDFMGSLQSVSVTPVSSLLLHNISEGVEVQSEGALTHECSTCGCK
HGRWNCSLAHCQVNGGLSPWGSWSPCSLSCGGLGLKTRSKSCTQPAPAHGGRNCQGPRQE
TTYCQAPECPVIVGPTEEPALPDEDAGFSPWSSWSPCTRPCTDVLSPVRKSRHRQCVRPP
CFGSSHQEKACNLPQCPGDEVCVGADCTTRNCSWTEWGAWGSCSRSCGVGQQQRIRTFLS
PRTNGSWCEDILGGHLDHRFCNIKPCRVDGGWSRWSPWSRCDKHCGGGRSIRTRSCSSPP
PKNGGKKCVGEKNHLKPCNTKPCDERGCPPGQVSVPCANECPQRCSDLQQGIECQGNTEC
QPGCRCPKGQLQQDGVCVQQWQCDCVDSLGHVWAAGSSHQADCNNCSCSDGQLVCTNQSC
QATCLWSSWSSWALCSVSCGTGQRTRYRSLVPDTEGADCQFEEVQHKPCDPGPCPPLCVH
DDRELGVGDTWLQGECKQCTCTPEGDYCQDIDCTVDGGWTPWSVWSDCSVTCGRGTQVRT
RACIDPPPRNNGSQCGGPEQETQDCLPAPCLDDLCPWSPWSPCSRSCGAGSVVRRRACLC
EEGGDTACTAEIEGERNREETQLCYKQPCSGCPMSDWSIWSECSCASQRQLRYRVALSPA
IRGQQCTPVETQSRTCSLNPCDDCKAPFVYSACGVPCEKHCALQGRRDQCGGVRECTPGC
YCPQGLLQQNGSCVPAEQCGCIHLQQQPSGHPPTPVTVPLGAMITIGCSSCLCHDGTLQC
DTRDCEGNLVILSEWSEWTPCSPCVPSSSLQLSHSTAGVISASQMLSTQRRFRACLDLYS
GVPVSKEEEESQCPGPLVEERLCPDSNLCRDMCQWSAWSAWTACAEPCSGGVRQRYRRPL
VSPPGPLCRSQQTQSQSCNTGLCPGERCEDRGRTYQESCANQCPRSCTDLWEHVQCLQGA
CHSGCRCPKGQLLQDGLCVPVTECRCGIPSGNGTLEFDPKEEISIDCNTCVCENGTLACT
KLQCPVYEPWSPWSSCSASCGHGQMTRTRLCQDTEGSPSCADTTQRESCDLPSCPECPSG
QVFNDCSGSCPYTCEDLWPHTQCLPGPCTPGCSCSPGQVLYEGSCRPHADCPCSTLSLPP
GHLLWNISTEEMTETWLPPGTAVQHLCNTCVCQGGVFNCTSELCDACPDSEQWGRSTLDE
LTLCERSCWDIYSSSLVNCSRSSEGCVCREGLYRNPGGVCVIAALCPCHDQGIQREDGCQ
SCRCVNGKKLCQLRCPPLHCDEAEVKVEEPGNCCPVCRKPFPDEPVPECRRYVQVKNITK
GDCRLDNVEVSSCRGRCLSSTNVILEEPYLQSVCECCSYRLDPDNPVRFLSLQCESGESE
PVVLPVIHSCECTSCQGGDLSRR
Download sequence
Identical sequences G3N7U1
ENSGACP00000001375 69293.ENSGACP00000001375 ENSGACP00000001375

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]