SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSECAP00000015453 from Equus caballus 76_2

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSECAP00000015453
Domain Number 1 Region: 2054-2213
Classification Level Classification E-value
Superfamily Galactose-binding domain-like 6.17e-16
Family Discoidin domain (FA58C, coagulation factor 5/8 C-terminal domain) 0.0035
Further Details:      
 
Domain Number 2 Region: 2530-2587
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.0000000000000275
Family TSP-1 type 1 repeat 0.00048
Further Details:      
 
Domain Number 3 Region: 2692-2744
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.000000000000314
Family TSP-1 type 1 repeat 0.00084
Further Details:      
 
Domain Number 4 Region: 3991-4048
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.000000000000968
Family TSP-1 type 1 repeat 0.00029
Further Details:      
 
Domain Number 5 Region: 4245-4299
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.00000000000115
Family TSP-1 type 1 repeat 0.00084
Further Details:      
 
Domain Number 6 Region: 2960-3018
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.00000000000131
Family TSP-1 type 1 repeat 0.00057
Further Details:      
 
Domain Number 7 Region: 3233-3281
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.00000000000196
Family TSP-1 type 1 repeat 0.00048
Further Details:      
 
Domain Number 8 Region: 3801-3850
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.0000000000157
Family TSP-1 type 1 repeat 0.00075
Further Details:      
 
Domain Number 9 Region: 2807-2861
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.0000000000249
Family TSP-1 type 1 repeat 0.00061
Further Details:      
 
Domain Number 10 Region: 3621-3671
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.000000000196
Family TSP-1 type 1 repeat 0.00054
Further Details:      
 
Domain Number 11 Region: 1265-1328
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.000000000687
Family ATI-like 0.045
Further Details:      
 
Domain Number 12 Region: 3932-3991
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.0000000017
Family TSP-1 type 1 repeat 0.001
Further Details:      
 
Domain Number 13 Region: 2440-2476
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.00000000183
Family LDL receptor-like module 0.0012
Further Details:      
 
Domain Number 14 Region: 1411-1448
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.0000000038
Family LDL receptor-like module 0.0014
Further Details:      
 
Domain Number 15 Region: 2221-2257
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.00000000537
Family LDL receptor-like module 0.0014
Further Details:      
 
Domain Number 16 Region: 2477-2530
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.00000000562
Family TSP-1 type 1 repeat 0.0027
Further Details:      
 
Domain Number 17 Region: 3164-3224
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.00000000615
Family TSP-1 type 1 repeat 0.0025
Further Details:      
 
Domain Number 18 Region: 3067-3124
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.00000000654
Family BSTI 0.042
Further Details:      
 
Domain Number 19 Region: 3451-3498
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.000000012
Family TSP-1 type 1 repeat 0.00088
Further Details:      
 
Domain Number 20 Region: 3387-3431
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.0000000157
Family TSP-1 type 1 repeat 0.0009
Further Details:      
 
Domain Number 21 Region: 4607-4660
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.0000000157
Family TSP-1 type 1 repeat 0.0014
Further Details:      
 
Domain Number 22 Region: 4815-4872
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.0000000229
Family BSTI 0.024
Further Details:      
 
Domain Number 23 Region: 1370-1406
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.0000000288
Family LDL receptor-like module 0.0013
Further Details:      
 
Domain Number 24 Region: 1799-1858
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.0000000343
Family BSTI 0.076
Further Details:      
 
Domain Number 25 Region: 1447-1482
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.000000038
Family LDL receptor-like module 0.0014
Further Details:      
 
Domain Number 26 Region: 1486-1524
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.0000000445
Family LDL receptor-like module 0.0023
Further Details:      
 
Domain Number 27 Region: 4975-5042
Classification Level Classification E-value
Superfamily FnI-like domain 0.0000000638
Family VWC domain 0.051
Further Details:      
 
Domain Number 28 Region: 816-876
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.0000000687
Family ATI-like 0.064
Further Details:      
 
Domain Number 29 Region: 458-520
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.0000000834
Family BSTI 0.082
Further Details:      
 
Domain Number 30 Region: 1738-1791
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.0000000863
Family TSP-1 type 1 repeat 0.00094
Further Details:      
 
Domain Number 31 Region: 4761-4810
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.000000105
Family TSP-1 type 1 repeat 0.0012
Further Details:      
 
Domain Number 32 Region: 2752-2800
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.000000118
Family TSP-1 type 1 repeat 0.00076
Further Details:      
 
Domain Number 33 Region: 1596-1631
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.000000209
Family LDL receptor-like module 0.0024
Further Details:      
 
Domain Number 34 Region: 4921-4985
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.000000589
Family ATI-like 0.02
Further Details:      
 
Domain Number 35 Region: 1556-1593
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.000000798
Family LDL receptor-like module 0.0012
Further Details:      
 
Domain Number 36 Region: 4146-4201
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.000000889
Family TSP-1 type 1 repeat 0.0021
Further Details:      
 
Domain Number 37 Region: 2377-2416
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.00000089
Family LDL receptor-like module 0.0019
Further Details:      
 
Domain Number 38 Region: 3022-3062
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.00000106
Family TSP-1 type 1 repeat 0.0027
Further Details:      
 
Domain Number 39 Region: 4048-4107
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.00000142
Family BSTI 0.05
Further Details:      
 
Domain Number 40 Region: 4675-4723
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.00000278
Family ATI-like 0.057
Further Details:      
 
Domain Number 41 Region: 2646-2710
Classification Level Classification E-value
Superfamily FnI-like domain 0.00000303
Family VWC domain 0.064
Further Details:      
 
Domain Number 42 Region: 3870-3918
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.00000327
Family TSP-1 type 1 repeat 0.0017
Further Details:      
 
Domain Number 43 Region: 4307-4357
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.00000772
Family TSP-1 type 1 repeat 0.0035
Further Details:      
 
Domain Number 44 Region: 4417-4473
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.00000867
Family ATI-like 0.011
Further Details:      
 
Domain Number 45 Region: 3506-3566
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.00000899
Family ATI-like 0.076
Further Details:      
 
Domain Number 46 Region: 3296-3345
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.0000105
Family BSTI 0.045
Further Details:      
 
Domain Number 47 Region: 963-1010
Classification Level Classification E-value
Superfamily FnI-like domain 0.0000146
Family VWC domain 0.057
Further Details:      
 
Domain Number 48 Region: 1644-1677
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.0000196
Family LDL receptor-like module 0.0026
Further Details:      
 
Domain Number 49 Region: 866-935
Classification Level Classification E-value
Superfamily FnI-like domain 0.0000303
Family Fibronectin type I module 0.033
Further Details:      
 
Domain Number 50 Region: 2861-2926
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.0000409
Family BSTI 0.049
Further Details:      
 
Domain Number 51 Region: 1895-1953
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.0000589
Family TSP-1 type 1 repeat 0.0014
Further Details:      
 
Domain Number 52 Region: 1951-2014
Classification Level Classification E-value
Superfamily FnI-like domain 0.0000921
Family Fibronectin type I module 0.07
Further Details:      
 
Weak hits

Sequence:  ENSECAP00000015453
Domain Number - Region: 3769-3818
Classification Level Classification E-value
Superfamily FnI-like domain 0.000126
Family VWC domain 0.093
Further Details:      
 
Domain Number - Region: 3675-3732
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.000147
Family BSTI 0.03
Further Details:      
 
Domain Number - Region: 2624-2655
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.000196
Family ATI-like 0.02
Further Details:      
 
Domain Number - Region: 4199-4262
Classification Level Classification E-value
Superfamily FnI-like domain 0.000293
Family VWC domain 0.074
Further Details:      
 
Domain Number - Region: 4360-4409
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.00471
Family TSP-1 type 1 repeat 0.005
Further Details:      
 
Domain Number - Region: 4118-4146
Classification Level Classification E-value
Superfamily PMP inhibitors 0.0162
Family PMP inhibitors 0.0027
Further Details:      
 
Domain Number - Region: 527-551
Classification Level Classification E-value
Superfamily PMP inhibitors 0.0942
Family PMP inhibitors 0.0039
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSECAP00000015453   Gene: ENSECAG00000016582   Transcript: ENSECAT00000018914
Sequence length 5146
Comment pep:novel chromosome:EquCab2:4:101750309:101803348:1 gene:ENSECAG00000016582 transcript:ENSECAT00000018914 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MLLPALLFGTAWALANGRWCEQTETILGEEEVTPRQEELVPCTSLYHYQRLGWRLDLSRS
GPAGLCPIYKPPETRPATWNRTVRACCPGWGGTHCTLVLADASPEGHCFATWQCQLRAGL
ANASAGSLEECCARPWGQSWQDGSSQACLSCSSLRPPGGTPSPALLRPLPGAVAQLWSQH
QRPSATCTTWSGFHYRTFDGRHYHFLGRCTYLLVGAVDSTWAVHLMPREHCPPPGHCQLA
RVMMGPEEVLVQGGNVSVNGQLVSDGESRLLPGLSLQWQGDWLVLSGGLGVVVRLDRSSS
VSVSVDHKLQGQTQGLCGVYNGRPEDDFLEPGGGLGLAATFGNSWRLPDSEPGCLDAVEV
APGCDDPLRGTEAGMQAGQLRAEAQDVCHQLLDGPFRECHAQVPPAEYHEACLFAYCSGA
PAGSGQEGRREAVCATLAHYAQDCAKRRIHVRWRKPGFCERLCPGGQLYSDCASACPPSC
AAAGAGGEGSCGEECVSGCECAPGLFWDGALCVPAARCPCYHRRQRYSPGDTVRQLCNPC
VCQDGRWLCAQAPVAAAHPLLPPGAQNKTFSQSFRIVGQAGLVTGCLQDFTKGQLLIVLE
HGTCDSGSCLHAISVSLGDTHVQLRDSGAVLVDGQDVGLPWDGPGGLHLSRASSTFLVLR
WPGAQVLWGVSDPAAYITLDPRHADQVQGLCGTFTRNQQDDFLTPAGDVETSIAAFASKF
QVGDEGRCPSEDSTLLSLCSTHAQHHIFSEAACAVLRGPAFQECHGLVDREPFHLRCLAA
MCGCAPGRDCLCPVLAAYARRCAQEGASLLWRNQTLCPVLCPGGQEYQECAPVCGRNCGE
PEDCGELGGCVAGCTCPLGLLWDPEGQCVPPSLCPCQLGAHRYAPGSATMRDCNRCVCQE
RGLWNCTARRCTPPRAYCPQELVYVPGACLLTCDSPTANHSCPPGSTGGCVCPSGTVLLD
ERCVPPELCPCRHSGQWYPPNATIQEDCNICVCQGRQWHCTGRRCGGRCQASGAPHYVTF
DGLALTFPGACEYLLVREASGRFTVSAQNLPCGASGLTCTKALTVRLQSTVVHMLRGQAV
TVNGVSVTPPKVYTGPGLCLRQAGLFLLLSTRLGLSLLWDGGTRVLLQLSPEFRGRVAGL
CGDFDGDASNDLRSRQGVLEPTAELAAHSWRLSPLCPEPGDLPHPCTVNTHRASWAHARC
GLMLQPLFARCHVEVPPQQHYEWCVYDACGCDSGGDCECLCSAIAAYADECARHGLHVRW
RSQELCPLQCEGGQVYEACGPTCPPTCHDHDPEPGWHCQAVACVEGCFCPEGTLLHGGFC
LEPTSCPCEWGGSFFPPGTVLQKDCGNCTCQESQWLCGGDGTHCEELVPGCAEGEAPCQE
SGHCVPHEWLCDNQDDCGDGSDEEGCATPGCGEGQMSCSSGLCLPLVQLCDGQDDCGDGT
DEQGCPCPQDSLACADGRCLPPALLCDGHPDCPDAADEESCLGQVNCTLGEVSCVDGTCV
GAIQLCDGIWDCPDGADEGPGHCPLPSLPTPPAGTFPGHPAGSRLNQLPWPAPVLVTPPC
APFEFPCGSGECAPRGRCDGEEDCADGSDERGCGWPCAPHHLPCANGPRCVAPAQLCDGV
PQCPDGSDESSDACGSTQLPPCGGLFPCGLAPQLCLNPERLCDGIPDCPQGEDELGCGWT
PDSPAGGQAGGRPGGGVTPQPTVLSLLFSPGGLLAPGGPNRTGVSCPEYSCPEGLCIVDG
EWTSWSPWSSCSEPCGGTMSRQRRCHPPQNGGRTCAMLPGGPHSTYQTRPCPQDDCPNAT
CSGELVFRPCAPCPLTCDDISGQAVCPPDRPCSSPGCWCPEGQVLDAEGRCVWPRKCPCL
VDGTRYWPGQQVKADCQLCICQDGRPRRCRPDPNCAVNCGWSSWSPWAECLGPCGSQSIQ
WSFRSPNNPRLSGRGRQCWGIHRKARRCQTGPCEGCAHQGQVRRVGERWRAGPCTVCQCL
HNGTARCSPYCPLGSCPQDWVLVEGLGESCCHCAPPGENQTVYPMATPTPTPAPSPQMGP
PLITYVLPPPGDPCYSPLGLAQLPEGSLRALPWQLEHPTWAARLGAPTDGPGHQGWSPRE
DAHSQQRTQPLYLQLDLLRPRTSLVSIIVPGAGSSDLLQFSSDGLHWYNYRDILPGAPLC
PQLFPGNWEDVGPTVRTFSRMVQAQHVRVWPHDAHHSDTHRSVSLRVELLGCEPVSPLEA
SPCPGGGHRCASGECAPRGAPCDGVEDCMDGSDEEGCVPPPGTGSIPSTARTPPLSSTQP
GQLPPQHREGLAETESWHPGQRSPVPPTGKGPARHLPTSEAPRLSPGKSVQTVVTTAPTR
QPEAEALQPGMAAVTVLPSDAMTPTAPAGQSVAPGPFPPVQCSPGQVPCEVVGCVDQEQL
CDEREDCLDGSDERHCGEHGATVPFTVPTTALPGLLASRALCSPSQLSCGSGECLPAERR
CDRHRDCQDGSDEDGCVDCGLAPWSGWSSCSRSCGLGLAFQRRELLWPPLPGGSCPPDRL
RSQPCFMQACPVAGAWAVWEAWGPCSVSCGGGHRSRRRSCVDPPPKNGGAPCPGTSQERA
PCGLQPCTGGTDCRAEPVRMIKDELCQKGLVSNPPPPLLPRTGAECPSPHTSGCRCPPGL
LLHDSHCLPLSQCPCLVGEELKQPGMPFLLDNCSQCTCEKGALLCEPGGCPVPCGWSAWS
SWGPCDRSCGSGVRARFRSPSNPPAASGGAPCEGDRQEVQVCHTECGTETLGWTPWAPWS
SCSRSCLAPGGGPGRRSRTRLCSSPGDTSCPGEATEEEPCSPAMCPGMPPVWGLWSPWST
CSAHCDGGIQTRGRSCSASAPGDSECQGPHSQTKDCNTQPCTAQCPGDMVFRSMEQCHQE
GGPCPRLCLAQGPGVECIGFCAPGCACPPGLFLHNASCLPPSRCPCQLHGQLYAPGEVAQ
LDSCNNCTCISGEMVCTSEPCPVACGWSPWTPWSVCSRSCNVGLRRRFRAGTAPPAAFGG
PACQGPDMEAEFCSLRPCQGPGGEWGPWSPCSVPCGGGYRNRTRGGGPHSLMDFSPCGLQ
PCAGPVPGVCPRGKRWLDCAEGPASCAELSAPQGTNQTCHPGCYCPSGTLLLNNVCVPSQ
DCPCAHGGRLHPPGSAVLRPCENCSCVSGLITNCTSWPCAEGQPTWSPWTPWSECSASCG
PARRHRHRFCTRRPGMVLSSMALLPPPASATPLCPGPEAEEEPCLLPGCDRAGGWGPWGP
WSSCSRSCGGGLRSRTRACDQPPPQGLGDYCEGPRAQGETCQVLPCPVTNCTAIQGAEYS
RCGPPCPRSCDDLVHCVWHCQPGCYCPPGQVLSADQAICVQPGHCSCLDLLTGEWHRPGT
QLARPDGCNYCTCSEGRLTCTDLPCPVPGGWCPWSEWTACTQPCRSQMRTRSRACTCPPP
QHGGAPCPGEAGEAGAQHQRETCPSPTACPVDGAWSPWGPWSPCDTCLGQSHRSRVCSWP
PTEGGRPCPGGHRQSRPCQDNSTRCTDCGGGQELLPCGRPCPRSCQDLSLGMMCQPGSMA
CQPSCGCPPGQLSQDGLCVSPAHCRCQYQPGAMGIPENQSRSAGSGLSSWESLEPGEVVT
GPCDNCTCVAGILQCQEVPGCPGPGVWSSWGPWEDCSVSCGGGEQLRSRRCARPPCLGPA
RQSRTCHTQVCREAGCPAGRLYRECQPSEGCPFSCAHITQQVGCFSDGCEEGCHCPEGTF
QHGSACVQECPCVLTALLLQELGAASADPEARPPILGEGGQPLGPGDELGSGQTLRVGCS
NCSCVHGKLSCSVEDCSRAGGGFSPWGPWGLCSRSCGGLGTRTRSRQCVYPTPAPGGQGC
LGPHQDLEYCPSPDCPGAGGSTVEPATGLPGGWGLWSPWSPCSSSCTDPARPALRSRTRL
CLVNCTSGDTSQERPCNLPSCTELPLCPGPGCVAGNCSWTTWAPWEPCSRSCGVGQQRRL
RAYRPPGPGGHWCPDILTAYQEHRFCNRRACPVPGGWSRWSPWSWCDRSCGGGRSLRSRR
CSSPPPKNGGAPCVGERHHARLCNPMPCEEGCPAGMEVVSCANRCPRRCSDLQEGIVCQD
GQPCQRGCRCPEGSLEQDSGCVPIEHCECTDAQGHSWAPGSQHQEACNNCSCQAGQLSCT
AQPCPPPSHCTWSRWSAWSPCSRSCGPGGQQSRFRSSTSDSWAPECREEQSQSQPCPQPP
CPPLCLHGAHSRTLGDSWQQGECQQCSCTPEGVICEDTKCAGLEPGAWTLWSPWSDCLVS
CGGGNQVRTRVCMALASRLEPRHCLGPDTQTQHCGWQPCPGLQEACSWGPWGPCSRSCGP
GLASRSGSCPCLLAEADPTCNGTFLHLDTQACYAGPCLEECVWSSWSSWTRCSCQVLVQQ
RYRHQGPAPGRAREGPPCTRLDGHFRPCLISNCSEDSCTPPFEFQACGSPCAGLCATHLS
RGLCQDLPPCQPGCYCPEGLLEQAGGCIPPEQCSCQHILGEGAGVTLAPGENLQLGCKEC
VCQHGELRCTSWGCQGLLPLSGWSEWSPCGPCLPLGVLAPASRAALEERWPQDPASLSPT
SAPVLASEQHRHRLCLDPETGRPWAGDPDLCTAPLSQQRLCPDPQACHDLCQWSPWGPWS
PCQVPCSGGFRLRWRKAGVPPGGGCRGPWAQTESCNMGPCPGESCESRDTVPTPDCANQC
PRSCADLWDRVQCLQGPCRPGCRCPPGQLVQDGHCVPISSCRCGLPSPNASWVLAPAEVV
QLDCKTCTCVNGSLLCPRHECPVLGPWSAWSSCSAPCGGGTTKRRRSCEEGPRGAPCQAQ
DTEQRQECNPQPCPECPPGQVLSACALSCPRLCSHLQPGDLCVQEPCQPGCGCPGGQLLH
NGTCVPPTACPCTQLSLPWGLTLSLEEQAQELPPGTVLSWNCTRCVCQGGAFNCSLADCQ
DCPPGEVWQQVAPGELGPCERTCWEPNATESQSNCSAGQAPGCVCQPGHFRSQAGPCVPA
DRCECWRHGHPHPPGSEWQEDCESCRCLGGRSVCTQQCPPLTCAQGEVTMQEPGSCCPTC
RQETLEEQSASCRHLTELRNLTKGPCYLDQVEVSYCSGHCPSSTNVMPEEPYLQSQCDCC
SYRLDPESPVRILNLRCPGGHTEPVVLPVIHSCQCSACQGGDFSKR
Download sequence
Identical sequences F6R3U6
ENSECAP00000015453

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]