SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSECAP00000015286 from Equus caballus 76_2

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSECAP00000015286
Domain Number 1 Region: 2540-2597
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.0000000000000275
Family TSP-1 type 1 repeat 0.00048
Further Details:      
 
Domain Number 2 Region: 2702-2754
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.000000000000314
Family TSP-1 type 1 repeat 0.00084
Further Details:      
 
Domain Number 3 Region: 4001-4058
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.000000000000968
Family TSP-1 type 1 repeat 0.00029
Further Details:      
 
Domain Number 4 Region: 2970-3028
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.00000000000131
Family TSP-1 type 1 repeat 0.00057
Further Details:      
 
Domain Number 5 Region: 2066-2227
Classification Level Classification E-value
Superfamily Galactose-binding domain-like 0.00000000000159
Family Discoidin domain (FA58C, coagulation factor 5/8 C-terminal domain) 0.0039
Further Details:      
 
Domain Number 6 Region: 3243-3291
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.00000000000196
Family TSP-1 type 1 repeat 0.00048
Further Details:      
 
Domain Number 7 Region: 4250-4307
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.00000000000314
Family TSP-1 type 1 repeat 0.00078
Further Details:      
 
Domain Number 8 Region: 3811-3860
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.0000000000157
Family TSP-1 type 1 repeat 0.00075
Further Details:      
 
Domain Number 9 Region: 2817-2871
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.0000000000249
Family TSP-1 type 1 repeat 0.00061
Further Details:      
 
Domain Number 10 Region: 3631-3681
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.000000000196
Family TSP-1 type 1 repeat 0.00054
Further Details:      
 
Domain Number 11 Region: 1269-1332
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.000000000687
Family ATI-like 0.045
Further Details:      
 
Domain Number 12 Region: 3942-4001
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.0000000017
Family TSP-1 type 1 repeat 0.001
Further Details:      
 
Domain Number 13 Region: 2450-2486
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.00000000183
Family LDL receptor-like module 0.0012
Further Details:      
 
Domain Number 14 Region: 1415-1452
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.0000000038
Family LDL receptor-like module 0.0014
Further Details:      
 
Domain Number 15 Region: 2487-2540
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.00000000562
Family TSP-1 type 1 repeat 0.0027
Further Details:      
 
Domain Number 16 Region: 3174-3234
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.00000000615
Family TSP-1 type 1 repeat 0.0025
Further Details:      
 
Domain Number 17 Region: 3077-3134
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.00000000654
Family BSTI 0.042
Further Details:      
 
Domain Number 18 Region: 2234-2270
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.00000000694
Family LDL receptor-like module 0.0014
Further Details:      
 
Domain Number 19 Region: 3461-3508
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.000000012
Family TSP-1 type 1 repeat 0.00088
Further Details:      
 
Domain Number 20 Region: 3397-3441
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.0000000157
Family TSP-1 type 1 repeat 0.0009
Further Details:      
 
Domain Number 21 Region: 4615-4668
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.0000000157
Family TSP-1 type 1 repeat 0.0014
Further Details:      
 
Domain Number 22 Region: 4823-4880
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.0000000229
Family BSTI 0.024
Further Details:      
 
Domain Number 23 Region: 1374-1410
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.0000000288
Family LDL receptor-like module 0.0013
Further Details:      
 
Domain Number 24 Region: 1812-1872
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.0000000343
Family BSTI 0.057
Further Details:      
 
Domain Number 25 Region: 1451-1486
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.000000038
Family LDL receptor-like module 0.0014
Further Details:      
 
Domain Number 26 Region: 1490-1528
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.0000000445
Family LDL receptor-like module 0.0023
Further Details:      
 
Domain Number 27 Region: 4983-5050
Classification Level Classification E-value
Superfamily FnI-like domain 0.0000000649
Family VWC domain 0.051
Further Details:      
 
Domain Number 28 Region: 820-880
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.0000000687
Family ATI-like 0.064
Further Details:      
 
Domain Number 29 Region: 458-520
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.0000000834
Family BSTI 0.082
Further Details:      
 
Domain Number 30 Region: 1751-1804
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.0000000863
Family TSP-1 type 1 repeat 0.00094
Further Details:      
 
Domain Number 31 Region: 4769-4818
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.000000105
Family TSP-1 type 1 repeat 0.0012
Further Details:      
 
Domain Number 32 Region: 1600-1639
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.000000113
Family LDL receptor-like module 0.0018
Further Details:      
 
Domain Number 33 Region: 2762-2810
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.000000118
Family TSP-1 type 1 repeat 0.00076
Further Details:      
 
Domain Number 34 Region: 2391-2430
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.000000393
Family LDL receptor-like module 0.0019
Further Details:      
 
Domain Number 35 Region: 4929-4993
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.000000589
Family ATI-like 0.02
Further Details:      
 
Domain Number 36 Region: 4156-4211
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.000000889
Family TSP-1 type 1 repeat 0.0021
Further Details:      
 
Domain Number 37 Region: 3032-3072
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.00000106
Family TSP-1 type 1 repeat 0.0027
Further Details:      
 
Domain Number 38 Region: 4058-4117
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.00000144
Family BSTI 0.05
Further Details:      
 
Domain Number 39 Region: 1561-1597
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.00000196
Family LDL receptor-like module 0.0012
Further Details:      
 
Domain Number 40 Region: 4683-4731
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.00000278
Family ATI-like 0.057
Further Details:      
 
Domain Number 41 Region: 2656-2720
Classification Level Classification E-value
Superfamily FnI-like domain 0.00000303
Family VWC domain 0.064
Further Details:      
 
Domain Number 42 Region: 3880-3928
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.0000034
Family TSP-1 type 1 repeat 0.0017
Further Details:      
 
Domain Number 43 Region: 4315-4365
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.00000772
Family TSP-1 type 1 repeat 0.0035
Further Details:      
 
Domain Number 44 Region: 4425-4481
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.00000867
Family ATI-like 0.011
Further Details:      
 
Domain Number 45 Region: 3516-3576
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.00000899
Family ATI-like 0.076
Further Details:      
 
Domain Number 46 Region: 3306-3355
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.0000105
Family BSTI 0.045
Further Details:      
 
Domain Number 47 Region: 967-1014
Classification Level Classification E-value
Superfamily FnI-like domain 0.0000146
Family VWC domain 0.057
Further Details:      
 
Domain Number 48 Region: 870-939
Classification Level Classification E-value
Superfamily FnI-like domain 0.0000303
Family Fibronectin type I module 0.033
Further Details:      
 
Domain Number 49 Region: 2871-2936
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.0000409
Family BSTI 0.049
Further Details:      
 
Domain Number 50 Region: 1908-1966
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.0000589
Family TSP-1 type 1 repeat 0.0014
Further Details:      
 
Domain Number 51 Region: 1964-2027
Classification Level Classification E-value
Superfamily FnI-like domain 0.0000921
Family Fibronectin type I module 0.07
Further Details:      
 
Weak hits

Sequence:  ENSECAP00000015286
Domain Number - Region: 3779-3828
Classification Level Classification E-value
Superfamily FnI-like domain 0.000126
Family VWC domain 0.093
Further Details:      
 
Domain Number - Region: 3685-3742
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.000147
Family BSTI 0.03
Further Details:      
 
Domain Number - Region: 2634-2665
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.000196
Family ATI-like 0.02
Further Details:      
 
Domain Number - Region: 4368-4417
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.00471
Family TSP-1 type 1 repeat 0.005
Further Details:      
 
Domain Number - Region: 4128-4156
Classification Level Classification E-value
Superfamily PMP inhibitors 0.0162
Family PMP inhibitors 0.0027
Further Details:      
 
Domain Number - Region: 527-551
Classification Level Classification E-value
Superfamily PMP inhibitors 0.034
Family PMP inhibitors 0.0039
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSECAP00000015286   Gene: ENSECAG00000016582   Transcript: ENSECAT00000018724
Sequence length 5154
Comment pep:novel chromosome:EquCab2:4:101750309:101803348:1 gene:ENSECAG00000016582 transcript:ENSECAT00000018724 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MLLPALLFGTAWALANGRWCEQTETILGEEEVTPRQEELVPCTSLYHYQRLGWRLDLSRS
GPAGLCPIYKPPETRPATWNRTVRACCPGWGGTHCTLVLADASPEGHCFATWQCQLRAGL
ANASAGSLEECCARPWGQSWQDGSSQACLSCSSLRPPGGTPSPALLRPLPGAVAQLWSQH
QRPSATCTTWSGFHYRTFDGRHYHFLGRCTYLLVGAVDSTWAVHLMPREHCPPPGHCQLA
RVMMGPEEVLVQGGNVSVNGQLVSDGESRLLPGLSLQWQGDWLVLSGGLGVVVRLDRSSS
VSVSVDHKLQGQTQGLCGVYNGRPEDDFLEPGGGLVLAATFGNSWRLPDSEPGCLDAVEV
APGCDDPLRGTEAGMQAGQLRAEAQDVCHQLLDGPFRECHAQVPPAEYHEACLFAYCSGA
PAGSGQEGRREAVCATLAHYAQDCAKRRIHVRWRKPGFCERLCPGGQLYSDCASACPPSC
AAAGAGGEGSCGEECVSGCECAPGLFWDGALCVPAARCPCYHRRQRYSPGDTVRQLCNPC
VCQDGRWLCAQAPLGPGHTLTHIDTQVCTHDTCSVRRRTDPGLRPIPPHLAQDFTKGQLL
IVLEHGTCDSGSCLHAISVSLGDTHVQLRDSGAVLVDGQDVGLPWDGPGGLHLSRASSTF
LVLRWPGAQVLWGVSDPAAYITLDPRHADQVQGLCGTFTRNQQDDFLTPAGDVETSIAAF
ASKFQVGDEGRCPSEDSTLLSLCSTHAQHHIFSEAACAVLRGPAFQECHGLVDREPFHLR
CLAAMCGCAPGRDCLCPVLAAYARRCAQEGASLLWRNQTLCPVLCPGGQEYQECAPVCGR
NCGEPEDCGELGGCVAGCTCPLGLLWDPEGQCVPPSLCPCQLGAHRYAPGSATMRDCNRC
VCQERGLWNCTARRCTPPRAYCPQELVYVPGACLLTCDSPTANHSCPPGSTGGCVCPSGT
VLLDERCVPPELCPCRHSGQWYPPNATIQEDCNICVCQGRQWHCTGRRCGGRCQASGAPH
YVTFDGLALTFPGACEYLLVREASGRFTVSAQNLPCGASGLTCTKALTVRLQSTVVHMLR
GQAVTVNGVSVTPPKVYTGPGLCLRQAGLFLLLSTRLGLSLLWDGGTRVLLQLSPEFRGR
VAGLCGDFDGDASNDLRSRQGVLEPTAELAAHSWRLSPLCPEPGDLPHPCTVNTHRASWA
HARCGLMLQPLFARCHVEVPPQQHYEWCVYDACGCDSGGDCECLCSAIAAYADECARHGL
HVRWRSQELCPLQCEGGQVYEACGPTCPPTCHDHDPEPGWHCQAVACVEGCFCPEGTLLH
GGFCLEPTSCPCEWGGSFFPPGTVLQKDCGNCTCQESQWLCGGDGTHCEELVPGCAEGEA
PCQESGHCVPHEWLCDNQDDCGDGSDEEGCATPGCGEGQMSCSSGLCLPLVQLCDGQDDC
GDGTDEQGCPCPQDSLACADGRCLPPALLCDGHPDCPDAADEESCLGQVNCTLGEVSCVD
GTCVGAIQLCDGIWDCPDGADEGPGHCPLPSLPTPPAGTFPGHPAGSQETEPTALASTSA
APPCAPFEFPCGSGECAPRGRCDGEEDCADGSDERGCGWPCAPHHLPCANGPRCVAPAQL
CDGVPQCPDGSDESSDACGERSPHRRGMRLRAMDLRPRGSGRRACQRVTVWGRQSLSERK
GLLSLPTFQQEGIRGWATWGFPLCPCKERVGGAGGQEHSRTFLVSAGSTQLPPCGGLFPC
EYSCPEGLCIVDGEWTSWSPWSSCSEPCGGTMSRQRRCHPPQNGGRTCAMLPGGPHSTYQ
TRPCPQDDCPNATCSGELVFRPCAPCPLTCDDISGQAVCPPDRPCSSPGCWCPEGQVLDA
EGRCVWPRKCPCLVDGTRYWPGQQVKADCQLCICQDGRPRRCRPDPNCAVNCGWSSWSPW
AECLGPCGSQSIQWSFRSPNNPRLSGRGRQCWGIHRKARRCQTGPCEGCAHQGQVRRVGE
RWRAGPCTVCQCLHNGTARCSPYCPLGSCPQDWVLVEGLGESCCHCAPPGENQTVYPMAT
PTPTPAPSPQMGPPLITYVLPPPGDPCYSPLGLAQLPEGSLRALPWQLEHPTWAARLGAP
TDGPGHQGWSPREDAHSQQRTQPLYLQLDLLRPRTSLVSIIVPGAGSSDLLQFSSDGLHW
YNYRDILPGVPTPTQDKPREAGGLSPTVRTFSRMVQAQHVRVWPHDAHHSDTHRSVSLRV
ELLGCEPVSPLEASPCPGGGHRCASGECAPRGAPCDGVEDCMDGSDEEGCAPCMASTPES
VIPSTARTPPLSSTQPGQLPPQHREGLAETESWHPGQRSPVPPTGISEESHSAAWLPGPG
KSVQTVVTTAPTRQPEAEALQPGMAAVTVLPSDAMTPTAPAGQSVAPGPFPPVQCSPGQV
PCEVVGCVDQEQLCDEREDCLDGSDERHCASTVPFTVPTTALPGLLASRALCSPSQLSCG
SGECLPAERRCDRHRDCQDGSDEDGCVDCGLAPWSGWSSCSRSCGLGLAFQRRELLWPPL
PGGSCPPDRLRSQPCFMQACPVAGAWAVWEAWGPCSVSCGGGHRSRRRSCVDPPPKNGGA
PCPGTSQERAPCGLQPCTGGTDCRAEPVRMIKDELCQKGLVSNPPPPLLPRTGAECPSPH
TSGCRCPPGLLLHDSHCLPLSQCPCLVGEELKQPGMPFLLDNCSQCTCEKGALLCEPGGC
PVPCGWSAWSSWGPCDRSCGSGVRARFRSPSNPPAASGGAPCEGDRQEVQVCHTECGTET
LGWTPWAPWSSCSRSCLAPGGGPGRRSRTRLCSSPGDTSCPGEATEEEPCSPAMCPGMPP
VWGLWSPWSTCSAHCDGGIQTRGRSCSASAPGDSECQGPHSQTKDCNTQPCTAQCPGDMV
FRSMEQCHQEGGPCPRLCLAQGPGVECIGFCAPGCACPPGLFLHNASCLPPSRCPCQLHG
QLYAPGEVAQLDSCNNCTCISGEMVCTSEPCPVACGWSPWTPWSVCSRSCNVGLRRRFRA
GTAPPAAFGGPACQGPDMEAEFCSLRPCQGPGGEWGPWSPCSVPCGGGYRNRTRGGGPHS
LMDFSPCGLQPCAGPVPGVCPRGKRWLDCAEGPASCAELSAPQGTNQTCHPGCYCPSGTL
LLNNVCVPSQDCPCAHGGRLHPPGSAVLRPCENCSCVSGLITNCTSWPCAEGQPTWSPWT
PWSECSASCGPARRHRHRFCTRRPGMVLSSMALLPPPASATPLCPGPEAEEEPCLLPGCD
RAGGWGPWGPWSSCSRSCGGGLRSRTRACDQPPPQGLGDYCEGPRAQGETCQVLPCPVTN
CTAIQGAEYSRCGPPCPRSCDDLVHCVWHCQPGCYCPPGQVLSADQAICVQPGHCSCLDL
LTGEWHRPGTQLARPDGCNYCTCSEGRLTCTDLPCPVPGGWCPWSEWTACTQPCRSQMRT
RSRACTCPPPQHGGAPCPGEAGEAGAQHQRETCPSPTACPVDGAWSPWGPWSPCDTCLGQ
SHRSRVCSWPPTEGGRPCPGGHRQSRPCQDNSTRCTDCGGGQELLPCGRPCPRSCQDLSL
GMMCQPGSMACQPSCGCPPGQLSQDGLCVSPAHCRCQYQPGAMGIPENQSRSAGSGLSSW
ESLEPGEVVTGPCDNCTCVAGILQCQEVPGCPGPGVWSSWGPWEDCSVSCGGGEQLRSRR
CARPPCLGPARQSRTCHTQVCREAGCPAGRLYRECQPSEGCPFSCAHITQQVGCFSDGCE
EGCHCPEGTFQHGSACVQECPCVLTALLLQELGAASADPEARPPILGEGGQPLGPGDELG
SGQTLRVGCSNCSCVHGKLSCSVEDCSRAGGGFSPWGPWGLCSRSCGGLGTRTRSRQCVY
PTPAPGGQGCLGPHQDLEYCPSPDCPGAGGSTVEPATGLPGGWGLWSPWSPCSSSCTDPA
RPALRSRTRLCLVNCTSGDTSQERPCNLPSCTELPLCPGPGCVAGNCSWTTWAPWEPCSR
SCGVGQQRRLRAYRPPGPGGHWCPDILTAYQEHRFCNRRACPVPGGWSRWSPWSWCDRSC
GGGRSLRSRRCSSPPPKNGGAPCVGERHHARLCNPMPCEEGCPAGMEVVSCANRCPRRCS
DLQEGIVCQDGQPCQRGCRCPEGSLEQDSGCVPIEHCECTDAQGHSWAPGSQHQEACNNC
SCQAGQLSCTAQPCPPPSHCTWSRWSAWSPCSRSCGPGGQQSRFRSSTSDSWAPECREEQ
SQSQPCPQPPCPPLCLHGAHSRTLGDSWQQGECQQCSCTPEGVICEDTKCAEPGAWTLWS
PWSDCLVSCGGGNQVRTRVCMALASRLEPRHCLGPDTQTQHCGWQPCPGLQEACSWGPWG
PCSRSCGPGLASRSGSCPCLLAEADPTCNGTFLHLDTQACYAGPCLEECVWSSWSSWTRC
SCQVLVQQRYRHQGPAPGRAREGPPCTRLDGHFRPCLISNCSEDSCTPPFEFQACGSPCA
GLCATHLSRGLCQDLPPCQPGCYCPEGLLEQAGGCIPPEQCSCQHILGEGAGVTLAPGEN
LQLGCKECVCQHGELRCTSWGCQGLLPLSGWSEWSPCGPCLPLGVLAPASRAALEERWPQ
DPASLSPTSAPVLASEQHRHRLCLDPETGRPWAGDPDLCTAPLSQQRLCPDPQACHDLCQ
WSPWGPWSPCQVPCSGGFRLRWRKAGVPPGGGCRGPWAQTESCNMGPCPGESCESRDTVP
TPDCANQCPRSCADLWDRVQCLQGPCRPGCRCPPGQLVQDGHCVPISSCRCGLPSPNASW
VLAPAEVVQLDCKTCTCVNGSLLCPRHECPVLGPWSAWSSCSAPCGGGTTKRRRSCEEGP
RGAPCQAQDTEQRQECNPQPCPECPPGQVLSACALSCPRLCSHLQPGDLCVQEPCQPGCG
CPGGQLLHNGTCVPPTACPCTQLSLPWGLTLSLEEQAQELPPGTVLSWNCTRCVCQGGAF
NCSLADCQDCPPGEVWQQVAPGELGPCERTCWEPNATESQSNCSAGQAPGCVCQPGHFRS
QAGPCVPADRCECWRHGHPHPPGSEWQEDCESCRCLGGRSVCTQQCPPLTCAQGEVTMQE
PGSCCPTCRQETLEEQSASCRHLTELRNLTKGPCYLDQVEVSYCSGHCPSSTNVMPEEPY
LQSQCDCCSYRLDPESPVRILNLRCPGGHTEPVVLPVIHSCQCSACQGGDFSKR
Download sequence
Identical sequences F6QEX6
9796.ENSECAP00000015286 ENSECAP00000015286 ENSECAP00000015286

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]