SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSECAP00000008938 from Equus caballus 69_2

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSECAP00000008938
Domain Number 1 Region: 2209-2302
Classification Level Classification E-value
Superfamily C-type lectin-like 1.79e-31
Family Link domain 0.00013
Further Details:      
 
Domain Number 2 Region: 509-647
Classification Level Classification E-value
Superfamily FAS1 domain 2.88e-29
Family FAS1 domain 0.0019
Further Details:      
 
Domain Number 3 Region: 1473-1521,1599-1713
Classification Level Classification E-value
Superfamily FAS1 domain 4.45e-28
Family FAS1 domain 0.0024
Further Details:      
 
Domain Number 4 Region: 1715-1868
Classification Level Classification E-value
Superfamily FAS1 domain 1.44e-25
Family FAS1 domain 0.0054
Further Details:      
 
Domain Number 5 Region: 965-1122
Classification Level Classification E-value
Superfamily FAS1 domain 1.26e-23
Family FAS1 domain 0.0037
Further Details:      
 
Domain Number 6 Region: 385-501
Classification Level Classification E-value
Superfamily FAS1 domain 4.05e-19
Family FAS1 domain 0.0051
Further Details:      
 
Domain Number 7 Region: 2300-2463
Classification Level Classification E-value
Superfamily FAS1 domain 1.7e-17
Family FAS1 domain 0.0097
Further Details:      
 
Domain Number 8 Region: 1130-1255
Classification Level Classification E-value
Superfamily FAS1 domain 0.000000000693
Family FAS1 domain 0.009
Further Details:      
 
Domain Number 9 Region: 902-961
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000287
Family EGF-type module 0.031
Further Details:      
 
Domain Number 10 Region: 1545-1582
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000407
Family Merozoite surface protein 1 (MSP-1) 0.088
Further Details:      
 
Weak hits

Sequence:  ENSECAP00000008938
Domain Number - Region: 1997-2137
Classification Level Classification E-value
Superfamily Growth factor receptor domain 0.000188
Family Growth factor receptor domain 0.0085
Further Details:      
 
Domain Number - Region: 230-283
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0012
Family EGF-type module 0.044
Further Details:      
 
Domain Number - Region: 873-911
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00263
Family EGF-type module 0.055
Further Details:      
 
Domain Number - Region: 1371-1498
Classification Level Classification E-value
Superfamily Growth factor receptor domain 0.00471
Family Growth factor receptor domain 0.02
Further Details:      
 
Domain Number - Region: 2141-2176
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00514
Family EGF-like domain of nidogen-1 0.083
Further Details:      
 
Domain Number - Region: 320-357
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0144
Family EGF-type module 0.036
Further Details:      
 
Domain Number - Region: 198-231
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0321
Family EGF-type module 0.083
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSECAP00000008938   Gene: ENSECAG00000009860   Transcript: ENSECAT00000011432
Sequence length 2572
Comment pep:known chromosome:EquCab2:16:35200255:35227458:-1 gene:ENSECAG00000009860 transcript:ENSECAT00000011432 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MAGPRGLLLSLLAFCLAGSGFVMGQKVLSRRCDVKTKFVTHMPCTLCPIIKKQTCPPGWL
REFPGKISQDCRYEVQLGDSLVSMSGCSLECWKDVVQKACCPGYWGSQCYECPGGAKTPC
NGHGTCLDGIDRNGTCVCQENFSGSACQDCQDPNRFGPDCQSVCSCVHGVCRHGPLGDGS
CLCFAGYTGPRCDQELPVCQALHCPQNAQCSAEAPTCSCLPGHTQQGSECRAPDPCRPSP
CSPLAQCSVSPRGQAECRCPENYHGDGMVCLPQDPCTANHGGCPSNSTLCLYQRPGKASC
SCKPGLVSINHNASTGCFAYCLPNSCDRSATCQVTPDGKTSCVCKEGEVGDGRACYGHLL
HEVQKASQTSTAFLRLRIALAMLDQGCREVLTTSGPFTVLVPSISSIFPRTMNASIAQQL
CKQHIIAGQHILEELGTQHTHKWWTLAGQEITVTFNRFMRKYTYKYKDQPQQTFTIHKAN
YPAVNGIFHVVTALRWQPPTELPEDPKRTISQILASTEAFSRFETILENCGLPSILDGPG
PFTVFAPSNEAVDRLRDGRLIYLFTAGLSKLQELVRYHIYSHGQLTVEKLISKGRVLTMA
NQVLAVNISEEGRILLGPEGVPLRRVDVLAANGVIHMLEGILLPPTILPILPKHCNEEQH
KIVAGSCVDCQALNTSTCPPNSVMLDIFPEECVYTYDPSGLNVLKKGCARYCNQTILKPG
CCKGFFGPDCAQCPGGFSNPCYGKGNCSDGVRGNGACLCFPDYKGIACHICSNPNKHGDQ
CQEDCGCVHGLCDNRPGSGGVCQSGTCAPGFSGRFCNESTGSCGPTEQAQHCHLHARCVT
QGRVARCLCLDGFEGDGFSCTPSNPCSHPDRGGCSENAECVPGTMGTHHCTCHKGWSGDG
RVCVAIDECELDARGGCHADALCSYVGPGQSRCTCKLGFAGDGYMCSPIDPCRAGNGGCH
DLATCRAVGGGQRVCTCPPGYGGDGLSCYGDIFQELEANAHFSVFYQWIKSAGITLPADS
RVTALVPSESAIRRLSPEDQAFWLQPRMLPQLVRAHFLQGSLSEEELARLGGQDVATLSP
TTRWEIHNISGRVWVQNASVDVADLLATNGVLHVLSQVLLPPKGAMPVGQGLLQRLDSVP
AFRLFRELLQHHGLVPQIEAATAYTIFVPTNHSLEVQGNRSSLDVDTVRHHVVLGEALSV
EALQRGGHRNSLLGPAHWLVFYNHSGQPEVNHVPLEGPLLEAPGGSLFGLSGVLTVGSSR
CLHTHAEALREKCINCTRKFRCTQGFKLEDTPKKSCVYQSGYSFSQGCSYTCAKKIQVPD
CCPGFFGTLCEPCPGGLGGVCSGHGQCQDRLLGSGECRCQEGFHGTACEMCELGRYGPNC
TGVCDCAHGLCQEGLRGDGRCVCSVGWQGLRCDQKITGLQCPKKCDPNANCVQDSATAPA
CICAAGYSGNGIYCSEVDPCAHDHGGCSPHANCTKVAPGQRTCTCQDGYTGDGELCQEVN
SCLVHHGGCHLHAECIPTGPQQVSCSCREGYSGDGIRTCELLDPCSQNNGGCSPYAVCKS
IGDGQRTCTCDAAHTVGDGFSCRARVGLELLRDRHASFFSLHLLEYKELKGNGPFTIFVP
HADLMTNLSQDELARIRTQRQLVFRYHVVGCRQLRSQELLEEGYVTTLSGHPLRFSEREG
SIYINDYARVVSSDQQAVNGILHFIDRVLLPPDALHWEPDAAPIPRRNVTAAAESFGYKI
FSSLVTVAGLLPLLQDGSHRPLTMLWPTDSALRALPRDQQAWLYHEDHRDKLAAILRGHV
IRNVEALASDLPNLGPLRTMHGTPISFSCSRARPGELMVGEDDARIVQRHLPFEGGLAYG
IDQLLEPPGLGARCDRFETRPLHLKICSICGLEPPCPQGSQEQGSPEACWRYYSKFWMSP
PLYSLALRGIWPQPSLWGPPQGLGRGCHRNCVTTTWKPSCCPGHYGSECRACPGGASSPC
SDHGVCMDGMSGSGQCRCRSGFAGTACELCAPGAFGPHCQACRCTSHGHCDEGLGGSGSC
FCDEGWTGSRCEVQLELQPVCAPPCAPEAVCRVGNSCECSLGYEGDGRTCTVADLCQDGH
GGCSEHANCSQVGTVVTCTCLPDYEGDGWSCRARDPCADGHRGGCSEHADCLNTGPNTRR
CVCHAGYVGDGLQCLEEPEPPVDRCLGQPPPCHVDAVCTDLHFQEKRAGVFHLQATSGPY
GLNFSEAEAACGAQGAVLASLPQLSAAQQLGFHLCLVGWLANGSAAHPVVFPAADCGDGQ
VGVVSLGTRGNLSERWDAYCYREQDVACQCRDGFVGDGTSVCNGKLLDVLATTANFSTFY
GMLLGYANATPRGLDFLDFLDDELTYKTLFVPVNEGFVDNMTLSGPDLELHASNTTFLST
NASQGTMLPAHSGLSLVISDVGPDNSTWVPVAPGAVVVSRVIVWDIMAFNGIIHALASPL
LAPPQPHAVVAPEAPSVAVGAGAVVATGALLGLVAGALYLRARGKATGFGFSAFQAEDDA
DDDFSPWQEGTSPTLVSVPNPVFGSHDAFCEPFDDSLLEEDFPDTQRILAVK
Download sequence
Identical sequences F7C1Y3
ENSECAP00000008938 XP_001493277.1.31192 ENSECAP00000008938 9796.ENSECAP00000008938

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]