SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for 9796.ENSECAP00000008938 from STRING v9.0.5

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  9796.ENSECAP00000008938
Domain Number 1 Region: 2209-2302
Classification Level Classification E-value
Superfamily C-type lectin-like 1.79e-31
Family Link domain 0.00013
Further Details:      
 
Domain Number 2 Region: 509-647
Classification Level Classification E-value
Superfamily FAS1 domain 2.88e-29
Family FAS1 domain 0.0019
Further Details:      
 
Domain Number 3 Region: 1473-1521,1599-1713
Classification Level Classification E-value
Superfamily FAS1 domain 4.45e-28
Family FAS1 domain 0.0024
Further Details:      
 
Domain Number 4 Region: 1715-1868
Classification Level Classification E-value
Superfamily FAS1 domain 1.44e-25
Family FAS1 domain 0.0054
Further Details:      
 
Domain Number 5 Region: 965-1122
Classification Level Classification E-value
Superfamily FAS1 domain 1.26e-23
Family FAS1 domain 0.0037
Further Details:      
 
Domain Number 6 Region: 385-501
Classification Level Classification E-value
Superfamily FAS1 domain 4.05e-19
Family FAS1 domain 0.0051
Further Details:      
 
Domain Number 7 Region: 2300-2463
Classification Level Classification E-value
Superfamily FAS1 domain 1.7e-17
Family FAS1 domain 0.0097
Further Details:      
 
Domain Number 8 Region: 1130-1255
Classification Level Classification E-value
Superfamily FAS1 domain 0.000000000693
Family FAS1 domain 0.009
Further Details:      
 
Domain Number 9 Region: 902-961
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000287
Family EGF-type module 0.031
Further Details:      
 
Domain Number 10 Region: 1545-1582
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000407
Family Merozoite surface protein 1 (MSP-1) 0.088
Further Details:      
 
Weak hits

Sequence:  9796.ENSECAP00000008938
Domain Number - Region: 1997-2137
Classification Level Classification E-value
Superfamily Growth factor receptor domain 0.000188
Family Growth factor receptor domain 0.0085
Further Details:      
 
Domain Number - Region: 230-283
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0012
Family EGF-type module 0.044
Further Details:      
 
Domain Number - Region: 873-911
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00263
Family EGF-type module 0.055
Further Details:      
 
Domain Number - Region: 1371-1498
Classification Level Classification E-value
Superfamily Growth factor receptor domain 0.00471
Family Growth factor receptor domain 0.02
Further Details:      
 
Domain Number - Region: 2141-2176
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00514
Family EGF-like domain of nidogen-1 0.083
Further Details:      
 
Domain Number - Region: 320-357
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0144
Family EGF-type module 0.036
Further Details:      
 
Domain Number - Region: 198-231
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0321
Family EGF-type module 0.083
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) 9796.ENSECAP00000008938
Sequence length 2572
Comment (Equus caballus)
Sequence
MAGPRGLLLSLLAFCLAGSGFVMGQKVLSRRCDVKTKFVTHMPCTLCPIIKKQTCPPGWL
REFPGKISQDCRYEVQLGDSLVSMSGCSLECWKDVVQKACCPGYWGSQCYECPGGAKTPC
NGHGTCLDGIDRNGTCVCQENFSGSACQDCQDPNRFGPDCQSVCSCVHGVCRHGPLGDGS
CLCFAGYTGPRCDQELPVCQALHCPQNAQCSAEAPTCSCLPGHTQQGSECRAPDPCRPSP
CSPLAQCSVSPRGQAECRCPENYHGDGMVCLPQDPCTANHGGCPSNSTLCLYQRPGKASC
SCKPGLVSINHNASTGCFAYCLPNSCDRSATCQVTPDGKTSCVCKEGEVGDGRACYGHLL
HEVQKASQTSTAFLRLRIALAMLDQGCREVLTTSGPFTVLVPSISSIFPRTMNASIAQQL
CKQHIIAGQHILEELGTQHTHKWWTLAGQEITVTFNRFMRKYTYKYKDQPQQTFTIHKAN
YPAVNGIFHVVTALRWQPPTELPEDPKRTISQILASTEAFSRFETILENCGLPSILDGPG
PFTVFAPSNEAVDRLRDGRLIYLFTAGLSKLQELVRYHIYSHGQLTVEKLISKGRVLTMA
NQVLAVNISEEGRILLGPEGVPLRRVDVLAANGVIHMLEGILLPPTILPILPKHCNEEQH
KIVAGSCVDCQALNTSTCPPNSVMLDIFPEECVYTYDPSGLNVLKKGCARYCNQTILKPG
CCKGFFGPDCAQCPGGFSNPCYGKGNCSDGVRGNGACLCFPDYKGIACHICSNPNKHGDQ
CQEDCGCVHGLCDNRPGSGGVCQSGTCAPGFSGRFCNESTGSCGPTEQAQHCHLHARCVT
QGRVARCLCLDGFEGDGFSCTPSNPCSHPDRGGCSENAECVPGTMGTHHCTCHKGWSGDG
RVCVAIDECELDARGGCHADALCSYVGPGQSRCTCKLGFAGDGYMCSPIDPCRAGNGGCH
DLATCRAVGGGQRVCTCPPGYGGDGLSCYGDIFQELEANAHFSVFYQWIKSAGITLPADS
RVTALVPSESAIRRLSPEDQAFWLQPRMLPQLVRAHFLQGSLSEEELARLGGQDVATLSP
TTRWEIHNISGRVWVQNASVDVADLLATNGVLHVLSQVLLPPKGAMPVGQGLLQRLDSVP
AFRLFRELLQHHGLVPQIEAATAYTIFVPTNHSLEVQGNRSSLDVDTVRHHVVLGEALSV
EALQRGGHRNSLLGPAHWLVFYNHSGQPEVNHVPLEGPLLEAPGGSLFGLSGVLTVGSSR
CLHTHAEALREKCINCTRKFRCTQGFKLEDTPKKSCVYQSGYSFSQGCSYTCAKKIQVPD
CCPGFFGTLCEPCPGGLGGVCSGHGQCQDRLLGSGECRCQEGFHGTACEMCELGRYGPNC
TGVCDCAHGLCQEGLRGDGRCVCSVGWQGLRCDQKITGLQCPKKCDPNANCVQDSATAPA
CICAAGYSGNGIYCSEVDPCAHDHGGCSPHANCTKVAPGQRTCTCQDGYTGDGELCQEVN
SCLVHHGGCHLHAECIPTGPQQVSCSCREGYSGDGIRTCELLDPCSQNNGGCSPYAVCKS
IGDGQRTCTCDAAHTVGDGFSCRARVGLELLRDRHASFFSLHLLEYKELKGNGPFTIFVP
HADLMTNLSQDELARIRTQRQLVFRYHVVGCRQLRSQELLEEGYVTTLSGHPLRFSEREG
SIYINDYARVVSSDQQAVNGILHFIDRVLLPPDALHWEPDAAPIPRRNVTAAAESFGYKI
FSSLVTVAGLLPLLQDGSHRPLTMLWPTDSALRALPRDQQAWLYHEDHRDKLAAILRGHV
IRNVEALASDLPNLGPLRTMHGTPISFSCSRARPGELMVGEDDARIVQRHLPFEGGLAYG
IDQLLEPPGLGARCDRFETRPLHLKICSICGLEPPCPQGSQEQGSPEACWRYYSKFWMSP
PLYSLALRGIWPQPSLWGPPQGLGRGCHRNCVTTTWKPSCCPGHYGSECRACPGGASSPC
SDHGVCMDGMSGSGQCRCRSGFAGTACELCAPGAFGPHCQACRCTSHGHCDEGLGGSGSC
FCDEGWTGSRCEVQLELQPVCAPPCAPEAVCRVGNSCECSLGYEGDGRTCTVADLCQDGH
GGCSEHANCSQVGTVVTCTCLPDYEGDGWSCRARDPCADGHRGGCSEHADCLNTGPNTRR
CVCHAGYVGDGLQCLEEPEPPVDRCLGQPPPCHVDAVCTDLHFQEKRAGVFHLQATSGPY
GLNFSEAEAACGAQGAVLASLPQLSAAQQLGFHLCLVGWLANGSAAHPVVFPAADCGDGQ
VGVVSLGTRGNLSERWDAYCYREQDVACQCRDGFVGDGTSVCNGKLLDVLATTANFSTFY
GMLLGYANATPRGLDFLDFLDDELTYKTLFVPVNEGFVDNMTLSGPDLELHASNTTFLST
NASQGTMLPAHSGLSLVISDVGPDNSTWVPVAPGAVVVSRVIVWDIMAFNGIIHALASPL
LAPPQPHAVVAPEAPSVAVGAGAVVATGALLGLVAGALYLRARGKATGFGFSAFQAEDDA
DDDFSPWQEGTSPTLVSVPNPVFGSHDAFCEPFDDSLLEEDFPDTQRILAVK
Download sequence
Identical sequences F7C1Y3
ENSECAP00000008938 9796.ENSECAP00000008938 ENSECAP00000008938 XP_001493277.1.31192

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]