SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSECAP00000021907 from Equus caballus 76_2

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSECAP00000021907
Domain Number 1 Region: 143-330
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 2.04e-38
Family Laminin G-like module 0.0035
Further Details:      
 
Domain Number 2 Region: 33-170
Classification Level Classification E-value
Superfamily Galactose-binding domain-like 1.13e-36
Family Discoidin domain (FA58C, coagulation factor 5/8 C-terminal domain) 0.00063
Further Details:      
 
Domain Number 3 Region: 968-1021,1083-1217
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 3.17e-32
Family Laminin G-like module 0.004
Further Details:      
 
Domain Number 4 Region: 350-514
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 4.65e-31
Family Laminin G-like module 0.0064
Further Details:      
 
Domain Number 5 Region: 778-960
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 3.96e-28
Family Laminin G-like module 0.0085
Further Details:      
 
Domain Number 6 Region: 572-629
Classification Level Classification E-value
Superfamily Fibrinogen C-terminal domain-like 0.000000000141
Family Fibrinogen C-terminal domain-like 0.0027
Further Details:      
 
Domain Number 7 Region: 543-579
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000172
Family EGF-type module 0.014
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSECAP00000021907   Gene: ENSECAG00000024128   Transcript: ENSECAT00000026276
Sequence length 1387
Comment pep:known_by_projection chromosome:EquCab2:11:20356372:20369918:-1 gene:ENSECAG00000024128 transcript:ENSECAT00000026276 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MMRLALFCILLAAISGAWGWGYYGCDEELVGPLYARSLGASSYYGLFTTPRFARLHGISG
WSPRIGDPNPWLQIDLMKKHRIRAVATQGSFNSWDWVTRYMLLYGDRVDSWTPFYQRGHN
ATFFGNVNESAVVRHDLHYHFTARYIRIVPLAWNPRGKIGLRLGLYGCPYKSDVLYFDGD
DAISYRFPRGVSRSLWDVFAFSFKTEEKDGLLLHAEGVQGDYVTLELQGAQLLLHMSLGD
SPIQPRPGHTTVSAGGVLNDQHWHYVRVDRFGRQANLTLDGYVQRFVLNGDFERLNLDTE
MFIGGLVGAAQKNLAYRHNFRGCMENVIFNRVNIADLAASRPGESGRMGGKVAFRCLDPV
PHPINFGGPHNFVQVPGFPRRGRLAVSFRFRTWDLTGLLLFSRLGDGLGHVELMLSEGQV
NVSIAQTGRKKLQFAAGYRLNNGFWHEVNFVAQENHAVISIDDVEGAEVRVSYPLLIRTG
TSYFFGGCPKPASRSGCHSNQTAFHGCMELLKVDGQLVNLTLVEGRRLGYYAEVLFDTCG
ITDRCSPNMCEHDGRCYQSWDDFICYCELTGYKGETCHQPLYKESCEAYRLSGKTSGNFT
IDPDGSGPLKPFVVYCDIRENRAWTVVRHDRLWTTRVTGSSMERPFLGAIQYWNASWEEV
SALANASQHCEQWIEFSCYNSRLLNTAGGYPYSFWIGRNEEQHFYWGGSQPGIQRCACGL
DGSCVDPALHCNCDADQPQWRTDKGLLTFVDHLPVTQVVVGDTNRSTSEAQFFLRPLRCY
GDRNSWNTISFHTGAALRFPPIRANHSLDVSFYFRTSAPSGVFLENMGGPYCQWRRPYVR
VELNTSRDVVFAFDVGNGDENLTVHSDDFEFNDDEWHLVRAEINVKQARLRVDHRPWVLR
PMPLQTYIWLEYDQPLYVGSAELKRRPFVGCLRAMRLNGVTLNLEGRANASEGTSPNCTG
RCAHPRFPCFHGGRCVERYSYYTCDCDLTAFDGPYCNHDIGGFFEPGTWMRYNLQSALRS
AAREFSHMLSRPVPGYEPGYIPGYDTPGYVPGYHGPGYRLPDYPQPGRPVPGYRGPVYNV
TGEEVSFSFSTSSAPAVLLYVSSFVRDYMAVLIKEDGTLQLRYQLGTSPYVYQLTTRPVT
DGQPHSVNITRVYRNLFIQVDYFPLTEQKFSLLVDSQLDSPKALYLGRVMETGVIDPEIQ
RYNTPGFSGCLSGVRFNNVAPLKTHFRTPRPMTAELAEALRVQGELSESNCGAMPRLFSE
VPPELDPWYLPPDFPYYHDDGWVAILLGFLVAFLLLGLVGMLVLFYLQNHRYKGSYHTNE
PKATHDYHPGSKHPLPTSGSAQAPAPTPASTQVPAPAPAPTPAPAPAPAPGPRDQNLPQI
LEESRSE
Download sequence
Identical sequences F6WIQ1
ENSECAP00000021907

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]