SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSECAP00000021874 from Equus caballus 76_2

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSECAP00000021874
Domain Number 1 Region: 143-330
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 2.15e-38
Family Laminin G-like module 0.0035
Further Details:      
 
Domain Number 2 Region: 33-170
Classification Level Classification E-value
Superfamily Galactose-binding domain-like 1.13e-36
Family Discoidin domain (FA58C, coagulation factor 5/8 C-terminal domain) 0.00063
Further Details:      
 
Domain Number 3 Region: 978-1031,1093-1227
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 3.17e-32
Family Laminin G-like module 0.004
Further Details:      
 
Domain Number 4 Region: 360-524
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 4.33e-31
Family Laminin G-like module 0.0064
Further Details:      
 
Domain Number 5 Region: 788-970
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 4.07e-28
Family Laminin G-like module 0.0085
Further Details:      
 
Domain Number 6 Region: 582-639
Classification Level Classification E-value
Superfamily Fibrinogen C-terminal domain-like 0.000000000141
Family Fibrinogen C-terminal domain-like 0.0027
Further Details:      
 
Domain Number 7 Region: 553-589
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000172
Family EGF-type module 0.014
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSECAP00000021874   Gene: ENSECAG00000024128   Transcript: ENSECAT00000026238
Sequence length 1397
Comment pep:known_by_projection chromosome:EquCab2:11:20356372:20369918:-1 gene:ENSECAG00000024128 transcript:ENSECAT00000026238 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MMRLALFCILLAAISGAWGWGYYGCDEELVGPLYARSLGASSYYGLFTTPRFARLHGISG
WSPRIGDPNPWLQIDLMKKHRIRAVATQGSFNSWDWVTRYMLLYGDRVDSWTPFYQRGHN
ATFFGNVNESAVVRHDLHYHFTARYIRIVPLAWNPRGKIGLRLGLYGCPYKSDVLYFDGD
DAISYRFPRGVSRSLWDVFAFSFKTEEKDGLLLHAEGVQGDYVTLELQGAQLLLHMSLGD
SPIQPRPGHTTVSAGGVLNDQHWHYVRVDRFGRQANLTLDGYVQRFVLNGDFERLNLDTE
MFIGGLVGAAQKNLAYRHNFRGCMENVIFNRVNIADLAVRRHSRITFEASRPGESGRMGG
KVAFRCLDPVPHPINFGGPHNFVQVPGFPRRGRLAVSFRFRTWDLTGLLLFSRLGDGLGH
VELMLSEGQVNVSIAQTGRKKLQFAAGYRLNNGFWHEVNFVAQENHAVISIDDVEGAEVR
VSYPLLIRTGTSYFFGGCPKPASRSGCHSNQTAFHGCMELLKVDGQLVNLTLVEGRRLGY
YAEVLFDTCGITDRCSPNMCEHDGRCYQSWDDFICYCELTGYKGETCHQPLYKESCEAYR
LSGKTSGNFTIDPDGSGPLKPFVVYCDIRENRAWTVVRHDRLWTTRVTGSSMERPFLGAI
QYWNASWEEVSALANASQHCEQWIEFSCYNSRLLNTAGGYPYSFWIGRNEEQHFYWGGSQ
PGIQRCACGLDGSCVDPALHCNCDADQPQWRTDKGLLTFVDHLPVTQVVVGDTNRSTSEA
QFFLRPLRCYGDRNSWNTISFHTGAALRFPPIRANHSLDVSFYFRTSAPSGVFLENMGGP
YCQWRRPYVRVELNTSRDVVFAFDVGNGDENLTVHSDDFEFNDDEWHLVRAEINVKQARL
RVDHRPWVLRPMPLQTYIWLEYDQPLYVGSAELKRRPFVGCLRAMRLNGVTLNLEGRANA
SEGTSPNCTGRCAHPRFPCFHGGRCVERYSYYTCDCDLTAFDGPYCNHDIGGFFEPGTWM
RYNLQSALRSAAREFSHMLSRPVPGYEPGYIPGYDTPGYVPGYHGPGYRLPDYPQPGRPV
PGYRGPVYNVTGEEVSFSFSTSSAPAVLLYVSSFVRDYMAVLIKEDGTLQLRYQLGTSPY
VYQLTTRPVTDGQPHSVNITRVYRNLFIQVDYFPLTEQKFSLLVDSQLDSPKALYLGRVM
ETGVIDPEIQRYNTPGFSGCLSGVRFNNVAPLKTHFRTPRPMTAELAEALRVQGELSESN
CGAMPRLFSEVPPELDPWYLPPDFPYYHDDGWVAILLGFLVAFLLLGLVGMLVLFYLQNH
RYKGSYHTNEPKATHDYHPGSKHPLPTSGSAQAPAPTPASTQVPAPAPAPTPAPAPAPAP
GPRDQNLPQILEESRSE
Download sequence
Identical sequences F6X9P2
ENSECAP00000021874 9796.ENSECAP00000021874 ENSECAP00000021874

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]