SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSECAP00000022972 from Equus caballus 76_2

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSECAP00000022972
Domain Number 1 Region: 12-152
Classification Level Classification E-value
Superfamily Galactose-binding domain-like 1.49e-46
Family Discoidin domain (FA58C, coagulation factor 5/8 C-terminal domain) 0.00034
Further Details:      
 
Domain Number 2 Region: 126-312
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 2.47e-38
Family Laminin G-like module 0.0025
Further Details:      
 
Domain Number 3 Region: 761-935
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 7.33e-35
Family Laminin G-like module 0.0022
Further Details:      
 
Domain Number 4 Region: 942-1154
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 1.94e-34
Family Laminin G-like module 0.0048
Further Details:      
 
Domain Number 5 Region: 336-527
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 1.48e-33
Family Laminin G-like module 0.0031
Further Details:      
 
Domain Number 6 Region: 553-611
Classification Level Classification E-value
Superfamily Fibrinogen C-terminal domain-like 0.000000000712
Family Fibrinogen C-terminal domain-like 0.003
Further Details:      
 
Domain Number 7 Region: 524-561
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000191
Family EGF-type module 0.012
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSECAP00000022972   Gene: ENSECAG00000008838   Transcript: ENSECAT00000029093
Sequence length 1300
Comment pep:known_by_projection chromosome:EquCab2:4:99376000:100674882:1 gene:ENSECAG00000008838 transcript:ENSECAT00000029093 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
SEKCDEPLVSGLPHGAFSSSSSMSGSYSPGYAKINKRGGAGGWSPSDSDHYQWLQVDFGN
RKQISAIATQGRYSSSDWVTQYRMLYSDTGRNWKPYHQDGNIWAFPGNVNSDSVVRHDLQ
HPVVARYVRMVPLDWNGEGRIGLRIEVYGCSYWADVINFDGHVVLPYRFRNKKMKTLKDV
IALKFKTTESEGVILHGEGQQGDYITLELKKAKLVFSLNLGSNQLGPIYGHTSVMTGSLL
DDHHWHSVVIERQGRSINLTLDRSMQHFRTNGEFDYLDLDYEITFGGIPFSGKPSSSSRK
NFKGCMESINYNGINITDLARRKKLEPSNVGNLSFSCVEPYTVPVFFNATSYLEVPGRLN
QDLFSVSFQFRTWSPNGLLLFSRFADNLGNVEIDLTESKVGVHINVTQTKMSQIDISSGS
GLNDGQWHEVRFLAKENFAILTIDGDEASAVRTNSPLQVKTGEKYFFGGFLNQMNNSSHS
VLQPSFQGCMQLIQVDDQLVNLYEVAQRRPGSFANVSIDMCAIIDRCVPNHCEHGGKCSQ
TWDSFKCTCDETGYSGATCHNSIYEPSCEAYKHLGQTSNYYWIDPDGSGPLGPLKVYCNM
TEDKVWTIVSHDLHMQTTVVGYNPEKYSVTQLIYSASMDQISALTKSPQRAPLPSPSYHL
NVPFCFLVDGSPYTWWVGKANEKHYYWGGSAPGIQKCACGIERNCTDPKYYCNCDADYKQ
WRKDAGFLTYKDHLPVSQVVVGDTDRQGSEAKLSVGPLRCQGDRNYWNAASFPNPSSYLH
FSTFQGETSADISFYFKTLIPRGVFLENLGNTDFIKLELKSATEVSFSFDVGNGPVEIVV
RSPSPLNDDQWHRVTAERNVKQASLQVDRLPQQIRKAPTEGHTRLELYSQLFVGGAGGQQ
GFLGCIRSLRMNGVTLDLEERAKVTSGFKSGCSGHCTSYGTNCENGGKCIEKYHGYSCDC
SNTAYDGTFCNKDVGAYFEEGMWLRYNFQASGISAKESGSRSENSPDQQNSPQDLAHEEI
RFSFSTTKAPCILLYVSSLTTDFLAVLVKPTGNLQIRYNLGGTREPYNIDVDHRNMANGQ
PHSVNITRHEKTIILKLDHYPSVSYHLPSSSDTLFNSPKSLFLGKVIETGKIDQEIHKYN
TPGFTGCLSRVQFNQIAPLKAALRPTNASAHVHIQGELVESNCGASPLTLSPMSSATDPW
HLDHLDSASADFPYNPGQGQAIRNGVNRNSAIIGGVIAVVIFTILCTLVFLIRYMFRHKG
TYHTNEAKGAESAESADAAIMNNDPNFTETIDESKKEWLI
Download sequence
Identical sequences F6T0D2
ENSECAP00000022972 9796.ENSECAP00000022972 ENSECAP00000022972

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]