SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSCAFP00000029720 from Canis familiaris 76_3.1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSCAFP00000029720
Domain Number 1 Region: 38-179
Classification Level Classification E-value
Superfamily Galactose-binding domain-like 1.24e-42
Family Discoidin domain (FA58C, coagulation factor 5/8 C-terminal domain) 0.00034
Further Details:      
 
Domain Number 2 Region: 153-339
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 1.32e-37
Family Laminin G-like module 0.006
Further Details:      
 
Domain Number 3 Region: 725-751,786-939
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 1e-36
Family Laminin G-like module 0.0019
Further Details:      
 
Domain Number 4 Region: 338-521
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 1.03e-36
Family Laminin G-like module 0.0052
Further Details:      
 
Domain Number 5 Region: 970-1174
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 1.39e-29
Family Laminin G-like module 0.018
Further Details:      
 
Domain Number 6 Region: 580-638
Classification Level Classification E-value
Superfamily Fibrinogen C-terminal domain-like 0.000000000141
Family Fibrinogen C-terminal domain-like 0.0033
Further Details:      
 
Domain Number 7 Region: 552-588
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000479
Family EGF-type module 0.014
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSCAFP00000029720   Gene: ENSCAFG00000020059   Transcript: ENSCAFT00000031919
Sequence length 1309
Comment pep:known_by_projection chromosome:CanFam3.1:5:74509885:74749100:-1 gene:ENSCAFG00000020059 transcript:ENSCAFT00000031919 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MGSITGAVLKLLLLLSTQNWNRVIAGNSYDCDEPLVSTLSQASFSSSSELSSSHGAGFAR
LNRRDGAGGWSPLVSNKYQWLQIDLGERMETTAVATQGGYGSSNWVTSYLLMFSDSGRNW
KQYRQEDSIWGFSGNANADSVVYYRLQPSIKARFLRFIPLEWNPRGRIGMRIEVFGCAYR
SEVVDLDGKSSLLYRFDQNSLSPVKDVISLKFKTMQSDGILLHREGQNGDHITLELRRGR
LFLLINSGEAKPPSTHTLINLTLGSLLDDQHWHSVLIQHVGKQVNFTVDEHRHRFHAQGE
FGYLDLDHEISFGGIPAPGKSVSFPHKNFHGCLENLYYNGVDIIDLAKRQKPEIIAMGNV
SFSCSQSQSVPVTFLSPRSYLALPGFSGEDELSASFQFRTWNKAGLLLFSELQLVSGGLL
LFLNDGKLKLNLYQPGRLPSDITAGVVGLNDGQWHSISLSARRNHLSMVVDGQVASAASS
LGPEQLYSGGTYYFGGCPDNSFGSKCKNPLDGFQGCMRLISISNRIVDLISVQQGSLGNF
SDLQIDSCGITDRCLPNYCEHGGECSQSWNTFHCNCANTGYTGATCHNSIYEQSCESYKH
RGNTSGFYYIDSDGSGPLEPFLLYCNMTETAWTVIQHNGSDLTRVRNTNPENPYAGFFEY
VASMEQLQATINLAEHCEQELTYYCKKSRLVNKQDGSPLSWWVGRTNETQTYWGGSLPDP
QKCTCGLEGNCIDSQYYCNCDADRNEWTNDTGFLSYKEHLPVRKIVVTDTGRPHSEAAYK
LGPLLCRGDRSFWNSASFNTEASYLHFPTFHGELSADVSFFFKTTASSGVFLENLGITDF
IRIELRSPAVVTFSFDVGNGPFEISVQSPTHFNDNQWHHVRVERNMKEASLRVDQLLPKT
QPAPADGHVLLQLNSQLFVGGTATRQRGFLGCIRSLQLNGMSLDLEERAKVTPGVEPGCR
GHCSSYGKLCQNGGKCREKLSGFSCDCTFSAYTGPFCSKEISAYFGSGSSLIYNFQENYS
LSKNSSSHAASFHGDMKLSREMIKFSFRTTRAPSLLLYVSSFYKEYLSVIIAKNGSLQIR
YKLNRYQEPDVINFDFKNMADGQLHHVDINREEGVVFVEIDENAKRQVHLSSGTEFSAIR
SLVLGRILEHGDLDQETALAGAQGFLGCLSAVQLSHVAPLKAALQPRHPARITVSGQVTE
SSCVAQAGTDATSRERTHSFADHSGTRNDREPLANAIRSDSAVIGGLIAVVIFILLCIAA
IAIRIYQQKRLYKRNEAKRSENVDSAEAVLKSELNIQSAVNENQKEYFF
Download sequence
Identical sequences F1PAM8
XP_536775.2.84170 ENSCAFP00000029720 ENSCAFP00000029720

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]