SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSECAP00000019033 from Equus caballus 69_2

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSECAP00000019033
Domain Number 1 Region: 1566-1682,1762-1870,1913-1974,2131-2161
Classification Level Classification E-value
Superfamily Sialidases 0.00000000111
Family Sialidases (neuraminidases) 0.02
Further Details:      
 
Domain Number 2 Region: 472-525,669-744,827-856
Classification Level Classification E-value
Superfamily Sialidases 0.000000745
Family Sialidases (neuraminidases) 0.034
Further Details:      
 
Domain Number 3 Region: 2414-2502,2600-2625,2659-2711
Classification Level Classification E-value
Superfamily Sialidases 0.00000471
Family Sialidases (neuraminidases) 0.047
Further Details:      
 
Weak hits

Sequence:  ENSECAP00000019033
Domain Number - Region: 2369-2394
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000512
Family Integrin beta EGF-like domains 0.026
Further Details:      
 
Domain Number - Region: 2465-2550,2658-2692,2743-2988
Classification Level Classification E-value
Superfamily Sialidases 0.000902
Family Sialidases (neuraminidases) 0.026
Further Details:      
 
Domain Number - Region: 563-588
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0013
Family EGF-type module 0.023
Further Details:      
 
Domain Number - Region: 921-947
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00293
Family EGF-type module 0.085
Further Details:      
 
Domain Number - Region: 913-946,1297-1328,2006-2046
Classification Level Classification E-value
Superfamily Growth factor receptor domain 0.00392
Family Growth factor receptor domain 0.009
Further Details:      
 
Domain Number - Region: 1411-1452,1568-1621
Classification Level Classification E-value
Superfamily Sialidases 0.00432
Family Sialidases (neuraminidases) 0.045
Further Details:      
 
Domain Number - Region: 1033-1087,1206-1268
Classification Level Classification E-value
Superfamily Sialidases 0.00869
Family Sialidases (neuraminidases) 0.088
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSECAP00000019033   Gene: ENSECAG00000020468   Transcript: ENSECAT00000023003
Sequence length 3347
Comment pep:known chromosome:EquCab2:4:4129555:4518186:-1 gene:ENSECAG00000020468 transcript:ENSECAT00000023003 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MSDHQFGNQFMCSVVASHVSHLPTTNLSFVWIAPPAGTGCVNFMATATHRGQIIFKDALA
QQLCEQGAPTEATLHPHLAEIHSDSIILRDDFDSYHQLELNPNIWVECNNCETGEQCGAI
MHGNAVTFCEPYGPRELITTSLNTTTASVLQFSIGSGSCRFSYSDPSITVSYAKNNSADW
TLLEKIRAPSNVSTIIHILYLPEDAKGENVQFQWKQENLRVGEVYEACWALDNILVINSA
HRQVILEDSLDPVDTGNWLFFPGATVKHSCQSDGNSIYFHGNEGSEFNFATTRDVDLSSE
DIQEQWSEEFESQPTGWDILGAVIGTECGTIESGLSMVFLRDGERKLCTPYMDTTGYGNL
RFYFVMGGICDSGDSHENDIILYAKIEGRKEHIALDTLSYSSYKVPSLVSVVINPELQTP
ATKFCLRQKNHQGHNRNVWAVDFFHVLPVLPSTMSHMIQFSINLGCGTHQPGNSVSLEFS
TNHGRSWSLIHTECLPEICAGPHLPHSTIYSSENYSGWNRITIPLPNAALTRDTRIRWRQ
TGPILGNMWAIDNVYIGPSCLKFCSGRGQCTQHGCKCDPGFSGPACEMASQTFPMFISES
FGSSRLSSYHNFYSIRGAEVSFGCGVLASGKALVFNKDGRRQLITSFLDSSQSRFLQFTL
RLGSKSVLSTCRAPDQPGEGVLLHYSYDNGITWKLLEHYSYLNYHEPRIISVELPDDARQ
FGIQFRWWQPYHSSQGEDVWAIDEIIMTSVLFNSISLDFTNLVEVTQSLGFYLGNVQPCC
GHDWTLCFTGDSKLASSMRYVETQSMQTGASYMIQFSLVMGCGKKYTPHMDNQVKLEYST
NHGLTWHLVQEECLPSMPSCQEFTSASIYHASEFTQWRRVTVLLPQKTWSSATRFRWSQS
YYTAQDEWALDNIYIGQQCPNMCGGHGSCDRGLCRCDQGYQGTDCQPEAALPSTIMSDFE
NQNGWESDWQEVIGGEIVKPEQGCGVISSGSSLYFSKAGKRQLVSWDLDTSWADFVQFYI
QIGGESAACNRPDSREEGVLLQYSNNGGIQWHLLAEMYFSDFSKPRFVYLELPAAAKTPC
TRFRWWQPVFSGEDYDQWAIDDIIILSEKQKQIIPAVNPTLPQNFYEKPAFDYPMNQMSV
WLMLANEGMVKNETFCSATPSAMVFGKSDGDRFAVTRDLTLKPGYVLQFKLNIGCATQFS
SAAPVLLQYSHDAGMSWFLVREGCYPASAGKGCEGNARELSEPTVYHTGDFEEWTRITIV
IPRSLASSKTRFRWIQESSSQKSLPPFGLDGVYISEPCPSYCSGHGDCVSGVCFCDLGYT
AAQGTCVSNVPNHSEMFDRFEGKLSPLWYKITGGQVGTGCGTLNDGRSLYFNGPGKREAR
TVPLDTRNIRLVQFYIQIGSKTSGIACIKPRARNEGLVVQYSNDNGIIWHLLRELDFMSF
LEPQIISIDLPREAKTPATAFRWWQPQHGHSAQWALDDVLIGMNDSSQTGFQDKFDGSID
LQANWYRIQGGQVDIDCLSMDTALVFTENIGKPRYAETWDFHVSASTFLQFEMSMGCSKP
FSSSHSVQLQYSLNNGKDWHLVTEECVPPTIGCLHYTASSIYTSERFQNWKRITVYLPLS
TMSPRTRFRWIQANYTVGADSWAIDNVVLASGCPWMCSGRGICDAGRCVCDRGFGGAFCV
PVVPLPSILKDDFNGNLHPDIWPEVYGAERGNLNGETIKSGTSLIFKGEGLRMLISRDLD
CANTLYVQFSLRFIAKGTPERSHSILLQFSINGGITWHLMDEFYFPQTTSILFINVPLPS
TAQTNATRFRLWQPYNNGKKEEIWIVDDFIIDGNNLNNPVMLLDTFDFGPREDNWFFYPG
GNIGLYCPYSSKGAPEEDSAMVFVSNEVGEHSITTRDLNVNENTIIQFEINVGCSTDSSS
ADPVRLEFSRDFGATWHLLLPLCYHSSSHVSSLCSTEHHPSSTYYAGTTQGWRREVVHFG
RLHLCGSVRFRWYQGFYPAGSQPVTWAIDNVYIGPQCEEMCNGHGSCINGTKCICDPGYS
GPTCKISTKNPDFLKDDFEGQLESDRFLLMSGGKPSRKCGILSSGNNLFFNEDGLRMLMT
RDLDLSHARFVQFFMRLGCGKGVPDPRSQPVLLQYSLNGGLSWSLLQEFLFSNSSNVGRY
IALEIPLKARAASTRLRWWQPSENGHFYSPWVIDQILIGGNISGNTVLEDDFTTLDSRKW
LLHPGGTKMPVCGSTGDALVFIEKASTRYVVTTDVAVNEDSFLQIDFAASCSVTDSCYGI
WEDFTKELSLKWELVVGEFVLDRVMNKRAQVQRVTLKEGQNKYISIDLLLEAKSKSQATR
FRWHQPAPFDKQQTWAIDNVYIGDGCIDMCSGHGRCIHGNCVCDEQWGGLYCDEPEASLP
PQLKDNFNRAPSSQNWLTVNGGKLSTVCGAVASGMALHFSGGCSRLLVTVDLNLTNAEFI
QFYFMYGCLITPNNRNQGVLLEYSVNGGITWNLLMEIFYDQYSKPGFVNILLPPDAKEIA
TRFRWWQPRHDGLDQNDWAIDNVLISGSADQRTVMLDTFSSAPVPQHERSPADAGPVGRI
AFDMFMEDKTAVNEHWLFHDDCTVERFCDSPDGVMICGSHDGREVYAVTHDLTPAEDWIM
QFKISVGCKVSEKVAQNQIHVQYSTDFGVSWNYLVPQCLPADPKCSGTVSQPSVFFPTKG
WKRITYPLPESLVGNPVRFRFYQKYSDVQWAIDNFYLGPGCLDNCRGHGDCLKEQCICDP
GYSGPNCYLTHTLKTFLKERFDSEEIKPDLWMSLEGGSTCTECGILAEDTALYFGGSTVR
QAITQDLDLRGAKFLQYWGRIGSENNMTSCHRPVCRKEGVLLDYSADGGTRITWTLLHEM
DYQKYISVRHDYILLPEDALTNTTRLRWWQPFVISNGLVVSGVERAQWALDNILIGGAEI
NPSQLVDTFDDEGTSHEENWSFYPNAVRTAGFCGNPSFHLYWPNKKKDKTDNALSSRELI
IQPGYMMQFKIVVGCEATSCGDLHSVMLEYTKDARSDSWQLVQTQCLPSSSNSIGCSPFQ
FHEATIYNAVNSSSWKRVTIQLPDHVSSSATQFRWIQKGEETEKQSWAIDHVYIGEACPK
LCSGHGSCTTGAVCICDESFRGDDCSVFSHDLPSYIKDNFESARVTEANWETIQGGVIGS
GCGQLAPYAHGDSLYFNGCQIRQAATKPLDLTRASKIMFVLQIGSTAQTDSCNSDLSGPH
AVDKAVLLQYSINNGITWHVIAQHQPKDFTQAQRVSYNVPLEARMKGVLLRWWQPRHNGT
GHDQWALDHVEVVLVITRKQNYMMNFSRQHGLRQFYNRRRRSLRRYP
Download sequence
Identical sequences F6XIH6
ENSECAP00000019033 ENSECAP00000019033 9796.ENSECAP00000019033

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]