SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSECAP00000019062 from Equus caballus 69_2

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSECAP00000019062
Domain Number 1 Region: 2414-2502,2600-2625,2659-2710
Classification Level Classification E-value
Superfamily Sialidases 0.00000000135
Family Sialidases (neuraminidases) 0.047
Further Details:      
 
Domain Number 2 Region: 1565-1682,1762-1870,1913-1974,2131-2162
Classification Level Classification E-value
Superfamily Sialidases 0.00000000366
Family Sialidases (neuraminidases) 0.02
Further Details:      
 
Domain Number 3 Region: 472-525,669-744,827-856
Classification Level Classification E-value
Superfamily Sialidases 0.00000027
Family Sialidases (neuraminidases) 0.034
Further Details:      
 
Domain Number 4 Region: 1041-1057,1201-1246,1405-1515,1567-1599
Classification Level Classification E-value
Superfamily Sialidases 0.0000859
Family Sialidases (neuraminidases) 0.037
Further Details:      
 
Weak hits

Sequence:  ENSECAP00000019062
Domain Number - Region: 2369-2394
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000512
Family Integrin beta EGF-like domains 0.026
Further Details:      
 
Domain Number - Region: 2465-2550,2658-2692,2743-2988
Classification Level Classification E-value
Superfamily Sialidases 0.000902
Family Sialidases (neuraminidases) 0.026
Further Details:      
 
Domain Number - Region: 563-588
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0013
Family EGF-type module 0.023
Further Details:      
 
Domain Number - Region: 921-947
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00195
Family EGF-type module 0.085
Further Details:      
 
Domain Number - Region: 2017-2048
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00577
Family EGF-type module 0.049
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSECAP00000019062   Gene: ENSECAG00000020468   Transcript: ENSECAT00000023034
Sequence length 3347
Comment pep:known chromosome:EquCab2:4:4129555:4518186:-1 gene:ENSECAG00000020468 transcript:ENSECAT00000023034 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MSDHQFGNQFMCSVVASHVSHLPTTNLSFVWIAPPAGTGCVNFMATATHRGQIIFKDALA
QQLCEQGAPTEATLHPHLAEIHSDSIILRDDFDSYHQLELNPNIWVECNNCETGEQCGAI
MHGNAVTFCEPYGPRELITTSLNTTTASVLQFSIGSGSCRFSYSDPSITVSYAKNNSADW
TLLEKIRAPSNVSTIIHILYLPEDAKGENVQFQWKQENLRVGEVYEACWALDNILVINSA
HRQVILEDSLDPVDTGNWLFFPGATVKHSCQSDGNSIYFHGNEGSEFNFATTRDVDLSSE
DIQEQWSEEFESQPTGWDILGAVIGTECGTIESGLSMVFLRDGERKLCTPYMDTTGYGNL
RFYFVMGGICDSGDSHENDIILYAKIEGRKEHIALDTLSYSSYKVPSLVSVVINPELQTP
ATKFCLRQKNHQGHNRNVWAVDFFHVLPVLPSTMSHMIQFSINLGCGTHQPGNSVSLEFS
TNHGRSWSLIHTECLPEICAGPHLPHSTIYSSENYSGWNRITIPLPNAALTRDTRIRWRQ
TGPILGNMWAIDNVYIGPSCLKFCSGRGQCTQHGCKCDPGFSGPACEMASQTFPMFISES
FGSSRLSSYHNFYSIRGAEVSFGCGVLASGKALVFNKDGRRQLITSFLDSSQSRFLQFTL
RLGSKSVLSTCRAPDQPGEGVLLHYSYDNGITWKLLEHYSYLNYHEPRIISVELPDDARQ
FGIQFRWWQPYHSSQGEDVWAIDEIIMTSVLFNSISLDFTNLVEVTQSLGFYLGNVQPCC
GHDWTLCFTGDSKLASSMRYVETQSMQTGASYMIQFSLVMGCGKKYTPHMDNQVKLEYST
NHGLTWHLVQEECLPSMPSCQEFTSASIYHASEFTQWRRVTVLLPQKTWSSATRFRWSQS
YYTAQDEWALDNIYIGQQCPNMCGGHGSCDRGLCRCDQGYQGTDCQPEAALPSTIMSDFE
NQNGWESDWQEVIGGEIVKPEQGCGVISSGSSLYFSKAGKRQLVSWDLDTSWADFVQFYI
QIGGESAACNRPDSREEGVLLQYSNNGGIQWHLLAEMYFSDFSKPRFVYLELPAAAKTPC
TRFRWWQPVFSGEDYDQWAIDDIIILSEKQKQIIPAVNPTLPQNFYEKPAFDYPMNQMSV
WLMLANEGMVKNETFCSATPSAMVFGKSDGDRFAVTRDLTLKPGYVLQFKLNIGCATQFS
SAAPVLLQYSHDAGMSWFLVREGCYPASAGKGCEGNARELSEPTVYHTGDFEEWTRITIV
IPRSLASSKTRFRWIQESSSQKSLPPFGLDGVYISEPCPSYCSGHGDCVSGVCFCDLGYT
AAQGTCVSNVPNHSEMFDRFEGKLSPLWYKITGGQVGTGCGTLNDGRSLYFNGPGKREAR
TVPLDTRNIRLVQFYIQIGSKTSGIACIKPRARNEGLVVQYSNDNGIIWHLLRELDFMSF
LEPQIISIDLPREAKTPATAFRWWQPQHGHSAQWALDDVLIGMNDSSQTGFQDKFDGSID
LQANWYRIQGGQVDIDCLSMDTALVFTENIGKPRYAETWDFHVSASTFLQFEMSMGCSKP
FSSSHSVQLQYSLNNGKDWHLVTEECVPPTIGCLHYTASSIYTSERFQNWKRITVYLPLS
TMSPRTRFRWIQANYTVGADSWAIDNVVLASGCPWMCSGRGICDAGRCVCDRGFGGAFCV
PVVPLPSILKDDFNGNLHPDIWPEVYGAERGNLNGETIKSGTSLIFKGEGLRMLISRDLD
CANTLYVQFSLRFIAKGTPERSHSILLQFSINGGITWHLMDEFYFPQTTSILFINVPLPS
TAQTNATRFRLWQPYNNGKKEEIWIVDDFIIDGNNLNNPVMLLDTFDFGPREDNWFFYPG
GNIGLYCPYSSKGAPEEDSAMVFVSNEVGEHSITTRDLNVNENTIIQFEINVGCSTDSSS
ADPVRLEFSRDFGATWHLLLPLCYHSSSHVSSLCSTEHHPSSTYYAGTTQGWRREVVHFG
RLHLCGSVRFRWYQGFYPAGSQPVTWAIDNVYIGPQCEEMCNGHGSCINGTKCICDPGYS
GPTCKISTKNPDFLKDDFEGQLESDRFLLMSGGKPSRKCGILSSGNNLFFNEDGLRMLMT
RDLDLSHARFVQFFMRLGCGKGVPDPRSQPVLLQYSLNGGLSWSLLQEFLFSNSSNVGRY
IALEIPLKARAASTRLRWWQPSENGHFYSPWVIDQILIGGNISGNTVLEDDFTTLDSRKW
LLHPGGTKMPVCGSTGDALVFIEKASTRYVVTTDVAVNEDSFLQIDFAASCSVTDSCYGI
WEDFTKELSLKWELVAQEFVLDRVMNKRAQVQRVTLKEGQNKYITPTKCQAPYAESQATR
FRWHQPAPFDKQQTWAIDNVYIGDGCIDMCSGHGRCIHGNCVCDEQWGGLYCDEPEASLP
PQLKDNFNRAPSSQNWLTVNGGKLSTVCGAVASGMALHFSGGCSRLLVTVDLNLTNAEFI
QFYFMYGCLITPNNRNQGVLLEYSVNGGITWNLLMEIFYDQYSKPGFVNILLPPDAKEIA
TRFRWWQPRHDGLDQNDWAIDNVLISGSADQRTVMLDTFSSAPVPQHERSPADAGPVGRI
AFDMFMEDKTAVNEHWLFHDDCTVERFCDSPDGVMICGSHDGREVYAVTHDLTPAEDWIM
QFKISVGCKVSEKVAQNQIHVQYSTDFGVSWNYLVPQCLPADPKCSGTVSQPSVFFPTKG
WKRITYPLPESLVGNPVRFRFYQKYSDVQWAIDNFYLGPGCLDNCRGHGDCLKEQCICDP
GYSGPNCYLTHTLKTFLKERFDSEEIKPDLWMSLEGGSTCTECGILAEDTALYFGGSTVR
QAITQDLDLRGAKFLQYWGRIGSENNMTSCHRPVCRKEGVLLDYSADGGTRITWTLLHEM
DYQKYISVRHDYILLPEDALTNTTRLRWWQPFVISNGLVVSGVERAQWALDNILIGGAEI
NPSQLVDTFDDEGTSHEENWSFYPNAVRTAGFCGNPSFHLYWPNKKKDKTDNALSSRELI
IQPGYMMQFKIVVGCEATSCGDLHSVMLEYTKDARSDSWQLVQTQCLPSSSNSIGCSPFQ
FHEATIYNAVNSSSWKRVTIQLPDHVSSSATQFRWIQKGEETEKQSWAIDHVYIGEACPK
LCSGHGSCTTGAVCICDESFRGDDCSVFSHDLPSYIKDNFESARVTEANWETIQGGVIGS
GCGQLAPYAHGDSLYFNGCQIRQAATKPLDLTRASKIMFVLQIGSTAQTDSCNSDLSGPH
AVDKAVLLQYSINNGITWHVIAQHQPKDFTQAQRVSYNVPLEARMKGVLLRWWQPRHNGT
GHDQWALDHVEVVLIPTRKQNYMMNFSRQHGLRQFYNRRRRSLRRYP
Download sequence
Identical sequences F6YD04
ENSECAP00000019062 ENSECAP00000019062

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]