SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSECAP00000012991 from Equus caballus 69_2

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSECAP00000012991
Domain Number 1 Region: 2171-2326
Classification Level Classification E-value
Superfamily Galactose-binding domain-like 1.21e-56
Family Discoidin domain (FA58C, coagulation factor 5/8 C-terminal domain) 0.000000093
Further Details:      
 
Domain Number 2 Region: 2019-2171
Classification Level Classification E-value
Superfamily Galactose-binding domain-like 2.2e-51
Family Discoidin domain (FA58C, coagulation factor 5/8 C-terminal domain) 0.000023
Further Details:      
 
Domain Number 3 Region: 20-204
Classification Level Classification E-value
Superfamily Cupredoxins 1.29e-45
Family Multidomain cupredoxins 0.0000152
Further Details:      
 
Domain Number 4 Region: 398-585
Classification Level Classification E-value
Superfamily Cupredoxins 2.72e-44
Family Multidomain cupredoxins 0.0000494
Further Details:      
 
Domain Number 5 Region: 1873-2017
Classification Level Classification E-value
Superfamily Cupredoxins 3.22e-41
Family Multidomain cupredoxins 0.0000681
Further Details:      
 
Domain Number 6 Region: 589-731
Classification Level Classification E-value
Superfamily Cupredoxins 9.21e-38
Family Multidomain cupredoxins 0.00011
Further Details:      
 
Domain Number 7 Region: 1691-1864
Classification Level Classification E-value
Superfamily Cupredoxins 8.42e-36
Family Multidomain cupredoxins 0.000083
Further Details:      
 
Domain Number 8 Region: 213-349
Classification Level Classification E-value
Superfamily Cupredoxins 3.18e-33
Family Multidomain cupredoxins 0.00037
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSECAP00000012991   Gene: ENSECAG00000015044   Transcript: ENSECAT00000016104
Sequence length 2331
Comment pep:known chromosome:EquCab2:X:123272626:123414956:-1 gene:ENSECAG00000015044 transcript:ENSECAT00000016104 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MQIELSTCFFLCLLPFSFSATRRYYLGTVELPWDYMQSELLSELHVDTRFSPRVPRSFPF
NPSVMYKKTVFVEFTDHLFNIAKPRPPWMGLLGPTIRAEVYDTVVITLKNMASHPVSLHA
VGVSYWKASEGAEYEDQTSQSEKEDDKVIPGESYTYVWQILKENGPMASDPPCLTYSYLS
HVDLVKDLNSGLIGALLICREGNLAKERTQTLHEFVLLFAVFDEGKSWHSETNESLTQAM
DPASAQARPEMHTVNGYVNRSLPGLIGCHKKSVYWHVIGMGTTPEVHSIFLEGHTFLVRN
HRQASLEISPISFLTAQTLLMDLGQFLLFCQISSHQHDGMEAYVKVDSCPEEPQLRMKNS
EEAEDYDDDLYDSDMDVVRFDGDNSPPFIQIRSVAKKHPKTWVHYIAAEEEDWDYAPSVL
TPNDRSYKSLYLNNGPQQIGKKYKKARFIAYTDETFKTREVIQYESGILGPLLYGEVGDT
LLIIFKNQASRPYNIYPHGITDVSPLHSGRLPKGVKHLKDMPILPGEIFKYKWTVAVEDG
PTKSDPRCLTRYYSSFVNLERDLASGLIGPLLICYKESVDQRGNQMMSDKRNVILFSVFD
ENRSWYLTENMQRFLPNADGVQPQDPEFQVSNIMHSINGFVFDSLQLSVCLHEVAYWYIL
SVGAQTDFLSVFFSGYTFKHKMVYEDTLTLFPFSGETVFMSMENPGLWVLGCHNSDFRNR
GMTALLKVSSCNRNTGDYYEDIYEDIPTSLLNEKNIIEPRSFSQNSRHPSTRQRRVKANT
TPENDVEKIYLRSGERTQLLKVQSVSSSDLLMLLGQNPTPHALSLSDLQEVTYEANDHLL
GTIERNKGPSEVAYLTPELHHSGDRVFSPEPELQLRLNENLGTTISVELKKLDFKISSSS
NDLMISPTIPSDKLAAGIEKTGSLGRPNMPVQFSSQLDTTVFGKNSPHLIEPAVPLGLSE
GDNDSKLIEAALMNRQESSLEENVLSMESERLFKEEGIHGPVSLTKDNALFKVNFSFVKT
NKAPINTTTTRKTHIDGPTLLIENRTSVWQDIILESNSGFPEVTSLIHDETFMDKNTTAL
GLNHVSNKTTASKNMEMIRQKKEDSAPLGEENPDISFFKMLFLPDSANLIKRPLCKNSLS
SGQRPSPKQLISLGSEKSVKDQNFLSEKNKVVVGEDESTKDTGLKEMIFPNSKSIFLTNM
ANVQGNDTHNQEKNSQEEIERKEKLIQKNVVLPQVYTVTGTKNFLKNLFLLSAKQNVEGL
DEGTYTPILQDTRTLIESANRAMTHMAHFSKVREEANLEVFGNQTKEMVEKHPSTTRMAP
NPHQQNVITHHGKRALKQFRLPREETKLERGLILNDTSTQWSKNMKYLIQGTLTQIEYNE
KDKRAITQSLSSDGSVRSHGITQTNGSALPIAKVSVFSSIRPRDLTKIPSQDNSSHLLAS
ACSYTFREKSSGIQESSHFLQGARKNNLSLAFLTLEMIRGQGKISPLGSLSEASGKVELL
PKVHVHQEDSFPMKTSDGSPGHLDLMEEIFLQKAQGPVQLNKVNRPGTIAFLKRAAESSE
KTPSKLLGPLATEIPREEWNSQEKSQKTKAFKAKDTISSLDPCENSHSIAARNKGQDKPQ
GEATWAKQEETGKLCFQNLPVLKHHQRQITLTTVQPEEDKIDYDDTFSTETKREDFDIYG
EEENQDPRSFQKRTRHYFIAAVERLWDYGMSRSPHALRNRAQSGDVPQFKKVVFQEFTDG
SFTQPLYRGELNEHLGLLGPYIRAEVEDNIMVTFKNQASRPYSFYSSLISYEDDQRQGAE
PRKKFVKPNETEVYFWKVQHHMAPTKDEFDCKAWAYFSDVDLDKDVHSGLVGPLLICHAN
TLNPAHGRQVTVQEFALFFTIFDETKSWYFTENMERNCRAPCNIQTEDPTFKENYRFHAI
NGYVMDTLPGLVMAQDQKIRWYLLSMGSNENIHSIHFSGHVFTVRKKEEYKMAVYNLYPG
VFETVEMLPSKAGIWRIECLIGEHLQAGMSTLFLVYSKKCQTPLGMASGHIRDFQITASG
QYGQWAPKLARLHYSGSINAWSTKDPFSWIKVDLLAPMIIHSIMTQGARQKFSSLYISQF
IIMYSLDGKKWQSYRGNSTGTLMVFFGNVDSSGIKHNIFNPPIIARYIRLHPTHYSIRST
LRMELMGCDLNSCSMPLGMENKAIADAQITASSYLNNMFATWSPSQARLHLQGRTNAWRP
RVNNPKEWLQVDFQKTMKVTGITTQGVKSLLTSMYVKEFLISSSQDGHNWTLFLQNGKVK
VFQGNKDSFTPVVNSLDPPLLTRYLRIHPQSWGHQIALRLEVLGCEAQQLY
Download sequence
Identical sequences F7CVG2
ENSECAP00000012991 9796.ENSECAP00000012991 ENSECAP00000012991

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]