SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSMUSP00000033539 from Mus musculus 69_38

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSMUSP00000033539
Domain Number 1 Region: 2159-2314
Classification Level Classification E-value
Superfamily Galactose-binding domain-like 1.42e-55
Family Discoidin domain (FA58C, coagulation factor 5/8 C-terminal domain) 0.00000017
Further Details:      
 
Domain Number 2 Region: 2007-2159
Classification Level Classification E-value
Superfamily Galactose-binding domain-like 6.29e-50
Family Discoidin domain (FA58C, coagulation factor 5/8 C-terminal domain) 0.00002
Further Details:      
 
Domain Number 3 Region: 20-208
Classification Level Classification E-value
Superfamily Cupredoxins 2.09e-47
Family Multidomain cupredoxins 0.0000177
Further Details:      
 
Domain Number 4 Region: 398-584
Classification Level Classification E-value
Superfamily Cupredoxins 1.13e-43
Family Multidomain cupredoxins 0.0000657
Further Details:      
 
Domain Number 5 Region: 1861-2005
Classification Level Classification E-value
Superfamily Cupredoxins 1.07e-39
Family Multidomain cupredoxins 0.0000642
Further Details:      
 
Domain Number 6 Region: 588-730
Classification Level Classification E-value
Superfamily Cupredoxins 1.42e-38
Family Multidomain cupredoxins 0.00012
Further Details:      
 
Domain Number 7 Region: 1681-1852
Classification Level Classification E-value
Superfamily Cupredoxins 1.13e-36
Family Multidomain cupredoxins 0.000071
Further Details:      
 
Domain Number 8 Region: 214-349
Classification Level Classification E-value
Superfamily Cupredoxins 5.95e-32
Family Multidomain cupredoxins 0.0004
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSMUSP00000033539   Gene: ENSMUSG00000031196   Transcript: ENSMUST00000033539
Sequence length 2319
Comment pep:known chromosome:GRCm38:X:75172715:75382316:-1 gene:ENSMUSG00000031196 transcript:ENSMUST00000033539 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MQIALFACFFLSLFNFCSSAIRRYYLGAVELSWNYIQSDLLSVLHTDSRFLPRMSTSFPF
NTSIMYKKTVFVEYKDQLFNIAKPRPPWMGLLGPTIWTEVHDTVVITLKNMASHPVSLHA
VGVSYWKASEGDEYEDQTSQMEKEDDKVFPGESHTYVWQVLKENGPMASDPPCLTYSYMS
HVDLVKDLNSGLIGALLVCKEGSLSKERTQMLYQFVLLFAVFDEGKSWHSETNDSYTQSM
DSASARDWPKMHTVNGYVNRSLPGLIGCHRKSVYWHVIGMGTTPEIHSIFLEGHTFFVRN
HRQASLEISPITFLTAQTLLIDLGQFLLFCHISSHKHDGMEAYVKVDSCPEESQWQKKNN
NEEMEDYDDDLYSEMDMFTLDYDSSPFIQIRSVAKKYPKTWIHYISAEEEDWDYAPSVPT
SDNGSYKSQYLSNGPHRIGRKYKKVRFIAYTDETFKTRETIQHESGLLGPLLYGEVGDTL
LIIFKNQASRPYNIYPHGITDVSPLHARRLPRGIKHVKDLPIHPGEIFKYKWTVTVEDGP
TKSDPRCLTRYYSSFINPERDLASGLIGPLLICYKESVDQRGNQMMSDKRNVILFSIFDE
NQSWYITENMQRFLPNAAKTQPQDPGFQASNIMHSINGYVFDSLELTVCLHEVAYWHILS
VGAQTDFLSIFFSGYTFKHKMVYEDTLTLFPFSGETVFMSMENPGLWVLGCHNSDFRKRG
MTALLKVSSCDKSTSDYYEEIYEDIPTQLVNENNVIDPRSFFQNTNHPNTRKKKFKDSTI
PKNDMEKIEPQFEEIAEMLKVQSVSVSDMLMLLGQSHPTPHGLFLSDGQEAIYEAIHDDH
SPNAIDSNEGPSKVTQLRPESHHSEKIVFTPQPGLQLRSNKSLETTIEVKWKKLGLQVSS
LPSNLMTTTILSDNLKATFEKTDSSGFPDMPVHSSSKLSTTAFGKKAYSLVGSHVPLNVS
EENSDSNILDSTLMYSQESLPRDNILSMENDRLLREKRFHGIALLTKDNTLFKDNVSLMK
TNKTYNHSTTNEKLHTESPTSIENSTTDLQDAILKVNSEIQEVTALIHDGTLLGKNSTYL
RLNHMLNRTTSTKNKDIFHRKDEDPIPQDEENTIMPFSKMLFLSESSNWFKKTNGNNSLN
SEQEHSPKQLVYLMFKKYVKNQSFLSEKNKVTVEQDGFTKNIGLKDMAFPHNMSIFLTTL
SNVHENGRHNQEKNIQEEIEKEALIEEKVVLPQVHEATGSKNFLKDILILGTRQNISLYE
VHVPVLQNITSINNSTNTVQIHMEHFFKRRKDKETNSEGLVNKTREMVKNYPSQKNITTQ
RSKRALGQFRLSTQWLKTINCSTQCIIKQIDHSKEMKKFITKSSLSDSSVIKSTTQTNSS
DSHIVKTSAFPPIDLKRSPFQNKFSHVQASSYIYDFKTKSSRIQESNNFLKETKINNPSL
AILPWNMFIDQGKFTSPGKSNTNSVTYKKRENIIFLKPTLPEESGKIELLPQVSIQEEEI
LPTETSHGSPGHLNLMKEVFLQKIQGPTKWNKAKRHGESIKGKTESSKNTRSKLLNHHAW
DYHYAAQIPKDMWKSKEKSPEIISIKQEDTILSLRPHGNSHSIGANEKQNWPQRETTWVK
QGQTQRTCSQIPPVLKRHQRELSAFQSEQEATDYDDAITIETIEDFDIYSEDIKQGPRSF
QQKTRHYFIAAVERLWDYGMSTSHVLRNRYQSDNVPQFKKVVFQEFTDGSFSQPLYRGEL
NEHLGLLGPYIRAEVEDNIMVTFKNQASRPYSFYSSLISYKEDQRGEEPRRNFVKPNETK
IYFWKVQHHMAPTEDEFDCKAWAYFSDVDLERDMHSGLIGPLLICHANTLNPAHGRQVSV
QEFALLFTIFDETKSWYFTENVKRNCKTPCNFQMEDPTLKENYRFHAINGYVMDTLPGLV
MAQDQRIRWYLLSMGNNENIQSIHFSGHVFTVRKKEEYKMAVYNLYPGVFETLEMIPSRA
GIWRVECLIGEHLQAGMSTLFLVYSKQCQIPLGMASGSIRDFQITASGHYGQWAPNLARL
HYSGSINAWSTKEPFSWIKVDLLAPMIVHGIKTQGARQKFSSLYISQFIIMYSLDGKKWL
SYQGNSTGTLMVFFGNVDSSGIKHNSFNPPIIARYIRLHPTHSSIRSTLRMELMGCDLNS
CSIPLGMESKVISDTQITASSYFTNMFATWSPSQARLHLQGRTNAWRPQVNDPKQWLQVD
LQKTMKVTGIITQGVKSLFTSMFVKEFLISSSQDGHHWTQILYNGKVKVFQGNQDSSTPM
MNSLDPPLLTRYLRIHPQIWEHQIALRLEILGCEAQQQY
Download sequence
Identical sequences Q06194
ENSMUSP00000033539 NP_032003.2.92730 ENSMUSP00000033539 10090.ENSMUSP00000033539 ENSMUSP00000033539

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]