SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSMUSP00000083204 from Mus musculus 69_38

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSMUSP00000083204
Domain Number 1 Region: 2024-2182
Classification Level Classification E-value
Superfamily Galactose-binding domain-like 9.92e-57
Family Discoidin domain (FA58C, coagulation factor 5/8 C-terminal domain) 0.0000000798
Further Details:      
 
Domain Number 2 Region: 1863-2023
Classification Level Classification E-value
Superfamily Galactose-binding domain-like 4.25e-54
Family Discoidin domain (FA58C, coagulation factor 5/8 C-terminal domain) 0.000000442
Further Details:      
 
Domain Number 3 Region: 1540-1720
Classification Level Classification E-value
Superfamily Cupredoxins 3.14e-50
Family Multidomain cupredoxins 0.0000196
Further Details:      
 
Domain Number 4 Region: 348-533
Classification Level Classification E-value
Superfamily Cupredoxins 2.18e-47
Family Multidomain cupredoxins 0.000057
Further Details:      
 
Domain Number 5 Region: 31-203
Classification Level Classification E-value
Superfamily Cupredoxins 2.25e-43
Family Multidomain cupredoxins 0.000000129
Further Details:      
 
Domain Number 6 Region: 539-663
Classification Level Classification E-value
Superfamily Cupredoxins 4.58e-36
Family Multidomain cupredoxins 0.00026
Further Details:      
 
Domain Number 7 Region: 1727-1863
Classification Level Classification E-value
Superfamily Cupredoxins 1.12e-33
Family Multidomain cupredoxins 0.00000204
Further Details:      
 
Domain Number 8 Region: 206-328
Classification Level Classification E-value
Superfamily Cupredoxins 4.07e-32
Family Multidomain cupredoxins 0.00000199
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSMUSP00000083204   Gene: ENSMUSG00000026579   Transcript: ENSMUST00000086040
Sequence length 2183
Comment pep:known chromosome:GRCm38:1:164151838:164220277:1 gene:ENSMUSG00000026579 transcript:ENSMUST00000086040 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MLLVCPCFFLLVVLGTRWAGWGSHQAEAAQLRQFYVAAQGILWNYHPEPTDPSLNSIPSF
KKIVYREYEQYFKKEKPRSSNSGLLGPTLYAEVGDVIKVHFRNKADKPLSIHPQGIKYSK
FSEGASYADHTFPAERKDDAVAPGEEYTYEWIVSEDSGPTPDDPPCLTHIYYSYENLTQD
FNSGLIGPLLICKKGTLTEDGTQKMFDKQHVLLFAVFDESKSRSQSPSLMYTINGFVNKT
MPDITVCAHDHVSWHLIGMSSGPELFSIHFNGQVLEQNQHKVSTVTLVSATSTTANMTMS
PEGRWIVSSLIPKHYQAGMQAYIDIKNCPKKTRSPKTLTREQRRYMKRWEYFIAAEEVIW
NYAPVIPANMDKIYRSQHLDNFSNQIGKHYKKVIYRQYEEETFTKRTDNPSIKQSGILGP
VIRAQVRDTLKIVFKNMASRPYSIYPHGVTFSPYEDGINSSSTSGSHTTIRPVQPGETFT
YKWNILEFDEPTENDAQCLTRPYYSDVDVTRDIASGLIGLLLICKSRSLDQRGVQRVADI
EQQAVFAVFDENKSWYIEDNINKFCENPDEVKRDDPKFYESNIMSTINGYVPESISTLGF
CFDDTVQWHFCSVGTHDDILTIHFTGHSFIYGRRHEDTLTLFPMRGESVTVTMDNVGTWM
LTTMNSNPKRRNLRLRFRDVKCNRDYDNEDSYEIYEPPAPTSMTTRRIHDSLENEFGIDN
EDDDYQYLLASSLGIRSFKNSSLNPEENEFNLTALALENSSEFISPSTDRVVDSNSSRIL
SKIINNNLKDFQRTLPGSGATVAGTLLRNLIGLDENFVLNSSTEHRSSSYHENDMENPQS
NITMVYLLPLGPKGSGNREQDKPKTIKTGRPHMMKHRFSWMKAPAGKTGRHSNPKNSYSG
MKSEEDIPSELIPLKQKITSKFLNRRWRVASEKGSYEIIAANGEDTDVDKLTNSPQNQNI
TVPRGESTSHTNTTRKPSDLPTFSGVGHKSPHVRQEEENSGFQKRQLFIRTRKKKKNKKL
ALHSPLSPRGFDPLRGHNHSPFPDRRLLNHSLLLHKSNETALSPDLNQTSPSMSTDRSLP
DYNQYSKNDTEQMSSSLDLYQSVPAEEHSPTFPAQDPDQTHSTTDPSYRSSPPELSQGLD
YDLSHDFYPDDIGLTSFFPDQSQKSSFSSDDDQAIPSSDLSLFTISPELDQTIIYPDLDQ
LLLSPEDNQKTSSPDLGQVPLSPDDNQKTSSPDLGQVSLSPDDNQKTSSPDLGQVPLSLD
DNQKTTSPDLGQVPLSPDDNQMITSPDLGQVPLSSDNQKTSSPDLGQVPLFPEDNQNYFL
DLSQVPLSSDQNQETSSTDLLTLSPDFGQTVLSPDLDQLPLPSDNSQVTVSPDLSLLTLS
PDFNEIILAPDLGQVTLSPDLIQTNPALNHGHKASSADPDQASYPPDSGQASSLPELNRT
LPHPDLTHIPPPSPSPTLNNTSLSRKFNPLVVVGLSRVDGDDVEIVPSEEPERIDEDYAE
DDFVTYNDPYRTDTRTDVNSSRNPDTIAAWYLRGHGGHKKFYYIAAEEITWNYAEFAQSE
MDHEDTGHTPKDTTYKKVVFRKYLDSTFTSRDPRAEYEEHLGILGPVIRAEVDDVIQVRF
KNLASRPYSLHAHGLSYEKSSEGKTYEDESPEWFQEDDAVQPNSSYTYVWHATKRSGPEN
PGSACRAWAYYSAVNVERDIHSGLIGPLLICRKGTLHMERNLPMDMREFVLLFMVFDEKK
SWYYEKSKGSRRIESPEEKNAHKFYAINGMIYNLPGLRMYEQEWVRLHLLNMGGSRDIHV
VHFHGQTLLDNRTKQHQLGVWPLLPGSFKTLEMKASKPGWWLLDTEVGENQVAGMQTPFL
IIDKECKMPMGLSTGVISDSQIKASEYLTYWEPRLARLNNAGSYNAWSIEKTALDFPIKP
WIQVDMQKEVVVTGIQTQGAKHYLKSCFTTEFQVAYSSDQTNWQIFRGKSGKSVMYFTGN
SDGSTIKENRLDPPIVARYIRIHPTKSYNRPTLRLELQGCEVNGCSTPLGLEDGRIQDKQ
ITASSFKKSWWGDYWEPSLARLNAQGRVNAWQAKANNNKQWLQVDLLKIKKVTAIVTQGC
KSLSSEMYVKSYSIQYSDQGVAWKPYRQKSSMVDKIFEGNSNTKGHMKNFFNPPIISRFI
RIIPKTWNQSIALRLELFGCDIY
Download sequence
Identical sequences O88783
ENSMUSP00000083204 NP_032002.1.92730 ENSMUSP00000083204 10090.ENSMUSP00000083204 ENSMUSP00000083204

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]