SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSP00000356771 from Homo sapiens 76_38

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSP00000356771
Domain Number 1 Region: 2065-2223
Classification Level Classification E-value
Superfamily Galactose-binding domain-like 9.22e-56
Family Discoidin domain (FA58C, coagulation factor 5/8 C-terminal domain) 0.0000000327
Further Details:      
 
Domain Number 2 Region: 1904-2064
Classification Level Classification E-value
Superfamily Galactose-binding domain-like 2.63e-53
Family Discoidin domain (FA58C, coagulation factor 5/8 C-terminal domain) 0.000000174
Further Details:      
 
Domain Number 3 Region: 1578-1760
Classification Level Classification E-value
Superfamily Cupredoxins 6.37e-50
Family Multidomain cupredoxins 0.0000264
Further Details:      
 
Domain Number 4 Region: 349-534
Classification Level Classification E-value
Superfamily Cupredoxins 6.75e-46
Family Multidomain cupredoxins 0.0000573
Further Details:      
 
Domain Number 5 Region: 31-201
Classification Level Classification E-value
Superfamily Cupredoxins 1.93e-45
Family Multidomain cupredoxins 0.000000125
Further Details:      
 
Domain Number 6 Region: 541-667
Classification Level Classification E-value
Superfamily Cupredoxins 1.92e-36
Family Multidomain cupredoxins 0.00024
Further Details:      
 
Domain Number 7 Region: 1767-1904
Classification Level Classification E-value
Superfamily Cupredoxins 5.09e-32
Family Multidomain cupredoxins 0.00000165
Further Details:      
 
Domain Number 8 Region: 207-329
Classification Level Classification E-value
Superfamily Cupredoxins 5.16e-32
Family Multidomain cupredoxins 0.00000138
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSP00000356771   Gene: ENSG00000198734   Transcript: ENST00000367797
Sequence length 2224
Comment pep:known chromosome:GRCh38:1:169514166:169586588:-1 gene:ENSG00000198734 transcript:ENST00000367797 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MFPGCPRLWVLVVLGTSWVGWGSQGTEAAQLRQFYVAAQGISWSYRPEPTNSSLNLSVTS
FKKIVYREYEPYFKKEKPQSTISGLLGPTLYAEVGDIIKVHFKNKADKPLSIHPQGIRYS
KLSEGASYLDHTFPAEKMDDAVAPGREYTYEWSISEDSGPTHDDPPCLTHIYYSHENLIE
DFNSGLIGPLLICKKGTLTEGGTQKTFDKQIVLLFAVFDESKSWSQSSSLMYTVNGYVNG
TMPDITVCAHDHISWHLLGMSSGPELFSIHFNGQVLEQNHHKVSAITLVSATSTTANMTV
GPEGKWIISSLTPKHLQAGMQAYIDIKNCPKKTRNLKKITREQRRHMKRWEYFIAAEEVI
WDYAPVIPANMDKKYRSQHLDNFSNQIGKHYKKVMYTQYEDESFTKHTVNPNMKEDGILG
PIIRAQVRDTLKIVFKNMASRPYSIYPHGVTFSPYEDEVNSSFTSGRNNTMIRAVQPGET
YTYKWNILEFDEPTENDAQCLTRPYYSDVDIMRDIASGLIGLLLICKSRSLDRRGIQRAA
DIEQQAVFAVFDENKSWYLEDNINKFCENPDEVKRDDPKFYESNIMSTINGYVPESITTL
GFCFDDTVQWHFCSVGTQNEILTIHFTGHSFIYGKRHEDTLTLFPMRGESVTVTMDNVGT
WMLTSMNSSPRSKKLRLKFRDVKCIPDDDEDSYEIFEPPESTVMATRKMHDRLEPEDEES
DADYDYQNRLAAALGIRSFRNSSLNQEEEEFNLTALALENGTEFVSSNTDIIVGSNYSSP
SNISKFTVNNLAEPQKAPSHQQATTAGSPLRHLIGKNSVLNSSTAEHSSPYSEDPIEDPL
QPDVTGIRLLSLGAGEFKSQEHAKHKGPKVERDQAAKHRFSWMKLLAHKVGRHLSQDTGS
PSGMRPWEDLPSQDTGSPSRMRPWKDPPSDLLLLKQSNSSKILVGRWHLASEKGSYEIIQ
DTDEDTAVNNWLISPQNASRAWGESTPLANKPGKQSGHPKFPRVRHKSLQVRQDGGKSRL
KKSQFLIKTRKKKKEKHTHHAPLSPRTFHPLRSEAYNTFSERRLKHSLVLHKSNETSLPT
DLNQTLPSMDFGWIASLPDHNQNSSNDTGQASCPPGLYQTVPPEEHYQTFPIQDPDQMHS
TSDPSHRSSSPELSEMLEYDRSHKSFPTDISQMSPSSEHEVWQTVISPDLSQVTLSPELS
QTNLSPDLSHTTLSPELIQRNLSPALGQMPISPDLSHTTLSPDLSHTTLSLDLSQTNLSP
ELSQTNLSPALGQMPLSPDLSHTTLSLDFSQTNLSPELSHMTLSPELSQTNLSPALGQMP
ISPDLSHTTLSLDFSQTNLSPELSQTNLSPALGQMPLSPDPSHTTLSLDLSQTNLSPELS
QTNLSPDLSEMPLFADLSQIPLTPDLDQMTLSPDLGETDLSPNFGQMSLSPDLSQVTLSP
DISDTTLLPDLSQISPPPDLDQIFYPSESSQSLLLQEFNESFPYPDLGQMPSPSSPTLND
TFLSKEFNPLVIVGLSKDGTDYIEIIPKEEVQSSEDDYAEIDYVPYDDPYKTDVRTNINS
SRDPDNIAAWYLRSNNGNRRNYYIAAEEISWDYSEFVQRETDIEDSDDIPEDTTYKKVVF
RKYLDSTFTKRDPRGEYEEHLGILGPIIRAEVDDVIQVRFKNLASRPYSLHAHGLSYEKS
SEGKTYEDDSPEWFKEDNAVQPNSSYTYVWHATERSGPESPGSACRAWAYYSAVNPEKDI
HSGLIGPLLICQKGILHKDSNMPMDMREFVLLFMTFDEKKSWYYEKKSRSSWRLTSSEMK
KSHEFHAINGMIYSLPGLKMYEQEWVRLHLLNIGGSQDIHVVHFHGQTLLENGNKQHQLG
VWPLLPGSFKTLEMKASKPGWWLLNTEVGENQRAGMQTPFLIMDRDCRMPMGLSTGIISD
SQIKASEFLGYWEPRLARLNNGGSYNAWSVEKLAAEFASKPWIQVDMQKEVIITGIQTQG
AKHYLKSCYTTEFYVAYSSNQINWQIFKGNSTRNVMYFNGNSDASTIKENQFDPPIVARY
IRISPTRAYNRPTLRLELQGCEVNGCSTPLGMENGKIENKQITASSFKKSWWGDYWEPFR
ARLNAQGRVNAWQAKANNNKQWLEIDLLKIKKITAIITQGCKSLSSEMYVKSYTIHYSEQ
GVEWKPYRLKSSMVDKIFEGNTNTKGHVKNFFNPPIISRFIRVIPKTWNQSIALRLELFG
CDIY
Download sequence
Identical sequences P12259
ENSP00000356771 NP_000121.2.87134 NP_000121.2.92137 gi|105990535|ref|NP_000121.2|

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]