SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSDNOP00000014175 from Dasypus novemcinctus 76_2

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSDNOP00000014175
Domain Number 1 Region: 1868-2026
Classification Level Classification E-value
Superfamily Galactose-binding domain-like 2.69e-56
Family Discoidin domain (FA58C, coagulation factor 5/8 C-terminal domain) 0.0000000911
Further Details:      
 
Domain Number 2 Region: 1708-1867
Classification Level Classification E-value
Superfamily Galactose-binding domain-like 7.8e-52
Family Discoidin domain (FA58C, coagulation factor 5/8 C-terminal domain) 0.000000538
Further Details:      
 
Domain Number 3 Region: 1383-1561
Classification Level Classification E-value
Superfamily Cupredoxins 3.45e-50
Family Multidomain cupredoxins 0.0000268
Further Details:      
 
Domain Number 4 Region: 29-204
Classification Level Classification E-value
Superfamily Cupredoxins 5.87e-45
Family Multidomain cupredoxins 0.000000211
Further Details:      
 
Domain Number 5 Region: 351-532
Classification Level Classification E-value
Superfamily Cupredoxins 1.47e-44
Family Multidomain cupredoxins 0.0000472
Further Details:      
 
Domain Number 6 Region: 539-662
Classification Level Classification E-value
Superfamily Cupredoxins 2.75e-36
Family Multidomain cupredoxins 0.00019
Further Details:      
 
Domain Number 7 Region: 1570-1707
Classification Level Classification E-value
Superfamily Cupredoxins 2.38e-32
Family Multidomain cupredoxins 0.00000102
Further Details:      
 
Domain Number 8 Region: 208-330
Classification Level Classification E-value
Superfamily Cupredoxins 4.13e-32
Family Multidomain cupredoxins 0.00000148
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSDNOP00000014175   Gene: ENSDNOG00000018263   Transcript: ENSDNOT00000018264
Sequence length 2027
Comment pep:known_by_projection scaffold:Dasnov3.0:JH569145.1:996032:1080205:1 gene:ENSDNOG00000018263 transcript:ENSDNOT00000018264 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MFPGCPRLWVLVVLGTSWAGGGSQEADAAQLRQFYVAAQVISWNYHPQPPDPSLNLFTTS
FKKIVYREYEAYFKKEKPRSSISAGLLGPTLYAEVGDTMKVHFKNKADKPLSIHPQGIKY
SKFSEGASYADHTFPEERLDDAVAPGEEYTYEWIITEDSGPMDDDPPCLTHIYYSYVNLV
EDFNSGLIGPLLICKKGTLIEDGTQKMFDKQLVLMFAVFDESKSWNQSSSLMYTVNGYVN
GTLPDITVCAYDHISWHLIGMSSGPELFSIHFNGQVLEQNNHKISAITLVSASSTTANMT
VSPEGRWSISSLISKHFQAGMQAYIDIKNCPKKTRNPKTLTRDQRRHIKRWEYFIAAEEV
IWDYAPVIPANMDKKYRSLHLDNFSNQIGKHYKKVVYKQYEDESFTKRVDNNQKDGILGP
IIRAQVRDTLKIVFKNKASHAYSIYPHGVTFSPHEDEVNSSSTSDNNTMIRAVQPGETYT
YKWNILESDEPTENDAQCLTRPYYSNVDITRDIASGLIGILLICKSRSLDKRGIQRAADI
EQQAVFAVFDENKSWYIEDNINKFCESPDKVNRDDPKFYESNIMSTINGYVPESIPTLGF
CFDDTVQWHFCSVGTQNDMLIIHFTGHSFIYGKRHEDTLTLFPMRGESVTVTMDNVGTWM
LTTIRSSPRSQNLRLRFRDVKCIRDYDDEDSYQIIYEHKTSSTMDTRKMHDSSEDKSEMD
DTDSDYQDTLASLLGIRSFSNSSMNQEKDEFNLTALALENNSEFIHPSTDGAVGSNPSSP
VNMSRHIGSDLFGPQKTLPHPGTTIAGPFLGHSTGLGKNSSLNPSTTEYSTSYSEDPIEE
PLQPDVTGISLLHLGAKGFKSKEHNNHKGSKAGRHQAAKHKFPQKEIPAHKTGRHLSKDN
ISSSRMRPWEDTPSDLLLLRQKNPFKILNGKWHLVSEKGSYEIIQETNEDMTDNKLLNNS
QNASRAWEEINPLTNKHGEQSGHPKFSGVKQKFPQVRKFGENGDLKTLFIKTRKRKKEQK
RTHHIPVSPGGFHSLRGESSTTFSDRKLDHSLLLQKSNETSLPTNLNQTVPSVNLGQIDS
LIDHNQNKNDISQTSIPPDLYQAMPPKEHYQTFPIQDPGQMYSTTDPSHRSSPPEHNQMP
DYDLSHQPFPTDISQTIPSLELEVRQPTIFPDLSQMALSPDLSQTTLSPDLSQMTLSPEP
SQVALSPELSQITLYPHQSQTLSSEFSQTNNSPDHRQMTLSPDLRRATFPPDLRQTLPPP
NLNQTSYPSETSQSLPLPELSHTFLSPDLGQMPSSSLTPTLNDTFKPGHFNPQVVIGLTG
YNGDDIEITPKEEVQSKEEESVEIDYVAYDNPYQTDTRTNIKYSRDPDHIAAWYLRSNSG
NRKYYYIAAEEISWDYSKFSKSEMDNEDTDNAPEGTVYKKVVFRKYLDSTFTKHDPRGEY
EEHLGIVGPVIRAEVGDVIQVRFKNLAPRPYSLHAHGLSYEKSSEGKTYEDDSPEWFKED
NAVQPNSSYTYVWHATERSAPEDPGSACRAWAYYSAVNPEKDIHTGLIGPLLICRKGTLN
KESNMPVDMREFVLLFMIFDEKKSWYYKRKPKKSWTRASSEVKKSHEFYAINGLIYSLPG
LMMYEQEWVRLHLLNMGGSRDIHVVHFHGQTLLENGTQQHQLGTWPLLPGSFKTLEMKAS
KPGWWLLDTEVGENQRAGMQTPFLIIDKGCKMPMGLSTGVISDLQIKASQHLANWEPRLA
RLNNGGSYNAWSVEKLSIESGFKPWIQVDFQREVIFTGIQTQGAKQYLTSHYTTEFYVAY
SSDQMNWQVFKGNSTKNLMYFEGNSDASTIKENQFNPPIMARYIRIYPTKSYNRPTLRME
LQGCEVNGCSTPLGMESQKIENKQITASSFKKSWWGDYWEASNARLNAQGRVNAWQAKAN
NNKQWLQIDLLKIKRITAIVTQGCKSLSSEMYVKSFTIYYSDQGVEWKPYRQKYSMASKI
FEGNSNTKGHVKNFFNPPIISRFIRIIPKTWNHSIALRLELFGCDIY
Download sequence
Identical sequences ENSDNOP00000014175

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]