SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSSSCP00000013628 from Sus scrofa 76_10.2

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSSSCP00000013628
Domain Number 1 Region: 1971-2126
Classification Level Classification E-value
Superfamily Galactose-binding domain-like 2.55e-55
Family Discoidin domain (FA58C, coagulation factor 5/8 C-terminal domain) 0.000000303
Further Details:      
 
Domain Number 2 Region: 1819-1971
Classification Level Classification E-value
Superfamily Galactose-binding domain-like 4.56e-53
Family Discoidin domain (FA58C, coagulation factor 5/8 C-terminal domain) 0.0000181
Further Details:      
 
Domain Number 3 Region: 20-207
Classification Level Classification E-value
Superfamily Cupredoxins 7.56e-46
Family Multidomain cupredoxins 0.0000164
Further Details:      
 
Domain Number 4 Region: 397-584
Classification Level Classification E-value
Superfamily Cupredoxins 7.98e-46
Family Multidomain cupredoxins 0.0000494
Further Details:      
 
Domain Number 5 Region: 1673-1817
Classification Level Classification E-value
Superfamily Cupredoxins 1.14e-40
Family Multidomain cupredoxins 0.0000706
Further Details:      
 
Domain Number 6 Region: 588-730
Classification Level Classification E-value
Superfamily Cupredoxins 2.89e-38
Family Multidomain cupredoxins 0.0000955
Further Details:      
 
Domain Number 7 Region: 1491-1664
Classification Level Classification E-value
Superfamily Cupredoxins 1.29e-37
Family Multidomain cupredoxins 0.0000861
Further Details:      
 
Domain Number 8 Region: 213-349
Classification Level Classification E-value
Superfamily Cupredoxins 9.91e-32
Family Multidomain cupredoxins 0.00047
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSSSCP00000013628   Gene: ENSSSCG00000012818   Transcript: ENSSSCT00000014012
Sequence length 2131
Comment pep:known_by_projection chromosome:Sscrofa10.2:X:143132258:143216275:1 gene:ENSSSCG00000012818 transcript:ENSSSCT00000014012 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MQLELSTCVFLCLLPLGFSAIRRYYLGAVELSWDYRQSELLRELHVDTRFPATAPGALPL
GPSVLYKKTVFVEFTDQLFSVARPRPPWMGLLGPTIQAEVYDTVVVTLKNMASHPVSLHA
VGVSFWKSSEGAEYEDHTSQREKEDDKVLPGKSQTYVWQVLKENGPTASDPPCLTYSYLS
HVDLVKDLNSGLIGALLVCREGSLTRERTQNLHEFVLLFAVFDEGKSWHSARNDSWTRAM
DPAPARAQPAMHTVNGYVNRSLPGLIGCHKKSVYWHVIGMGTSPEVHSIFLEGHTFLVRH
HRQASLEISPLTFLTAQTFLMDLGQFLLFCHISSHHHGGMEAHVRVESCAEEPQLRRKAD
EEEDYDDNLYDSDMDVVRLDGDDVSPFIQIRSVAKKHPKTWVHYISAEEEDWDYAPAVPS
PSDRSYKSLYLNSGPQRIGRKYKKARFVAYTDVTFKTRKAIPYESGILGPLLYGEVGDTL
LIIFKNKASRPYNIYPHGITDVSALHPGRLLKGWKHLKDMPILPGETFKYKWTVTVEDGP
TKSDPRCLTRYYSSSINLEKDLASGLIGPLLICYKESVDQRGNQMMSDKRNVILFSVFDE
NQSWYLAENIQRFLPNPDGLQPQDPEFQASNIMHSINGYVFDSLQLSVCLHEVAYWYILS
VGAQTDFLSVFFSGYTFKHKMVYEDTLTLFPFSGETVFMSMENPGLWVLGCHNSDLRNRG
MTALLKVYSCDRDIGDYYDNTYEDIPGFLLSGKNVIEPRSFAQNSRPPSASQKQFQTITS
PEDDVELDPQSGERTQALEELSVPSGDGSMLLGQNPAPHGSSSSDLQEARNEADDYLPGA
RERNTAPSAAARLRPELHHSAERVLTPEPEKELKKLDSKMSSSSDLLKTSPTIPSDTLSA
ETERTHSLGPPHPQVNFRSQLGAIVLGKNSSHFIGAGVPLGSSEEDHESSLGENVSPVES
DGIFEEERAHGPASLTKDDVLFKVNISLVKTNKARVYLKTNRKIHIDDAALLTENRASAT
FMDKNTTASGLNHVSNWIKGPLGKNPLSSERRPSPELLTSSGSGKSVKGQSSGQGRIRVE
EDELSKGKEMMLPNSELTFLTNSADVQGNDTHSQGKKSREEMERREKLVQEKVDLPQVYT
ATGTKNFLRNIFHQSTEPSVEGFDGGSHAPVPQDSRSLNDSAERAGTHIAHFSAIREEAP
LEAPGNRTGPGPRSAVPARVKRDLKQIRLPLEEIKPERGVVLNATSTRWSESSPILQGAK
RNNLSLPFLTLEMAGGQGKISALGKSAAGPLASGKLEKAVLSSAGLSEASGKAEFLPKVR
VHREDLLPQKTSNVSRAHGDLGQEIFLQKTRGPVNLNKVNRPGRTPSKLLGPPMPKEWES
LEKSPKSTALRTKDIISLPLDRHESNHSIAAKNEGQAETQREAAWTKQGGPGRLCAPKPP
VLRRHQRDISLPTFQPEEDKMDYDDIFSTETKGEDFDIYGEDENQDPRSFQKRTRHYFIA
AVEQLWDYGMSESPRALRNRAQNGEVPRFKKVVFREFADGSFTQPSYRGELNKHLGLLGP
YIRAEVEDNIMVTFKNQASRPYSFYSSLISYPDDQEQGAEPRHNFVQPNETRTYFWKVQH
HMAPTEDEFDCKAWAYFSDVDLEKDVHSGLIGPLLICRANTLNAAHGRQVTVQEFALFFT
IFDETKSWYFTENVERNCRAPCHLQMEDPTLKENYRFHAINGYVMDTLPGLVMAQNQRIR
WYLLSMGSNENIHSIHFSGHVFSVRKKEEYKMAVYNLYPGVFETVEMLPSKVGIWRIECL
IGEHLQAGMSTTFLVYSKECQAPLGMASGRIRDFQITASGQYGQWAPKLARLHYSGSINA
WSTKDPHSWIKVDLLAPMIIHGIMTQGARQKFSSLYISQFIIMYSLDGRNWQSYRGNSTG
TLMVFFGNVDASGIKHNIFNPPIVARYIRLHPTHYSIRSTLRMELMGCDLNSCSMPLGMQ
NKAISDSQITASSHLSNIFATWSPSQARLHLQGRTNAWRPRVSSAEEWLQVDLQKTVKVT
GITTQGVKSLLSSMYVKEFLVSSSQDGRRWTLFLQDGHTKVFQGNQDSSTPVVNALDPPL
FTRYLRIHPTSWAQHIALRLELLGCEAQQHV
Download sequence
Identical sequences ENSSSCP00000013628

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]