SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSPPYP00000020350 from Pongo abelii 76_2

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSPPYP00000020350
Domain Number 1 Region: 43-183
Classification Level Classification E-value
Superfamily Galactose-binding domain-like 5.95e-46
Family Discoidin domain (FA58C, coagulation factor 5/8 C-terminal domain) 0.00029
Further Details:      
 
Domain Number 2 Region: 927-1139
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 9.5e-36
Family Laminin G-like module 0.0048
Further Details:      
 
Domain Number 3 Region: 156-344
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 9.9e-34
Family Laminin G-like module 0.0029
Further Details:      
 
Domain Number 4 Region: 368-559
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 3.76e-33
Family Laminin G-like module 0.0038
Further Details:      
 
Domain Number 5 Region: 810-920
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 7.92e-23
Family Laminin G-like module 0.0026
Further Details:      
 
Domain Number 6 Region: 585-643
Classification Level Classification E-value
Superfamily Fibrinogen C-terminal domain-like 0.000000000544
Family Fibrinogen C-terminal domain-like 0.003
Further Details:      
 
Domain Number 7 Region: 556-593
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000113
Family EGF-type module 0.012
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSPPYP00000020350   Gene: ENSPPYG00000018144   Transcript: ENSPPYT00000021151
Sequence length 1285
Comment pep:known_by_projection chromosome:PPYG2:7:143657656:146113081:1 gene:ENSPPYG00000018144 transcript:ENSPPYT00000021151 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MLAAPRAGCGAALLLWIVSSCLCRAWTAPSTSQKCDEPLVSGLPHGAFSSSSSISGSYSP
GYAKINKRGGAGGWSPSDSDHYQWLQVDFGNRKQISAIATQGRYSSSDWVTQYRMLYSDT
GRNWKPYHQDGNIWAFPGNINSDGVVRHELQHPVIARYVRVVPLDWNGEGRTGLRIEVYG
CSYWADVINFDGHVVLPYRFRNKKMKTLKDVIALKLYKTSESEGVILHGEGQQGDYITLE
LKKAKLVLSLNLGSNQLGPIYGHTSVMTGSLLDDHHWHSVIIERQGRSINLTLDRSMQHF
RTNGEFDYLDLDYEITFGGIPFSGKPSSSSRKNFKGCMESINYNGINITDLARRKKLEPS
NVGNLSFSCVEPYTVPVFFNATSYLEVPGRLNQDLFSVSFQFRTWNPNGLLVFSHFADNL
GNVEIDLTESKVGVHINITQTKMSQIDISSGSGLNDGQWHEVRFLAKENFAILTIDGDEA
SAVRPPRPLQVKTGEKYFFGGFLNQMNNSSHSVLQPSFQGCMQLIQVDDQLVNLYEVAQR
KPGSFANVSIDMCAIIDRCVPNHCEHGGKCSQTWDSFKCTCDETGYTGATCHNSIYEPSC
EAYKHLGQTSNYYWIDPDGSGPLGPLKVYCNMTEDKVWTIVSHDLQMQTTVVSYNPEKHS
VIQLVYSASMDQISAITDSAEYCEQYISYFCKMSRLLNTPDGSPYTWWVGKANEKHYYWG
GSGPGIQKCACGIERNCTDPKYYCNCDADYKQWRKDAGFLSYKDHLPVSQVVVGDTDRQG
SEAKLSVGPLRCQGDRNYWNAASFPNATEVSFSFDVGNGPVEIVVRSPTPLNDDQWHRVT
AERNVKQASLQVDRLPQQIRKAPTEGHTRLELYSQLFVGGAGGQQGFLGCIRSLRMNGVT
LDLEERAKVTSGFISGCSGHCTSYGTNCENGGKCLERYHGYSCDCSNTAYDGTFCNKDVG
AFFEEGMWLRYNFQAPATNARDSSSRVENAPDQQNSHPDLAQEEIRFSFSTTKAPCILLY
ISSFTTDFLAVLVKPTGSLQIRYNLGGTREPYNIDTDHRNMANGQPHSVNITRHEKTIIL
KLDHYPSVSYHLPSSSDTLFNSPKSLFLGKVIETGKIDQEIHKYNTPGFTGCLSRVQFNQ
IAPLKAALRQTNASAHVHIQGELVESNCGASPLTLSPMSSATDPWHLDHLDSASADFPYN
PGQGQAIRNGVNRNSAIIGGVIAVVIFTILCTLVFLIRYMFRHKGTYHTNEAKGAESAES
ADAAIMNNDPNFTETIDESKKEWLI
Download sequence
Identical sequences H2PNY4
9600.ENSPPYP00000020350 ENSPPYP00000020350 ENSPPYP00000020350

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]