SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSPSIP00000002341 from Pelodiscus sinensis 69_1.0

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSPSIP00000002341
Domain Number 1 Region: 668-919
Classification Level Classification E-value
Superfamily YWTD domain 2.35e-45
Family YWTD domain 0.0000113
Further Details:      
 
Domain Number 2 Region: 257-542
Classification Level Classification E-value
Superfamily Oligoxyloglucan reducing end-specific cellobiohydrolase 1.57e-34
Family Oligoxyloglucan reducing end-specific cellobiohydrolase 0.013
Further Details:      
 
Domain Number 3 Region: 1465-1652
Classification Level Classification E-value
Superfamily Fibronectin type III 7.76e-24
Family Fibronectin type III 0.0019
Further Details:      
 
Domain Number 4 Region: 1842-2011
Classification Level Classification E-value
Superfamily Fibronectin type III 6.91e-16
Family Fibronectin type III 0.0059
Further Details:      
 
Domain Number 5 Region: 2-182
Classification Level Classification E-value
Superfamily Oligoxyloglucan reducing end-specific cellobiohydrolase 0.0000000000000068
Family Oligoxyloglucan reducing end-specific cellobiohydrolase 0.01
Further Details:      
 
Domain Number 6 Region: 1323-1362
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.000000000000707
Family LDL receptor-like module 0.00082
Further Details:      
 
Domain Number 7 Region: 1103-1141
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.0000000000353
Family LDL receptor-like module 0.00066
Further Details:      
 
Domain Number 8 Region: 1060-1101
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.0000000000419
Family LDL receptor-like module 0.00068
Further Details:      
 
Domain Number 9 Region: 1230-1267
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.0000000000484
Family LDL receptor-like module 0.0011
Further Details:      
 
Domain Number 10 Region: 1277-1311
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.000000000327
Family LDL receptor-like module 0.00095
Further Details:      
 
Domain Number 11 Region: 984-1021
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.0000000017
Family LDL receptor-like module 0.001
Further Details:      
 
Domain Number 12 Region: 1146-1179
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.00000000458
Family LDL receptor-like module 0.002
Further Details:      
 
Domain Number 13 Region: 1630-1746
Classification Level Classification E-value
Superfamily Fibronectin type III 0.00000000546
Family Fibronectin type III 0.0045
Further Details:      
 
Domain Number 14 Region: 1377-1414
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.0000000223
Family LDL receptor-like module 0.0016
Further Details:      
 
Domain Number 15 Region: 1023-1062
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.0000000524
Family LDL receptor-like module 0.0011
Further Details:      
 
Domain Number 16 Region: 1419-1455
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.00000223
Family LDL receptor-like module 0.0017
Further Details:      
 
Domain Number 17 Region: 1182-1218
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.00000327
Family LDL receptor-like module 0.0025
Further Details:      
 
Weak hits

Sequence:  ENSPSIP00000002341
Domain Number - Region: 1744-1839
Classification Level Classification E-value
Superfamily Fibronectin type III 0.00102
Family Fibronectin type III 0.0037
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSPSIP00000002341   Gene: ENSPSIG00000002308   Transcript: ENSPSIT00000002349
Sequence length 2124
Comment pep:novel scaffold:PelSin_1.0:JH211939.1:150530:310501:1 gene:ENSPSIG00000002308 transcript:ENSPSIT00000002349 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
QVSLNDSHNQMVVHWAGEKSNVIVALARDSLGLLRPKKSDVYVSNDYGKTFKKISQKFSF
GAGNTSDVAISQFYHSPADNKRYIFVDAYTQYLWITMDFCNTVQGFSIPFRAADLLLHSR
IPDLVLGFPWEWPNSQLWQSDDFGQTWIMIQEHVKSFSCPRGIDPYDKLNTIYIERHEPT
GSSTIIRSTDFFQTRENKEIILEEVEDFQLRDKYLFATKTVHLLGSQQQTSVQLWVSFNR
KPMRAAQFVTRHPVKEYYIVDASEEQVFACVSHSNNRTNLYISDAEGLQFSLSLENVLYY
SPGGAGSDTLVRYFANEPFADFHRVEGVRGVYIATLINGSFSEENMRSVITFDKGGTWEF
LQAPAYTGYGEKIDCEFSKGCSLHLAQRLSQLLNFQSRRMPILSKESAPGLIIATGSVGK
NMASKTNVYVSSSAGARWREVLSGPHYYTWGDHGGILVAITQDTETDQLKYSTNEGETWK
TFTFSEKPVFVYGLLTEPGEKSTVFTIFGSYKENGHSWLILQINTTDALGVPCTENDYKL
WSPSDERGNECLLGHKTVFKRRTPHATCFNGEDFDRPVMVSNCSCTREDFECDFGFKLSD
DLSLEVCVPDPEFAGKPYSPPVPCPVGSTYRKTRGYRKISGDTCAGGDVESRLEGERVPC
PLAEENEFILYATRYSIHRYDLSSGISEELPLTGLRGAVALDFDYDHNCLYWADVTLDII
QRLCLKDSSGQEIIISTGLETVEALAFEPLSQLLYWVNAGIPKIEVANPDGDLRLTVLNA
SILERPRALTLVPKEGLMFWTDWGDSRPGIYRSDMDGSSASCIVSEGVRWPNGISVDDHW
IYWTEAYMDRIERIDFNGMQRSVILDSLPHPYAIAVFKNEIYWNDWSQLSIFRASKYSGS
RMEILVGRLNGIMDMKIFYRGKTTGRNACITRPCSLLCLPKSNNSRSCKCPEGVSSSVLP
SGEVKCDCPHGYVMKNTTCIKEDNTCLPNQYRCFNGNCINSIWQCDNDNDCGDMSDEKNC
PTTVCDSETQFRCQGSGTCIPLSYKCDLEDDCGDNSDESHCEAHQCRSDEFSCRSGMCIR
LSWMCDGDNDCRDWSDEANCTAVYHTCEASSFQCHNGHCIPQRWACDGDADCQDGSDEDP
VKCEKKCNGFQCPNGTCIPNSKHCDGLHDCSDGSDEQHCEPLCTRYMDFVCKNRQQCLFH
SMVCDGIVQCRDGSDEDANYAGCSQDLEFHRTCDQFNFQCQNGVCISLVWKCDGMDDCGD
YSDEANCENPTEAPNCSRYYQFQCQNGHCIPKRWKCDEENDCGDWSDEKDCEGSPILPFT
TSAPATCLPNHFRCNSGTCIMNSWVCDGYQDCTDGSDEDACPTLFNVTATSTTSLGRCSR
FEFECQQLKKCVPNWKRCDGVRDCQDGTDEINCPTHSTLSCPNGYKCEDGEACIKTTERC
DGFLDCSDSSDERNCTDDTIVYKVQNLQWMADFSGDVTVTWARPKKMSSASCVYNIYYRM
VGESIWKTLETHSNKTNSVLKVLKPDCTYQVKVQVQCLSKVYNTNDFITLRTPEGLPDPP
LHLQLSLKEEVEGVVIGRWAPPVSAHGLIREYIVEYSRNGSKEWSSLRASKNYTEIENLQ
INKLYTLRKKVAAVTSRGVGNWSDSKSITTMKGKVIPPPTIHIESCNENSISFTLKVDTD
IKVTGYIVNIFWTFDTHRQEKRTLFLDGEKSAQRVGNLTAHTPYEISAWAKTALGDSPLS
FAHVVTSGTRPPSPSLKAKAINQTAVECTWTGPRNVVYGVFYATSFLELYRSPQNTSTTL
HNITVIVNKDEQYLFLVRVMSPYQGPPSDYIVVKMIPDNRLPPRHLHSVLTRKTFAVIKW
ESPYDSPDQDMLYVVAVKDLIKKTDKIYKVKTRNSTVEYTIKKLEPGGKYHILVQLGNMS
KESSMKITTVSLSAPDALKILTENDHILLFWKSLALKESNFNENRGYEIHMFDSTMNITA
YLGNTTENFFKISNLKIGHNYTFTVQARCLYSGQMCGEPATLLYDELGAVIGEDASASKT
GKSTDVATIVVPVLFLLLVTMGIGFVVLYMRHRRLQNSFTAFANSHYSSRLGSAIFSSGD
DLGDEDDEAPMITGFSDDVPMVIA
Download sequence
Identical sequences K7F2T1
ENSPSIP00000002341 ENSPSIP00000002341

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]