SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSDORP00000001993 from Dipodomys ordii 76_1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSDORP00000001993
Domain Number 1 Region: 665-916
Classification Level Classification E-value
Superfamily YWTD domain 6.02e-46
Family YWTD domain 0.0000118
Further Details:      
 
Domain Number 2 Region: 258-539
Classification Level Classification E-value
Superfamily Oligoxyloglucan reducing end-specific cellobiohydrolase 4.18e-31
Family Oligoxyloglucan reducing end-specific cellobiohydrolase 0.014
Further Details:      
 
Domain Number 3 Region: 1555-1736
Classification Level Classification E-value
Superfamily Fibronectin type III 1.2e-19
Family Fibronectin type III 0.0029
Further Details:      
 
Domain Number 4 Region: 2-187
Classification Level Classification E-value
Superfamily Oligoxyloglucan reducing end-specific cellobiohydrolase 1.75e-16
Family Oligoxyloglucan reducing end-specific cellobiohydrolase 0.0098
Further Details:      
 
Domain Number 5 Region: 1321-1359
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.0000000000117
Family LDL receptor-like module 0.00088
Further Details:      
 
Domain Number 6 Region: 1100-1139
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.000000000072
Family LDL receptor-like module 0.00067
Further Details:      
 
Domain Number 7 Region: 1228-1264
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.0000000000995
Family LDL receptor-like module 0.0011
Further Details:      
 
Domain Number 8 Region: 1057-1098
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.000000000262
Family LDL receptor-like module 0.00076
Further Details:      
 
Domain Number 9 Region: 1273-1308
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.00000000249
Family LDL receptor-like module 0.001
Further Details:      
 
Domain Number 10 Region: 1143-1176
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.00000000511
Family LDL receptor-like module 0.0015
Further Details:      
 
Domain Number 11 Region: 1928-2011
Classification Level Classification E-value
Superfamily Fibronectin type III 0.0000000243
Family Fibronectin type III 0.0055
Further Details:      
 
Domain Number 12 Region: 1025-1058
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.000000262
Family LDL receptor-like module 0.0013
Further Details:      
 
Domain Number 13 Region: 1180-1215
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.00000196
Family LDL receptor-like module 0.0023
Further Details:      
 
Domain Number 14 Region: 1749-1863
Classification Level Classification E-value
Superfamily Fibronectin type III 0.0000841
Family Fibronectin type III 0.0079
Further Details:      
 
Weak hits

Sequence:  ENSDORP00000001993
Domain Number - Region: 592-606,921-978
Classification Level Classification E-value
Superfamily Growth factor receptor domain 0.000235
Family Growth factor receptor domain 0.01
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSDORP00000001993   Gene: ENSDORG00000002124   Transcript: ENSDORT00000002128
Sequence length 2119
Comment pep:known_by_projection genescaffold:dipOrd1:GeneScaffold_3139:12:98965:1 gene:ENSDORG00000002124 transcript:ENSDORT00000002128 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
VSLNDSHNQMVVHWAGEKSNVIVALARDSLALARPKSSDVYVSYDYGKSFNKISEKLNFG
EGNNTAAVIAQFYHSPADNKRYIFADAYAQYLWITFDFCSTIHGFSIPFRAADLLLHSKA
SNLVMGFDRSHPNKQLWKSDDFGQTWIMIQEHVKSFAWGIDPYDKPNTIYIERHEPAGYS
TVLRSTDFFQSRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXEYYIADASEDQVFVCVSHSNNRTNLYISEAEGLKFSLSLENVLYYSPG
GAGSDSLVRYFANEPFADFHRVEGLQGVYIATLINGSMNEENLRSVITFDKGGTWEFLQA
PAFTGYGEKIDCELSQGCSLHLAQRLSQLLNLQLRRMPILSKESAPGLIIATGSVGKNLA
SKTNVYISSSAGARWREALPGPHYYTWGDHGGIIMAIAQGMETNELKYSTNEGETWKTFI
FSEQPVFVYGLLTEPGEKSTVFTIFGSNKENVHSWLILQVNATDALGVPCTENDYKLWSP
SDERGNECLLGHKTVFKRRTPHATCFNGEDFDRPVVLSNCSCTREDYECDFGFKLSEDLS
LEVCVPDPEFSGRSYSPPVPCPVGSTYRRTRGYRKISGDTCSGGDVEARLEGELVPCPLA
EENEFILYAMRKSIYRYDLASGATEQLPLSGLRAAVALDFDYEHNCLYWSDLALDSIQRL
CLNGSTGQEVIINSGLETVEALAFEPLSQLLYWVDAGFKKIEVANPDGDFRLTIINSSVL
DRPRALVLVPQEGVLFWTDWGDLKPGIYRSNMDGSAVRRLVSEDVKWPNGISVDEQWIYW
TDAYLDCIERATFSGQQRSLILDNLPHPYAIAVFKNEIYWDDWSQLSIFRASKYSGSQIE
TLGSQLTGLMDIKIFYKGKNTGSNACVPRPCSLLCLPKANNSKSCRCPEGVASSVLPSGV
LMCECPQGYQLKNNTCTKEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTT
VCDVDTQFRCQESGTCIPLSYKCDLEDDCGDNSDERHCEMHQCRSEEYNCSSGMCIRSSW
VCDGDNDCRDWSDEANCTAMYHTCEASNFQCHNGHCVPQRWVCDGDADCQDGSDEDPANC
EKKCNGFHCPNGTCIPSSKHCDGLRDCSDGSDEQHCEPLCTRFMDFVCKNRQQCLFHSMV
CDGVIQCRDGSDEDPAFAGCSQDPEFHKVCDEFGFQCQNGVCISLIWKCDGMDDCGDYSD
EANCENPTEAPSCSRYFQFRCENGHCIPNRWKCDREDDCGDWSDERDCGDLHVPPSPTPG
PSTCLPNYFRCSHGACVMDTWVCDGYRDCADGSDEEACPTVXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPDAPG
NLQLSVHSEAEGVIVCHWAPPVHPHGFIREYIVEYSRTGSKMWASQRAGSNSTEIKDLLP
HTLYTVRXXXXXXXXXXXXXXXXXXXXXXXXXIPPPDIHINSYSENSLSFTLTMGGDTKV
TGYVVNLFWAFDTHKQEKKTLNFPGSSTSHKVGNLTAHTSYEISAWAKTDVGDSPLAFEH
VTTRGVRPPAPSLKAKAINQTAVECVWTGPRNVVYGIFYATSFLDLYRNPKGLTTSLHNK
TVIVGRDEQYLFLVRVLVPYQGPSSDYVVVKMIPDSRLPPRHLHLIHTGKTSAIIKWESP
YDSPDQDLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXLLSAPDALKIITENDHVLLFWKSLALKEKQFNESRGYEIHMFDSAVNLTAYLG
NTTDNFFKISNLKMGHNYTFTVQARCLVGSQLCGEPAVLLYDELGSGGAMAAVQDARPTD
VAAVVVPILFLILLSLGVGFAILYTKHRRLQSSFTAFANSHYSSRLGSAIFSSGDDLGED
DEDAPMITGFSDDVPMVIA
Download sequence
Identical sequences ENSDORP00000001993 ENSDORP00000001993

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]