SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSECAP00000017252 from Equus caballus 69_2

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSECAP00000017252
Domain Number 1 Region: 2-188,334-540
Classification Level Classification E-value
Superfamily Oligoxyloglucan reducing end-specific cellobiohydrolase 3.14e-48
Family Oligoxyloglucan reducing end-specific cellobiohydrolase 0.0079
Further Details:      
 
Domain Number 2 Region: 666-919
Classification Level Classification E-value
Superfamily YWTD domain 6.02e-46
Family YWTD domain 0.00000949
Further Details:      
 
Domain Number 3 Region: 1465-1650
Classification Level Classification E-value
Superfamily Fibronectin type III 3.99e-29
Family Fibronectin type III 0.0016
Further Details:      
 
Domain Number 4 Region: 1840-2013
Classification Level Classification E-value
Superfamily Fibronectin type III 6.23e-17
Family Fibronectin type III 0.0054
Further Details:      
 
Domain Number 5 Region: 1322-1359
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.0000000000563
Family LDL receptor-like module 0.0011
Further Details:      
 
Domain Number 6 Region: 1229-1267
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.0000000000576
Family LDL receptor-like module 0.00097
Further Details:      
 
Domain Number 7 Region: 1101-1139
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.0000000000681
Family LDL receptor-like module 0.00071
Further Details:      
 
Domain Number 8 Region: 1058-1099
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.000000000209
Family LDL receptor-like module 0.00076
Further Details:      
 
Domain Number 9 Region: 981-1019
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.000000000681
Family LDL receptor-like module 0.001
Further Details:      
 
Domain Number 10 Region: 1144-1177
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.00000000223
Family LDL receptor-like module 0.0018
Further Details:      
 
Domain Number 11 Region: 1274-1309
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.00000000602
Family LDL receptor-like module 0.001
Further Details:      
 
Domain Number 12 Region: 1655-1784
Classification Level Classification E-value
Superfamily Fibronectin type III 0.0000000157
Family Fibronectin type III 0.007
Further Details:      
 
Domain Number 13 Region: 1376-1413
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.000000017
Family LDL receptor-like module 0.0015
Further Details:      
 
Domain Number 14 Region: 1416-1455
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.0000000393
Family LDL receptor-like module 0.0011
Further Details:      
 
Domain Number 15 Region: 1022-1061
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.000000183
Family LDL receptor-like module 0.0012
Further Details:      
 
Domain Number 16 Region: 1181-1216
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.00000275
Family LDL receptor-like module 0.0023
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSECAP00000017252   Gene: ENSECAG00000019083   Transcript: ENSECAT00000020979
Sequence length 2120
Comment pep:known chromosome:EquCab2:7:28934178:29071598:1 gene:ENSECAG00000019083 transcript:ENSECAT00000020979 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
QVSLNDSHNQMVVHWAGEKSNVIVALARDSLALVRPKSSDVYVSYDYGKSFKKISEKLNF
GVGNNSEAVISQFYHSPADNKRYIFADAYAQYLWITFDFCNTIQGFPIPFRAGDLLLHSK
AADLLLGFDRSHPNKQLWKSDDFGQTWVLIQEHVKSFSWGIDPYDEPTTIYIERHEPFGF
STVFRSTDFFQSLENQEVILEEVKDFQLRDKYMFATRVLYLLNSPEPSSVQLWVSFDRKP
MQAAQFVTRHPINEYYIADASEDQVFVCVSHSNNRTNLYISEAEGLKFSLSLENVLYYSP
GGAGSDTLVRYFANEPFADFHRVEGLQGVYIATLINGSMSEENMRSVITFDKGGTWEFLQ
APAFTGYGEKINCELSQGCSLHLAQRLSQLLNLQLRRMPILSKESAPGLIIATGSVGKNL
ASKTNVYISSSAGARWREALPGPHYYTWGDHGGIIMAIAQGMETNELKYSTNEGETWKTF
IFSEKPVFVYGLLTEPGEKSTVFTIFGSNKENVHSWLILQVNATDALGVPCTENDYKLWS
PSDERGNECLLGHKTVFKRRTPHATCFNGEDFDRPVVVSNCSCTREDYECDFGFKMSEDL
LLEVCVPDPEFPGRAYSPPVPCPEGSTYRRTRGYRKISGDTCSGGDVEARLEGELVPCPL
AEENEFMLYALRKSIYRYDLASGATEQLPLTGLRAAVALDFDYERNCLYWSDLALDIIQR
LCLNGSTGQEVIINSGLETVEALAFDPLSQLLYWVDAGFKKIEVANPDGDFRLTIVNSSV
LDRPRALVLMSQKGLMFWTDWGDLKPGIYRSNMDGSAVHRLVSEDVKWPNGIAVDDQWIY
WTDAYLDCIERITFSGQQRSIILDNLPHPYAIAVFKNEIYWDDWSQLSIFRASKYSGSQM
AILASQLTGLMDMKIFYKGKTTGSNACVPQPCSLLCLPKANNSKSCRCPEGVASSVLPSG
DLMCECPHGYQLRNNTCVKEENTCLRNQYRCSNGKCINSIWWCDFDNDCGDMSDERNCPT
TICDLDTQFRCQESGSCIPLSYKCDLEDDCGDNSDESHCEMHQCRSDEYNCSSGMCIRSS
WVCDGDNDCRDWSDEANCTAIYHTCEASNFQCHNGHCIPQRWACDGDTDCQDGSDEDPVN
CEKKCNGFRCPNGTCIPSSKHCDGLRDCSDGSDEQHCEPLCTRFMDFVCKNRQQCLFHSM
VCDGIVQCRDGSDEDPEFAGCSHDPEFRKVCDEFSFQCLNGVCISLIWKCDGMDDCGDYS
DEANCEYPTEAPNCSRYFQFQCENGHCVPNRWKCDRENDCGDWSDERDCGDSYLLPSPTP
EPSTCLPNYYRCSNGACVMDSWVCDGYRDCADGSDEEVCPSPANVTPASTPTQFGRCDRF
EFECHQPKKCIPNWKRCDGHQDCQDGQDEANCPTHSTLTCMSSEFKCEDGEACIVLSERC
DGFLDCSDESDEKACSDELTVYKVQNLQWTADFSGDVTLTWTRPKKMPSASCVYNVYYRV
VGESMWKTLETHSNKTNMILKVLKPDTTYQVKVQVQCLSKVHNTNDFVTLRTPEGLPDAP
QNLQLSLHREVEGVIVGQWTPPAHTHGLIREYIVEYSRSGSKMWASQRAASNFTEIKNLL
VSAQYTVRVAAVTSRGIGNWSDSKSITTIKGKVIPPPDIHIDSYSENSLSFTLSMDNDIK
VNGYVVNLFWAFDSHKQEKRTLNFQGSMLSHKVGNLTAHTAYEISAWAKTDLGDSPLAFE
HVTTKGVRPPAPSLKAKAINQTAVECTWTGPRNVVYGIFYATSFLDLYRNPKSLTTSLHN
KTVIVSRDEQYLFLVRVVVPYEGPSSDYVVVKMIPDSRLPPRHLHAVHITKTSAVLKWES
PYDSPDQDLLYAIAVKDLIRKSDRSYKVKSCNSTVEYTLNKLEPGGKYHIIVQLGNMSKD
SNIKITTVSLSAPDALKIITENDHVLLFWKSLALKEKYFNESRGYEIHMFDSAMNITAYL
GNTTDNFFKISNLKLGHNYTFTVQARCLFGSQICGEPAVLLYDDLGSGGDASAFQAARST
DVAAVVVPILFLILLSLGVGFAILYTKHRRLQSSFTAFANSHYSSRLGSAIFSSGDDLGE
DDEDAPMITGFSDDVPMVIA
Download sequence
Identical sequences F7CH72
ENSECAP00000017252 ENSECAP00000017252

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]