SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSMICP00000010275 from Microcebus murinus 69_1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSMICP00000010275
Domain Number 1 Region: 782-1011
Classification Level Classification E-value
Superfamily YWTD domain 5.1e-43
Family YWTD domain 0.0000101
Further Details:      
 
Domain Number 2 Region: 1558-1742
Classification Level Classification E-value
Superfamily Fibronectin type III 9.53e-23
Family Fibronectin type III 0.0028
Further Details:      
 
Domain Number 3 Region: 95-282
Classification Level Classification E-value
Superfamily Oligoxyloglucan reducing end-specific cellobiohydrolase 1.54e-17
Family Oligoxyloglucan reducing end-specific cellobiohydrolase 0.0092
Further Details:      
 
Domain Number 4 Region: 1932-2105
Classification Level Classification E-value
Superfamily Fibronectin type III 1.7e-16
Family Fibronectin type III 0.0056
Further Details:      
 
Domain Number 5 Region: 1415-1453
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.0000000000314
Family LDL receptor-like module 0.00092
Further Details:      
 
Domain Number 6 Region: 1152-1193
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.000000000183
Family LDL receptor-like module 0.00077
Further Details:      
 
Domain Number 7 Region: 1322-1358
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.000000000209
Family LDL receptor-like module 0.0011
Further Details:      
 
Domain Number 8 Region: 1075-1113
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.000000000903
Family LDL receptor-like module 0.0011
Further Details:      
 
Domain Number 9 Region: 1196-1230
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.00000000127
Family LDL receptor-like module 0.00077
Further Details:      
 
Domain Number 10 Region: 1369-1402
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.00000000262
Family LDL receptor-like module 0.00095
Further Details:      
 
Domain Number 11 Region: 534-634
Classification Level Classification E-value
Superfamily Oligoxyloglucan reducing end-specific cellobiohydrolase 0.00000000523
Family Oligoxyloglucan reducing end-specific cellobiohydrolase 0.039
Further Details:      
 
Domain Number 12 Region: 1469-1506
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.0000000249
Family LDL receptor-like module 0.0015
Further Details:      
 
Domain Number 13 Region: 1509-1548
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.0000000367
Family LDL receptor-like module 0.0012
Further Details:      
 
Domain Number 14 Region: 1747-1875
Classification Level Classification E-value
Superfamily Fibronectin type III 0.000000165
Family Fibronectin type III 0.0067
Further Details:      
 
Domain Number 15 Region: 1120-1153
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.000000301
Family LDL receptor-like module 0.0012
Further Details:      
 
Domain Number 16 Region: 1273-1309
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.00000144
Family LDL receptor-like module 0.0026
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSMICP00000010275   Gene: ENSMICG00000011249   Transcript: ENSMICT00000011283
Sequence length 2212
Comment pep:novel genescaffold:micMur1:GeneScaffold_2072:1331006:1504429:1 gene:ENSMICG00000011249 transcript:ENSMICT00000011283 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MATRSSRRESRLPFLFTLVALLPPWAVCEVWTQRLHGGRAPLPQDRGFLVVQGDPRELRL
WARGDARGASRADEKPLRRRRSAALQPEPIKVYGQVSLNDSHNQMVVHWAGEKSNVIVAL
ARDSLALARPKSSDVYVSYDYGKSFKKISEKLNFGVGNSSEAVIAQFYHSPADNKRYIFA
DAYAQYLWITFDFCNTVQGFSIPFRAADLLLHSKASNLLLGFDRSHPNKQLWKSDDFGQT
WILIQEHVKSFSWGIDPYDKPNTIYIERHEPSGYSTVLRSTDFFQSRENQEVILEEVRDF
QLRDKYMFATKVVHLLGGQQQLSVQLWVSFGRKPMRPAQFVTRHPINEYYIADASEDQVF
VCVSHSNNRTNLYISEAEGLKFSLSLENVLYYSPGGAGSDTLVXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXALPGPHYY
TWGDHGGIIMAIAQGTETNELKYSTNEGETWKAFVFSEKPLFVYGLLTEPGEKSTVFTIF
GSNKENVHSWLILQVNATDALGVPCTENDYKLWSPSDERGNECLLGHKTVFKRRTPHATC
FNGEDFDRPVVVSNCSCTREDYECDFGFKMSEDLSLEVCVPDPEFAGKSYSPPVPCPVGS
TYRRTRGYRKISGDTCSGGDVEARLEGELVPCPLAXXXXXXXXXXXXXXXXXXXXXXATE
QLPVSGLRAAVALEFDYERNCLYWSDLALDIIQRLCLNGSTGQEVIISSGLETVEALAFE
PLSQLLYWVDAGFKRIEVANPDGDFRLTIVNSSVLDRPRALVLVPQDGVMFWTDWGDVKP
GIYRSDMDGSAIRRLVSEDVKWPNGISVDGQWIYWTDAYLDCIERISFSGQRRSVILDNL
PHPYAIAVFKNEIYWDDWTQLSIFRASKYSGAQLEILASQVTGLMDMKIFYKGKNTGSNA
CVPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNTCLR
NQYRCSNGNCINSIWWCDFDNDCGDMSDERNCPTTICDADTQFRCQESGTCIPLSYKCDL
EDDCGDNSDESHCEMHQCRSDEYNCSSGMCIRASWVCDGDNDCRDWSDEANCTAIYHTCE
ASNFQCRNGHCIPQRWACDGDTDCQDGSDDPVNCXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXPLCTRFMDFVCKNRQQCLFHSMVCDGIIQCRDGSDEDAAFAGCSHDPEF
HKECDEFGFQCQNGVCISLIWKCDGMDDCGDYSDEANCENPTEAPNCSRYFQFRCENGHC
IPNRWKCDRENDCGDWSDEKDCGDSHILPSPTPGPSTCLPNYYRCSSGACVMDTWVCDGY
RDCADGSDEEACPSLANATSASTPTQLGRCNRFEFECHQPKKCLPNWKRCDGHRDCQDGR
DEANCPTHSTLTCTSREFKCEDGEACILLSERCDGFLDCSDESDERACSDELTVYKVQNL
QWTADFSGDVTLTWMRPKKMPSASCVYNVYYRVVGESMWKTLETHSNKTNTVLKVLKPDT
TYQVKVQVQCLNKAHNTNDFVTLRTPEGLPDAPRNLQLSLHREAEGVIVGHWTPPIHTHG
LIREYIVEYSRSSKMWASQRAASNXXXXXXXXXXXXYTLGVAAVTSRGIGNWSDSKCITT
IKGKAIPPPDIHIDSYGENSLSFTLSMEGDIKVNGYVVNLFWAFDTHKQEKRTLNFRGSI
LSHKVGNLTAHTSYEISAWAKTDLGDSPLAFEHVMTRGVRPPAPSLKAKAVNQTAVECTW
TGPRNVVYGIFYATSFLDLYRNPKSLTTSLHNKTVIVSRDEQYLFLVRVVVPYQGPSSDY
VVVKMIPDSRLPPRHLHAVHTGKTSAVIKWESPYDSPDQDLLYAIAVKDLIRKTDRSYKV
KSCNSTVEYTLNKLEPGGKYHVIVQLGNMSKDSSIKITTVSLSAPDALKIITENDHVLLF
WKSLALKEKHFNESRGYEIHMFDSAMNITAYLGNTTDNFFKISNLKMGHNYTFTVQARCL
FGSQICGEPAVLLFDELGSGGDASAIQAARSTDVAAVVVPILFLILLSLGVGFAVLYTKH
RRLQSSFTAFANSHYSSRLGSAIFSSGDDLGEDDEDAPMITGFSDDVPMVIA
Download sequence
Identical sequences ENSMICP00000010275 ENSMICP00000010275

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]