SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSSARP00000012054 from Sorex araneus 76_1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSSARP00000012054
Domain Number 1 Region: 44-469,522-774
Classification Level Classification E-value
Superfamily P-loop containing nucleoside triphosphate hydrolases 5.31e-224
Family Motor proteins 9.6e-16
Further Details:      
 
Domain Number 2 Region: 834-958
Classification Level Classification E-value
Superfamily Myosin rod fragments 3.01e-28
Family Myosin rod fragments 0.00000131
Further Details:      
 
Domain Number 3 Region: 1223-1340
Classification Level Classification E-value
Superfamily Myosin rod fragments 7.32e-28
Family Myosin rod fragments 0.0049
Further Details:      
 
Domain Number 4 Region: 956-1066
Classification Level Classification E-value
Superfamily Myosin rod fragments 1.02e-24
Family Myosin rod fragments 0.0013
Further Details:      
 
Domain Number 5 Region: 1418-1540
Classification Level Classification E-value
Superfamily Myosin rod fragments 0.00000000000000314
Family Myosin rod fragments 0.0053
Further Details:      
 
Weak hits

Sequence:  ENSSARP00000012054
Domain Number - Region: 1700-1922
Classification Level Classification E-value
Superfamily Tropomyosin 0.000144
Family Tropomyosin 0.0022
Further Details:      
 
Domain Number - Region: 1554-1648
Classification Level Classification E-value
Superfamily Myosin rod fragments 0.00785
Family Myosin rod fragments 0.01
Further Details:      
 
Domain Number - Region: 1340-1417
Classification Level Classification E-value
Superfamily Myosin rod fragments 0.0209
Family Myosin rod fragments 0.0094
Further Details:      
 
Domain Number - Region: 1064-1171
Classification Level Classification E-value
Superfamily Myosin rod fragments 0.0288
Family Myosin rod fragments 0.0043
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSSARP00000012054   Gene: ENSSARG00000013298   Transcript: ENSSART00000013349
Sequence length 1924
Comment pep:known_by_projection genescaffold:COMMON_SHREW1:GeneScaffold_3620:84933:112995:-1 gene:ENSSARG00000013298 transcript:ENSSART00000013349 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MSSDAEMAVFGEAAPYLRSFEERVQTSFDSSVFVDPKESFVKATVQSRELGKVTVKTEGG
TTVTVKDDQVYPMNPPKYDKIEDMAMMTHLHEPAVLYNLKERYAAWMIYTYSGLFCVTVN
PYKWLPVYNAEVVAAYRGKKRQEAPPHIFSISDNAYQFMLTDRENQSILITGESGAGKTV
NTKRVIQYFATIAVTGDKKKEEAPSGKMQGTLEDQIISANPLLEAFGNAKTVRNDNSSRF
GKFIRIHFGTTGKLASADIETYRLEKSRVTFQLKAERSYHIFYQIMSNKKPDLIEMLLIT
TNPYDYAFVSQGEITVPSIDDQEELMATDSAIDILGFTSDERVSIYKLTGAVMHYGNMKF
KQKQREEQAEPDGTEVADKAAYLQSLNSADLLKALCYPRVKVGNEYVTKGQTVQQVYNSV
GALAKAMYEKMFLWMVTRINQQLDTKQPRQYFIGVLDIAGFEIFDXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPMGIFSILEEECMFPKAT
DTSFKNKLYEQHLGKSNNFQKPKPAKGKAEAHFSLVHYAGTVDYNIAGWLDKNKDPLNET
VLGLYQKSPMKTLAYLFSGAAAAAEAGGGKKGAKKKGSSFQTVSALFRENLNKLMTNLRS
THPHFVRCLIPNETKTPGAMEHELVLHQLRCNGVLEGIRICRKGFPSRILYADFKQRYKV
LNASAIPEGQFIDSKKASEKLLGSIDIDHTQYKFGHTKVFFKAGLLGLLEEMRDEKLAQL
ITRTQAMCRGYLARVEYQKMVERRESIFCIQYNVRAFMNVKHWPWMKLYFKIKPLLKSAE
TEKEMANMKEEFEKAKENLAKAEAKRKELEEKMVALMQEKNDLQLQVQSEADSLADAEER
CDQLIKTKIQLEAKIKEVTERAEDEEEINAELTAKKRKLEDECSELKKDIDDLELTLAKV
EKEKHATENKVKNLTEEMAGLDETIAKLTKEKKALQEAHQQTLDDLQAEEDKVNTLTKAK
IKLEQQVDDLEGSLEQEKKIRMDLERAKRKLEGDLKLAQESTMDVENDKQQLDEKLKKKE
MSLQSKEDEQAGMQLQKKIKLARIEELEEEIEAERASRAKAEKQRSDLSRELEEISERLE
EAGGATSAQIEMNKKREAEFQKMRRDLEEATLQHEATAATLRKKHADSVAELGEQIDNLQ
RVKQKLEKEKSEMKMEIDDLASNMETVSKAKGNLEKMCRTLEDQVSELKTKEEEQQRLIN
DLTAQRGRLQTESGEYSRQLDEKDSLVSQLSRGKQAFTQQIEELKRQLEEEIKAKSALAH
ALQSSRHDCDLLREQYEEEQEAKAELQRAMSKANSEVAQWRTKYETDAIQRTEELEEAKK
KLAQRLQDAEEHVEAVNAKCASLEKTKQRLQNEVEDLMIDVERTNAACAALDKKQRNFDK
VLAEWKQKYEETHAELEASQKESRSLSTELFKIKNAYEESLDHLETLKRENKNLQQEISD
LTEQIAEGGKRIHELEKIKKQIEQEKSELQSALEEAEASLEHEEGKILRIQLELNQVKSE
IDRKIAEKDEEIDQLKRNHIRVVESMQSTLDAEIRSRNDAIRIKKKMEGDLNEMEIQLNH
SNRMAAEALRNYRNTQGILKDTQIHLDDALRGQEDLKEQLAMVERRANLLQAEIEELRAT
LEQTERSRKIAEQELLDASERVQLLHTQNTSLINTKKKLETDISQIQGEMEDIVQEARNA
EEKAKKAITDAAMMAEELKKEQDTSAHLERMKKNLEQTVKDLQHRLDEAEQLALKGGKKQ
IQKLEARVRELEGEVENEQKRNVETVKALRKHERRVKELTYQTEEDRKNVLRLQDLVDKL
QAKVKAYKRQAEEAEEQSNVNLAKFRKIQHELEEAEERADIAESQVNKLRVKSREVHTKI
ISEE
Download sequence
Identical sequences ENSSARP00000012054 ENSSARP00000012054

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]