SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSSARP00000000740 from Sorex araneus 76_1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSSARP00000000740
Domain Number 1 Region: 57-838
Classification Level Classification E-value
Superfamily P-loop containing nucleoside triphosphate hydrolases 3.9e-284
Family Motor proteins 2.76e-17
Further Details:      
 
Domain Number 2 Region: 1235-1353
Classification Level Classification E-value
Superfamily Myosin rod fragments 7.46e-29
Family Myosin rod fragments 0.0022
Further Details:      
 
Domain Number 3 Region: 976-1072
Classification Level Classification E-value
Superfamily Myosin rod fragments 1.02e-19
Family Myosin rod fragments 0.0012
Further Details:      
 
Domain Number 4 Region: 1430-1552
Classification Level Classification E-value
Superfamily Myosin rod fragments 0.0000000000000654
Family Myosin rod fragments 0.006
Further Details:      
 
Domain Number 5 Region: 847-893
Classification Level Classification E-value
Superfamily Myosin rod fragments 0.000000000222
Family Myosin rod fragments 0.00035
Further Details:      
 
Domain Number 6 Region: 1619-1839
Classification Level Classification E-value
Superfamily Tropomyosin 0.000000032
Family Tropomyosin 0.00082
Further Details:      
 
Weak hits

Sequence:  ENSSARP00000000740
Domain Number - Region: 1069-1183
Classification Level Classification E-value
Superfamily Myosin rod fragments 0.00183
Family Myosin rod fragments 0.0071
Further Details:      
 
Domain Number - Region: 1808-1912
Classification Level Classification E-value
Superfamily Myosin rod fragments 0.00392
Family Myosin rod fragments 0.0091
Further Details:      
 
Domain Number - Region: 1353-1433
Classification Level Classification E-value
Superfamily Myosin rod fragments 0.0262
Family Myosin rod fragments 0.01
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSSARP00000000740   Gene: ENSSARG00000000789   Transcript: ENSSART00000000805
Sequence length 1940
Comment pep:known_by_projection genescaffold:COMMON_SHREW1:GeneScaffold_1654:22873:47781:-1 gene:ENSSARG00000000789 transcript:ENSSART00000000805 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MSSDSEMEVYGIAAPFLRKSEKERIEAQNQPFDAKTYCFVVDAKEEYVKGKIKNTQDGKV
TVETEDNRTLVVKPEDVYAMNPPKFDRIEDMAMLTHLNEPAVLYNLKERYTSWMIYTYSG
LFCVTVNPYKWLPVYNPEVVDGYRGKKRQEAPPHIFSISDNAYQFMLTXXXXXXXXXXGE
SGAGKTVNTKRVIQYFATIAATGDLAKKKDSKMKGTLEDQIISANPLLEAFGNAKTVRND
NSSRFGKFIRIHFGTTGKLASADIETYLLEKSRVTFQLKAERSYHIFYQILSNKKPELIE
LLLITTNPYDYPFISQGEILVASIDDAEELLATDSAIDILGFTPEEKSGLYKLTGAVMHY
GNMKFKQKQREEQAEPDGTEVADKTAYLMGLNSSDLLKALCFPRVKVGNEYVTKGQTVDQ
VHHAVNALSKSVYEKLFLWMVTRINQQLDTKLPRQHFIGVLDIAGFEIFEYNSLEQLCIN
FTNEKLQQFFNHHMFVLEQEEYKKEGIEWTFIDFGMDLAACIELIEKPMGIFSILEEECM
FPKASDTSFKNKLYDQHLGKSNNFQKPKVVKGKAEAHFSLVHYAGTVDYSVSGWLEKNKD
PLNETVVGLYQKSSNRLLAHLYATFATTDADSGKKKVAKKKGSSFQTVSALFRENLNKLM
SNLRTTHPHFVRCIIPNESKTPGAMEHSLVLHQLRCNGVLEGIRICRKGFPNRILYGDFK
QRYRVLNASAIPEGQFIDSKKACEKLLASIDIDHTQYKFGHTKVFFKAGLLGVLEEMRDD
RLAKLITRTQAVCRGFLMRVEFQKMVQRRESIFCIQYNIRSFMNVKHWPWMKLYFKIKPL
LKSAETEKEMATMKEEFQKTRDELAKSEAKRKELEEKLVTLVQEKNDLQLQVQAXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXVKNLSEELAGLDETIEKLTREKKALQEAHQQTLDDLQAEEDKVNS
LSKIKSKLEQQVDDLESSLEQEKKLRVDLERNKRKLEGDLKLAQESILDLENDKQQLDER
LKKKDFEYSQLQSKMEDEQTLGLQFQKKIKELQARIEELEEEIEAERATRAKTEKQRSDY
ARELEELTERLEEAGGVTSTQIELNKKREAEFLKLRRDLEEVTLQHEATVAALRKKHADS
VAELGEQIDNLQRVKQKLEKEKSEFKLEIDDLASNMESVSKSKANLEKICRTLEDQLSEA
RGKNEEIQRSMSELATQKSRLQTEAGELSRQLEERESIVSQLSRSKQAFTQQIEELKRQL
EEESKAKNALAHALQSSRHDCDLLREQYEEEQEAKAELQRAMSKANSEVAQWRTKYETDA
IQRTEELEEAKKKLAQRLQDSEEQVEAVNAKCASLEKTKQRLQGEVEDLMVDVERANSLA
AALDKKQRNFDKVLAEWKTKCEESQAELEASLKESRTLSTELFKLKNAYEEALDQLETVK
RENKNLEQEIADLTEQITENGKTIHELEKSRKQMELEKADVQLALEEAEAALEHEEAKIL
RIQLELTQVKSEIDRKIAEKDEEIEQLKRNYQRMVETMQSTLDAEVRSRNEAIRIKKKME
GDLNEIEIQLSHANRQAAETLKHLRSVQGQLKDTQLHLDDALRGQEDLKEQLAIVERRAN
LLQAEVEELRASLEQTERARKLAEQELLDSNERVQLLHAQNTSLIHTKKKLETDLTQLQN
EVEDASRDARNAEEKAKKAITDAAMMAEELKKEQDTSAHLERMKKNMEQTVKDLQHRLDE
AEQLALKGGKKQIQKLEMRIRELEFELEGEQKKNTESVKGLRKYERRIKELTYQSEEDRK
NMLRLQDLVDKLQVKVKSYKRQAEEADEQANAHLTKFRKAQHELEEAEERADIAESQVNK
LRAKTRDFTSSRMVIHESEE
Download sequence
Identical sequences ENSSARP00000000740 ENSSARP00000000740

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]