SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for A0A1S4H334 from Uniprot 2018_03 genome

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  A0A1S4H334
Domain Number 1 Region: 58-837
Classification Level Classification E-value
Superfamily P-loop containing nucleoside triphosphate hydrolases 2.21e-282
Family Motor proteins 0.0000000000000248
Further Details:      
 
Domain Number 2 Region: 1234-1350
Classification Level Classification E-value
Superfamily Myosin rod fragments 6.8e-24
Family Myosin rod fragments 0.0024
Further Details:      
 
Domain Number 3 Region: 955-1072
Classification Level Classification E-value
Superfamily Myosin rod fragments 2.49e-22
Family Myosin rod fragments 0.0031
Further Details:      
 
Domain Number 4 Region: 838-963
Classification Level Classification E-value
Superfamily Myosin rod fragments 4.84e-22
Family Myosin rod fragments 0.00025
Further Details:      
 
Domain Number 5 Region: 1430-1551
Classification Level Classification E-value
Superfamily Myosin rod fragments 0.000000000314
Family Myosin rod fragments 0.0091
Further Details:      
 
Domain Number 6 Region: 1629-1744
Classification Level Classification E-value
Superfamily Myosin rod fragments 0.00000000602
Family Myosin rod fragments 0.0019
Further Details:      
 
Domain Number 7 Region: 1066-1182
Classification Level Classification E-value
Superfamily Myosin rod fragments 0.0000123
Family Myosin rod fragments 0.0035
Further Details:      
 
Domain Number 8 Region: 1771-1929
Classification Level Classification E-value
Superfamily Tropomyosin 0.0000582
Family Tropomyosin 0.0022
Further Details:      
 
Weak hits

Sequence:  A0A1S4H334
Domain Number - Region: 1201-1243
Classification Level Classification E-value
Superfamily EB1 dimerisation domain-like 0.085
Family EB1 dimerisation domain-like 0.0054
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) A0A1S4H334
Sequence length 1961
Comment (tr|A0A1S4H334|A0A1S4H334_ANOGA) Myosin heavy chain {ECO:0000313|VectorBase:AGAP010147-PF} OX=7165 OS=Anopheles gambiae (African malaria mosquito). GN= OC=Culicidae; Anophelinae; Anopheles.
Sequence
MPKPPVQVGEDPDPTEFLFVSLEQKRIDQSKPYDSKKACWVPEEKEGYVLGEIKATKGEL
VTVALPGGEEKNFKKEQLSQVNPPKFEKVEDMADLTYLNEAAVLHNLRQRYYSKLIYTYS
GLFCVVINPYKRYPLYTNRCAKMYRGKRRNEVPPHLFAVSDGAYVNMLTNHENQSMLITG
ESGAGKTENTKKVIAYFATIGASGKKDENAEKKGSLEDQVVQTNPVLEAFGNAKTVRNDN
SSRFGKFIRIHFTGSGKLAGADIETYLLEKARVISQQTLERSYHIFYQIMSGSVKGLKEK
CFLSNDVYDYMIIAQGKTTIPNVDDGEEMGLTDEAFNVLGFTQEEKDNIYRITSAVMHMG
RMQFKQKGREEQAEADGTEDGDRVAKLLGVGTDDLYKNLLKPRIKVGNEFVTKGQNKDQV
TNSVGALCKGIFDRLFKWLVKKCNETLDTKQKRAQFIGVLDIAGFEIFDFNGFEQLCINF
TNEKLQQFFNHHMFVLEQEEYKKEGINWAFIDFGMDLLACVELIEKPMGILSILEEESMF
PKATDQTFAEKLMTNHLGKSAPFMKPRPPKPGIPAGHFAIGHYAGVVSYNITGWLEKNKD
PLNDTVVDQFKKGSNALMVEIFADHPGQSADPAAAKGGRGKKGAGFATVSSSYKEQLNNL
MTTLKSTQPHFVRCIIPNEMKTAGVVDAHLVMHQLTCNGVLEGIRICRKGFPNRMMYPDF
KLRYKILCPQLIKEPCSPEKVTQIVLTHIQLPEEQFRMGKTKVFFRAGVLGQMEEFRDER
LSKIMSWMQAWCRGYLSRKEFKKMQEQRVSLEIVQRNLRKYLKLRTWAWWKLWQKVKPLL
NVSRVEDQIAKLEEKATKAQEAYEKEEKLRKELEALNSKLLAEKTALLDSLSGEKGALQE
YQEKAAKLTAQKNDLENQLRDTQERLAQEEDARNQLFQTKKKLEQEIGSQKKDAEDLELQ
IQKIEQDKASKDHQIRNLNDEIAHQDELINKLNKEKKMQGEVNQKTAEELQAAEDKVNHL
NKVKAKLEQTLDELEDSLEREKKLRGDVEKAKRKVEGDLKLTQEAVADLERNKKELEQTV
LRKDKEISALSAKLEDEQSLVGKLQKQIKELQARIEELEEEVEAERQARAKAEKQRADLA
RELEELGERLEEAGGATSAQIELNKKREAELAKLRRDLEEANIQHEGTLANLRKKHNDAV
AEMAEQVDQLNKLKTKAEKERTQYFAELNDARIGCDQLSNEKAAQEKIAKQLQHTLNEVQ
SKLDETNRTLNDFDASKKKLSIENSDLLRQLEDAESQVSQLSKIKISLTQQLEDTKRLAD
EEARERATLLGKFRNLEHDLDNLREQVEEEAEGKGDIQRQLSKANAEAQLWRSKYESEGV
ARAEELEEAKRKLQARLAEAEETIESLNQKCIALEKTKQRLATEVEDLQLEVDRASSIAN
AAEKKQKAFDKIIGEWKLKVDDLAAELDASQKECRNYSTELFRLKGAYEEGQEQLEAVRR
ENKNLADEVKDLLDQIGEGGRNIHEIEKSRKRLEAEKDELQAALEEAEAALEQEENKVLR
AQLELSQVRQEIDRRIQEKEEEFENTRKNHQRALDSMQASLEAEAKGKAEALRMKKKLEA
DINELEIALDHANKANAEAQKNIKRYQQQLKDVQSALEEEQRARDDAREQLGISERRANA
LQNELEESRTLLEQADRGRRQAEQELSDAHEQLNEVSAQNASIAAAKRKLESELQTLHSD
LDELLNEAKNSEEKAKKAMVDAARLADELRAEQDHAQTQEKLRKALEQQIKELQVRLDEA
ESNALKGGKKAIQKLEQRVRELESELDSEQRRHADAQKNLRKSERRIKELTFQSEEDRKN
HERMQDLVDKLQQKIKTYKRQIEEAEEIAALNLAKFRKAQQELEEAEERADIAEQAATKF
RTKGGRAGSVQRGASPAPQRQPSAMPALAGLNLPTFDDHGF
Download sequence
Identical sequences A0A1S4H334
AGAP010147-PF|hypothetical

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]