SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for A0A1I8M203 from Uniprot 2018_03 genome

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  A0A1I8M203
Domain Number 1 Region: 2851-3151
Classification Level Classification E-value
Superfamily BEACH domain 2.88e-127
Family BEACH domain 0.000000000196
Further Details:      
 
Domain Number 2 Region: 3273-3532
Classification Level Classification E-value
Superfamily WD40 repeat-like 5.23e-34
Family WD40-repeat 0.0027
Further Details:      
 
Domain Number 3 Region: 2740-2846
Classification Level Classification E-value
Superfamily PH domain-like 1.6e-25
Family PreBEACH PH-like domain 0.000021
Further Details:      
 
Domain Number 4 Region: 35-217
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 1.34e-23
Family Clostridium neurotoxins, the second last domain 0.011
Further Details:      
 
Domain Number 5 Region: 315-427,514-638
Classification Level Classification E-value
Superfamily ARM repeat 0.00000023
Family Armadillo repeat 0.057
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) A0A1I8M203
Sequence length 3535
Comment (tr|A0A1I8M203|A0A1I8M203_MUSDO) Uncharacterized protein {ECO:0000313|VectorBase:MDOA000431-PG} KW=Complete proteome; Reference proteome OX=7370 OS=Musca domestica (House fly). GN=101897857 OC=Muscoidea; Muscidae; Musca.
Sequence
MLGVLASYSITVKELKLLFGTMKAVNGKWPRHSAKLLNVLRQMPHRNGPDVFFSFPGRKG
SAMVLPPLAKWPYENGFTFTTWFRLDPINSVNIEREKPYLYCFKTSKGVGYTAHFVGNCL
VLTSMKVKGKGFQHCVKYEFQPRKWYMIAIVYIYNRWTKSEIKCLVNGQLASSTEMAWFV
STNDPFDKCYIGATPELDEERVFCGQMSAIYLFSEALTTQQICAMHRLGPGYKSQFRFDN
ECYLNLPDNHKRVLYDGKLSNAIVFMYNPVATDGQLCLQSAPKGNVSYFVHTPHALMLQD
VKAVVTHSIHCTLNSIGGIQVLFPLFSQLDMAHEGISDIKRDPTLCSKLLGFICELVETS
QTVQQHMIQNRGFLVISFMLQRSSREHLTLEVLGSFLNLTKYLVTCLSANSDLLLKQLFC
FSFLTWQLLDHVLFNPALWIYTPANVQARLYSYLATEFLSDTQIYSNVRRVSTVLQTVHT
LKYYYWVVNPRAKSGIVPKGLDGPRPAQKDILAIRAYILLFLKQLIMIGQGVKEDELQSI
LNYLTTMHEDENLHDVLQMLISLMSEHPSSMVPAFDVKHGVRTIFKLLAAESQLIRLQAL
KLLGFFLSRSTYKRKYDVMSPHNLYTLLAERLLLYEESLSMPTYNVLYEIMTEHISQHIM
YNRHPEPESHYRLENPMMLKVVATLIRQSKQTESLIEVKKLFLSDMTLLCNSNRENRRTV
LQMSVWQEWLIAMAYIHPKNTEEQKISDMVYSLFRMLLHHAIKYEYGGWRVWVDTLAIVH
SKVSYEEFKLQFAQMYEHYERQRTDQITDPALRQARPISTISGWEREEMQQQHGSGNVVH
NATSVASLEDVPPVEEEVEEIDMEEESTQQDEEIEEPVEESKCACASDETNATAVAAPTC
EEKSTEVETEEETKEAASGDAAAIKSSISNISDVYNEHIKSEVLVVATAQCNGNASSTST
NTSANTTPAKTASLPPATAAAAGGPEALKKTLNIDDLEELELENAKAATSVEDAEAHVQE
VLKNSEKVLQECKLVADEMQEASSVIKDEEIELAVNEVVQGVLNNEKKQIKEPSMETSTK
AAEETKAELEESDKVSLLNTKNLLNNNLTEAEAEATTASTIDNNNVNNNNAETSSPTGAQ
EDNTTTQKTTDNGEGHDEKETKKTEQKQETNQENLLNNNDIQTEADATPETTTTNDHSKN
PETEEKKEEYNTKLPAEETSAKPNDLVELEVSSSLSTTVTTGETSEEISSLSPDTTISSP
SMMDDAPMLSINDEPATTPVEENKTEAAVVKDIVDELIDKVIEKSVVEEGETNNNDVDDK
EKEAEEFINETAQEIIDDVVKTAVEEATTTETKPLDESNLAEKKSTSPSPLPESTAEKEN
VEAIVQTVVDDLVEQTVSTIIDNATQEAETEIVEEISEQVAELQLDDAIVEQTEEETQTS
AADDLETVPPQLLLQEKIDDVRIEDDEYLEEPVSQHEQQTQTQQTEDVGSDAVEESQDQG
EQDVHQHQQQATQQQQHSASTQVENQHFDNAGKQQQQPATHAQQQQQQQQSQHQQQQQQQ
RSKSGSTRPMFSPGPTRPPFRIPEFKWSYIHQRLLSDVLFSLETDIQVWRSHSTKSVLDF
VNSSENAIFVVNTVHLISQLADNLIIACGGLLPLLASATSPNSELDVLEPTQGMPLEVAV
SFLQRLVNMADVLIFATSLNFGELEAEKNMSSGGILRQCLRLVCTCAVRNCLECKERTRY
NVGAMARDVPGAAHLQALIRGAQASPKNIVESITGQLSPVKDPEKLLQDMDVNRLRAVIY
RDVEETKQAQFLSLAIVYFISVLMVSKYRDILEPPAEPQIQRQSPIMQRSAGEPSGRPLF
PQWSHHVYPQFLPGSHHNHVTTANMQQQQQQTNMQQQQQQQHVQQHHHHQQQQQQQQQLY
YHQQMPSPPPHQHQQQQQSMQQTSHLQHQHHHLTHQQQQQQQQQQYYHHQAMNNAAVAAA
AASYATHCPSPPLANQSTSPSTSSTTNTSQPASTSSLSSLASQPHHQQQRHGQKHQHYQP
QQQQQQHMTAGAGANGGGHYNMMNGSQVLNGKISTPQQQQQYSNPSAAASAVDMNGAGYH
QHPHAHMAQPPQYMRTPSTSSSAMMMTNGLNGMTSAHQNGIVDYHSQHGGAGVVGGGGGM
MLNGGGSVGNQIANGNNPSLMHNNGNAGNPVAGNNHSSMIVGGGAMNNGRNMRNGLGGHV
LGSAGGVVGGGGVGMHGANVMASSSSAYKQNNGINNNYRYNGRSATTGTGGRTIQDGDYE
IIVVDENNPSVLADNDSHSSGPPSIKSDSECNSLNMNSTENEVPEVESSSEIMVDDNKPI
NSNDESWTDVNLNEDASVQATATGIIMPGGGSAGVDGKMRGDDSGMHSLHASHMQHSQHG
SERGDKPDSEISVVRVPDGYANSGNASGNSGNVAGVQRGSRPDDLPMKPPLVGQLPMTTP
SREASLTQKLEVALGPVCPLLREIMVDFAHYLSKTLVGSHGQELLMEGKGLTTFKNSHSV
VELVMLLCSQEWQNSLQKHAGLAFIELINEGRLLSHAMKDHIVRVANEAEFILNRMRADD
VLKHADFESQCAQTLLERREEERMCDHLITAARRRDNVIASRLLEKVRNIMCNRHGAWGD
SSGSVQKQTYWKLDAWEDDARRRKRMVQNPRGSSHPQATLKAALENGGPEDAILQTRDEF
HTQIAVSRAHQATQHTADLLDDAELLIEDRELDLDLTGPVNISTKAKLIAPGLVAPGTVS
ITSTEMFFEVDEDHPDFQKIEPEVLKYCDHLHGKWYFSEVRAIFSRRYLLQNVALEIFLA
SRTSILFAFPDQHTVKKVIKALPRVGVGIKYGIPQTRRASMMSPRQLMRNSNMTQKWQRR
EISNFEYLMFLNTIAGRTYNDLNQYPIFPWVLTNYETKDLDLSLPSNYRDLSKPIGALNP
SRRAYFEERYESWESDTIPPFHYGTHYSTASFTLNWLVRVEPFTTMFLALQGGKFDYPDR
LFSSVNLSWKNCQRDTSDVKELIPEWYFLPEMFYNSSGYRLGHREDGALVNDVELPPWAK
TPEEFVRINRMALESEFVSCQLHQWIDLIFGYKQRGPEAVRATNVFYYLTYEGSVDLDAI
ADPVMREAVENQIRNFGQTPSQLLMEPHPPRSSAMHLSPMMFSAMPDDLCQILKFYQNSP
IIHISANTYPQLSLPSVVTVTAGHQFAVNRWNCNYTASVQSPSYADSNQQQGAVKPLAID
PVLTAAQNTTHNNPMNRRHLGDNFSQMLKIRSNCFVVTVDSRFLIACGFWDNSFRVFNTE
SAKIVQIVFGHFGVVTCLARSECNITSDCYIASGSADCTVLLWHWNARTQSIVGEGDVPT
PRATLTGHEQAVTSVVISAELGLVVSGSTNGPVLIHTTFGDLLRSLDAPMDFHSPELIAM
SREGFIVVNYDKGNVAAYTINGKELRHETHNDNLQCMLLSRDGEYLMTAGDRGIVEVWRT
FSLAPLYAFPACNAGIRSLALTHDQKYLLAGLSTGSIVVFHIDFNRWHHEYQQRY
Download sequence
Identical sequences A0A1I8M203

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]