SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSGMOP00000020089 from Gadus morhua 69_1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGMOP00000020089
Domain Number 1 Region: 2924-3234
Classification Level Classification E-value
Superfamily BEACH domain 3.79e-109
Family BEACH domain 0.000000114
Further Details:      
 
Domain Number 2 Region: 3352-3599
Classification Level Classification E-value
Superfamily WD40 repeat-like 1.31e-32
Family WD40-repeat 0.0063
Further Details:      
 
Domain Number 3 Region: 2825-2928
Classification Level Classification E-value
Superfamily PH domain-like 2.09e-22
Family PreBEACH PH-like domain 0.012
Further Details:      
 
Weak hits

Sequence:  ENSGMOP00000020089
Domain Number - Region: 251-269,401-456,2100-2430,2509-2526,2602-2667
Classification Level Classification E-value
Superfamily ARM repeat 0.000403
Family Armadillo repeat 0.057
Further Details:      
 
Domain Number - Region: 236-295,324-458,733-833,904-949
Classification Level Classification E-value
Superfamily ARM repeat 0.00151
Family Clathrin heavy-chain linker domain 0.094
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGMOP00000020089   Gene: ENSGMOG00000018634   Transcript: ENSGMOT00000020586
Sequence length 3614
Comment pep:novel genescaffold:gadMor1:GeneScaffold_612:591963:662232:1 gene:ENSGMOG00000018634 transcript:ENSGMOT00000020586 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MSGASNTLAREFLTDVHQLCSAVAQRAESREEDEEESHMVALGEYLVRGRGFLLLSTLDS
IIDQELTCREELLTLLLSLLPLVWKIPVQEDKAPDFILPSLLEVFLSRESRSPPLRAGQK
ARPDAQSSGRRSHTGSASWKSRRSRRIAQRYSVKEARQSQLSTSDSDANADTKAPGPGQG
GTRGRRTHGSAQRTPHHSQAPPIPVSVATTTETMTASYPHPWGPGSHSLDAETLTDPAAM
LIFNRMENSPFDLCHVLLSLLEKVCKFDMSINHNPGLAVSVVPTLTEFLTEFGDCCGPGG
GGGGGGGGAGAEELAGGWTEEPVALVQRMLLRTVLHLMSVDVGQSEALPDSLRRSLTDLL
RATLKIRSCLDRQADPFAPRPKKTLQEVQDDFSFSRYRHRALLLPELLEGVLQVLLGCLQ
ASASNPFFFSQALELIHEFIQHRGLELFEVTVLRLEGLARARDSEVGGEAAERVRGLVGG
VLKIISAVKKAKSEQLHQSVCARRRHRRCEYSHFLHHHRDLSGLPVSAFKQAARRNPFEE
EEAEGAAVRYPERCCCLAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXPGLRTYQSHVLNVLGRLILDQLGGGQPSEKAKLASCNICTLDSSQLPGLEE
TLQQGEPFAASLGPGHRSQGILPSGGDAEDMLWKWGALEAYQALVFGEDRQLSQQIAGHV
CQLTLRGNVLVQWQLYTHIFNPVMQRGVELVHHAQQLGVSAVCAQMCSYHSQCLPVEVLL
VYLQTLPALLKSXXXXXLFISCNGLSQITELIHLDQTRSWALKVFETLILRAGGQPADGA
LQELESDVGAQERDSVLGPTEDPGSQRARTAGARRRGRARARPARRTRGPRRCRRXXXXX
XXRQLEEEWPLQSIRLLEALLAICLHSSSPALQRTEPEMSFQLQSVEETLCEVRDQLSRS
GVVNSDLAVPLFDSLLRVALARVSSCPDGPEEKPDRVSLPWGLQVPVESVAPAGDLCEEV
EEAQGCHGGKAAGEEEEGYDADSESNPDDMAKQEEGAEAESAAVRELSAVADGARGGALL
FPEICSMELQLLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKGVKTILSGFHKVL
TQTDPSYKDCQVVLVELLVAMVSQRITADELALLIRLFLEKTAPVDILLKGILQIVEANM
DLEPLHFLSFPIVLGAPTTAGASPGPRQNGSAGGGGGKGVGLLWKGKLSPRRDGDAEPRT
GHVRSSPWHAAPLHLPLVGQNCWPHMASGFSASMWLRLTEAEAEEKGKEAGKAPAPEAQR
GSSPAGQRALDEGLVHVLSMGSKALMLQVWADFCTGSLTFRICIDPNDEIKAGLLAQAES
GPGLLVPGCWQHLGLTYSQQPEGKKNIQGRMVVWVCGIRKSDVSLDYTLPRKSSLSSDSN
KTFCMLGHSLLSSEEPLRQGVRWNLGTVLLFNGSRIGSEEAFYLYASGPDLTSIMPCKYG
KPSGTFSKYVTQEGLKCDHVRELLMKSKDVDTSALVESLAVVYTPSSPRVYTIYEPVIRL
KGQAKTVVTQRPFSSKEVQSSTLEAPALRALLPIEPQGLQNVLHKIGGTATFVFLFAQSV
ELSDCEQTQALALQVLLSLAKYNQHRIHEMDCCHGYSMIHQVLIKAKCIVGYHMLKTLLD
GCCSGPVLTLGEDGQFRLDLESAAVVQDIQLLSEVLLDWKIWAKAECGVWETLLAALEIL
IRVHHPQQVFNIRQFLKAQVVHRFLLTCQVLQENRDQYLTAIPQEVCLSFVKIIQEVLGS
PPDLDLLKLVYNFLLAVHPPTNTYVCHTPTSFYFSLHIDGKLYREKVQSIMYLRHSHSGG
KSASSSVFSLSPTVFPDLHPDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSTTEDCLVLI
CCGLYDLLRGLLLLLPDLMLEEVMDKLIQPEALIVLVNHPSPLIQQGVMKLLDAYFTRAS
KEQKEKFLKNHGFSLLANQLYLHQGSQGLLECFLEMLFGRPVSLEEGLDLEDMEGISPFR
KRCIIPMLGLIENSLYENALLHNLLCMLLQLLNACPKLADILLDHGLLYVLFNTLSTLNG
MESGIPLNDYKLLVCDIQQLLVAVTIHSCSSSGSQYFRIIEDLISLLGFMQTSKMRRTQE
MAVALQFRVLQSAIEFIRTTANQDPQKLSNSMNLPSSPHQAIHPKRKSISGECRRRFSMV
QPDLLLIRMRSVASDELSQMMHRRMSQENPIRASETEFVQRLQRLVVLAVNRLIYHDMSQ
DFYQLLNFPETPDHLLSTPEPADPGHDEGCSSSSSVPASPVPFPLPPASKKSFQRDILRL
MMEGIKLSLXXXXXXXXXXQQWRRILWSSRDTFRIQIGRLLVHTLSPAQPLADRKEALDF
VFDPRHGDILRESLSPGLEHGPKLVLSLHELLHDHKEGLSREEQNAAAVFMTSLKLCGHR
CIPPSAPHKPDILKAIKEEKLKYEADEKTSRLAWEKKMTNTQKSLIQRLDGKSRDISKIA
ADITQSVSLRQGMERKKVILHIRSLYKTNLSASRRWQELVQQHSHDRAVWYDPASYPTSW
QLDPTEGPNRERRRLQRCYLTIPNKYLLKDRRKAEDAAKQPLSFLFEDNTHSSFSSTVKD
KATSEPIRFTRRCISVAPSVETAGELLLGKSGMYFVEDNATDAHDSQSPHSELEAASFSW
TYEEIKEVHKRWWQLRDNAVEIFLTNGRTLLLAFDTTKFRDDVYHNILTSDLPNLLEHGN
ISALTQLWGSGQISNFEYLTHLNKHAGRSFNDLMQYPVFPFILRDYISETLDLQEPGIYR
NLNKPIAVQSKEKEDRYVDNYKYLEEEYKKGIREDDPMPPVQPYHYGSHYSNSGTVLHFM
VRMPPFTKMFLAYQDQSFDIPDRTFHSMNTTWRLSSFESMTDVKELIPEFFYLPEFLVNR
EGFDFGVRQNSERVNHVNLPPWARNDPRLFVLIHRQALESDQVSHTLCQWIDLVFGLKQK
GRAAVHAINVFHPATYFGMDVSAVEDPVQRRALETMIKTYGQTPRQLFSGTHVGRAGPRL
LMDGELPNAVGLLVQLAFRXXXXXXXXXXXXSPLPWIKGLKWGEYVGSPSAPDPVVCFSQ
PHGERFGSLLALPTRAICGLSRKFCLMMIYSKEQGVRSMHSTDIQWSAILSWGYADNMLR
LKSKQSEPPINFIQSSQLHQVTSCAWVPDGCQLFTGSKCGVITAYSNRFTSTTPSEMEVE
SQVHLYGHTGEVTGLFVCKPYSILISVSQDGTCILWDLNRLCYVQSLTGHKSPVTAVSGS
ETTGDIATVCDSVGGGSDLRLWTVNGDLIGHVHCREIICSVAFSNQPEGVSVNVIAGGLE
NGVVRLWSTWDLKPVREITFPKSSKPIISLTYSCDGHHLYTANSEGTVMAWCRRDQQRMK
LPMFYSFLSSYAAG
Download sequence
Identical sequences ENSGMOP00000020089 ENSGMOP00000020089

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]