SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSONIP00000021833 from Oreochromis niloticus 69_1.0

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSONIP00000021833
Domain Number 1 Region: 2651-2702
Classification Level Classification E-value
Superfamily UBA-like 0.0000000395
Family TS-N domain 0.061
Further Details:      
 
Weak hits

Sequence:  ENSONIP00000021833
Domain Number - Region: 3644-3771
Classification Level Classification E-value
Superfamily Ricin B-like lectins 0.00288
Family Ricin B-like 0.027
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSONIP00000021833   Gene: ENSONIG00000017312   Transcript: ENSONIT00000021852
Sequence length 4402
Comment pep:novel scaffold:Orenil1.0:GL831582.1:170570:299430:-1 gene:ENSONIG00000017312 transcript:ENSONIT00000021852 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MLEGLVAWVLNTYLGKYVSNLNTDQLSIALLKGAVELENLPLRKDALREFDLPFEVKAVG
FIGKITLQIPFYRPHSDPWVISMSQLNLIVGPAQLQEYDGERDKDEERERKKRLLKALED
KFKNECEQTGESYWYSVTASLVTRIVENIELKIQHVHLRFEDDFSNPEKPFSFGVCINNI
CAQNPSRELVQKLFRQKQLEIEDFSVYWDTECEMLGKLPINQIQDAMSQCMQSRDHQYIF
EPVCASVLLRRNTSKEPLRSRHTPRIEGQVQLESMSLRLSQVQYQQIMAFLKELDRRERE
MFYRKWRPKVPISGNCRQWWMFAIQANLSDIKEQRRRGSWDFALQRARDSLTYTSLYFRR
LKGVLSPQEESELERVEDEQTFEELQILREVVHEWFRKQEVIAENAREPSSEPQTCSSTT
SVPKSGSSGMIQYLQSWFPGWGGWYGESQDPDRPQELLQSPSSWNILETEELFDPLEDSH
TLNTFTRRDHLFARLDFLLEKGGVTLFHQDKCGDTLHESGVIQLEFSGVKVMVESLPRSE
SSLLSVKLGSLFLRDLTTQGTIFPVLVSPKTVSSDGRASHKNIEKLITFQSPTFPRCNAE
CVSCPVFEMIYERNPVRCKFERRLEVNTSPLNIIYNPQAIKEVIEFFYKGRIHTSDPGFG
YQSELELRVAEAARRQYNKLKMQTKAEIRQTIDQLLVGEFIENSKRWTMKLDISAPQVIF
PDDFQSGDPMLVVVDLGRILLTNSQDDNKSNSMATQPERENISDDEYQTPLATPPESPPP
ELDVPLKAQVKYPEFTSFRSLEGTQAYRQKLYEKYSLSFKDLQIMVGRYKDNWKHLQESE
VGPTHVVEKFNVLLQLEQRLRYTSDPQLPGAVLSGTLPDLKIHMNLEKMTALRSCLARLN
SPAVKPSDPLTLRHEKIFQREESSWKLQGSAKNLTQSVMTLEQHTREVLVESRLLLAEFN
INYMQLGVESDGRYISVLKVFGTNAHFVKRPYDAEVALTVHGLLLVDTLQTYGSDFDLLV
ASHKHLSFDIPTGSLRESQPSSPVSANGSSPPHQQSCTEQSSHLPSDGLSPFSSLFKDQE
ALIKLEYQFVSSDCPSMNLESSLQVTSMQVNNLDIILNPETMVELLKFLQKSFPKEEGTW
TSPVQHDKAQQHTEQDCGEEQDGIYQSTYDQNKELTVEIHRLNLLLLRTESTSTVLGTEK
KGLKIATASITGTKVNVSMGSRLDINGSLGCMQLVDLTQEGGRSQFVVSIGSVEDCSPSL
DGLTFLPDTRERSPAEALNFHLMEKSQGECSLKLQMASLHYNHSAKFLKELSLSANEVED
NFRSMLKRAATKVSTVLANKTAEYSGMVSLFETPSRKIRSLSQSWVYPFEDEEDVPMEEP
ESTLDTFLVKLTLNISIESPVVSIPRKPGHPELLVGHLGSITIQNFVAGQDSEEERLQVL
VKDICLYSLKMTNLVVKRLGKGVNAGSVSPTHDRSNTQDGPQFTRHDFFESLNKGKAFHI
LKDTTIQFTLEKVPVDRDAQFTLLTTEESFKSTGLLRIEGKFVNPVKVFLCKPVYEQVLQ
TLDNLSLTEEQHVTPSQPPTPPPPTPSCTKPHRFPDPQGGIFSHMSLSSPLPLQSNKPTM
DSSSFTQLKVTLHVAELQVHLSADLTQGSQGLVSLRFQDLEGDFTKDHPHLLEIQLALRS
LLMEDLLEQNPESKYKHLMVSRGAPKPSTFSPKEYLSQSCPSASNALYPVMPRSLPAHME
EAQNVFQLYQRHPSNPSSSSRKSKRGPDCPSTPPPSPSHHTPSPNPPPDFDESLVHISVQ
LVDQNHPEFRTHYGSVGRSVDVDFNCLDVLITLQTWVVILDFFGVGSTANNHAVKVPVTP
QPAPGEPLYEPDSGEKDKREPVNTKVDLKVHSLSLVLNKKLNELAKASVSKLSAHLEMFD
GDMALQGTLGSLSLSDLTPHGDLYRERFTTQGGEALIFNILKYGQPDPDLERECDIKVSL
QMASVHYVHTQRFQAEVAAFIQHFTQLQDVLGRQRAAVEGQLVRDQPQRASRVLLDIEAG
APVILIPESSWSPRLIVVNLGQLRVKNRFLPAGAYGTFSLRDKETRRVSTCSSTSCTGLG
RPERPQTVDTEKAPEKDGPRSLGQHLCSYCTIPKKTMNANAFVQNTMNLAVCGVSLSPLE
NHVCLLDCIALDLQEMDIFAAERLPSQPLDSGGKPLEYSDLVFPSYCVRRTGNNLLKDCC
RLKLKVERNLDKELSHVVPDMSIQGSLSSVHCSLDVDHYHLIRGLLENNLGEPIEEFLRP
YNLQDPSTYTVLSGDVYTNFSFLLDMMDVSLELLYNPQNSEHKCSLARFDFLKSKLLFES
FSNGSKSVNLVSHSLLAYDTRYTGLNKRAGEDDGTRHNVFDCILQASKTGANRASLQLEL
HYRSTRESSSFTVVLNNLRVFLIFDWVQLVGDFLQKPTEKIPSDARHHQRWPSNTSTDSG
SSTMASTIGTVMPKTVKSGVVTKRSTVSVTQERCLEIKINVTGTEFVVVEDSSCLDTNAI
ILKGTTVLTYKPRLLDRPFSGSLAGIEVFSCRLGSEQETALSIIDPVNIQLELCGSPTYQ
SSSGLLDAFNVEDIPPLLEIQFPALDIRLSYNDIQLFLAITKSLPTASAPLPHVSDTTAP
PTKGASPAPKDIFRQKTESLIEGQLTHLEDLGFRKEDCKTALIHCKGQLDQAATWLLENA
ENIAGHPRARANSGTSSHSAPLSGVEVKADSICICFIDDCLDCDIPLAELTFSRLYVLQR
IGSIQEGNASFTLSGDYYNRELSGWEPFIEPWPCILSWQQQAAGRLHPPRLKMGIRAKQR
LDINITSVLLEQYTTTKSSWIADYCNEEDQAPQTSPPMPWIGSSVDPPSFGQSAPLAHLR
TRSTASLTCLEQQIHSRDVKLSKKRQPFVPYALRNHTGCTMWFATMTTTPTRVALSHSSS
ADSISDVHSSGSDDSHNVSQWREVLPGEEIPFEFEAHEKLRHRHTHELKLHQLLVRVCGW
EQVKPVSVDKVGIFFRYAAPDRSSSSNTVGSPISRTNIIHPHVYFSALPPVRVVFSITME
GSARKVVTVRSALMVKNRLDVPMEVRLDSPSAPDKPVVLPPILPGQALAVPLHLTSWRLQ
SRPKGLGLFFCKVPIHWTTVERPGEISSSKRECQSADFDDSLKHSFRFCVVIKKENYPDQ
QPANKYVSGSTKQIYRQPGHTVYLLPTMVLANLLPCDISYYIKGMSIKGSLKPGKEAVLH
AADTSQNMELGVLLENFPVCKELLIPPGTQNYVVRMRLYDTNKRLLCLTIRIILRAQGAL
KILISAPYWLINKTGLPLIFRQDNTKTDAAGQFEEHELARSLSPLLFCYTDKEQPAMCTM
RIGKGIHPDGIPGWCQGFSLDGGSGVRAVKVIQHGNRPGLIYNIGISVRKGKGRYRDTHI
VTFAPRYLLDNRSTHKLAFAQREFARGKGTANPDGYISTLPGSSVVFHWPRNDYDQLLCV
RFMDIPNCTWSGGFEVNKPKSFHVNMSYLKKLPFLFKCSAASKGKIQKEFCFSDNKDQLE
KRLDNLLQVPIQFWQHGVVDMQLHTEVKPGAVLDYACDEPTLPPCLILTVKGAGSSEVTA
DMNFFREYNKLYYENFIYIAATHTFSQTVERRLVGKKCLVSCAELVLDVDTKTQRVILRK
KEPGKRSQLWRMTGTGMLCHEGSSPPQSKPAQPRPLDSSLVLDIAGLAAVSDNSFEPLML
RRPDSRRSTTQTWYFSSGMLTCGLPRLVVQVKGGVAGLYDGAEVVLGPDSGLLEPSLEQQ
FINQKMRPGSGVLSVQVLPDGPTRVLQISDFNQRRMMRSSPSTEQDRGKDDVKKREAEQE
LEVLVNLEEGLGLSLVNKVPEELVFTTLSGIDVHFTRTAANEVLELSIHNIQVDNQLLGT
THPVMLCVTPSSSESSVSDSGPAMQVNSVKVPSSLMLTDLYKHLMVTARRFTVIIEEKLL
LKLLSFFGYGQTDAELEKLDENLRDKPNEDTGPPKRYYFENLKISLPQVKLSVFTSHKLP
AELKALKGTLGFPLVRFEDAVINMYPFTRVHPYETQEIIINDILKHFREELISQAAQILG
SVDFLGNPMGLLNDVSEGMTELFKHGNVGGLIRSVTHGVSNSAAKFAGTLSDGLGKTMDN
RHQNEREYIRYHGATSGEHLVAGIHGLAHGIIGGMTSIITSTVEGVKTEGGVGGFFSGLG
KGLVGTVTKPVAGALDFASETAQTVRDMASLSNHRLTVQRVRKPRCCKGPQGLLPRYSAT
QADGQEQLFHLTDNIHSEFFIAVEPIDTYCVLISSKVVYFLKPGDFVDREAIFLEVKYDD
LYHCLVSKDHGKVYVQLTKKAENTSSGVAIPGPSHQKPMVHVKSESLAIKISQEINYAKS
LYYEQQLMLPSTENEDCLLLES
Download sequence
Identical sequences I3KL38
ENSONIP00000021833 ENSONIP00000021833

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]