SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSONIP00000003524 from Oreochromis niloticus 69_1.0

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSONIP00000003524
Domain Number 1 Region: 1702-2067
Classification Level Classification E-value
Superfamily Cysteine proteinases 7.06e-91
Family Ubiquitin carboxyl-terminal hydrolase, UCH 0.0000522
Further Details:      
 
Domain Number 2 Region: 932-1023
Classification Level Classification E-value
Superfamily Ubiquitin-like 0.00000000852
Family Ubiquitin-related 0.013
Further Details:      
 
Domain Number 3 Region: 3-45
Classification Level Classification E-value
Superfamily UBA-like 0.00000488
Family UBA domain 0.021
Further Details:      
 
Domain Number 4 Region: 2219-2369,2419-2549
Classification Level Classification E-value
Superfamily ARM repeat 0.00000896
Family Exportin HEAT-like repeat 0.045
Further Details:      
 
Weak hits

Sequence:  ENSONIP00000003524
Domain Number - Region: 305-636
Classification Level Classification E-value
Superfamily ARM repeat 0.00015
Family Diap1 N-terninal region-like 0.077
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSONIP00000003524   Gene: ENSONIG00000002812   Transcript: ENSONIT00000003525
Sequence length 2632
Comment pep:novel scaffold:Orenil1.0:GL831140.1:7433480:7463332:1 gene:ENSONIG00000002812 transcript:ENSONIT00000003525 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
METEEEQHITTLLCMGFPDPDVIRKALRLAKNDINEAVALLTNESPGLGYGYEPMESGPN
PGLGSSGEGENSGRTGTGGFDPPPAYHDVVDSERSNDENGNCSGGSMEFPTTNLYELESR
VFTDHWSIPYKREESLGKCLIASSCLARHGLADADENCKRFMDRCMPEAFKKLLTSSAVH
KWGTEIHEGIYNMLMLLVELVAERVKQDPVPVNLMGVLTMALNPDNEYHFKNRMKACQRN
WAEVFGDEANMFAVSPSNTYQKEPHGWLVDLVNRFGELGGFTAIQTKLNTEEIEIAYVSA
LVQPLGVCAEYLNSSLVQPMLDPVIHKMITYVQNLEEKDLKDKRLVSIPDLLSAIKLLCM
RFQRELVTVVDDLRLDTLLRMLKTPHFSTKMNSLKEVTKLIEESTVSKSVKNAIDTDKLL
DWLVENSVLSIALEGNIDQAQYCERIKGIIELLGSKLSLDELSKIWRIQAGQSSTVIENI
HTIIAAAAVKFSFDQLTHLFVLIQKSWEVESDRVRQKLLSLIGRIGREARSETTTGKVLE
VLWELAHLPTLPTSLVQQALEEHLGILSDAYAVKELVKRSYIIKCIEDIKKASQQSSPQA
VWVVPALRQLHEITRSFIKQTYQKQDKSIIQDLKKNFEIVKLITGSLVCCHRLAVTAAGN
SGLSGSTLVDGRYTYQEYLDSHLRFLAFFLQEASLYLVWNRAKELWECLVSGPDVCELDR
EMCFEWFTKGQHDLESDVQQQLFKEKILKLEPYEITMNGFNLFKTFFENVNLCDHRLKRQ
GTQLCVERLDLAGMDFIWRIAMETPDEEIANEAIQLIITYSYTNLNPKMKKDSVSLHKKF
IADCYKRLEAASSALGGPTLTHAVTKATKMLTATAMPTVATSVQSPSRSTKLVIIERLLL
LAERYVITIEDMYSVPRTILPHGASFNGHPVTLHITYESTKDTFTLETHSNETVGSIRWK
ISEHLSCPVDNVQIFANDSVLTMNRDHKLLSQLGFSDEQSLTVKSSGTGTPSGSSESSAS
ASSSSSSAVFNSAYALEQEKSLPGVVMALVCNVFEMLYQLANLDESRITLRVRKLLLLIP
TDPEVQDALDNFVPKESSVWSHQKTLFTLGQGTGSRSPSMSSKQQHQPSAASILESLFRS
SAPGMSTFRVLYNLEVLSSKLMPTSDDEMAKTSSKSFCENFLKAGGLSLVVNVMQRDSIP
SEVDYETRQGVYSICLQLARFLLVGQSMPAVLDDDVIRDGDALSSRPFRNAGRTGRQLSL
CGTPEKSSYRQMSLSERSSIRVEEIIPAARVAIQTMEVGDFTSTVACFMRLTWAAAAGRL
DLVGSPQPIRETHGSLLPQGVRTRVSSTGSNCSSSSEGETTPTALHAGICVRQQSVSIKD
AIIAREALSLLVTCLQLRCQQLCKTPAGVQTHYKITVSLLLCIAIRYSVLCPTGSKSCVW
FLSPASFYNLPSVNDFIIDILLGSPSGEIRRVACDQLYTLSQTDTSAFPEVQKPNLFLLS
VILTAQLPLWSPTSVMRGVNQRLLSQCTEYFDLRCQLLDDLTTSEMEVLKVSAATMLEDE
ISWLDNFEPSWSSEMETSEADNILLAGHLRLIKTLLSLCGNEKEHLGPSLIQQLLDDFLF
RASRIIINSSNPTTSPAPSHDFHPKCSTASSRLAAYEVLVMLADSSLSNLQLITKELLSM
HHQSDPSLCKEFDYLPPVESRSVSGFVGLKNGGATCYMNAVFQQLYMQPGLPEAFLSIED
DTDQPEESVFYQVQSLFGHLMESKLQYYVPENFWKIFKMWNKELYVREQQDAYEFFTSLV
DQLDEHLKKMGREQIFKNTFQGIFSDQKICKDCPHRYEREETFMALNLGVTSCQSLEISL
DQFVRGEVLEGSNAYYCEKCKEKRTTVKRTCIKSLPSVLCIHLMRFGFDWESGRSIKYDE
QIRFPWVLNMEPYTVSGMARQDCSGEGGEGRGDGTSGGSPRKKVTISENYELVGVVVHSG
QAHAGHYYSFIKDRRGGARGRWYKFNDNVVEEFDMNDETLEYECFGGEYRPKVYDQSNPY
PDVRRRYWNAYMLFYQKISDQNSPVLPKKSRVSIMRQEAEDLSLSAPSSPDVSPQSSPRP
PRANNDRLTLLTRLVRKGEKKGLFVEKMPVSIYQIVRDENLKFMRNRDVYNSDYFNFTLS
LASVNATKLKHPGYQPMAKESLQLAVHFLFHTYLHTKKKLRVDTEEWMATVEVLLSKSSE
ACQWMVQYLVGPEGREIVKVCLLECSVREVRVVVASILEKTLESALQFGDPGLDSLTDAL
LSLLDKDVPENVKNCAQYFNLFSNFAQRGCGPCQLLLKHSAYRRMLIFLLGPNRQNNQNR
RWSPAQAREFLHLHSTLALITLHSDLSPQRTQAPGGLKLRLSSVLSSTPLLPLHADILAS
LFTPEGQPYLLEVMFAMRELSGPLSLLIEMVTYSSFCNEPFSLGVLQLLKTQLETAPPHE
LKNIFLMLQELLVVEDPLQSQRLKYAFESEKGLLALMQQSNNVDSRRCYQCVKFLVTLAQ
KCPQAKDYFKDLSGTWSWAVQWLQKKMTEHYWTPQSNVSNETSTNKTFQRTISAQDTLAY
ATALLNEKEQSGSSNGSDGSPANENADRSLRQGSESPMMLGDSKSDLEDVDP
Download sequence
Identical sequences I3J3T4
ENSONIP00000003524 ENSONIP00000003524

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]