SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for LOC_Os04g48094.1|13104.m04892|protein from Oryza sativa ssp. japonica 5.0

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  LOC_Os04g48094.1|13104.m04892|protein
Domain Number 1 Region: 724-1153
Classification Level Classification E-value
Superfamily DNA/RNA polymerases 1.89e-175
Family Reverse transcriptase 0.0000137
Further Details:      
 
Domain Number 2 Region: 2180-2518,2617-2693
Classification Level Classification E-value
Superfamily DNA/RNA polymerases 6.14e-116
Family Reverse transcriptase 0.0021
Further Details:      
 
Domain Number 3 Region: 1325-1484
Classification Level Classification E-value
Superfamily Ribonuclease H-like 9.74e-45
Family Retroviral integrase, catalytic domain 0.011
Further Details:      
 
Domain Number 4 Region: 3033-3188
Classification Level Classification E-value
Superfamily Ribonuclease H-like 2.15e-40
Family Retroviral integrase, catalytic domain 0.007
Further Details:      
 
Domain Number 5 Region: 2742-2849
Classification Level Classification E-value
Superfamily Ribonuclease H-like 1.18e-17
Family Ribonuclease H 0.009
Further Details:      
 
Domain Number 6 Region: 566-671
Classification Level Classification E-value
Superfamily Acid proteases 0.00000000000000229
Family Retroviral protease (retropepsin) 0.029
Further Details:      
 
Domain Number 7 Region: 510-551
Classification Level Classification E-value
Superfamily Retrovirus zinc finger-like domains 0.00000000174
Family Retrovirus zinc finger-like domains 0.004
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) LOC_Os04g48094.1|13104.m04892|protein
Sequence length 3320
Comment retrotransposon protein, putative, Ty3-gypsy subclass
Sequence
MGSKMAGHPPNHRGEGMIDFVAELARMTLVVGYPNAPEYTTIPQLSGELPHRVRLEVHGY
VGTCLANMVVEASGGTADHACQEAAYLMMARLRERHNYIFHDTAYRFHPRRANGDDVSSF
RPTAGENDTTFGHMCAVMRGLDRMHSDLHKATKALNDGKLMRIVALKDEIARLKRENAQL
KGLPAPGGVRIPNKARKTTTATGCLFGMVYTRNGSRATGEGSNGEERADGVHPSSDSGNG
PPPLPENPTLAQVMAHQTQMMAAMMQQMQQQHQQMHQRMMQHVEQQHQQFGPPPPQSKLP
EFLREKVAFATHQLQGPASAWWDNHMATRPPGTEVTWAEFCRSFRKAQVPDGVVAQKKRE
FRALHQGNRTVTEYLHEFNRLARYAPEDVRTDAEKQEKFMAGLDDELTNQLISGDYADFE
RLVDKAIRQEDQRNKMDRKRKAAQFRAPQGSHQRPRFAPGQQGGPTTMIVRQHRPFNPSN
FPQGASGSQNHHGGQSNRGAASRPPMAPAQSGQPAQAKKEIGAKPGSCFNCGELGHFADK
CPKPRSAGPRFIQARVNHASAEEAQAAPEVVLGTFPVNSIPATVLFDSGATHSFISKKFV
GMHGFIREELSTPMRVHTPGNSSTSVQFSPSITIEIQRSPFLANLILLESKDLDVILGMD
WLTKFKGVIDCANRTVTLTNEKGETVVYKSPDSPKQGVSLNQIEAEIPVDTVERNLRKLE
DIPIVSEYPEVFPEDLTTMPPKREIEFRIDLAPGTAPIYKRPYRMAANELAEVKKQVDEQ
LQKGYIRPSTSPWGAPVIFVEKKDKTKRMCVDYRALNEVTIKNKYPLTRIDDLFDQLKGA
KVFSKIDLRSGYHQLRIREEDIPKTAFTTRYGLYECTVMSFGLTNAPAFFMNLMNKVFME
FLDKFVEVFIDDILIYSKSEEEHEQHLRLVLEKLREHQLYAKFSKCDFWLTEVKFLGHVI
TAQGVAVDPSNVESVTKWTPPKTVSQIRSFLGLAGYYRRFIENFSRIARPMTQLLKKDEK
FKWTAECDKSFEELKKKLVSAPVLILPDPMKDFQVYCDASRHGLGCVLMQEGRVVAYASR
QLRPHEGNYPTHDLELATVVHALKIWRHYLIGNRCEVYTDHKSLKYIFTQPDLNLRQRRW
LELIKDYDMSIHYHPGKANVVADALSRKSYCTALCIEGMCEELRQEFEHLNVGIVELGFV
AALEARPTLVDQVRAAQANDSEIAELKKNMRVGKARDFHEDEHGTIWLGERLCVPDDKEL
KDLILTEAHQTQYSIHPGSTKMYQDLKERFWWVSMRREIAEFVALCDVCQRVKAEHQRPA
GLLQPLQIPEWKWEEIGMDFITGLPRTSSGHDSIWVVVDRLTKVAHFIPVHTTYTGKRLA
ELYLSRIMCLHGVPKKIVSDRGSQFTSKFWQKLQEELGTRLNFSTAYHPQTDGQTERVNQ
ILEDMLRACALDFGGAWDKSLPYAEFSYNNSYQASLQMAPFEALYGRKCRTPLFWDQTGK
RQLFGTEVLAEAEEKVRIIRERLRIAQSRQKSYADNRRKELTFEAGDYVYLRVTPLRGVH
RFQTKGKLAPRFVGPYKILERRGEVAYQLELPSNMIGIRDVFHVSQLKKCLRVPEEQADS
EHIDIQGDLTYVEKPWVVLSLALLFPTSEFEFGSDEGVTTIIEKYDGSVNPAEFLQIYTT
GIEAAGGDDRVMANFFPMALKGQARGWLMNLPPASVHSWEDLCQQFTTNFQGIYPRPDEE
ADLHAVQRRDDESLRSYIQRFCQVRNTIPCIPAHAVIYAFRGGVRHNRMLEKIASKEPQT
TAQLFELADRVARKEEAWTWNSSGSGVAAPAAPGSAARSGRRDRRRKKGSARSDDESHVL
AVEGASRAPRKGRPASNKKKEAGAPSRERPTGKWCTVHNTSLHDLADCRAVKSLAERTRK
WEEEKRQERREGKTPAAPAGNRRGEAKQKATAEDIDDGDDDLGFQEPEATVATVDGGACA
HASRRSLKAMKRELLAAAPTHEATWRARWSEVALSFDQTDHPPCVARGGQIAMPLGHIDL
PVTFGGSANFCTERVDFDVADLSLPYNAVLGRPALVKFMAAVHYAYLQMKMPGPGGPITV
HGDLKVTLACMEQRADHLAAAYKPAGGDERLSTSVPAAPRQRMVTCDEVSVKEEDALVSL
LRANTDVFAWRPADMPEVPREVIEHRLAVQLGARPVRQKVRRQAPERQAFIREEVARLLE
AGFIREVIHPEWLANPVVVPKANGKLRMCIDYTDLNKACPKDPYPLPRIDQIVDSTAGCD
LLCFLDAYSSYHQIRMAREDEEKTAFITPIGTYCYTTMPFGLKNAGPTFQRTTRISLGSQ
IGRNVEAYVDDLVVKTRNQETLLSDLAETFKSLRSARIKLNPDKCVFGVPAGKLLGFLVS
ARGIEANPEKIRAIERMHPPSMLRDVQCVTGCMAALSRFISRLGEKALPLFKLLKRSGPF
TWTEEAERAFTQLKAYLSSPPVLVAPEPDEPLLLYLAATPQVVSAALVVERDKDNPHSAH
PHPVSTRPGREQGGEAPEPNGGPRPSTAGAGPPPACPTVPGAPDPQDGPGATAGRLRLSP
SDPEVVGTEAECAPCGLSDEERPGDAAPGEEDRPRRKMQRPVYFVSEALRDAKTRYPQAQ
KMLYAILMASRKLRHYFQAHRVTVVTSYPLGQILHNREGTGRVVRWAIELSEFDLRFEPR
HAIKSQALADFVAEWTPAPEPVSAPKASSGPSQLPHTAYWGAGAGVTLTSPSGDVLRYLV
RLDFRATNNMAEYEGLLAGLRVAAGLGIRRLLVFGDSQLVVNQVCKEYQCSDPQMDAYVR
QVRRMERHFDGIELRHVPRRDNTVADELSRLASSRAQTPPGAFEERLAQPSARPDPLGET
DAPERPPRPVGVQASGPEGSAPSSPRLIAWITEIQAYLADKTLPEDREGSERVRRISKRY
VLVEGTLYRRAANGVLLKCIPREQGVELLADIHEGECGAHSVSRTLVGKAFRQGFYWPTA
LNDAVDLVRRCRACQFHARQTHQPTQALQTIPLSWPFAVWGLDILGPFRRAPGGFEYLYV
AVDKFTKWPEAYPVIKIDKHSALKFIRGITARFGVPNRIITDNGTQFTSELFGDYCEDMG
IKLCFASPAHPRSNGQVERANAEILKGLKTKTFNILKKHGDSWIEELPAVLWANRTTPSR
ATGETPFFLVYGAEAVLPSELTLRSPRVTMYCEADQDQLRRDDLDYLEERRRRAALRAAR
YQQSLRRYHQRHVRARSLCVDDLVLRRVQTRAGLSKLSPMWEGPYRVIGVPQPGSVRLAT
GDGTELPNPWNIEHLRRFYP
Download sequence
Identical sequences LOC_Os04g48094.1|13104.m04892|protein 39947.LOC_Os04g48094.1

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]