SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for 39947.LOC_Os06g18950.1 from STRING v9.0.5

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  39947.LOC_Os06g18950.1
Domain Number 1 Region: 924-1252
Classification Level Classification E-value
Superfamily DNA/RNA polymerases 1.78e-85
Family Reverse transcriptase 0.001
Further Details:      
 
Domain Number 2 Region: 1545-1699
Classification Level Classification E-value
Superfamily Ribonuclease H-like 3.29e-37
Family Retroviral integrase, catalytic domain 0.0048
Further Details:      
 
Domain Number 3 Region: 1281-1403
Classification Level Classification E-value
Superfamily Ribonuclease H-like 3.54e-24
Family Ribonuclease H 0.0063
Further Details:      
 
Domain Number 4 Region: 790-864
Classification Level Classification E-value
Superfamily Acid proteases 0.00000375
Family Retroviral protease (retropepsin) 0.031
Further Details:      
 
Weak hits

Sequence:  39947.LOC_Os06g18950.1
Domain Number - Region: 542-583
Classification Level Classification E-value
Superfamily Retrovirus capsid dimerization domain-like 0.0311
Family Retrovirus capsid protein C-terminal domain 0.01
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) 39947.LOC_Os06g18950.1
Sequence length 1844
Comment (Oryza sativa)
Sequence
MAEKQPPPPSPSSAKGKPLKLGLTDIIEENVMPINPEKFTPEQKEEFEAMMQQARDQFLN
SFTQTRKGTFVQKYKIKVVADDPRTGSSKDGEGKQAPDGSAQPSIKSATDGDQGDDSQGV
HRIQGDGAQGPQGGNLNQNNEAAQDFFNKFQDRVDYAIHNDLINQSRVLTNTLANMMKSV
ADGSIAEHQAAGPVYLQGSTFPNYRPLITDIHPPTQAVPPIASSAQPTAPASAPVPATPP
SAPGQLINPQLLVREQPQHAGPNVTQLAQDQAVQPVQQTPSRGPALQPIQQTPLRQQAVQ
PIQQQGLLDASAGLATPSGQLMQYAPNQVVPEHLVHHAQPDGTMIPQVVPEHLVRGIQPN
LHNYQGGNLNYQYQPLSPQVQYQQTGSAQPQFAPQHNQFEPIPQQPQESPQQGQSADVIA
DVMREQFGLKPKKTGNLYRQPYPEWFERVPLPNRFKVPDFSKFSGQDTTSTYEHISRFLA
QCGEASVVDALRVRLFRLSLAGSAFTWFSSLPHGSINSWADLEKQFHSYFYSGIHEMKLS
DLTSIKQKHDEPVHEYIQRFREMRNKCYSLSLTDAQLADLAFQGMIAPIREKFSSEDFES
LSHLTQKVALHEQRCAEARKNSRKINHVYPYMYDSDDDENDSEIAAAEWVRSKKVIPCQW
VKNSGKEERQQIQVAIEGGKIKFDDSKRPMKVDGMRLPSIEECPGCNEVAESSSRSYDRG
NRLRQRVPVHQRLGPINHDRGQEDNEDRKTQWCPSADVDEVEEESAKLVLSPEQAVFKKP
EGTENRHLKPLYINGYVNGKPMSKMMVDGGAAVNLMPYATFRKLGRNADDLIKTNVVLKD
FGGNPSETKGVLNVELTVGRFLQLVAWKGLDSRQLLYSLDYALVLNSVARQQDRDCTSRQ
MENPSYYFEGVVEGSNVYTKDTVDDLDDKQGHKILSFMDGNAGYNQIFMAEEDIHKTAFR
CPGAISLFEWVVMTFSLKSAGATYQRAMNYIYHDLIDWLVEVYIDDVVVKSKGIEDHIAD
LRKVFERTRKYGLKMNPTKCAFGVSAGQFLGFLVHERGIEVTRRSINAIKKIKPPGDKTE
LQEMIGKINFVRRFISNLSGRLEPFTPLLRLKPDQQFTWGTEQQKALDNIKDYLSSPPVL
IPPQKGIPFRLYLSAGEKSIGSVLIQELEGKERVVFYLSRRLLDAETRYSPVEKLCLCLY
FLCTRLRHFLLSNECTVICKADVVKYMLSAPILKGRVGKWIFSLTEFDLRYESPKAINGQ
AIADFIVDRRDDSIGSVEIVPWTLFFDGSVCTHGCGIGLVIISPRGACFEFAYTIKPYAT
NNQAEYEVVLKGLQLLKEVEADAIEIMGDSLLVISQLAGEYEYKNDTLIVYNEKCQELMR
EFRLVTLKHVSREQNIEANDLAQGASGYKPMIKDVQIEVAAVTADDWRYEVHQYLQNPSQ
SASRKLRYKALKYTLLDDELYYRTIDGVLLKCLSADQAKVAIGVVHEGICGTHQSAHKMK
WLLRRAGYFWPTMLEDCFRYYKGCQDCQKFRAIQRAPASAMNHIIKPWPFRGWGIDMIGM
INPPSSKGHKFILVATDYFTKWVEAIPLKKVDSGDAIQFVQEHIIYRFGIPQTITTDQGS
IFGSDEFVQFADSRGIKFLNSSPYYAQANGQAEASNKSLIKLIKRKISDYPRQWHTRLAE
ALWSYRMACQGSIQVPPYKLVCGHEAVLPWEVRVGSRRIELQDGLTADEYYNLMADERED
LVQSRLRALAKVTRDKERVARHYNKKVAPKTFSEGDLVWKLILPIGTRDSKFGKWSPNWE
GPFQIHKVVSKGAYMLQGLDGEVCGRALNGKYLKKFYPSVWVNT
Download sequence
Identical sequences LOC_Os06g18950.1|13106.m02022|protein 39947.LOC_Os06g18950.1 LOC_Os06g18950.1|PACid:21928648

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]