SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for 3702.AT5G43800.1-P from STRING v9.0.5

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  3702.AT5G43800.1-P
Domain Number 1 Region: 799-969
Classification Level Classification E-value
Superfamily Ribonuclease H-like 7.33e-44
Family Retroviral integrase, catalytic domain 0.0034
Further Details:      
 
Domain Number 2 Region: 1162-1385,1415-1596
Classification Level Classification E-value
Superfamily DNA/RNA polymerases 6.67e-21
Family Reverse transcriptase 0.028
Further Details:      
 
Domain Number 3 Region: 276-324
Classification Level Classification E-value
Superfamily Retrovirus zinc finger-like domains 0.000000942
Family Retrovirus zinc finger-like domains 0.0041
Further Details:      
 
Weak hits

Sequence:  3702.AT5G43800.1-P
Domain Number - Region: 536-568
Classification Level Classification E-value
Superfamily Retrovirus zinc finger-like domains 0.000129
Family Retrovirus zinc finger-like domains 0.0048
Further Details:      
 
Domain Number - Region: 2315-2431
Classification Level Classification E-value
Superfamily Nitrous oxide reductase, N-terminal domain 0.0706
Family Nitrous oxide reductase, N-terminal domain 0.048
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) 3702.AT5G43800.1-P
Sequence length 2484
Comment (Arabidopsis thaliana)
Sequence
MDYPKEFVAVGKAIMLEKGNYGHWKVKMRALIRGLGKEAWIATSVGWKAPVVKGENGEDV
LKTEDQWTDAEEAKATANSRALSLIFNSVNQNQFKRIQNCESAKEAWDKLAKAYEGTSSV
KRSRIDMLASQFENLTMDESENIEEFSGKISAIASEAHNLGKKYKDKKLVKKLLRCLPSR
FESKRTAMGTSLDTDTIDFEEVVGMLQAYELEITSGKGGYSKGVALAVSSEKNEIQELKD
SMSMMAKNFSRAMKRVEKRGFARNQGSDRDRDRDRDRNSKRSEIQCHECQGYGHIKAECP
SLKRKDLKCSECRGIGHTKFDCIGSKSKPDRSYIAESDSDSDDEDSEEDVKGFVSFVGII
EDDNVSSDSSDSEVGCEKEEISADDESDVEMDVDGEFRKLYENWLVLSKEKVIWLEEKVK
VQEQIEQLKGELAVANQIKSEMILKYSAKEEKNRELSQDLSDTRKKIHMLNKGTKDLDSI
LAAGRVGKSNFGLGYHGGGSSTKTNFVRSKAAAPTQSQSVFRSKSNSVPARRKYQNQNHY
HSQRTVTGYECYYCGRHGHIQRYCYRYAARLSKLKRQGKLYPHQGRNSKMYVRREDLYCH
VAYTSIAEGVKKPWYFDSGASRHMTGSQANLNNYSSVKESNVMFGGGAKGRIKGKGDLTE
TEKPHLTNVYFVEGLTANLISVSQLCDEGLTVSFNKVKCWATNERNQNTLTGVRTGNNCY
MWEEPKICLRAEKEDPVLWHQRLGHMNARSMSKLVNKEMVRGVPELKHIEKIVCGACNQG
KQIRVQHKRVEGIQTTQVLDLIHMDLMGPMQTESIAGKRYVFVLVDDFSRYAWVRFIREK
SETANSFKILALQLKNEKKMGIKQIRSDRGGEFMNEAFNSFCESQGIFHQYSAPRTPQSN
GVVERKNRTLQEMARAMIHGNGVPEKFWAEAISTACYVINRVYVRLGSDKTPYEIWKGKK
PNLSYFRVFGCVCYIMNDKDQLGKFDSRSEEGFFLGYATNSLAYRVWNKQRGKIEESMNV
VFDDGSMPELQIIVRNRNEPQTSISNNHGEERNDNQFDNGDINKSGEESDEEVPPAQVHR
DHASKDIIGDPSGERVTRGVKQDYRQLAGIKQKHRVMASFACFEEIMFSCFVSIVEPKNV
KEALEDHFWILAMEEELEEFSRHQVWDLVPRPPQVNVIGTKWIFKNKFDEVGNITRNKAR
LVAQGYTQVEGLDFDETFAPVARLECIRFLLGTACGMGFKLHQMDVKCAFLNGIIEEEVY
VEQPKGFENLEFPEYVYKLKKALYGLKQAPRAWYERLTTFLIVQGYTRGSVDKTLFVKND
VHGIIIIQIYVDDIVFGGTSDKLVKTFVKTMTTEFRMSMVGELKYFLGLQINQTDEGITI
SQSTYAQNLVKRFGMCSSKPAPTPMSTTTKLFKDEKGVKVDEKLYRGMIGSLLYLTATRP
DLCLSVGLCARYQSNPKASHLLAVKRIIKYVSGTINYGLNYTRDTSLVLVGYCDADWGGN
LDDRRSTTGGVFFLGSNLISWHSKKQNCVSLSSTQSEYIALGSCCTQLLWMRQMGLDYGM
TFPDPLLVKCDNESAIAISKNPVQHSVTKHIAIRHHFVRELVEEKQITVEHVPTEIQLVD
IFTKPLDLNTFVNLQKSLGIGEVLSSVFVLKQGWLYSEQVFISILHSVINGFQIFCCKKF
TKRKSYSSWFGDAKSVKYVTDTVCRALRKGNAERGRKEAFHFIRSMSLSKLRGTKEKSCH
PDGNKGEILFTYTLERRDVKQKHENKERASIHPATREKYKKKKKEGVQKQIFSLTSRLSA
ERSSKKQDLNTKDQTYISIAMLLHSLSHFLFTRTCHSEIRVKTTLGESTFCLCCVSRLCP
EFFESSYHKLSTVQSPLGQFWTSVARDGYHFADTNGDLFYVIQDGLRKISAPEVIKFDFV
IDLGLLLRGVNFGLMCFIFPDHKTGHFLVLSCLICINQNRRKKLKGKSKILLSPDSSDGK
NSRLPASDKSSGHSSCGENTQRSPACCSICSGYNSNRGKNTTRRTAYSSDDNLRLRQQLR
RRGARSQSCSFSCRRPVKASRGSASSVSRTRLACSGLRSASEDLDGSSNRSRTRPPPKRL
FISRASALPVGSMHPSTSSGERCGGSIHLDTSLATDDSISHKPRGEVQGFQASSLAKVCA
AWSSRDAHDESDRGSLARHCSRDRSVCERSDLGVLCFPYFQGNEVQSLCRQNVVLSSDDQ
DVQISGYVACQGIRESDYVQTCSCVPIWESRHLEGSSHSGHAYSSRWSIFDLLHQLDSFE
ELECHCSCDAVSSEADSFFWKASLSSRKSCSLSWTNSEHPAISKSHLQSVDVSASDNTGK
LEAWSSPKAVSRRSWWDTSKASRRVFVWHVLTWRLKACLYLANNNSNTHRYDFQSFTCLL
SLLVCIEFLINYVIQAGGDYGLVSSSSHSSQGVNADVEAQGLEPEADDDYVPPVDEGWSP
ENSDSFVNSLDVDNEEGYEDTQSS
Download sequence
Identical sequences 3702.AT5G43800.1-P

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]