SUPERFAMILY 1.75 HMM library and genome assignments server

Superfamily is undergoing a server migration - you are now browsing on the new server. Please contact us if you experience any problems.

Domain assignment for 31234.CRE21846 from STRING v9.0.5

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  31234.CRE21846
Domain Number 1 Region: 1141-1569
Classification Level Classification E-value
Superfamily DNA/RNA polymerases 5.23e-30
Family Reverse transcriptase 0.0038
Further Details:      
 
Domain Number 2 Region: 1903-2070
Classification Level Classification E-value
Superfamily Ribonuclease H-like 2.47e-21
Family Retroviral integrase, catalytic domain 0.042
Further Details:      
 
Domain Number 3 Region: 868-918
Classification Level Classification E-value
Superfamily Acid proteases 0.0000788
Family Retroviral protease (retropepsin) 0.036
Further Details:      
 
Weak hits

Sequence:  31234.CRE21846
Domain Number - Region: 826-859
Classification Level Classification E-value
Superfamily Retrovirus zinc finger-like domains 0.0942
Family Retrovirus zinc finger-like domains 0.0072
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) 31234.CRE21846
Sequence length 2643
Comment (Caenorhabditis remanei)
Sequence
MSRTGPPAWIDPSSQPSEDGTPAPDRPRPLSTNFKKAQRHYRNAITRAATAGTKAARESR
LIVEELRVNRDEQGLETIDHLRKEMIVAAAMIETDAHFHDVVNNFQAILSEKERRDLCNG
VTAYLNAKARPVDLPPIKESILELQQVLTDHQFQFTEAVSPAETADISDSIARSPLEDEE
LLERDLLDSSISDADIESVILTPRNEALEVPHREQFHSASSSVRPPANDREAPASDQVDG
GERTGIFPAVPSNIRSNVEHRGTHIIGSRLSRTAPVTAFPETMVSCPVCHEDHDILECNS
PIRPAYCAKNELCVHCTSARHLTHQCPLQIAQASPISRTPKLEAQENGGRFMSAARPIGA
STPLDRADSGFPSPPETEARENNRNISGTVSPPRVRKTTLVEEESDSECREPEKSRKMSS
KSKDLSYYDVDTILTKFSADPMQYKRFMTMFEKQVMQNSRLTDDLRLALLEKKLVGEAKC
YFINVGDARKAVEASLKALRAAFQDDSTGANEALARFQKLTFHETNYKQATRELLECNTL
IMTLEDLGQDVVSPGFVRSLAQKLPRSAFKLVRELYANNAQPSTNDVIELYTDYLKAEAF
YDKFCPATASERTKEIPDEAVLAAEGTFSNPTQSSKVSNQAANTRVTAPVKSNSNTANVN
SNSAQSSKKNDKKKKSKTASQSSGPTQGNRVNQGYSAGAGMGYFGSQATVQQNNQTHQSH
GNAPHTGVPAQNPQGSSSYGIQNGGKQASTPAQKEKKASIPISKGQPGETLEPCYKYGRG
YDERFIAHTFPRDSPTGTKCCFICGPGHSILQCALSSYDVRQFFRQSNSCHNCAQRNHRT
EECASYSTCAYCQESGQISKLPFAALRTTDGHRVLALVDSGASLSVLSHESAERFGLAIL
ATKTLTISGYSRTTTEESNIYQMSFSTDGDPYSMLIAGAPRLPKTRFISPLLSSEDLGFL
RDNKVNTNVISSDQKFNGQFIDMILGNDLLARLLGTSRRLLLPSERFVELTPFAPIVFPP
PRSSLPPPESVRTLIEAFISEGFITALTTPPDSKDPVDRLHSEISQLWNLDNLGIEEPGP
IEGKKTELQDLIAWFEQNVRFDDEGNLLVSLPWNGKQLRLASNRGVAVKRLEQLVISLKK
KNNLLQDYDEIIRKQLDSGIIELVTPEMDDNTDPRYYIPHRVVEKLTSLTTKLRIVLDAS
SKKGGELSLNDCLEAGPSMLVDLFDILIRSRMPDYLVVADIEKAFHQVRMVPEDRDCTRF
LWLKDIAKPPVRSNIAEYRFTRIPFGMTSSPFLLAATINHFLRDMKNPIAERIRENIYVD
NVMLTTNNREEIQSLRIDSREAFNQMNMRLREYITNCPGEMEKFPKDEISSETTIKLLGY
LWDTVNDTYTIKLAQLLETHPTKRQVASRMAETFDPLGNLAPLFVSFKLLMRDLWVDGID
WKHRIPKSLLPRWDAIRKQFSELSITIPRLLRPRGGYKNVQLLVFSDASKDTYACAVYIM
YEYDDREPEIGLLTAKSKIKPSSSKTLTIPRLELLAIEIGTRIAMSVVKAMTSEHPCSVR
FFSDAMVALYWVLRNEQKKCWVSNRVKGIHEVCDSLKSLEIPNTFHHCPTDLNPADIATR
GMGSEELKNCTLWFHGPGFLKEDPSKWPCRLEGNITCPSDFRELISSEIIATKKNTDTTD
STEQSVGQSEFDALTDALKGMCMVTQRKDQYVSFVPYERSNSLSRVVSYTHSTLNCLLKL
FKRHEWKSPIMQEFVKSKSIPDTSLMGLKGRAIARRLVFIEHYKEAASQGQEFPSKLLPV
EGSDGIVRTHRRVPSPVLASDAYKRILVHKKHRLARLVVEETHLKNVHLPATYLVTALRT
RYWILTDKQLADSVCRSCVPCQKVNNKPFAYPFARRIPRFRTTPSVPFQHVGLDYMGPLS
YRLDDGISLGKAYVLVYTCLVTRATHLELIPDGTAETYVQGLKNVFSRRGIPHSVYSDNA
RTFTLGSKIISDDLKRYVPSTSFTNFLATFDINFHYITPLAPWQGGIYERVVGIVKHQMR
KEIGKTTKSFFSLNHVIVRVESMINSRPLTPNPRDINDLPALRPMDFILPTVLIDLPSER
DGLKSNEQFDPTRNSSVTERRTLDHLAGLDEVIERLWDIWSSAYLAYLRENAHPEKRTSL
LKPRVGQLVLIYTDKSPRHNWPLGVIDSLKYSKDGSVRTATVRCRDKFYERAVNQLIPLE
VNPVDDPPASEQVESDQQFLVPPDSPNIATFPVYSQNPNTPTSNNKGIRNKDQKVHRVGE
ELINAGPKVLKSPDLDACRSGAPKDRNFTTADKSGHTSDKDATALESSRRGVGLKSCCFD
PTIDSTDNGSDVVASRSGATRGGVECATPVPDVCRSDTSGKVVNRTAPNLDAGCSGTSTK
EFNQFAPNLGIRQSSSTRRGVGLKSCCFDPTIDSSDTGSDMAASCSDAARRRMKCTIPAP
DVCRSGTSGKGFNRVVPNLDASCSGASTKESSQFAPNLSSRRAKTHRRRIGLKSYCFDPT
IDSSGTSSEVVASRSGTTTEDTKQSDPTLGASRSGASKCKDDQKSSGSEYSQNVKLKRPR
TIDDWAKIRLPIHRVRPYQPRKAKAKLARYVLITQAAEPQTPRSVDSCQVPDQASVPLQT
KMH
Download sequence
Identical sequences E3MU87
XP_003100266.1.11157 CRE21846 31234.CRE21846

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]