SUPERFAMILY 1.75 HMM library and genome assignments server

Superfamily is undergoing a server migration - you are now browsing on the new server. Please contact us if you experience any problems.

Domain assignment for 31234.CRE06509 from STRING v9.0.5

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  31234.CRE06509
Domain Number 1 Region: 749-1180
Classification Level Classification E-value
Superfamily DNA/RNA polymerases 5.78e-33
Family Reverse transcriptase 0.013
Further Details:      
 
Domain Number 2 Region: 1480-1652
Classification Level Classification E-value
Superfamily Ribonuclease H-like 3.29e-20
Family Retroviral integrase, catalytic domain 0.018
Further Details:      
 
Weak hits

Sequence:  31234.CRE06509
Domain Number - Region: 500-536
Classification Level Classification E-value
Superfamily Acid proteases 0.0027
Family Retroviral protease (retropepsin) 0.047
Further Details:      
 
Domain Number - Region: 2819-2847
Classification Level Classification E-value
Superfamily Retrovirus zinc finger-like domains 0.0366
Family Retrovirus zinc finger-like domains 0.0063
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) 31234.CRE06509
Sequence length 2899
Comment (Caenorhabditis remanei)
Sequence
MSLQLYFLSTTSQSTALSTFVKKYQNELSKLKNATLPIPKQDVGQIVEVIDLLDSKIQTL
EATTIKLSEQIEKIGDEEDANVKNYEEKLPLLIQLNQDAINLRDSYHAVLKRIRSENVEP
VDNKQNIKEFSQRRPSMEMERQVSNLPPVKLPVFSGKRWEFQKFWSLYEEIIHKAEISNI
LKFTHLLNHLQGGAKELLDQFQITPENYDIAVKLLKNKYADTETTILELNEKVRKDCAKD
SSTREQRLLFERLMVAIKQLERLQEPVDNRMMKELIMEKFNDKIRRATFKKKIASTEDWT
ISKMFTDIEENITLEEDLELLMKGKNEPKEKADNPKKDRNQSNKSDRSEKQKKTRLCLFC
KDSEHHSSKCTKFVSIKDRKDFLNREGRCLNCHSNAHKTAECYSTRPCYFCKGRHSSVLC
TNRGESSSPSSANSSRDSKHNQQAKTKVKTATTNVTHTEEVEEAVQCETQTTERAPTTTS
KAFVPTIQAKARNKVSGEWTTISMMIDTGADRSFIKESVAKTLNLETPDGPTLRLKTFGS
PTAKGPEKYKEASIVVFTEKESAEIDVLLRSCVVGNIPKAEMSREDIRFIHRNKIDVNTD
AFEDEVDPDVLIGMDQMSTIWIGDTITLPSGILLLNTKFGYTTMGRKKTSRDRSKVNSVM
IINSIQEDEPFEYLQKQDVLKCNGDEFAGSSADERKEKDKQILQFFRDTVQKRIEGYFVK
LPLKTNKIATLPDNYRLTLKRLIGIVKTTPLEVKKMIQEIFEDQVKKNILEIVTAQTPKG
EWTHYSPIQPVLTPHKATTKCRVVVDASAHYKGNDSLNDAIEQGPTLLPDILDILIRFRS
GETVILADVEKAFLQVRLNEEDRDLTRILWIKDINLPATPDNVEVYRFTRVLFGLNASPF
LLAATIMLHLENHANSKLASKINENLYVDNLIFTFDGSAREALELYKEFKAIFADANMNL
REFMGNSDEFNDNIPEVDKAQKSDLKVLGIPYDPDKDTIQLECFVSTDEKYSRRTVSRKI
GSDFDPQGLMTPLMLTSKLFQRVLWQDEFAYKWDTPLNKNHESQWKELLDQTEGFIKELP
RNILDQSNKNKIICFCDASQNATAYCFYAHNKYGMNIFLGKSKVKSLKEKWTIPKLELHA
LTMGTQRMLSVVQCLQKGDIGVSEAIILTDSEIALSWIKSTPGKKEVGVLITNRLESIRL
ASQEIAETGVKVRFGHIRSEDNPADLGTRGITKDEFQNSFWWTGPSFCQKDPSVWDTYQT
FEIKESEEDNARINICNSIDDNADTAEIFDSISASSLLRKRRVIAYTMRAIAKFANPLSQ
DVKERLRTTIAELKEVPEGNPPISASEQSAAEIRIIREHQAQISLRKKHSWNELNLELDD
NKIIICRGRLKHMDNAKMARFPILIEPKTQLAKLIIREAHGKWHCNEQQTMTEVRKKFWI
PNLRQQVKSLLSKCVACQRYNKPPFKYPDMVDLPEHRVKETAPFQHTGLDYFGPISYRKE
DNTVASCWGCLFICATTRLVHIQLIVRPDTSCFLKAFQRFVSLCGKPNAIVSDNAPHFIL
ADKILQDIAETTTKNCNFNNEVKKFLGDAKIEWKFITPYAPWQGGMYERMMRSIKQSMFK
GIGRSILSLDDIHTTFTEVAAALNSRPLTYVGQNLDSGFVLRPIDFVYPNIQVNYPMDST
LEMNEDYAPPGEISLAKDEAIAAIKSVAKVVETVWQTWKTTYLAELRSTHKLRMNNKRGK
SETPAVGQVILITDPDLPRNYWKLGEIVKADPSSDGVLREVHLRTSKGNIIKRPINLVVP
LELDGEDTQRKNETNGVSLPEEQVPDAPEIQTKDMVKRYNLRKQKRVNYNEDQHEDRFQV
ASAISTLVNFPWSKFMIMVILSIIIGPTMAISPLECTPTGIRVNVEYESFEMCVQNYCTS
RPRMTWNSNYADVWIPPALKITDHHATAKILMANAVTVYELNCHAVSTCDSIDCVICTTN
VLNPECHPYMALGGFAVILYIIAMIVYCIFKVKISMGAPLIMIYKLLQMIISKCRGFIPR
KSSRRGKINWEIMVTILMFSSMIHSSNACQEVNLLSQGEKICTKEEPKTCKLVTQEHLTI
GSFNKEACLRIEQNGMTTKEIRIRFLEIRMECLKNTITFTKDAQIHVWSAKRCAHTGSCV
ADKCLNITQNSMVPELGEANNHIGNTYCTESCGGLGCSCFFPSPGCLFYRIYGKEKNNGT
LEIFQCAEWRERLVIEMSVTRLNGDRHQKTETMTLPVSFPGSLQDISVTAAWINKPVSPI
LENWLIRNDKSQVALWQPYRLPSIQCKRKEGDETCQLNERCTCEPAEDSMVCTCEEDDLE
TQFNTIQKRLPLREGHWTLKAENESVVATINDEVTIGLVLTLEDNVTTSILISSDKCYIK
AKQIQGCYNCASGGQAEIKCTSSMKEVIGNIVCDKDMFTVPCSPNGKSTNITFFAQYAGF
RKVCSINCGGRYTEYFKITGTLKFTGSMWTSIYRIIEGKTTLMNEIAWPDLSHLAQWYLQ
FMKSMMAIIITVAVIVATTGTLVISVFLIGFKKTAKWTLFFFAIPLILSMDIQEILRAAS
TIKKHAKILEKIENEKAADKGKEIEIHLAPKWGIDPTKSTLDEMEQFIAQLKEEVAEFQK
DLESAKEEEKLAHQKYVTHLDTSKMKKIENLTVKRAEELNKEADELEKQVNMTNAVIGDI
EAMIGFKNDVLKLVEKWTRNATFELHRTGKKPDESHAQFLARTQGGEVPQVEKDPRTEKA
IKTTQRQEKDLKDRTADPMEHKHTLQSVVSKPTKRPAPSREIKTNEIKRQRKIRRITSFG
EDKPNMKCSFCGGGHFSNQCPQHPSIADRKEIVKRDRLCEHCLLVKTKEPCGCKERTCYY
CETTNHHSALCSLPQTIID
Download sequence
Identical sequences E3M183
XP_003109989.1.11157 CRE06509 31234.CRE06509

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]