SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for gi|521464202|ref|YP_008151279.1| from Sorangium cellulosum So0157-2

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  gi|521464202|ref|YP_008151279.1|
Domain Number 1 Region: 1149-1430
Classification Level Classification E-value
Superfamily Terpenoid cyclases/Protein prenyltransferases 1.88e-57
Family Complement components 0.0023
Further Details:      
 
Domain Number 2 Region: 1536-1629
Classification Level Classification E-value
Superfamily Alpha-macroglobulin receptor domain 0.0000000000144
Family Alpha-macroglobulin receptor domain 0.0031
Further Details:      
 
Weak hits

Sequence:  gi|521464202|ref|YP_008151279.1|
Domain Number - Region: 489-563
Classification Level Classification E-value
Superfamily Invasin/intimin cell-adhesion fragments 0.000216
Family Invasin/intimin cell-adhesion fragments 0.0033
Further Details:      
 
Domain Number - Region: 261-349
Classification Level Classification E-value
Superfamily Invasin/intimin cell-adhesion fragments 0.022
Family Invasin/intimin cell-adhesion fragments 0.012
Further Details:      
 
Domain Number - Region: 999-1082
Classification Level Classification E-value
Superfamily E set domains 0.0714
Family SVA-like 0.04
Further Details:      
 
Domain Number - Region: 177-243
Classification Level Classification E-value
Superfamily Invasin/intimin cell-adhesion fragments 0.0785
Family Invasin/intimin cell-adhesion fragments 0.01
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) gi|521464202|ref|YP_008151279.1|
Sequence length 1641
Comment hypothetical protein SCE1572_24390 [Sorangium cellulosum So0157-2]
Sequence
MRARFDAFRRRWSSPHLLAALALTLAGAGSGAACSSDEEVAPPFGRPDQPRNGTIEVDEA
KLRGVIEEGDLRLQIPVRNLTAGATRGHLRVTLVDLGADEEQGSTEVDVDISSSAAATAE
ARLPAPAVSGQPGLARYLVRIDDGSGKGLLVTRSLLRVIPPYDVQLEGPARVVRGRAASY
RVRATDAFSHQPVPSVEVELAVGEGAAARELRGTTDALGAAVFEVEPGEIGDLPVSARAL
GFGVSAVVDERVTVEGPGPKLLLTTDKPLYKPGQTIHLRALALERGNNAPLARAAVTFEV
EDGKGNKIFKKPIATDAFGVAATTFQIGSIVNVGEFKVRLVSGETTTEKIVTVGQYALPK
FEVAVKTDRPWYSAGDTLTGTIDARYFFGKDVAGGSVTIEAASLDVGQTTFQRIMGTLDA
KGHYGFSLTLPQHLAGLPLEQGDAAVTLTISVADTAGQAVTKAIPVKVAADVASLVLVPE
ARELVPGVPNTLLLFVTDPLGEPIPEIDAVITAPDGARLTARTDAFGQAAVSFTPPVGAA
DGEFSVRATVGEKAVQQKFSFAAQAGGEHLIVRTDRAVYEAGETIEVAIESSEETGSVYV
DWLNDGQAVDMRSLELEGGRARFSMPVDAGLLGNNRVEAYLVDDDGNLVRSGRTVFARGD
SALNIEVDADKPLYAPGETATLRLSVKDDAGNPKVAAVGVQVVDQAVFALVDAQPGLLRT
YFELEDEFAKPSYQIRGPSANLQNLLFSATASDDPEQAEAAQRTTQASLAALAGTSVTGL
QARTWPAVLAKAKTELEPFYERQKGALRPVLAEAAAGAVEDLRELGCDPMQYFCDGQGEY
AALFAGRTAERLRAFDVWGNAYRVSSTWSGFTVLSSGPDEVAQSADDAAIAFEFADLGVD
VPAPNWPEAAEGDGDWANGGASGGGGGGVVGDPGGGGESGPRVRREFPETLYVNPSVITS
PDGTATLSIPLADSITEWRLSALANSADGKLGGVQSGFKVFQDFFVDVSFPATLTRGDEV
EFPIAVYNYLETPQTVDLSLQPGSWYTALGATTAQVSLAPGEVRGVRFPVRVDTVGVNAL
TVTARGTEGADAVARTVRVVPDGKPFAESKSGMVEAGSVTHALSFPDGAVVGSNQLYLEI
YPAFLSQVVSGMDSMLQVPSGCFEQTTSTTWPNVLVTQYMKQTGQITPEIQMKAESLISA
GYQRLLTFEHAGGGFSWFGEQDGRPYLSVTAFGVMEFADMIKVHAVDEAMLARTVAWLAG
QQKTDGTWEGDQTEFFSFHTGTLRNSAFVLWALASAGYEGPEIARGLEVLGRSLQPATDD
LYTLAIAANALAVAAPSGALTDRVLDALDERKTAEGGAIFWDDGGTQTSFYEVGDDAKVT
TTALAAHALLSADAHRSSADGGVKFLTESKDANGNFGSTQATIWSLRTLLLAASKGSEGA
VGSLSVAVDGEAVSTVALREDQADVMTTVDLTHLATTGAHEVTLAFEGEGKLSYNLVSRH
HLPWSAVPSDGGGPLSVSVSYDKTALYVNDIVTASVDVRNTTSSAQSMILVTVGLPPGFQ
VLTEDLQQYIAAGQLSRFEITGKQLILYVKEIPAGGEASFDYRLQATMPVRAEDGGAEVH
PYYQPEQQSFSGAQVLEVRGE
Download sequence
Identical sequences S4Y390
WP_020736801.1.94496 gi|521464202|ref|YP_008151279.1|

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]