SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for gi|221054884|ref|XP_002258581.1| from Protozoadb 2010_08

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  gi|221054884|ref|XP_002258581.1|
Domain Number 1 Region: 1813-1854
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000000167
Family Merozoite surface protein 1 (MSP-1) 0.012
Further Details:      
 
Weak hits

Sequence:  gi|221054884|ref|XP_002258581.1|
Domain Number - Region: 1770-1808
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000324
Family Merozoite surface protein 1 (MSP-1) 0.011
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) gi|221054884|ref|XP_002258581.1|
Sequence length 1874
Comment hypothetical protein, conserved in Plasmodium species [Plasmodium knowlesi strain H]
Sequence
MSTCTAIWSFFLFVDTKMRLLLVLLLILLFQLCETLNWKNLQNVHVTVRECTLSLIALME
EEETIRLGKRNDERMAQIVEDKNEARQTLYKLYFSIRHLFSSLGIRFRKELYLFGVAEGN
AGGGKIGKREKKQVDNTSGKRFTDDKYLRGVKTEMDLKDLLWEMINYYKKKFIQGKPSTS
CAYINQKNSLRKQIEIVRYTHSYIATKLYYINYSKYFRLFLGRGTYMANVIFNPSLDIKD
DVYSGFYDNYAGMIADEQQRGATHGTESKTKDDPQDGQNGRQPICSDYKSISGEKCTQEF
AQSVKTMLHNFEVSLEGYIQSSVVEMKKQMIQVEEQQEGRNYCRDLMKELKDKSYDRLSV
EEIERFEQLAKNYLQKDLDMLVEREKKKMYRRRNFFEKYFFFIIEKMLHVRRKVETSLEA
LSRELRGSTNKPPTGTSEKLTATDKPLIEPLSFRKRGPGQVELIVGAEIKSIFQLLDEYI
SMYSFVYEHLKYYHLNVRDIFSLDYIRENSKDNRVYIAEQIAQQMDELNGSFLKYRNVKY
IYEQFVEKGIRKTNSVSNLRRGESIEGHGNHVPEQGPFSLPNLRKEDLPELWESTWNRRS
YSSLSEEDKEKKEKLNEELIKREDEYLRRLENVVKLLTQYKKLKNKKIKIKHIVNIEKVE
MHPILFNLKLRKNEMVGFYKSILFFGKIFVVKRLVLILKMKITFLSKVTPSAFLLRNRFF
LLLYETHLEIMRSKYQIEMNKNFCRNLNLENLDGMMKQGDLPHLLLKYVIYMFGLNSSFM
SEHLGVDIGPGTGSEYMGDSNSSNGDGDFGEVDPSWGAHDLSVIYEMARNFIKRPLVHTE
IFFPSNKASPSGINNTQDEKVKGAFRQIHSYFSSIFNSDEISGYILKKFERLEEFGSSGC
YHGHYCSMENRVYSKDVIKRSVFYNLNDMEDEFDMINEANVSSVGSCSSSLYAYASSSSS
TSSSFSSSFSFPLKFPFYLLNNALITNSLSEAYKYMLYKTQQNQIFKLYSISKGTNMSLL
ETVFLQLILNFGMTPYNKLKGTMINSFCRKKGRLKQNGALSNAHAGQFQYRKGSLSRNEH
ILYVSNGEHLQGGLTISCSLMEHNHEYYTHDKTPTEWWNSYSSSYNNTQNQERNHRVYTC
DLLNGQRKKISCFEVKHIEYIPNGIPFWSEVTEKKNIKSNYQLYEHFFQGVKGVDRRKRN
FPKDNSNGMSSREDIYCSEDKTGYFIIDQSVVHPSEGMDTDEALIGPMLSKSAASQKEIL
FEKESMEQYNMILSWLHRSQEKKNWNREKVRKIERDISGLRWRAKLYEQNISYVKNKMAQ
MSRPPGEGSHRLSDLQKKYQSEVSNAAQEKYNSLMDIYKDIVEMFRKTEKNLTKLKNQNK
REEEKEETEKEPNVEYYGLSKYAFLRKYQVENLNMYTMYNEQIIKYFQKQNRCCDAYIEE
MKIHLEFLPQVCCSNGSKRDKDNILKNIYISISDLMTDLIRCENNTNRLIRDFKKVKDTL
HIINTVNANLHKKQRTFHLSTKYFYREKKEENIYFFTDKMENIKTYKIYQQLIDRVNEDL
IFVMHTMVNKIDERQQLLQQVEQKVPTLVNIKRILTEDNDVASINVETLFANFFHENMLN
YDKVKILRKILKKRISVYKNVLNNIRYSFEHEPQVSNDSMALFYNFVDYDSEKDTDAAQF
ADALLAYNEGGGIILPEQEEDAKNRNRRPLTKYQEIWRNLNELKGEDGRNVKEKYDDEGD
DDDDDRDIYEDWGEEDWAEDSLKAVADRVKNNCRNRKCPPNSFCFIETFNEECLCFLNYN
MVGGKCILNEENSCTVKNGGCDLKATCELKKNRVNCICPKGTKPIYEGVVCSFSFVSSFS
QILLLLAMMAFVIA
Download sequence
Identical sequences A0A1A7VY15 A0A1Y3DX94 B3L2X7
PKH_072840 XP_002258581.1.91479 gi|193808650|emb|CAQ39353.1| gi|221054884|ref|XP_002258581.1| 5850.PKH_072840

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]