SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for 121224.XP_002423159 from STRING v9.0.5

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  121224.XP_002423159
Domain Number 1 Region: 1509-1636,1663-1728
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 9.01e-40
Family Laminin G-like module 0.0006
Further Details:      
 
Domain Number 2 Region: 550-663
Classification Level Classification E-value
Superfamily Cadherin-like 1.16e-30
Family Cadherin 0.00076
Further Details:      
 
Domain Number 3 Region: 966-1079
Classification Level Classification E-value
Superfamily Cadherin-like 3.71e-30
Family Cadherin 0.00043
Further Details:      
 
Domain Number 4 Region: 1752-1935
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 2.18e-29
Family Laminin G-like module 0.0069
Further Details:      
 
Domain Number 5 Region: 875-971
Classification Level Classification E-value
Superfamily Cadherin-like 1.11e-25
Family Cadherin 0.0017
Further Details:      
 
Domain Number 6 Region: 656-767
Classification Level Classification E-value
Superfamily Cadherin-like 2.49e-25
Family Cadherin 0.0014
Further Details:      
 
Domain Number 7 Region: 445-548
Classification Level Classification E-value
Superfamily Cadherin-like 3.43e-25
Family Cadherin 0.00068
Further Details:      
 
Domain Number 8 Region: 767-869
Classification Level Classification E-value
Superfamily Cadherin-like 2.57e-24
Family Cadherin 0.0012
Further Details:      
 
Domain Number 9 Region: 1075-1193
Classification Level Classification E-value
Superfamily Cadherin-like 4.43e-22
Family Cadherin 0.0017
Further Details:      
 
Domain Number 10 Region: 343-443
Classification Level Classification E-value
Superfamily Cadherin-like 1.86e-20
Family Cadherin 0.0016
Further Details:      
 
Domain Number 11 Region: 1187-1282
Classification Level Classification E-value
Superfamily Cadherin-like 0.0000000000343
Family Cadherin 0.015
Further Details:      
 
Domain Number 12 Region: 1453-1494
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000000000698
Family EGF-type module 0.0054
Further Details:      
 
Domain Number 13 Region: 2069-2109
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00001
Family Laminin-type module 0.019
Further Details:      
 
Domain Number 14 Region: 2545-2840
Classification Level Classification E-value
Superfamily Family A G protein-coupled receptor-like 0.0000375
Family Rhodopsin-like 0.043
Further Details:      
 
Weak hits

Sequence:  121224.XP_002423159
Domain Number - Region: 1974-2014
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000171
Family EGF-type module 0.026
Further Details:      
 
Domain Number - Region: 1939-1973
Classification Level Classification E-value
Superfamily EGF/Laminin 0.013
Family EGF-type module 0.04
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) 121224.XP_002423159
Sequence length 3235
Comment (Pediculus humanus corporis)
Sequence
MANSKRIHNTHKKWHFFYPPIHIIFLIIILLSHQILSYVILVPDNISEGSIIFNAALTKL
HKNRIYSLNTHKNGYFVKKLLGVDRTSGEVFIKEKLDCQGIWYPNLFTLYVDSFPEKLPE
KNSKKKNEDFQSLINSIKNMRFENGYENKRHKRNVNGVKYYSMPLRIFIYGKSCNEDEII
NNSPTTINTIDKELLKFSSLTYLHKLCNFRYSEAKNWISESLASFAMPSESEYSKICLKK
SQFVNSISSFLPKTILSVCEIKYVNVDDPRFTVETSAGDLVATHDFCILEPIWKITIFLN
LNCFGSSGFLNSMDHRLKIIYHHEQLNDTEVAHRVRRELKNQSPYFDQALYVASVIEEKA
PGIPVITLRAKDPENSSITYSMSSLIDSRSQGMFDIDETTGMVTTSVKLDRELFDLHYFR
VVAADDSFPPRSGTTTLQINVIDANDHAPVFESDSYEAAVREGVAIGTIVITVRATDQDI
NKNAEIVYSFKDPDEEFNIDGKTGVVTTRKALDREVTANYALTVIASDSPPVGERKTATA
LLNVRVLDDNDNYPQFTERTYTVSVPEDMDASLSPVIATVKATDADEGKNAAIRYVIIGG
NTQAQFLIDSLTGDVILAKPLDYEVLKSYRLVIRAQDGGNPSKSNTTQLLVNVKDVNDNE
PRFYSTLFQESVLESVPVGYSIVKVQAYDSDEGANVALKYSLSERDEFGTPTNELPLTID
SVSGWIQTTKPLDREMTSKYQFQVIVEDGGEPPKSATANVIINVQDVNDNDPVFNPKIYE
VVVSEQDPPGTLVASVTATDPDENSRLHYEITNGNVRGRFSITSQNNRGLVAIAHPLDYR
QDKRYILTVSATDTGGRSDIATVYVNVTDANNYSPVFENAPYTAQVFEDAPVGTTVLVVQ
ASDGDVGQNAQITYSLTSGAEFSVDSFSINPQTGAVVTTKLLDRETVSSYLLTVTARDGG
KNYVEITVKDVNDNSPVFSSATYSGVISEDALVGTSVLQVQATDIDSGLNGRIRYAFNPP
GSSGAVDNSFVIDPTLGVIRTSKNLDRESVPFYSLKVYAIDRGTPSLYSVVNVNIKIEDV
NDSPPVFESEKIVFYIPENSPIGSTVGEVRAKDPDEGVNAIIQYSIIGGEDSSSFSLVTR
PGWDKAEILTTVDLDYESPRKKYEMVIRAASPPLRNDVKVEILVTDVNDNAPVLKDFQII
FNNFRGCFSNGVVGTIPAFDADVSDDLHYHILSGNNANLVMLNESTGKITLSPQLNTNVP
KLASMEVLVSDGINEVKATMSLFVRLITEEMLFNSVTVRLADMTEKAFLSPLLGFFVDAL
AAVIPCPREYIYLFSIQDEVDMESKILKVSFSARRPDVTGEEFYSSQFLEERVYLNRAIL
ARLSTVQILPFNDNLCVKEPCLNYEQCLTVLKFGNASGFISSDSVLFRPIYPVSTFTCQC
PHGFTGSREHYLCDTEVDLCYSNPCQNGATCMRKEGGYSCVCKKGFTGLYCEIDSHSQSC
QSGFCGKGVCSPSSGNDKDYNDFCELRSRGFSKSSFLTFPSLKQRHRLHIKFRFATQSQN
GLLLYNGRYNEKHDFIALEIMNGSVQFSFSLGTNISRTVARVPGGVSDGKWHTVTLFYLN
KTATISLDDCDVKLALKKGSILGEKWACANSTTQILSTKCAIFTETCHRFLDLTGPLQLG
GLPSLPTNFQVQNKDFDGCISDLYIDHKFIDLNSFVADNGTTAGCHHKKDFCSSNPCKNG
GKCKEEWGTFLCECKEGHGGKDCSQSIQSSWRFRGDGILSYNPLLRPIQLPWINSLSVKT
LQKDAFLMSIQVGQNSSATMALKNGYLDYYYNGESINLGHGIINDGLWHHIEVKWMSNDV
WLSLDYGQREITKTFNVKVQGLYVNKILVGGPDESYLSLNSDFGYFDGCIQDIKVGNQQT
SLQRPTVKKNVSEGCSSTATCPEAKCPAHSDCKEYWEHSSCSCHLGWVGLSCSDVCEYDP
CENRGRCVHDSSFSKGYLCSCDSDEYSGEYCETKVDQPCPSSWWGYPVCGPCQCNVESGY
NPECNKTTGECYCKENHFQPSGSKKCLPCECYLAGSFTPKCDTLNGQCECRPGVIGRRCD
SCSNPYAEVTLNGCEVVYDGCPKSFSFGLWWDRTTFGEVATASCPVGSVGKAKRSCNGDT
SGWDEPDLFNCTSDKFVSLHKVLNQLNNQELHINTFVAVKIASDLYEATNMTSVLHGVDV
MVAHQLIEKLINYENTMSGLNLTHSQDKDYIRNLVSSANVLLDPKYSNHWQTVEKLIDEG
PVDLVMDIEKYLSTLTSSQGDIYTSPFEIVKPNMVLGLDVITTSSVFGYEGSDKENQIGE
KEKVILPDTSHFLHSSLELNMASSVSDLETKTSPTVSFPKYNNYLQDSKNFDTHSKVMIP
LHILGIENVKQGDLPDLKASERRAVFGYAFYKEAGNLFPEEYDETVTKRWGIQLKVGSAV
LSFSTLVPYETESEDENKNDGNNFIYKPLSEIRLVSPIRVRLWLDSERQPIKSNPQCVHW
TTVRGKGEWSRSGCHTDLPEVNDDTEPYIVNCTCYHLSTFAVLLDVIDLEYIPQPTFLED
LMTYVGFSVSIFLLIIALLILSCIRGRPTNSNSIHKNIVFCILCGEIIYFAALKFRSLLL
QQEFPCKMIAMFLHYFWLSSFSWTLVDSLHLYRMLTELRDINHGQMRFYYCLGYGLPAII
VGLSVGVRADQYGNFYFCWLSIYESVVWSLVGPITFVVVITMIMLMLSIRAAFTLKNHIL
GYGNLRTLVWVSVIFLPLLGIVWIFLILNVSEPLALLPHALSLAVIIQAVYTLAGFCFVN
ARVRRNLYVSLLKCCGKEIPKDLDLSIDAIGSSSNIASDRPTTYRNAEVSVSTRRNMGIS
TSSTTSRSTAKTSSSPYRSDTQLRGTNTDTSTSNYNSTNELPSFMRGYKSPGKIKEEGRN
AQTDSDSDNSVDGRSLELASSHSSDDDVSSRPHRSTRNGPINTNYMPNICEGGVPSPPTL
NVISQSELFPNLAPLYAPRWSSQIPQSYLPSNMSELRDDETSPQPLPRPDIDPLSEYDPH
KLSHLEPSLYEKTNSYQVNLSTVYENDNKMDNYNMDVPDPEEKVHLGEKYLFPYTAEEDH
CTVPTYSSNNCLHSRGSSRDSPNLRSHRDSPSYSEFRESPSSSYFTKSRDSPTFHKDVHQ
RTNSNNLPQKENQPYTSVGSHYPSSLSLHQRNTPTRNSPLFPKNSHLDTDAQDSE
Download sequence
Identical sequences E0VAL5
121224.XP_002423159 XP_002423159.1.24195 vb|PHUM040650-PA|EEB10421.1|class

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]