SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for XP_009026391.1.102002 from NCBI 2017_08 genome

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  XP_009026391.1.102002
Domain Number 1 Region: 1523-1754
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 1.8e-37
Family Laminin G-like module 0.0034
Further Details:      
 
Domain Number 2 Region: 989-1108
Classification Level Classification E-value
Superfamily Cadherin-like 2.43e-28
Family Cadherin 0.0014
Further Details:      
 
Domain Number 3 Region: 1799-2021
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 1.09e-27
Family Laminin G-like module 0.0075
Further Details:      
 
Domain Number 4 Region: 551-671
Classification Level Classification E-value
Superfamily Cadherin-like 1.13e-24
Family Cadherin 0.0013
Further Details:      
 
Domain Number 5 Region: 775-884
Classification Level Classification E-value
Superfamily Cadherin-like 1.71e-24
Family Cadherin 0.0006
Further Details:      
 
Domain Number 6 Region: 886-1001
Classification Level Classification E-value
Superfamily Cadherin-like 1.14e-21
Family Cadherin 0.0014
Further Details:      
 
Domain Number 7 Region: 77-176
Classification Level Classification E-value
Superfamily Cadherin-like 1.2e-19
Family Cadherin 0.0015
Further Details:      
 
Domain Number 8 Region: 172-249
Classification Level Classification E-value
Superfamily Cadherin-like 2e-17
Family Cadherin 0.0019
Further Details:      
 
Domain Number 9 Region: 448-563
Classification Level Classification E-value
Superfamily Cadherin-like 3.14e-17
Family Cadherin 0.0026
Further Details:      
 
Domain Number 10 Region: 658-782
Classification Level Classification E-value
Superfamily Cadherin-like 3.57e-17
Family Cadherin 0.0017
Further Details:      
 
Domain Number 11 Region: 1094-1235
Classification Level Classification E-value
Superfamily Cadherin-like 6.57e-16
Family Cadherin 0.0022
Further Details:      
 
Domain Number 12 Region: 332-434
Classification Level Classification E-value
Superfamily Cadherin-like 0.0000000357
Family Cadherin 0.0078
Further Details:      
 
Domain Number 13 Region: 1245-1323
Classification Level Classification E-value
Superfamily Cadherin-like 0.000000286
Family Cadherin 0.01
Further Details:      
 
Domain Number 14 Region: 2102-2144
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000036
Family EGF-type module 0.035
Further Details:      
 
Weak hits

Sequence:  XP_009026391.1.102002
Domain Number - Region: 10-73
Classification Level Classification E-value
Superfamily Cadherin-like 0.000113
Family Cadherin 0.0074
Further Details:      
 
Domain Number - Region: 289-348
Classification Level Classification E-value
Superfamily Cadherin-like 0.00236
Family Cadherin 0.019
Further Details:      
 
Domain Number - Region: 2054-2095
Classification Level Classification E-value
Superfamily beta-sandwich domain of Sec23/24 0.0212
Family beta-sandwich domain of Sec23/24 0.027
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) XP_009026391.1.102002
Sequence length 2502
Comment hypothetical protein HELRODRAFT_179299 [Helobdella robusta]; AA=GCF_000326865.1; RF=representative genome; TAX=6412; STAX=6412; NAME=Helobdella robusta; AL=Scaffold; RT=Major
Sequence
MKQRGRLRDGPFSIDVGSGVITLASPLEKTKFRYHLNISAIDNGACCPGTTSRKSHGQLI
IEVKDVKNHAPKFVDCASYNPVIMENENVGTFVVRVKAVDLDAGQNGKVSYSIVKSNDQS
SDKFEINSSTGEIRTSEVFDREAQLGVTDYGITVKAEDEGSQSLAGFCSFRVKIGDKNDN
PPVFSLSTYLTSVEEKFPVGKRVKQVYATDRDIGDNGKIEYYMKSDPSGFFAINQFNGWV
SIAKPMSGVSFLSKLTFVKVIIAIKLIYLTEHCNKSLTSLVFTPNNPLQRDMVKIVIEAR
DKGVPPQSSISHMEIKITQKSNAYPSWDKDYSQIPLTVSENAPENTIITRLRAHSSIPDS
TVNYIIQSADVPEKNGEPRSFYHLVDEKNNEMVLLTYRALDFEAIPHYFITVKAANRATN
SLHIDTRLLINLIDENDEIPQFVGLNENGLFSATVAENMKPGTDVITVSACDKDHNPFYS
KISYRLKKEGTDYDKFTIDKDTGVIRSRATFDREVKQDYYLEVIAEDGAPSQRTNHYPPG
TPNQGIAGVQIRVTDTNDNTPYFKNENYTTRVPENTDPGTVITTVTAEDNDEDHRLFYSI
VDGNLGNAFEVTPDNGEIKVRGQLDYENGPREYRLMYRVFDEKFTNFTLVIVQVLDVNDN
PPKFDNSIYNVTDINEEEPGISKNNEKYLLTVRATDPDVDRKSSIRYSLTGQFADDGTFV
INESSGTISLTRPLDRDAGKGRPVWNFNVLAHDESRDDQPSLTGYAEVRVMPRDINDNAP
VFDRNRLIGRVPEHSKAGVTVLTVITTDVDNGNNGSVTYSLKQVPMKGQQPLFTINTQMG
LISTVIPNALDREVQAEYQVLVQARDRGIPPLMSSATVTILVDDVNDHRPRFSHLLYKAV
LSESEPEGHHVLTVSATDKDSGNNAKLTYSLKEKDRSHFYIDTIETTNSGVIKVFKPLDY
DTMEKPYFNLTVHVRDSNPNHNDTAFVEITITDANDNPPVFNPNLEKATLFENATVGTTL
KKFTAIDIDTGINAEFSYSIDRSTDMEKQFAVDANGLVTVSKPLDMEVMPAHRIQIHAID
KGNPPQTGRGTLMISLLDVNDNPPEFAAIYQPVVYENKPAEQMVIMVSAVDRDSAANGAP
FELWLPCGGACPCQANPTCADFGFTFIPGGGLNGNGAGRIQTLRSFDRELQKEYEIPIVM
RDSGQPSVSGTSTLTVTIGDVNDNEHFPGHKEVLVNKYVGWKGEFDNYPLGNVFAEDEDD
WDLENKTFQLVHHHYHDKLFRVDKNTGQLYTKSPLKETTYNLKVRVHDLVWNKEVVSTVR
IEVHSFADDAVQNMASLRIQGITAEKFISKPEKSKKISNVFAQHEHHSRMELFAMLLASK
LDGKEYRASNVEILSVTNHATMLNVIDVWFAVHGSLYLKPSKLNGLIHVYKQEFESVLEG
SILMSPINECMIEKCLDVTDGCRTLVTISEHEPLIINTNSSSFVGVIAQSKAECSCSSAA
SFLESLGRECQPSSCFNGGTCVQKYSTLECLCPPDFNGPRCQQTHISFSGSGYAWFQPFE
ACRHNHISFQFATTRATGLILYYGPITYTKNNPFLALELRNGYPLLRMDDGKGEVVLSFV
EVDLHIVNKMNKLNDGAWHRVDIIIQNQFVRLIVDRCERGPSIFESESESSSIVLTNSCE
SMANLSGEDRGMMHINGPLSLGGSHPDSSIPEHITDDAFEGCIKHLVINNQLYDLHVGLE
GQGDSFEDGCRKEEAICHGLSSSNPNSSSKSSDKLARQSGKNRCGSEGECLAGIVPSSNV
QCVCKPGWRGFGCSIPTTVLDFGQKGFVELDFKDDMYADMDRKNGFESDLKVMVRTRQTS
ALLFKSYNTQKSEFIALEIIDSRLTYRFNIGSEESKVSLSQLNISDGEWHSIAVERRGIW
VEMIVDRGEGIYCNRSFPMIDKHHVAFKVSQRSIVLGGDVKFPSFGSDPLVSRDFNDGCL
TDVRYNGMWLPMTPQENDESECAEVMSRSQNIEAGCSSEACNQIDICQDGLVCVDLWRRA
ECRCPIGSRPVTIHHQQPLFNEQPFHHQPPFQHHIHHQQEQQLQQQQNLQYSQHLKRQSL
DEQLACEPINECLESTPCLHGQCQDTPEGFYCLCNSGYTGPICAEAVTASFVAMTPEALL
AIVLSFTVVTVIVFVAILVMRRRPHTSYSDVDLCDDVRDNIIRHDEEGIGEEDQQKYDIS
KLKKPVQPVGNGLAKPNNNINNTNKNSKNNSTNHNNINKNNANNKLKVANKHLTSVNNID
LLNNHKVMYEMQTVPILVNDKKPPKSSLVVKGKKELYSSSSNRTTPQSRLPLPSSSFPSS
SSSSPRVHSDDMLVVLKKKLKKIEADPVAPPFETVREYAFEGCGSYAGSLSSLASNCDDD
DDVIVGNDENHMTAGRAVDRRFIISQLTNQRPAFKKLADLYGSNITVYFSDNDANANEDA
FNNASHPLHAICGNVFEIEQGDCNRAVRHLDHFTNTQFVKDV
Download sequence
Identical sequences T1FEI2
XP_009026391.1.102002 jgi|Helro1|179299

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]