SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for W6USA6 from Uniprot 2018_03 genome

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  W6USA6
Domain Number 1 Region: 86-343
Classification Level Classification E-value
Superfamily vWA-like 1.19e-59
Family Integrin A (or I) domain 0.00000233
Further Details:      
 
Domain Number 2 Region: 3-43
Classification Level Classification E-value
Superfamily Plexin repeat 0.0000241
Family Plexin repeat 0.0044
Further Details:      
 
Weak hits

Sequence:  W6USA6
Domain Number - Region: 571-597
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00134
Family Integrin beta EGF-like domains 0.015
Further Details:      
 
Domain Number - Region: 484-514
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00335
Family Integrin beta EGF-like domains 0.027
Further Details:      
 
Domain Number - Region: 356-426
Classification Level Classification E-value
Superfamily Integrin domains 0.00492
Family Integrin domains 0.0085
Further Details:      
 
Domain Number - Region: 513-556
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0184
Family Integrin beta EGF-like domains 0.0083
Further Details:      
 
Domain Number - Region: 440-466
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0307
Family Integrin beta EGF-like domains 0.023
Further Details:      
 
Domain Number - Region: 694-737
Classification Level Classification E-value
Superfamily Integrin beta tail domain 0.0575
Family Integrin beta tail domain 0.0069
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) W6USA6
Sequence length 994
Comment (tr|W6USA6|W6USA6_ECHGR) Integrin beta {ECO:0000256|RuleBase:RU000633} KW=Complete proteome; Reference proteome OX=6210 OS=Echinococcus granulosus (Hydatid tapeworm). GN=EGR_03922 OC=Cyclophyllidea; Taeniidae; Echinococcus.
Sequence
MGRGPTCSWCFDSEYDDTAEGPGYRCDERNILLERNCPAGKIESQNSTIVDAGVQGGDAQ
LMPPRQKVSVRPHDSFKIDFTFQSKTDYPVDIYFLVDQSYTMRDDLETVSRLTKDIANSF
TVVTKDLRMGFGAFVDKPVFPFIVPTPEAQVNPCLYGVGNQDLQCDPPFLYKHILSLTSD
FAEFEQKTILSRPSGNLDSPEGGLDALLQVARCPEYIGWRKNARKIVLFATDGGFHLAGD
GRLAGLIKPPPKTCQLTYELDRFNKSLVYLGWHNSVETDYPSVGEVAEVLTERDISVIFA
IDAKVFPLYEKLAAFLPSAAVGTLTDSSTNIVNLLRENYDKIANRAELILSYDTDSLEVE
ILAKCQGETEFTKKTVCKEHPVGGKIEYRVSVTPTRCFSGKKEVILKMVALEEQAMLEVT
SACHCPACGTPTASYPTSPRCQYHGYLLCGACVCAPQYSGEFCECSAESSQQEEIMLQQC
TRPGDDVPCSDRGRCVCGRCKCNLARYMGLYCECDRHGCKRAFDDNQVCGGPQRGECQCD
GTCKCKPGYTGERCDCIDSNANCYDPNNPDGPACSGMGVCDCGQCFCNSGRTGQLCNEVQ
GGESALCTDRGVEQCVLCLRREMLGTDEVRIAEEAEKRPPTAPPVTVSPETVAACHAVCN
TTVIDTTKVQIIDEVGEESKDVGEGGGSGGGSGGGSNLCIIYTEDSCRVMFKYRYSDAVY
IHRKVDLKIVRKSECTKTTGILYIILGVIAGIVLGGLILLLIYKLVITIDDRRELAKFKM
HNENVHWEMAENPIFEPPTTRVMNPTFNDDAALELSQLLASRLSLVHRGHRSYRFIPSKP
VWRWHDDRSPCSDSAIFTIPPEEIPPRIELDDQTLKLLERLSLVNFNEETTKTIVEEAIH
FADCLLTPVVFRGCATRATVEPMISLLGEKDWEPVEVVAEDWTEGGKEGSDPIAVTAAHI
LHHAAVTWENYFVAPPSNQPIHPELDGKAKSLDG
Download sequence
Identical sequences W6USA6

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]