SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for G3WZZ0 from Uniprot 2018_03 genome

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  G3WZZ0
Domain Number 1 Region: 127-366
Classification Level Classification E-value
Superfamily vWA-like 1.87e-67
Family Integrin A (or I) domain 0.00000206
Further Details:      
 
Domain Number 2 Region: 1121-1307
Classification Level Classification E-value
Superfamily Fibronectin type III 8.83e-35
Family Fibronectin type III 0.00000618
Further Details:      
 
Domain Number 3 Region: 1496-1698
Classification Level Classification E-value
Superfamily Fibronectin type III 2.32e-34
Family Fibronectin type III 0.0011
Further Details:      
 
Domain Number 4 Region: 985-1100
Classification Level Classification E-value
Superfamily CalX-like 3.27e-18
Family CalX-beta domain 0.00087
Further Details:      
 
Domain Number 5 Region: 623-708
Classification Level Classification E-value
Superfamily Integrin beta tail domain 3.66e-16
Family Integrin beta tail domain 0.0028
Further Details:      
 
Domain Number 6 Region: 26-76
Classification Level Classification E-value
Superfamily Plexin repeat 0.000000000628
Family Plexin repeat 0.0019
Further Details:      
 
Domain Number 7 Region: 537-574
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000000837
Family Integrin beta EGF-like domains 0.024
Further Details:      
 
Domain Number 8 Region: 578-614
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000391
Family Integrin beta EGF-like domains 0.0051
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) G3WZZ0
Sequence length 1796
Comment (tr|G3WZZ0|G3WZZ0_SARHA) Integrin beta {ECO:0000256|RuleBase:RU000633} KW=Complete proteome; Reference proteome OX=9305 OS=Sarcophilus harrisii (Tasmanian devil) (Sarcophilus laniarius). GN=ITGB4 OC=Mammalia; Metatheria; Dasyuromorphia; Dasyuridae; Sarcophilus.
Sequence
MPWPGAWAGLLFWAVLCIGLQGNTANRCKKALVKSCTECIRIDKDCAYCTDETFKDRRCN
TREELLAMGCEAGSVVFKESSFHITEYTEIDTTLKKSQVSPQAMHVRLRPGEEKSFEFQV
FEPLESPVDLYILMDFSYSMSDDLANLKQMGRNLARVLSQLTSDYTIGFGKFVDKVSVPQ
TDMRPDKLKEPWPNSDPPFSFKNVISLTKDLEEFRHKLQRERISGNLDAPEGGFDAILQT
AVCTKEIGWRKDSTHLLVFSTESAFHYEADGVNVLAGIMKRNSEECHLDSTGTYTQYNQQ
DYPSVPTLVRVLAHYNIIPIFAVTNHSFSYYEKLHSYFPVSSLGLLHEDSANIVELLEEA
FYRIKSNLDIRALNTPRGLRTEVTSKKYKKTKAGSFQIQRGEVGTYHVQLRAVEQVDGKH
VCLLPPEDQGGEIHLKPSFSDGLRIDTSIICDVCDCELQKELLSTKCSSHGNFTCGHCVC
KEGWSGKSCNCSTGSLSDTKPCIPEGEDKVCSGRGECQCGRCVCYGDGRYEGQFCQHDNF
QCPRTSGFLCNDRGRCSMGQCICEKGWTGKSCECPLSNATCIDSNGGICNGRGRCECGRC
HCDQVSLYTDTTCEISYSAAFLGVCEDLRSCVQCQAWGTGVKKGQKCASCNFKVKMVEEL
KKAEEVLEYCSFRDEDDDCTYSYTVEGNEAIGPNSTVLVHKKKDCPPGTFWWLIPLLIFL
LLLLALLLLLCWKYCACCKACLALLPCCNKGHMVGFKEDHYVLRENLMASDHLDTPMVRS
GNLKGRDMVRWKINNGITSHAANPKELVPFRLSLRLARLCTENLLKPDTRECDQLRQEVE
ENVLNPIMRRAASTGKHQNFSHRQQPNAGKKIRQDHTIVDTVLMAPRSAKLPLLKLTEKQ
VSQGAFHELKVAPGYYTLTGDQDARGMVEFQEGVELVDVRVPLFIRPDNDDDQHLVVEAI
DVPVGTATIGRRLVNITIIKEQASGIVSFDQPEYSFNHMDQVARIPVTRRVMDSGKCQVS
YRTQDNTARANRDYIPVEGDLLFQPGETKKELQVKLLDLQEMDSLLLGHQSRRFHIHLTN
PKYGARLGEQHSATVLIVTAESLDKDFIGQTSSSYSTPAEPGAPQNANAKAVGSRRIHFN
WIPPPGKPPFWVQVKYWIQGDPESEARYLDCKVPSVELTNLYPYCDYEMKVCAYGPSGEG
PYSPLVSCRTHEEVPSEPGRLAFNVVSSTVTQLSWAEPAETNGEITAYEVCYGLVNEENK
PIGPMKKVLVDTPKKRTLLIENLRESQPYRYTVKARNGAGWGPEREAIINLATQPKRPMS
IPIIPDIPIVDAQGGEDYDSFLMYSDDVLRSPASSQRPSVSDDTEHLLNGRLDFVFPGSA
NSLHRMTTTTSNYGTHLSPHPQHRILSTSSTLTQDYHSLTRTEHSATLPRDYSTLASVSS
YGLPPIQEDGGSGARVSVPKDRVRVKVVHSYSGFQDSVILSRPAAHWGPESRLPSGVPDT
PTRLVFSALGPTSLKVSWQEPQCDRALQGYSMEYQLLSGGELHRLHINDPHQTSVVVEDL
LPNHSYIFRVRAQSQEGWGPEREGVITIESQVHPESPLSPLPGSPYTLSTPSAPGPLVFT
ALSPDSLQMSWERPRRPNGDILGYLVTCEMAHGGRCPSTTFQVEGDHLENRLTVPGLSEN
VPYKFKVQAKTTQGFGPEREGIITIESQNGGPFPQLGGHFGLQQEAAQFPREYSSITTST
HSSITKPFLADGLVLGTQQLEAGGSLTRQVTQEYVSRTLTSSGTLTKQLERQFFQT
Download sequence
Identical sequences G3WZZ0
ENSSHAP00000020995 ENSSHAP00000020995

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]