SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSSHAP00000020995 from Sarcophilus harrisii 76_7.0

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSSHAP00000020995
Domain Number 1 Region: 127-366
Classification Level Classification E-value
Superfamily vWA-like 1.87e-67
Family Integrin A (or I) domain 0.00000206
Further Details:      
 
Domain Number 2 Region: 1121-1307
Classification Level Classification E-value
Superfamily Fibronectin type III 8.83e-35
Family Fibronectin type III 0.00000618
Further Details:      
 
Domain Number 3 Region: 1496-1698
Classification Level Classification E-value
Superfamily Fibronectin type III 2.32e-34
Family Fibronectin type III 0.0011
Further Details:      
 
Domain Number 4 Region: 985-1100
Classification Level Classification E-value
Superfamily CalX-like 3.27e-18
Family CalX-beta domain 0.00087
Further Details:      
 
Domain Number 5 Region: 623-708
Classification Level Classification E-value
Superfamily Integrin beta tail domain 3.66e-16
Family Integrin beta tail domain 0.0028
Further Details:      
 
Domain Number 6 Region: 26-76
Classification Level Classification E-value
Superfamily Plexin repeat 0.000000000628
Family Plexin repeat 0.0019
Further Details:      
 
Domain Number 7 Region: 537-574
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000000837
Family Integrin beta EGF-like domains 0.024
Further Details:      
 
Domain Number 8 Region: 578-614
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000391
Family Integrin beta EGF-like domains 0.0051
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSSHAP00000020995   Gene: ENSSHAG00000017802   Transcript: ENSSHAT00000021165
Sequence length 1796
Comment pep:known_by_projection scaffold:DEVIL7.0:GL856832.1:2991958:3028782:-1 gene:ENSSHAG00000017802 transcript:ENSSHAT00000021165 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MPWPGAWAGLLFWAVLCIGLQGNTANRCKKALVKSCTECIRIDKDCAYCTDETFKDRRCN
TREELLAMGCEAGSVVFKESSFHITEYTEIDTTLKKSQVSPQAMHVRLRPGEEKSFEFQV
FEPLESPVDLYILMDFSYSMSDDLANLKQMGRNLARVLSQLTSDYTIGFGKFVDKVSVPQ
TDMRPDKLKEPWPNSDPPFSFKNVISLTKDLEEFRHKLQRERISGNLDAPEGGFDAILQT
AVCTKEIGWRKDSTHLLVFSTESAFHYEADGVNVLAGIMKRNSEECHLDSTGTYTQYNQQ
DYPSVPTLVRVLAHYNIIPIFAVTNHSFSYYEKLHSYFPVSSLGLLHEDSANIVELLEEA
FYRIKSNLDIRALNTPRGLRTEVTSKKYKKTKAGSFQIQRGEVGTYHVQLRAVEQVDGKH
VCLLPPEDQGGEIHLKPSFSDGLRIDTSIICDVCDCELQKELLSTKCSSHGNFTCGHCVC
KEGWSGKSCNCSTGSLSDTKPCIPEGEDKVCSGRGECQCGRCVCYGDGRYEGQFCQHDNF
QCPRTSGFLCNDRGRCSMGQCICEKGWTGKSCECPLSNATCIDSNGGICNGRGRCECGRC
HCDQVSLYTDTTCEISYSAAFLGVCEDLRSCVQCQAWGTGVKKGQKCASCNFKVKMVEEL
KKAEEVLEYCSFRDEDDDCTYSYTVEGNEAIGPNSTVLVHKKKDCPPGTFWWLIPLLIFL
LLLLALLLLLCWKYCACCKACLALLPCCNKGHMVGFKEDHYVLRENLMASDHLDTPMVRS
GNLKGRDMVRWKINNGITSHAANPKELVPFRLSLRLARLCTENLLKPDTRECDQLRQEVE
ENVLNPIMRRAASTGKHQNFSHRQQPNAGKKIRQDHTIVDTVLMAPRSAKLPLLKLTEKQ
VSQGAFHELKVAPGYYTLTGDQDARGMVEFQEGVELVDVRVPLFIRPDNDDDQHLVVEAI
DVPVGTATIGRRLVNITIIKEQASGIVSFDQPEYSFNHMDQVARIPVTRRVMDSGKCQVS
YRTQDNTARANRDYIPVEGDLLFQPGETKKELQVKLLDLQEMDSLLLGHQSRRFHIHLTN
PKYGARLGEQHSATVLIVTAESLDKDFIGQTSSSYSTPAEPGAPQNANAKAVGSRRIHFN
WIPPPGKPPFWVQVKYWIQGDPESEARYLDCKVPSVELTNLYPYCDYEMKVCAYGPSGEG
PYSPLVSCRTHEEVPSEPGRLAFNVVSSTVTQLSWAEPAETNGEITAYEVCYGLVNEENK
PIGPMKKVLVDTPKKRTLLIENLRESQPYRYTVKARNGAGWGPEREAIINLATQPKRPMS
IPIIPDIPIVDAQGGEDYDSFLMYSDDVLRSPASSQRPSVSDDTEHLLNGRLDFVFPGSA
NSLHRMTTTTSNYGTHLSPHPQHRILSTSSTLTQDYHSLTRTEHSATLPRDYSTLASVSS
YGLPPIQEDGGSGARVSVPKDRVRVKVVHSYSGFQDSVILSRPAAHWGPESRLPSGVPDT
PTRLVFSALGPTSLKVSWQEPQCDRALQGYSMEYQLLSGGELHRLHINDPHQTSVVVEDL
LPNHSYIFRVRAQSQEGWGPEREGVITIESQVHPESPLSPLPGSPYTLSTPSAPGPLVFT
ALSPDSLQMSWERPRRPNGDILGYLVTCEMAHGGRCPSTTFQVEGDHLENRLTVPGLSEN
VPYKFKVQAKTTQGFGPEREGIITIESQNGGPFPQLGGHFGLQQEAAQFPREYSSITTST
HSSITKPFLADGLVLGTQQLEAGGSLTRQVTQEYVSRTLTSSGTLTKQLERQFFQT
Download sequence
Identical sequences G3WZZ0
ENSSHAP00000020995 ENSSHAP00000020995

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]