SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSSHAP00000020996 from Sarcophilus harrisii 76_7.0

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSSHAP00000020996
Domain Number 1 Region: 127-366
Classification Level Classification E-value
Superfamily vWA-like 1.78e-67
Family Integrin A (or I) domain 0.00000206
Further Details:      
 
Domain Number 2 Region: 1115-1300
Classification Level Classification E-value
Superfamily Fibronectin type III 9.63e-37
Family Fibronectin type III 0.00000618
Further Details:      
 
Domain Number 3 Region: 1441-1641
Classification Level Classification E-value
Superfamily Fibronectin type III 1.28e-34
Family Fibronectin type III 0.0012
Further Details:      
 
Domain Number 4 Region: 982-1106
Classification Level Classification E-value
Superfamily CalX-like 1.12e-18
Family CalX-beta domain 0.00087
Further Details:      
 
Domain Number 5 Region: 623-708
Classification Level Classification E-value
Superfamily Integrin beta tail domain 3.53e-16
Family Integrin beta tail domain 0.0028
Further Details:      
 
Domain Number 6 Region: 26-76
Classification Level Classification E-value
Superfamily Plexin repeat 0.000000000628
Family Plexin repeat 0.0019
Further Details:      
 
Domain Number 7 Region: 537-574
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000000809
Family Integrin beta EGF-like domains 0.024
Further Details:      
 
Domain Number 8 Region: 578-614
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000391
Family Integrin beta EGF-like domains 0.0051
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSSHAP00000020996   Gene: ENSSHAG00000017802   Transcript: ENSSHAT00000021166
Sequence length 1739
Comment pep:known_by_projection scaffold:DEVIL7.0:GL856832.1:2991958:3028782:-1 gene:ENSSHAG00000017802 transcript:ENSSHAT00000021166 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MPWPGAWAGLLFWAVLCIGLQGNTANRCKKALVKSCTECIRIDKDCAYCTDETFKDRRCN
TREELLAMGCEAGSVVFKESSFHITEYTEIDTTLKKSQVSPQAMHVRLRPGEEKSFEFQV
FEPLESPVDLYILMDFSYSMSDDLANLKQMGRNLARVLSQLTSDYTIGFGKFVDKVSVPQ
TDMRPDKLKEPWPNSDPPFSFKNVISLTKDLEEFRHKLQRERISGNLDAPEGGFDAILQT
AVCTKEIGWRKDSTHLLVFSTESAFHYEADGVNVLAGIMKRNSEECHLDSTGTYTQYNQQ
DYPSVPTLVRVLAHYNIIPIFAVTNHSFSYYEKLHSYFPVSSLGLLHEDSANIVELLEEA
FYRIKSNLDIRALNTPRGLRTEVTSKKYKKTKAGSFQIQRGEVGTYHVQLRAVEQVDGKH
VCLLPPEDQGGEIHLKPSFSDGLRIDTSIICDVCDCELQKELLSTKCSSHGNFTCGHCVC
KEGWSGKSCNCSTGSLSDTKPCIPEGEDKVCSGRGECQCGRCVCYGDGRYEGQFCQHDNF
QCPRTSGFLCNDRGRCSMGQCICEKGWTGKSCECPLSNATCIDSNGGICNGRGRCECGRC
HCDQVSLYTDTTCEISYSAAFLGVCEDLRSCVQCQAWGTGVKKGQKCASCNFKVKMVEEL
KKAEEVLEYCSFRDEDDDCTYSYTVEGNEAIGPNSTVLVHKKKDCPPGTFWWLIPLLIFL
LLLLALLLLLCWKYCACCKACLALLPCCNKGHMVGFKEDHYVLRENLMASDHLDTPMVRS
GNLKGRDMVRWKINNGITSHAANPKELVPFRLSLRLARLCTENLLKPDTRECDQLRQEVE
ENLNEVYRQIPGVHKLQHTKFRQQPNAGKKQDHTIVDTVLMAPRSAKLPLLKLTEKQVSQ
GAFHELKVAPGYYTLTGDQDARGMVEFQEGVELVDVRVPLFIRPDNDDDQHLVVEAIDVP
VGTATIGRRLVNITIIKEQASGIVSFDQPEYSFNHMDQVARIPVTRRVMDSGKCQVSYRT
QDNTARANRDYIPVEGDLLFQPGETKKELQVKLLDLQEMDSLLLGHQSRRFHIHLTNPKY
GARLGEQHSATVLIESLDKDFIGQTSSSYSTPAEPGAPQNANAKAVGSRRIHFNWIPPPG
KPSGYRVKYWIQGDPESEARYLDCKVPSVELTNLYPYCDYEMKVCAYGPSGEGPYSPLVS
CRTHEEVPSEPGRLAFNVVSSTVTQLSWAEPAETNGEITAYEVCYGLVNEENKPIGPMKK
VLVDTPKKRTLLIENLRESQPYRYTVKARNGAGWGPEREAIINLATQPKRPMSIPIIPDI
PIVDAQGGEDYDSFLMYSDDVLRSPASSQRPSVSDDTEHLLNGRLDFVFPGSANSLHRMT
TTTSNYGTHLSPHPQHRILSTSSTLTQDYHSLTRTEHSATLPRDYSTLASVSSYESRLPS
GVPDTPTRLVFSALGPTSLKVSWQEPQCDRALQGYSMEYQLLSGGELHRLHINDPHQTSV
VVEDLLPNHSYIFRVRAQSQEGWGPEREGVITIESQVHPESPLSPLPGSPYTLSTPSAPG
PLVFTALSPDSLQMSWERPRRPNGDILGYLVTCEMAHGGPSTTFQVEGDHLENRLTVPGL
SENVPYKFKVQAKTTQGFGPEREGIITIESQNGGPFPQLGGHFGLQQEAAQFPREYSSIT
TSTHSSITKPFLADGLVLGTQQLEAGGSLTRQVTQEYVSRTLTSSGTLTKQLERQFFQT
Download sequence
Identical sequences G3WZZ1
ENSSHAP00000020996

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]