SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSGACP00000012862 from Gasterosteus aculeatus 76_1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGACP00000012862
Domain Number 1 Region: 893-1110
Classification Level Classification E-value
Superfamily Fibrinogen C-terminal domain-like 4.06e-74
Family Fibrinogen C-terminal domain-like 0.00000152
Further Details:      
 
Domain Number 2 Region: 716-883
Classification Level Classification E-value
Superfamily Fibronectin type III 3.23e-23
Family Fibronectin type III 0.001
Further Details:      
 
Domain Number 3 Region: 626-715
Classification Level Classification E-value
Superfamily Fibronectin type III 1.14e-19
Family Fibronectin type III 0.00025
Further Details:      
 
Domain Number 4 Region: 284-366
Classification Level Classification E-value
Superfamily Fibronectin type III 5.71e-17
Family Fibronectin type III 0.00074
Further Details:      
 
Domain Number 5 Region: 118-159,236-269
Classification Level Classification E-value
Superfamily Fibronectin type III 0.0000000159
Family Fibronectin type III 0.0042
Further Details:      
 
Domain Number 6 Region: 558-642
Classification Level Classification E-value
Superfamily Fibronectin type III 0.000012
Family Fibronectin type III 0.0072
Further Details:      
 
Weak hits

Sequence:  ENSGACP00000012862
Domain Number - Region: 84-107
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00921
Family EGF-type module 0.045
Further Details:      
 
Domain Number - Region: 22-45
Classification Level Classification E-value
Superfamily EGF/Laminin 0.01
Family EGF-type module 0.044
Further Details:      
 
Domain Number - Region: 53-76
Classification Level Classification E-value
Superfamily EGF/Laminin 0.067
Family Integrin beta EGF-like domains 0.046
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGACP00000012862   Gene: ENSGACG00000009746   Transcript: ENSGACT00000012886
Sequence length 1112
Comment pep:known_by_projection group:BROADS1:groupX:14996770:15014430:-1 gene:ENSGACG00000009746 transcript:ENSGACT00000012886 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
TCVCFPGFHGADCGQSNCPGNCTDRGRCVGGECACDRGFAGPDCSERTCPNDCGGRGTCV
GGRCACDAGFAGPGCGATGCPGNCANKGRCVKGRCVCPPPFTGPDCGRKVSCQSLIILSA
SDITETSVLLLWTPPGLQYETYYVTFSSQVGPRLLPPPEQSAYELLALLAFSTVSAAIRR
YPTSTSSLEKHAVSLTCLLGAGHHHDNASKPRWRRTKCRTPFSKFLYSPWSEQKETEQQI
SVQVDGGLTTYSQTGLAAGQDYTATVAGEIAGKHILFNIISITPGPTNLRVVKTTSTSAV
VQWERSQGEIDRYHVIVTPTDGAGSSQEVTVPAGQDSAHIRQLEAGRLYDVVVVAEKGAS
RSKPATSQLSLRKKVEINGTIPKDTERSSLYRKPNVSGPFRLNTTRLLSRWRQFGPGPLK
KLPVGQKKKPPTGPLKLKPDVPASGDRTMALTLRDPDVNALTPEACSNTSRRSSSEKPTA
DAGTKEQTDVGRVGQGNDTPVSSEPTGTGRSQQKKCMNKLKSKPEQTRGVDSTDGSSSGD
TREPLDQVGVTNRTSDGFTLTWDSPEKKYKNFVVTSKDVQTKGTESQKEDRVTEFEHLPP
QTKYTVTLLGKGPGLLSRLHKLVISTGPEPPTNVVFSEVTENSLTVSWTKPRTPVSGYKV
TYTQTEEGEPVSVSVDSDDSTLDLSKLTPGSAYEVSVISLLGLDESDPTRDLVATLPDPP
TDLRAFNVTDTTALLLWRPALATVDKYIIVMVDSDSELRISVSGNAAELQLSGLEGSSTY
TVTVTSQRGSTQSSAASTSFTTTGGSGGGESPRDLRADNLTPRTAMLSWKPPSNPVGSYR
LTYQSEHLGLKEVMVDASVTEYNLTRLHPGSKYSVQLQAERGGRFSAAISTDFTTGTLRF
PFPTDCSQELLNGIRTSGEAEVFPQGKQGTPMEVYCDMETDGGGWTVFQRRKDGSVDFFR
GWKDYTRGFGVLSGEFWLGLESIYNLTAMTRMSLRVDLRDKDGAAFAKYSTFELVKRNYK
LIVGGYSGTAGDSLSYHNQRIFSTKDRDLAPFLTRCAMSYRGGWWYKNCHEANLNGAYGT
DRNQQGVIWTAWRGTKFSVPFTEMKMRPAAFS
Download sequence
Identical sequences G3P5I9
ENSGACP00000012862 69293.ENSGACP00000012862 ENSGACP00000012862

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]