SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSECAP00000016790 from Equus caballus 69_2

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSECAP00000016790
Domain Number 1 Region: 1126-1346
Classification Level Classification E-value
Superfamily Fibrinogen C-terminal domain-like 3.79e-86
Family Fibrinogen C-terminal domain-like 0.000000844
Further Details:      
 
Domain Number 2 Region: 595-768
Classification Level Classification E-value
Superfamily Fibronectin type III 4.74e-24
Family Fibronectin type III 0.0000105
Further Details:      
 
Domain Number 3 Region: 955-1123
Classification Level Classification E-value
Superfamily Fibronectin type III 1.6e-23
Family Fibronectin type III 0.0015
Further Details:      
 
Domain Number 4 Region: 778-937
Classification Level Classification E-value
Superfamily Fibronectin type III 4.42e-21
Family Fibronectin type III 0.0011
Further Details:      
 
Domain Number 5 Region: 416-597
Classification Level Classification E-value
Superfamily Fibronectin type III 3.99e-20
Family Fibronectin type III 0.0000067
Further Details:      
 
Domain Number 6 Region: 326-413
Classification Level Classification E-value
Superfamily Fibronectin type III 4.6e-16
Family Fibronectin type III 0.0011
Further Details:      
 
Weak hits

Sequence:  ENSECAP00000016790
Domain Number - Region: 208-231
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00167
Family Integrin beta EGF-like domains 0.038
Further Details:      
 
Domain Number - Region: 239-262
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00251
Family Integrin beta EGF-like domains 0.083
Further Details:      
 
Domain Number - Region: 299-324
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00395
Family EGF-type module 0.066
Further Details:      
 
Domain Number - Region: 270-294
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00598
Family Integrin beta EGF-like domains 0.075
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSECAP00000016790   Gene: ENSECAG00000018690   Transcript: ENSECAT00000020457
Sequence length 1358
Comment pep:known chromosome:EquCab2:5:10711172:10789781:-1 gene:ENSECAG00000018690 transcript:ENSECAT00000020457 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MGADGETVVLKNMLIGVNLILLGSLLKPTECQLEVTTERVQRQAVVEEGGGANYNTSGKE
QPVVFNHVYNINVPLDSLCSSRLEASAEQEVSAEDEALAEYTGHTSDHESQVTFTHRINL
PKKACPCASSAQVLQELLSRIEMLEREVSLLRDQCDSNCCQGSAATGQLDYIPHCSGHGN
FSLESCGCICHEGWFGKNCSEPYCPLGCSSRGVCVDGQCICDSEYSGDDCSELRCPADCS
SRGLCVDGECVCEEAYTGEDCSELRCPGDCSGKGRCANGTCLCQEGYVGEDCGQRRCPNA
CSGRGDCQEGLCVCEEGYQGPDCSAVAPPEDLRVAGISDRSIELEWDGPMAVTEYVISYQ
PTALGGLQLQQRVPGDWSGVTITELEPGLTYNISVYAVISNILSLPITAKVATHLSTPQG
LRFKTITETTVEVQWEPFSFSFDGWEISFIPKNNEGGVIAQLPSDVTSFNQTGLKPGEEY
IVNVVALKEQARSLPTSASVSTVIDGPTQILVRDVSDTVAFVEWTPPRAKVDFILLKYGL
VGGEGGKTTFRLQPPLSQYSVQALRPGSRYEVWVSAVRGTNESESTTTQFTTEIDAPKNL
RVGSRTATSLDLEWDNSEAEVQGYKVVYSTLAGEQYHELLVPKSIGPTSRATLTDLVPGT
EYGVGISAVMNSQQSVPATMNARTELDSPRDLMVTASSETSISLIWTKASGPIDHYRITF
TPSSGIASEVTVPKDMTSYTLTDLEPGAEYIISITAERGRQQSLESTVDAFTGFRPISHL
HFSHVTSSSVNITWSDPSPPADRLILNYNPRDEEEEMMEVSLDATKRHAVLMGLQPATEY
IVNLVAVHGTVTSEPIVGSITTGIDPPKDITISNVTKDSVMVSWSPPVASFDYYRVSYRP
TQVGRLDSSVVPNTVTEFTITKLYPATEYEISLNSVRGREESERICTLVHTAMDNPVDLI
ATNITPTEALLQWKAPVGEVENYVIVLTHFAVAGETILVDGGSEEFQLVDLLPSTHYTVT
LYATNGPLTSGTISTNFSTLLDPPANLTASEVTRQSALISWQPPRADIENYVLTYRSTDG
SRKELIVDAEDTWIRLEGLSESTDYTVLLQAAQDATRSSITSTAFTTGGRIFPHPQDCAQ
HLMNGDTLSGVYTIFLNGELSQKLQVYCDMTTDGGGWIVFQRRQNGQTDFFRKWAEYRVG
FGNLEDEFWLGLDNIHRITSQGRYELRVDMRDGQEAAFAYYDKFSVEDSRSLYKLRIGGY
NGTAGDSLSYHQGRPFSTEDRDNDVAVTNCAMSYKGAWWYKNCHRTNLNGKYGESRHSQG
INWYHWKGHEFSIPFVEMKMRPYNHRLTAGRKRRSLQF
Download sequence
Identical sequences F7A2I3
9796.ENSECAP00000016790 ENSECAP00000016790 XP_005609748.1.31192 ENSECAP00000016790

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]