SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSSARP00000000224 from Sorex araneus 69_1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSSARP00000000224
Domain Number 1 Region: 888-1093
Classification Level Classification E-value
Superfamily Fibronectin type III 2.09e-39
Family Fibronectin type III 0.0007
Further Details:      
 
Domain Number 2 Region: 648-785
Classification Level Classification E-value
Superfamily Fibronectin type III 1.97e-24
Family Fibronectin type III 0.0016
Further Details:      
 
Domain Number 3 Region: 468-557
Classification Level Classification E-value
Superfamily Fibronectin type III 1e-22
Family Fibronectin type III 0.0015
Further Details:      
 
Domain Number 4 Region: 1525-1612
Classification Level Classification E-value
Superfamily Immunoglobulin 1.82e-18
Family I set domains 0.012
Further Details:      
 
Domain Number 5 Region: 373-461
Classification Level Classification E-value
Superfamily Immunoglobulin 0.00000000000394
Family I set domains 0.016
Further Details:      
 
Domain Number 6 Region: 1311-1398
Classification Level Classification E-value
Superfamily Immunoglobulin 0.00000000173
Family I set domains 0.02
Further Details:      
 
Domain Number 7 Region: 1096-1176
Classification Level Classification E-value
Superfamily Immunoglobulin 0.000000186
Family I set domains 0.035
Further Details:      
 
Domain Number 8 Region: 1441-1490
Classification Level Classification E-value
Superfamily Immunoglobulin 0.00000474
Family I set domains 0.028
Further Details:      
 
Domain Number 9 Region: 235-269
Classification Level Classification E-value
Superfamily Immunoglobulin 0.0000606
Family I set domains 0.029
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSSARP00000000224   Gene: ENSSARG00000000239   Transcript: ENSSART00000000249
Sequence length 1631
Comment pep:novel genescaffold:COMMON_SHREW1:GeneScaffold_1184:143162:351429:-1 gene:ENSSARG00000000239 transcript:ENSSART00000000249 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MSLPFYQKIYDHYDSSYRSKDLRTTMSQYQQEKKRSAIYTHGSTAYSSRSSAAHRQESAA
FSQLSAASYQQQDSLQSAVHRRAASTYDYGYSHGLTDSSMMLDYSSSLSPQTKRARKSLL
SGDEKENLPSDYKVPIFSGRQMHVSGVTDTEEERIKEAAAYIAQRNLLAREEGIMASKQS
TVSKQSLSSLYQEEAFEKKSRKAAIREKAENLSLKKTLEETGAFRRKLNEDGLLHAPEFI
IKPRSHTVWEKESVKLHCSVAGWPEPRVTXXXXXXXXXXXXXXXXXXXXXXXXXXTLEIN
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTRFRATTSPPSFAVTPYGYSSK
FEIHFDDKFDVSFGREGETMSLGCRVVITPEIKQFQPEIQWYRNGVPVSPSKWVQTQWSG
DRAALTFSHLNKEDEGLYSIRVQMGDYYEQYSAYVFVRDADAEIEGAAPLDVVCLDANKD
YITVSWKQPVVDGGSPILGYFIDKCEVGTESWSQCNDTPVKFARFPVTGLIEGRSYIFRV
RAVNKTGIGLPSRVSEPVAALDPAEKARLKSRPSAPWTGQIIVTEEEPTXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCDAETENWQRVNTELPVKSPHFALFDLVE
GKSYRFRVRCSNSAGVGEPSEATEVTVVGDKLDIPKPPGKIIPSRNTDTSVVVTWEESKD
SKEVVGYYIESSVAGSGKWEPCNNNPVKGPRFTCHGLAAGQSYIFRVRAVNAAGLSECSP
DSEVIQVKAAIGGGVSPDVWPELRDTPVGLTDSGAGLHEVSQPAFKKDALLDSELNKSSL
PSSSPNLGQTEVSNVSETVQEELTPPPQTALMGNSKSEPLKQKKDLAPPSPPCDITCLES
CRDSMVLGWKQPDKTGGAEITGYYVNYREVIDGVPGMWREANIKAVSDAAYKISNLKENM
VYQFQVSAMNLAGVGQPSKVSECFKCEEWTIAVPGPPHSLKYSEVRNTSLVLLWEPPTYS
GRTPVTGYFVDMKEASAKDDQWRGLNEIVLKNKYLKVQNLKEGISYVFRVRAINQAGVGK
PSDLAGPVVAKTLPGTKEVVVTVDDNGVISLNYECDQMAPNSEFVWSKDYVSTNDSPRLE
TESKGNKTKITFKDLGTEDLGIYSCDVTDTDGIASSYLIDEEEMKRLLALSQEHKFPTVP
AKSELAVEILDKGQVRFWMQAEKLSSNAKVNFVFNDQEIFEGPXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXYKQLQKEAEFQRQEWIRKQGPHFAEYLS
WEVTGECNVLLKCKVANIKKETNIIWYKDEREISVDEKHDFKDGICTLLITEFSKKDAGI
YEVILKDDRGKDKSRLKLVDEAFNELMSEVCKIIALSATDLKIQSTAEGIRLYSFVTYYL
DDLKVNWSHNGTPIKYTDRVKSGVTGEQIWLQINEPTPNDKGKYVMELFDGKTGHLKSVD
LSGQAYDEAFAEFERLRARVLGGLPDVVTIQEGKALNLTCNVWGDPTPEVAWLKNEKSLE
PDDHCSLAFENGKTAYFTIRGVSTSDSGKYGLVVKNKYGSEISDFTVSVFIPEEEARKAN
SEPQKDNKKSR
Download sequence
Identical sequences ENSSARP00000000224 ENSSARP00000000224

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]