SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for XP_020024094.1.5219 from NCBI 2017_08 genome

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  XP_020024094.1.5219
Domain Number 1 Region: 945-1160
Classification Level Classification E-value
Superfamily Fibrinogen C-terminal domain-like 1.31e-78
Family Fibrinogen C-terminal domain-like 0.00000284
Further Details:      
 
Domain Number 2 Region: 680-768
Classification Level Classification E-value
Superfamily Fibronectin type III 7.62e-21
Family Fibronectin type III 0.0001
Further Details:      
 
Domain Number 3 Region: 768-931
Classification Level Classification E-value
Superfamily Fibronectin type III 3.39e-20
Family Fibronectin type III 0.00000815
Further Details:      
 
Domain Number 4 Region: 382-552
Classification Level Classification E-value
Superfamily Fibronectin type III 3.84e-18
Family Fibronectin type III 0.0024
Further Details:      
 
Domain Number 5 Region: 287-377
Classification Level Classification E-value
Superfamily Fibronectin type III 1.97e-16
Family Fibronectin type III 0.0015
Further Details:      
 
Domain Number 6 Region: 582-676
Classification Level Classification E-value
Superfamily Fibronectin type III 0.00000000000000117
Family Fibronectin type III 0.00000351
Further Details:      
 
Domain Number 7 Region: 191-275
Classification Level Classification E-value
Superfamily Fibronectin type III 0.000000000000255
Family Fibronectin type III 0.0022
Further Details:      
 
Domain Number 8 Region: 115-189
Classification Level Classification E-value
Superfamily Fibronectin type III 0.00000000394
Family Fibronectin type III 0.0028
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) XP_020024094.1.5219
Sequence length 1169
Comment LOW QUALITY PROTEIN: tenascin-X [Castor canadensis]; AA=GCF_001984765.1; RF=representative genome; TAX=51338; STAX=51338; NAME=Castor canadensis; AL=Scaffold; RT=Major
Sequence
MNLYGFHSGQRVGPVSAIGVTGTNSIFPMLVAVQDPQVTLELGQCHGFSFFPRPKDICWG
PCLTLSVFILAAEEETPSPTETQHRAPETSKRALLEREXPDRILPTGLLSLSELPPRATL
TPFTVQYKDRDGRPQVVPVGGEEREATVGNLEPGRKYKMHLYGIHQGQRVGPVSTMGITA
FLPTEPPVEPRLGELAVAEVTSNTVHLLWTMAQGLFDSFLVQYKDAQGQSKAVPVSRNLH
EVTILGLDPARKYKFLLFGLQNGKRHGPVSIEAKTIPETKPSPRLGELTVTGMTRDSVGL
SWTVPEGEFDSFMVQYKDRDGQPQIVPVAADQREVTVSSLEPNRKYKFLLYGLTGRKRLG
PISVDGTTAPLEKTPQPSPRLGELAVTGVTSDSLQLSWTVAQGLFDSFLVQYRDSDGQVQ
AVLMTADQREVTIEGLEPDRKYKFLLYGLFGGKRLGPVSALGVTATEEDTPAIVTPRPTE
VPRLGLLAVTEATPDSLHLSWIVAQGPFDSFVIQYLDTDGQPQALLVDGDQNRVLISGLE
PSTSYKFFLYGLHEGKRQGPISAEGTTGPAPAGQTSGDPGPRLSQLSVTDVTTSSLRLNW
EAPLGAFDSFLLRFGVPSPSTLEPHLRPLLQRELMVPGMRRSAVLRDLRPGTLYSLTLYG
LRGPHKADSIQGTARTLSPVLESPRDLQFSEIRETSAKVNWVPPPSRVDSFKVSYQLADG
GEPQSVQVDGHSHSQNLQGLIPGTRYEVTVVSVRGFEESEPLTGVLTTVPDGPTQLRALN
LTEGSALLHWKPPHTPVDKYNVRVMAPGAPPLQDSAPGSAVDYPLRDLVLHTNYTATVRG
LRGPNFTSPASITFTTGLKAPRDLEAKDVTARTALLTWTEPQAPPTGYLLSFDTAGGQTQ
EILLPAGVTSHRLLGLFPSTTYNARLQAVWGESLMPPVSTSFTTGGLRIPFPRDCGEEMQ
NGASTSRTTTIFLNGNRERPLDVFCDMETDGGGWLVFQRRMDGQTDFWRDWEEYAHGFGN
ISREFWLGNEALHSLTQSGDYSMRVDLRAGDEAVFAQYDSFRVDSAAENYRLHLEGYHGT
AGDSMSYHSGSVFSARDRDPNNLLISCAVSYRGAWWYRNCHYANLNGLYGSTVDHQGVSW
YHWKGFEFSVPFTEMKLRPRSYRSPAREG
Download sequence
Identical sequences XP_020024094.1.5219

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]