SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSLOCP00000019831 from Lepisosteus oculatus 76

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSLOCP00000019831
Domain Number 1 Region: 144-237
Classification Level Classification E-value
Superfamily Immunoglobulin 2.01e-22
Family I set domains 0.087
Further Details:      
 
Domain Number 2 Region: 521-712
Classification Level Classification E-value
Superfamily Fibronectin type III 2.7e-22
Family Fibronectin type III 0.0027
Further Details:      
 
Domain Number 3 Region: 230-326
Classification Level Classification E-value
Superfamily Immunoglobulin 6.33e-20
Family I set domains 0.015
Further Details:      
 
Domain Number 4 Region: 329-428
Classification Level Classification E-value
Superfamily Immunoglobulin 0.000000000000188
Family I set domains 0.018
Further Details:      
 
Domain Number 5 Region: 413-509
Classification Level Classification E-value
Superfamily Immunoglobulin 0.00000000000111
Family I set domains 0.015
Further Details:      
 
Domain Number 6 Region: 30-142
Classification Level Classification E-value
Superfamily Immunoglobulin 0.00000000948
Family V set domains (antibody variable domain-like) 0.029
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSLOCP00000019831   Gene: ENSLOCG00000016092   Transcript: ENSLOCT00000019864
Sequence length 2042
Comment pep:known_by_projection chromosome:LepOcu1:LG1:22219960:22259406:-1 gene:ENSLOCG00000016092 transcript:ENSLOCT00000019864 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MRRTHLWIAGVITAAALCFLCPTLGADSVVRSRVGGSAVLGCSLSPPATDNTTPRLFPLH
VVEWVRLGYAVPVLIKFGVYSPRVHPNYRGRVSLERGASLRIDGLQLDDEGWFECRILLL
DRQTDEFQNGTWTFLSITAPPVFFKTPPPILEVLEGEPLILTCGAHGNPPPIITWRKDDT
LIENGDNAQVTNGSLSLFSVSRDKAGIYKCHVSNEEGNLTHSSQLLVKGPPVIVIPPEDT
TLNMSQDAVLRCQAEAYPSNLTYLWWKQGENVFHIDLLKTRVKVLVDGTLLIQSVTPDDA
GNYTCVPTNGLLTPPTASAYLTVKHPAQVLPMPEETYLPVGMEGVITCPVRAEPPMLFVN
WTKDGHLLNLDMFPGWMVNSEGSVFITAANDDAVGMYTCTAYNSYGTMGQSSPTKVVLQD
PPSFRVTPRAEYLQEVGRDLVIPCEAHGDPSPNITWSKVGSAPRSPFRTAQNGSLIMRPL
SKDHQGTWECSARNRVAAVSTRTAVSVLGTSPHAVSAVTLVPGTNRMNVSWEPGFDGGYS
QKFTVWVKQVSRGKHEWTSLPVPPSQSYLLVTGLLPGTGYQFSILPQNKLGSGPFSEIIT
ASTLVPKTKPPPTVKSPPLLTPPRSLSANQSSKGVVLQWLPPLAESTTVTGFILQSRREK
GEWVVLEGFIEANQSEILVQGLLKDCNYELRMLSRDGQMISEPSESVNISTAGMDMYPAR
TRLSELLPEPLVAGVIGGVCFLCVAIILSLVTACAMNRRRERRRRKRREDIPIAFQKGQS
PQAGSLADSPDSVLKLKLCPLLFSHSSSSSDHSSFEKASRSEYQDQRTQLLSNAAPPPRY
TLFESHIGGLSSPTAALESISRGPDGRFIVQPYEETSATFQVKKSLRKDFPQCIGVPGPG
GSPKSNSLCSEMDEKKELTLTVNIPRSRSPNSSPGRVKLMAKNFSKSGCFYTDEEGEELC
SEVDGEMQGEHSSFYSDNLEKRSRDSLKKYRMCIRTAQSQSREEILNLETVKKDKKAEKE
SYLPIDRERQVSQMEHEKGIDSLSKCLKLAKEREEIERELKQYTVSHRVQEREGRSDSRA
VNQSSLRAGGRDTEWETKDGETEPVWKPQEVTFRPKSKLAMGQSHKDSGFRRGCYFGNTS
SPLGQISSPSSFIHWDISPVTSVTSLVPAQSPAENTTPTPMTPMNRTAVRDSGASAGFEC
TGSPVTECTSLSLLSPTSDTFPHISMDVREAKSRYPERLERGDKEAEVKRCQSQSLLEED
WKHQQHDALAEETILAKGPTLISKAEREEQIEHYSCILGIDLETETERETYPDAFTRSPE
ATLRCPSEKGESEEDDGKPESLFTGCTLPYEHGRKRDMARENTNDKNFEKEGIRARSRKS
DKYLFSDSPSNASAIPLIENDTNSDHSDLSVSKMSGFLKPTLKPPSSPTALCNSQKSRAS
PLQTSAILEYLSLPGFIEMSVDEPGEVTEITDLLETSADWKHGDFLKAEPNLDPKDSEPH
VQKSTETDTNTFVDNHSSHGNVDPCETQKTECFSTELETRKSTSKAATSDSLNEKTDFDL
EAGRILSPNATHESLTKKDSTATKSNMQANVIAEQQEHVLEQQRHDFTQKSQVSRTNNIV
SRFCQTPTPFLKKSMSSGLNRTVPQAEKSSTFLKKSVSLGSQKWESYESPCPRNYVSERC
LRDELPAPDIRIKSYSLGRAPAPFSSRRGVFLKGPPAFRPQNKASWETLRTTETALPHRP
RYLPPGMDLQRHKQSSERQSSELQRQAVTFPEMSRWPSDYQHVPGPVQPKSSLSESFRVA
QVIPPGPGMARGDFLQPLDSRRGSARAFLPRGYSWPSPYHVSIEPKGKTAREHEGEGETD
TEMKDFRDVKEARASYASQSSGRGSVGPSSLLRQSLSLTPSLPGSPETTEESERQGAELK
CEEKTITRRNTSVDESYEWDTGDFCIESEILEALRLYSTVEGGSRERDRRRERPRSTIAL
RELQNKGLLSSVSPSDSQSRMPCGSLSEERFNALRREFQEFRQAQEAAQHDPAPPDTDTA
LL
Download sequence
Identical sequences W5NGS2
ENSLOCP00000019831

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]