SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSOPRP00000004659 from Ochotona princeps 76

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSOPRP00000004659
Domain Number 1 Region: 2756-2924
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 1.95e-21
Family Laminin G-like module 0.01
Further Details:      
 
Domain Number 2 Region: 2581-2753
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 9.5e-19
Family Laminin G-like module 0.0054
Further Details:      
 
Domain Number 3 Region: 3153-3326
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 2.24e-16
Family Laminin G-like module 0.0077
Further Details:      
 
Domain Number 4 Region: 2381-2573
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 0.000000000000968
Family Laminin G-like module 0.012
Further Details:      
 
Domain Number 5 Region: 1682-1723
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000000865
Family Laminin-type module 0.0044
Further Details:      
 
Domain Number 6 Region: 1263-1311
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000000103
Family Laminin-type module 0.0092
Further Details:      
 
Domain Number 7 Region: 488-535
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000000586
Family Laminin-type module 0.027
Further Details:      
 
Domain Number 8 Region: 353-425
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000000112
Family Laminin-type module 0.021
Further Details:      
 
Domain Number 9 Region: 296-355
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000000159
Family Laminin-type module 0.074
Further Details:      
 
Domain Number 10 Region: 533-578
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000019
Family Laminin-type module 0.013
Further Details:      
 
Domain Number 11 Region: 628-678
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000335
Family Laminin-type module 0.0086
Further Details:      
 
Domain Number 12 Region: 3066-3145
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 0.00000463
Family Laminin G-like module 0.038
Further Details:      
 
Domain Number 13 Region: 1734-1783
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000614
Family Laminin-type module 0.053
Further Details:      
 
Domain Number 14 Region: 423-465
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000335
Family Laminin-type module 0.027
Further Details:      
 
Weak hits

Sequence:  ENSOPRP00000004659
Domain Number - Region: 681-722
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000239
Family Laminin-type module 0.024
Further Details:      
 
Domain Number - Region: 110-221
Classification Level Classification E-value
Superfamily Galactose-binding domain-like 0.000851
Family APC10-like 0.073
Further Details:      
 
Domain Number - Region: 1643-1684
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000865
Family Laminin-type module 0.064
Further Details:      
 
Domain Number - Region: 1314-1350
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00117
Family Laminin-type module 0.024
Further Details:      
 
Domain Number - Region: 576-630
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00614
Family Laminin-type module 0.011
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSOPRP00000004659   Gene: ENSOPRG00000004960   Transcript: ENSOPRT00000005063
Sequence length 3327
Comment pep:known_by_projection genescaffold:pika:GeneScaffold_3100:585985:811802:1 gene:ENSOPRG00000004960 transcript:ENSOPRT00000005063 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MATAAGVQARAPRPPLLLLLLLVLPAWCATAWDTGATPRRSLDPPYFNLAEAARIWATAT
CGERGQGDGGGRPTPELYCKLVGGPTAQGSGHAIQGQFCDYCNYEDPRKAHPATNAIDGS
ERWWQSPPLSSGTQFNRVNLTLDLGQLFQVAYVLIKFANSPRPDLWVLERSVDFGNTYSP
WQYFAHSKRDCLEQFGKEANTGLTRDDDVLCTTDYSRIVPLENGEVVVSLINGRPGAKNF
TFSPTLREFTKATNIRLRFLRTNTLLGHLISKAQRDPTVTRRYYYSIKDISIGGRCVCNG
HAEVCNVNNSEELFRCECQHHTCGETCDRCCMGYNQKRWQPAAWEQSNECEACNCHGHAD
DCYYDSDVEQQQASLNIQGVYAGGGVCINCQHNTAGINCEKCAKGYYRPFGVPVDAPHGC
IPCSCDPERADGCEQGSGRCHCKANFHGDHCEKCADGYYNFPLCLRNPSFSTPVSNPEDP
VAGDIKGCDCDLEGVLPEICDAQGRCLCRPGVEGPRCDACRSGFFSFPICVACQCSALGS
YPTPCNAETGQCECQLGITGQRCDRCLSGAYDFPHCQGSSGACDPAGTSDSGFDYCQCKL
HVESPTCSICKPLYWNLVQENPHGCSECQCHEAGTASGIGECGQGDGDCYCKLHVTGDAC
DTCEDGYFALDKSNYFGCQGCQCDIGGALTTMCSGPSGACQCRQHVVGKACQRPAKNHYF
PDLHHMKYEIEEGTTPDGGALRFGFDPLMFPEFSWRGYTQMTSVQNEVQMMLNVDKSNLS
LFRIILRYINPGREATSGRISIYPSWGAAQSKEIIFLPSKEPTFVTIPGNGLADPFSMSP
GTWIACIKAEGVLLDYLVLLPRDYYEAPVLQLPVTDPCATAGSPHENCLLYQHLPVTRFP
CTFACEARHFLLDGEPRPLAVRQPTPAHPVMVDLSGRQVELHLRLRVPRVGPYVVVIEYV
TEGDQLVVVDVSMESPGAVVVGQVNIYSCKYSVPCRSAVTDGLGRLAVWELLEDADVQLK
AHMAQLLLHQICIIPIEEFSTEYLRPQVHCIASYGPFGNQSATCISLAHETPPTALTLDV
PGGGPLPHPPWGLSQSAGVTQGLNLKAPQNQVTLRGLVPRRGRYVLVIHFCQVKHPVFPA
QVFVHGGQPWSGSFSASFCPHVLGCRDQVIADGQVEFDLSEPEVAVTVKVPEGKSLVLVR
VLVVPAENYDYQILHKNSVDKSFEFVTHCGRNSFYIDPQMASAFCKSSARSLVAFYHNGA
LPCECHPTGAISTRCSPEGGQCPCRANVIGRQCTRCATGYYGFPHCKPCNCGQRLCEETT
GQCLCPPHTVKPQCEVCETHSFSFHPLVGCESCNCSTRGTVGAASLECDKDSGQCXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXFVDMLGWRLETAEGGDVPVSFNPGSNSVV
VDLQELPATVRSAWVAPFYLGQVSSYGGFLTYQAKSFGLPGDMVLLDKRPDVQLTGQHMS
LVHKESSPPRPDRLHQGRVQVVEGNFRHAGSGAPVSREELMMVLSRLDGLHIRGLYFTET
QRLTLGEVGLEEASDSGSGRRAHHVEMCACPPDYAGDSCQGCSSGYYRDDRGGLYPGRCV
PCNCNGHSNRCQDGTGICINCQHNTAGDHCERCKEGYYGNAVVGACGVCPCPHTNSFATG
CVVNGGNVRCSCKPGYTGVRCECAPGYFGNPQKFGGSCQPCNCNSNGQLGSCDPVTGDCI
NQHNELKDSSPGEECDDCDSCVTTLLNDLAAKSEELHLVKSQLGLRVGMVLEQMRHLGTQ
TKELNQLLNYHSTISSRRSDIDGLEKELSNLNREFETLQEKAQVNSRKAQTFYASLDQTT
QRAKELDVKVQNVIRNVNILLKQISGANGEGNYLPSGDFSRELAEAQRMMRELRSRNFGN
QLREAEAEKRAAQLMLSQISNLLATHREDSNGLAKSLWDSLNEYEAKLRDLRAALQEASA
QAKQAANLNQDNEKTLEDIKRHTKEISSLRSDFTKFLSTADASLLQTNIVLQQLEESQKE
YEKVAATLNEARQELSDRVRALSRSASKTSLVVEAENHARSLQELARQLEEIKRNASGDE
LVRRAVDAATAYENILNAIKAAEDAANKATSASESALQTVIKEDLPAKAKTLSSESDQLL
KEAKITQKKLQQEISPALSNLKQTLQTVTSQREGIDTNITALRAELHGIQRGDIDEVISS
TKNTVKKTNDITNEVLDGLHPIQTDVERIKDTYGSARSEDFNKALTDADSSVKKLTRNLP
DLLSKIESINQQLLPLGNISDNVDRIRELIQQARDAANKVTVPMRFNGKSGVEVRLPTDP
GDLKGYTSLSLFLQRPDSRENGGTEDMFVMYLGNKDASRDYIGMALIDNRLTCIYNLGDR
EAELQVDQILTESETQEAVMDRVKFQRIYQFASLNYTKEATSSKPKPPQFYDIESGTSNT
LLNLDPENVVFYVGGYPSDFILPSRLRLPPYKGCIELDDLNENVLSLYNFKKTFNLNTTE
VEPCRRRKEESDKNYFEGTGYARIRTQPYSQILTFAQTIQTTVDRGVVFFAENQDRFISL
NVEDGSFMVRYRLDSGPPKEKGIQGTINNGRDHSIQIIVRKAQKQILISMHSQSIQIEGD
LFDFSTYYLGGIPIAIRERLNISTPAFRGCMKNLKKTTGVIRLNDSVGVTKKCSEDWKLV
RSASFSKGGYLSFSNLGLPLPNHYQTSFGFQTFQPSGILLNHQTRTSSLQVTLEDGHIEV
STKDSSGTVFRSPQTYVDGLLHYVSVISDSAGLRLLVDDQPLRSSQKLQGFANSQQSLRL
GGSSFEGCISNVFVQRLSPSPEVLDLTSRSNKQDVALGSCSLNQPPFLMLLKGPARLNKA
DAFDINQPLHDTPVASLKNMDVWKDARSCPPLPRVQPSHGALRFGDSPTSHLLFKLPQEL
LKAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXVVFGQDGGKVHLVVDGLRARGGSLPGNSTVSVGASVYLGAPLSGKPKSLPQNSFV
GCLKNFQLDLMPLDTPSANAGVTPCLGGSLEKGIYFSQEEGGHVLLANSVSWGPEFKIVF
SIRPRSLTGILIHIGSQPKQHLSVYMEAGQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXTIKQHILHLQLDADNSYTAGQLTFLPSQEPLYIGGIPANLQSQKLPVWKSFFGCLKNM
HINHNPVHVTEASEVRGPVSLNGCPDH
Download sequence
Identical sequences ENSOPRP00000004659 ENSOPRP00000004659

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]