SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSOPRP00000012161 from Ochotona princeps 69

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSOPRP00000012161
Domain Number 1 Region: 5-135
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 9.32e-17
Family Clostridium neurotoxins, the second last domain 0.036
Further Details:      
 
Domain Number 2 Region: 227-346,411-478
Classification Level Classification E-value
Superfamily Metalloproteases ("zincins"), catalytic domain 0.00000000000000794
Family Reprolysin-like 0.071
Further Details:      
 
Domain Number 3 Region: 358-519
Classification Level Classification E-value
Superfamily Metalloproteases ("zincins"), catalytic domain 0.00000000000208
Family Reprolysin-like 0.082
Further Details:      
 
Domain Number 4 Region: 1276-1343
Classification Level Classification E-value
Superfamily Complement control module/SCR domain 0.000000000153
Family Complement control module/SCR domain 0.0055
Further Details:      
 
Domain Number 5 Region: 1207-1279
Classification Level Classification E-value
Superfamily Complement control module/SCR domain 0.00000000278
Family Complement control module/SCR domain 0.0031
Further Details:      
 
Domain Number 6 Region: 1145-1211
Classification Level Classification E-value
Superfamily Complement control module/SCR domain 0.0000000195
Family Complement control module/SCR domain 0.0044
Further Details:      
 
Domain Number 7 Region: 1076-1149
Classification Level Classification E-value
Superfamily Complement control module/SCR domain 0.00000361
Family Complement control module/SCR domain 0.0037
Further Details:      
 
Domain Number 8 Region: 530-566,747-800
Classification Level Classification E-value
Superfamily Fibronectin type III 0.0000193
Family Fibronectin type III 0.0071
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSOPRP00000012161   Gene: ENSOPRG00000013296   Transcript: ENSOPRT00000013325
Sequence length 1488
Comment pep:novel genescaffold:pika:GeneScaffold_4519:157486:362703:1 gene:ENSOPRG00000013296 transcript:ENSOPRT00000013325 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
LYDKCSYTSRDRGWVVGIHTVSDQGNRDPRYFFSLKTDRARKATTINAHRSYFPGQWVFL
AATYNGRLMKLYLNGAQVATSGEQVGGIFSPLTQKCKVLMLGGSTLNHNYRGYVEHFSLW
KVARTQREILSDMESHGFHTPLPQLLLQENWENVKRAWSPMKEGNSPHVELSNAHNFLLD
TSLEPPLCGQTLCDNAEVIANYNQLPRFRLPKVVRYRVVNLYDDDYENPTVSQQQVDFQH
QQLIEAFKQYNISWELKVLEVRNSSLRHRLILANCDISKIGDENCDPECNHTLTGHDGGD
CRHLRPPAFMKKQQNGVCDMDCNYEHFNFDGGECCDPEITDVTKTCFDPDSPHRAYLDVN
ELKNILRLDGSTHLNIFFANSSEEELAGVATWPWDKEALMHLGGIVLNPAFYGIPGHTNT
MIHEIGHSLGLYHTFRGISEIQSCSDPCMETEPSFETGDLCNDTNPAPKHKFCGDPMPGN
DTCGFHSFFNTPYNNFMSYADDDCTDSFTPNQVARMHCYLDLVYQGWQPSSKPAPVALAP
QIVSHTTDSVTLEWFPPIDGHSFERELGSACDLCLEGRILMQYAFNASSPMPCGPSGHWS
PREAEGHPDVEQPCKSSVRTWSPNSAVNPHTVPPACPEPQGCYLELKFLYPLVPESLTIW
VTFVSTDWDSSGAVNDIKLLTVSKKNISLGPQNIFCDIPLTIKLQDVDEEVYGIQIYTLD
EHLEIDAAMLTSVAGCPLCLDCKPLRYKVVRDPPIQEDKASIVHFNRRFTDRDLKHNSVY
QYRIIAISGTEESEPSPAAIYIHGSGYCGDGIIQIDQGEECDDMNKINGDGCSLFCQQEV
SFNCIDQPSRCYFHDGDGVCEEFEQKTSIKDCGVYTPQGFLDQWASNVSVSHQDQQCPGW
VIIGQPAASQVCRTKVIDLSEGISQHAWYPCTISYPYSQLAQTTFWLRAYFSQPMVAAAV
IVHLVTDGTYYGDQKQEIISVQLLDTKDQSHDLXXXXXXXXXXXXXXXXXXXXXXXXXXX
QAVRVCFSSPLVVISGVTLRSFYNSVHVTLSSCQRGETYSPAEQSCVHFMCEATDCPELA
VENASLNCSTNDRYHGAQCTVSCQMGYVLQIQRDDELIKNQVGPSVTVTCTEGKWNKQVA
CEPVDCGLPDHHHVYAASFSCLDGTTFGRRCSFQCRHPAQLKGNNSHLTCMEDGLWSFPE
ALCELMCLAPPPIPNAELQTARCRESKHKVGSFCKYKCKPGYHVPGSSRKSKKRAFKTQC
TQDGSWQVGACVPVTCDPPPPKFHGLYQCTNGFQFNSECRIKCEDSDTSQGHGSNVIHCR
KDGTWSGSFHLCQEMQGQCSAPNQINSNLKLQCSDGYAIGSECAISCLDHNSESVILPIN
VSVRDIPHWLNPTRVERVVCTAGLKWYPHPALIHCVKGCEPFMGDNYCDAINNRAFCNYD
GGDCCASTVKTKKVTPFPMSCDLQGDCACRDPKAQEHSQKDLRGYSHG
Download sequence
Identical sequences ENSOPRP00000012161 ENSOPRP00000012161

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]