SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSOPRP00000010557 from Ochotona princeps 76

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSOPRP00000010557
Domain Number 1 Region: 1499-1725
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 6.18e-42
Family Laminin G-like module 0.0014
Further Details:      
 
Domain Number 2 Region: 542-654
Classification Level Classification E-value
Superfamily Cadherin-like 3.85e-29
Family Cadherin 0.00056
Further Details:      
 
Domain Number 3 Region: 647-751
Classification Level Classification E-value
Superfamily Cadherin-like 5.14e-29
Family Cadherin 0.00059
Further Details:      
 
Domain Number 4 Region: 957-1063
Classification Level Classification E-value
Superfamily Cadherin-like 9.42e-29
Family Cadherin 0.00062
Further Details:      
 
Domain Number 5 Region: 432-540
Classification Level Classification E-value
Superfamily Cadherin-like 5.28e-26
Family Cadherin 0.001
Further Details:      
 
Domain Number 6 Region: 327-430
Classification Level Classification E-value
Superfamily Cadherin-like 2.14e-25
Family Cadherin 0.00085
Further Details:      
 
Domain Number 7 Region: 1742-1953
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 9.3e-25
Family Laminin G-like module 0.0072
Further Details:      
 
Domain Number 8 Region: 855-957
Classification Level Classification E-value
Superfamily Cadherin-like 1.3e-24
Family Cadherin 0.0019
Further Details:      
 
Domain Number 9 Region: 1058-1175
Classification Level Classification E-value
Superfamily Cadherin-like 1.86e-24
Family Cadherin 0.0011
Further Details:      
 
Domain Number 10 Region: 752-853
Classification Level Classification E-value
Superfamily Cadherin-like 1.17e-18
Family Cadherin 0.0014
Further Details:      
 
Domain Number 11 Region: 3851-3881,3958-4037
Classification Level Classification E-value
Superfamily SpoIIaa-like 0.00000000000275
Family Anti-sigma factor antagonist SpoIIaa 0.013
Further Details:      
 
Domain Number 12 Region: 1438-1478
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000000000164
Family EGF-type module 0.0082
Further Details:      
 
Domain Number 13 Region: 1166-1268
Classification Level Classification E-value
Superfamily Cadherin-like 0.000000000457
Family Cadherin 0.01
Further Details:      
 
Domain Number 14 Region: 1986-2032
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000000513
Family EGF-type module 0.02
Further Details:      
 
Domain Number 15 Region: 1950-1984
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000264
Family EGF-type module 0.02
Further Details:      
 
Weak hits

Sequence:  ENSOPRP00000010557
Domain Number - Region: 2080-2119
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000109
Family Laminin-type module 0.0084
Further Details:      
 
Domain Number - Region: 2108-2182
Classification Level Classification E-value
Superfamily Hormone receptor domain 0.00366
Family Hormone receptor domain 0.0047
Further Details:      
 
Domain Number - Region: 1417-1443
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00719
Family EGF-type module 0.071
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSOPRP00000010557   Gene: ENSOPRG00000011490   Transcript: ENSOPRT00000011572
Sequence length 4071
Comment pep:known_by_projection genescaffold:pika:GeneScaffold_1323:102764:145272:-1 gene:ENSOPRG00000011490 transcript:ENSOPRT00000011572 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
EGAQGYQRPKPESRTARTSAWEGIKYLQVKVGRRCDNTLEXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXLSSRARETGQEPGSLLCRPEVSSCRTGPLRRGGVSPEALFLEVPGPGNSLPL
PSDFLVQHGPKPAFSQQNSARSAERVGTTRCCGELWVPVRRGQRERAAPSPAERTDLWPE
FLPWAAGSGPALDSAPRTARTAPLVGSAPPKPRTAPEALPRRMRSRGLLRRHFLQQRPGP
RTPGVPVEAWRTPLGSRARPRRATNRHPQFPQYNYQAQVPENEAAGTAVLRVAAQDPDAG
EAGRLVYSLAALMNSRSLELFSIDPQSGLIRTAAALDRESMERHYLRVTAQDHGSPRLSA
TTMVAVTVVDRNDHTPVFEQTQYRETLRENVEEGYPILQLRATDGDAPPNANLRYRFVGP
PATRTTAAFEIDPRSGLISTSGRVDREHMESYELVVEASDQGQEPGPRSATVRVHITVLD
ENDNAPQFSEKRYVAQVREDVRPHTVVLRVTATDRDKDANGLVHYNIISGNSRGHFAIDS
LTGEIQVVAPLDFEAEREHALRIRAQDAGRPPLSNNTGLVNIQVVDINDHAPIFVSTPFQ
VSVLENAPLGHSVIHIQAVDADHGENARLEYSLTGVAADTPFVINSATGWVSVSGPLDRE
SVEHYFFGVEARDHGSPPLSASASVTVTVLDVNDNRPEFTVKEHHLRLNEDAAVGTSVLS
VTAVDRDANSAISYQITGGNTRNRFAISTQGGVGLVTLALPLDYKQERYFKLVLTASDRA
LHDHCSVHINITDANTHRPVFQSAHYSVSMNEDRPVGSTVVVISASDDDVGENARITYFL
EDNLPQFRIDATSGAITLQAPLDYEDQVTYTLAITARDNGIPQKADTTYVEVMVNDVNDN
APQFVTSHYTGLVSEDAPPFTSVLQISATDRDAHANGRVQYTFQNGEDGDGDFTIEPTSG
IIRTVRRLDREAVSVYELTAYAVDRGVPPLRTPVSIHVTVQDVNDNAPVFPAEEFEVQVK
ENSIVGSVVAQITAVDPDEGPNAHIMYQIVEGNIPELFQMDIFSGELTALIDLDYEARQE
YVIVVQATSAPLVSRATVHVRLVDQNDNSPVLNNFQILFNNYVSNRSDTFPSGVIGRIPA
YDPDVSDHLFYSFERGNELQLLVVNQTSGELRLSRKLDNNRPLVASMLVTVTDGLHSVTA
QCVLRVVIITEELLANSLTVRLENMWQERFLSPLLGHFLEGVAAVLATPAEDVFIFNIQN
DTDVGGTVLNVSFSALAPRGAGAGAAGPWFSSEELQEQLYVRRAALAARSLLDVLPFDDN
VCLREPCENYMKCVSVLRFDSSAPFLASASTLFRPIQPIAGLRCRCPPGFTGDFCETELD
LCYSNPCRNGGACARREGGYTCVCRPRFTGEDCELDTEAGRCVPGVCRNGGTCTDAPHGG
FRCQCPVGGAFEGPRCEVAARSFPPSSFVMFRGLRQRFHLTLSLSFATVQPSGLLFYNGR
LNEKHDFLALELVAGQVRLTYSTGESNTVVSPTVPGGLSDGQWHTVHLRYYNKPRTDALG
SAQGPSKDKVAVLSVDDCDVAVXXXXXXXXXXXXXXXXXXXXXXXXSLDLTGPLLLGGVP
NLPENFPVSHKDFIGCMRDLHIDGRQVDMAAFVANNGTTAGCQAKLHFCDSSPCKNSGFC
SERWGGFSCDCPVGFGGKDCRLTMAHPHHFRGNGTLSWDFGGDVVVSVPWYLGLAFRTRA
TQGVLLSVQAGQHSTLLCQLDRGLLSVTVSQGSGHAAHLLLDQVTVSDGRWHDLRLELQE
EPGGRRGHHVLMISLDFSLFQDTLAVGSELQGLKVKQLHVGGLPPSSKEALPQGLVGCIQ
GVWLGTTPSGSPALLAPSHRVNVEPGCVVTNSCASGPCPAHADCQDLWQTFSCTCQPGYY
GPGCVDACLLNPCQNQGSCRHLPGAPHGYTCDCPSGYFGHRCEHRMDQQCPRGWWGSPTC
GPCNCDVHKGFDPNCNKTNGQCHCKDFYYRPRGSDSCLPCDCYPVGSTSRSCAPHSGQCP
CRPGALGRQCNSCDSPFAEVTASGCRVLYDACPKSLRSGVWWPQTKFGALATVPCPRGAL
GAAVRLCDEDRGWLEPDLFNCTSPAFRELSLLXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPSSN
RERRYRYSNFPGQDVDPHHVLPTGSSAENTTAPSVGSPPVPQEPETEPGMSIVILLVYRT
LGGLLPAQFQAERRGARLPENPVMNSPVVSVAVFHGRNFLSGVLESPISLEFRLLQTANR
SKAICVQWDPPGPAEQHGLWTARDCELVHRNGSHARCRCSRTGTFGVLMDASPRERLEGD
LELLAVFTHVVVAVSVAALVLTAAVLLSLRSLKSNVRGIHANVAAALGVAELLFLLGVHR
THNQVQGQGQGTSVLMTTTLSQEGPGTGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPDFCWISMHDPLIWSFGPVVL
VIVMNGTLLLLTARTSCSTGQRDAKKSSVLXXXXXXXXXXXLVSASWLFGLLAVNHSILA
FHYLHAALSGLQGLAVLLLFCILNSDARAAWTPACLGRKAVPEEARPAPGTGPGAYNNTA
LFEESGLIRITLGASTVSSVSSARSGRTQDQDSQRGRGYLRDNVLVRHGSAADHTEHGVQ
AHAGPTDLDVAMFHRDAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXERDTEALLPAAQAMELRTRDYCVERPLLNQEQLEELGRWGPAPRTHR
WRTWLQCSRARAHALLLQHLPILAWLPRYPVRDWLLGDLLSGLSVAIMQLPQGAYALLAG
LPPVFGLYSSFYPVIYFLGTSRHISGSLGIPGLVDAGTFAVMSVMVGSVIESLAPDEDLL
AVSNDFTDNETARDAVRVQLASTLGVLVGLFQVGLGLVRFGFVVTYLSEPLVRSYTTAAS
VQVFISQLKYVFGLQLRSRSGPLSLIYTVLEFFWKLPQTVVGTLVTALVAGLALVAIKLL
SEKLRRYLPLPIPGELLTLIGATAISYGVGLEQQFGVDIVGNIPAGLVPPVAPNPQLFAQ
LVGNAFAIAVVGFAIAISLGKIFALRHGYRVDSNQELVALGLSNLVGGLFQCFPVSCSMS
RSLVQESTGGNTQVAGAISSLFLLLIILKLGELFRDLPKAVLAAVIIVNLKGMLLQFKDI
PTLWKTNRIDLLIWLVTFVATILLNLDLGLAVAIVFSLLLMVVRTQLPHYSILGRVPDTD
IYRDVAEYSEAREVPGVKVFRSSATLFFANAEFYGDALKQRCGVDVDYLLSQKKKLLKKR
EMQLKRLKKSKTQPQTQAGLPEWVCPQVKPGDYLDGVAASSQEDAKAPSGSMLRMLGLPQ
PDFHSLVLDLGTLSFVDTVCLKSLKNIFREFREIEVEVYIAACHSPVVAQLEAGNFFDTS
ITKRHVFASVHDAVTFALQHPRQEPGSSALAPHPSELLLCNSLSGTLDQVQ
Download sequence
Identical sequences ENSOPRP00000010557 ENSOPRP00000010557

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]