SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSECAP00000014231 from Equus caballus 76_2

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSECAP00000014231
Domain Number 1 Region: 1495-1721
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 2.02e-42
Family Laminin G-like module 0.0014
Further Details:      
 
Domain Number 2 Region: 643-748
Classification Level Classification E-value
Superfamily Cadherin-like 6.28e-29
Family Cadherin 0.00073
Further Details:      
 
Domain Number 3 Region: 538-650
Classification Level Classification E-value
Superfamily Cadherin-like 1.3e-28
Family Cadherin 0.00045
Further Details:      
 
Domain Number 4 Region: 954-1066
Classification Level Classification E-value
Superfamily Cadherin-like 3.43e-28
Family Cadherin 0.00071
Further Details:      
 
Domain Number 5 Region: 321-424
Classification Level Classification E-value
Superfamily Cadherin-like 1.1e-25
Family Cadherin 0.00096
Further Details:      
 
Domain Number 6 Region: 1738-1948
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 2.37e-25
Family Laminin G-like module 0.0075
Further Details:      
 
Domain Number 7 Region: 851-953
Classification Level Classification E-value
Superfamily Cadherin-like 7.71e-25
Family Cadherin 0.0014
Further Details:      
 
Domain Number 8 Region: 426-536
Classification Level Classification E-value
Superfamily Cadherin-like 1.71e-24
Family Cadherin 0.00091
Further Details:      
 
Domain Number 9 Region: 1054-1171
Classification Level Classification E-value
Superfamily Cadherin-like 2.28e-24
Family Cadherin 0.0011
Further Details:      
 
Domain Number 10 Region: 749-849
Classification Level Classification E-value
Superfamily Cadherin-like 6.28e-20
Family Cadherin 0.00086
Further Details:      
 
Domain Number 11 Region: 1434-1474
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000000000209
Family EGF-type module 0.0082
Further Details:      
 
Domain Number 12 Region: 1162-1264
Classification Level Classification E-value
Superfamily Cadherin-like 0.000000000628
Family Cadherin 0.01
Further Details:      
 
Domain Number 13 Region: 1982-2028
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000000312
Family EGF-type module 0.014
Further Details:      
 
Domain Number 14 Region: 1946-1980
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000018
Family EGF-type module 0.023
Further Details:      
 
Domain Number 15 Region: 2544-2790
Classification Level Classification E-value
Superfamily Family A G protein-coupled receptor-like 0.0000284
Family Rhodopsin-like 0.019
Further Details:      
 
Domain Number 16 Region: 2104-2183
Classification Level Classification E-value
Superfamily Hormone receptor domain 0.0000471
Family Hormone receptor domain 0.0044
Further Details:      
 
Domain Number 17 Region: 2076-2115
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000837
Family Laminin-type module 0.01
Further Details:      
 
Weak hits

Sequence:  ENSECAP00000014231
Domain Number - Region: 1413-1439
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00779
Family EGF-type module 0.071
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSECAP00000014231   Gene: ENSECAG00000016006   Transcript: ENSECAT00000017509
Sequence length 3306
Comment pep:known_by_projection chromosome:EquCab2:16:38313112:38337325:1 gene:ENSECAG00000016006 transcript:ENSECAT00000017509 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MARQPLWWSLRGPTTPLLLLLLLSLFPLSREELEDGGGQGWDPRIVAATGRGAQIGGGAL
ALCPETPEIQEDGEPGLGIREPVFVGLRGGRQSTQSGRGPPEQPDPGLRAEYGVQALGSH
GRETGQGTGSLLCWRPEISSCGKTGPLRRDSLSLEALSPGVPGPEDSSPFPSDLLVPPRG
SKSVSSQRNAGRSALQKVGTIRCCGELWASGRRSQGEKRATSREERIVPRTDCPPGAAGS
DPELVSAPLTARTAPASSSAPRKSRTAPEPAPERMRSRGLFRRRFLPQRPGPRPPGVPAE
PGAWKTLPASRVRPRRAANRHPQFPQYNYQALVPENEAAGTVVLRVVAQDPDTGEAGRLV
YSLAALMNSRSLELFSIDPQSGLIRTEAALDRESMERHYLRVTAQDHGSPRLSATTMVAV
TVADRNDHAPVFEQAQYRETLRENVEEGYPILQLRATDGDATPNANLRYRFVGSPAARAA
AAAAFEIDPRSGLISTSGRVDREHMESYELVVEASDQGQEPGPRSATVRVHITVLDENDN
APQFSEKRYVAQVREDVRPHTVVLRVTATDRDKDANGLVHYNIISGNSRGHFAIDSLTGE
IQVVAPLDFEAEREYALRIRAQDAGRPPLSNNTGLASIQVVDINDHTPIFVSTPFQVSVL
ENAPLGHSVIHIQAVDADHGENARLEYSLTGVAPDTPFVINSATGWVSVSGPLDRESVEH
YFFGVEARDHGSPPLSASASVTVTVLDVNDNRPEFTMKEYHLRLNEDAAVGTSVVSVTAV
DRDANSAISYQITGGNTRNRFAISTQGGVGLVTLALPLDYKQERYFKLVLTASDRALHDH
CYVHINITDANTHRPVFQSAHYSVSVNEDWPVGSTVVVISASDDDVGENARITYLLEDNL
PQFRIDADSGAITLQAPLDYEDQVTYTLAITARDNGIPQKADTTYVEVMVNDVNDNAPQF
VASHYTGLVSEDAPPFTSVLQISATDRDAHANGRVQYTFQNGEDGDGDFTIEPTSGIVRT
VRRLDREAVPVYELTAYAVDRGVPPLRTPVSIQVTVQDVNDNAPVFPAEEFEVRVKENSI
VGSVVAQITAVDPDEGPNAHIMYQIVEGNIPELFQMDIFSGELTALIDLDYEARQEYVIV
VQATSAPLVSRATVHVCLVDQNDNSPVLNNFQILFNNYVSNRSDTFPSGIIGRIPAYDPD
VSDHLFYSFERGNELQLLVVNQTSGELRLSRKLDNNRPLVASMLVTVTDGLHSVTAQCVL
RVVIITEELLANSLTVRLENMWQERFLSPLLGHFLEGVAAVLATPAEDVFIFNIQNDTDV
GGTVLNVSFSALAPRGAGTGAAGPWFSSEELQEQLYVRRAALAARSLLDVLPFDDNVCLR
EPCENYMKCVSVLRFDSSAPFLASASTLFRPIQPIAGLRCRCPPGFTGDFCETELDLCYS
NPCRNGGACARREGGYTCVCRPRFTGEDCELDTEAGRCVPGVCRNGGTCTDGPDGGFHCQ
CPAGGAFEGPRCEVAARSFPPSSFVMFRGLRQRFHLTLSISFATVQPSGLLFYNGRLNEK
HDFLALELVAGQVRLTYSTGESNTVVSPTVPGGLSDGQWHTVHLRYYNKPRTDALGGAQG
PSKDKVAVLSVDDCNVAVALQFGAEIGNYSCAAAGMQTSSKKSLDLTGPLLLGGVPNLPE
NFPVSHKDFVGCMRDLHIDGRRVDMAAFVANNGTTAGCQAKLHFCDSGPCKNSGFCSERW
GGFSCDCPVGFGGRDCRLTMAYPHRFHGNGTLSWDFGNDMAVSVPWYLGLAFRTRATKGV
LMQVQAGPHSTLLCQLDRGLLSVTMTRGSGRAAHLLLDQVAVSDGRWHDLRLELQEEPGG
RRGHHVLMVSLDFSLFQDTMAVGSELQGLKVKRLHVGGLPPSSEEESPQGLVGCIQGVWL
GSTPLGSPALLAPSHRVNVEPGCVVTNACASGPCPPHADCRDLWQTFSCTCWPGYYGPGC
VDACLLNPCQNQGSCRHLPGAPHGYICDCVGGYFGHHCEHRMDQQCPRGWWGSPTCGPCN
CDVHKGFDPNCNKTNGQCHCKEFHYRPRDSDSCLPCDCYPVGSTSRSCAPHSGQCPCRPG
ALGRQCNSCDSPFAEVTASGCRVLYDACPKSLRSGVWWPQTKFGVLASVPCPRGALGLRG
AGAAVRLCDEDQGWLEPDLFNCTSPAFRELSLLLDGLELNKTALDTVEAKKLAQRLREVT
SHTDHYFSQDIRVTARLLAHLLTFESHQQGFGLTATQDAHFNENLLWAGSALLAPETGDL
WAALGQRAPGGSPGSAGLVQHLEEYAATLARNMELTYLNPVGLVTPNIMLSIDRMEHPSP
TRGTRRYPRYHSNLFRGQDAWDPHTHVLLPSQAPRPSPSEVLSTSSSSIENSTTSSVAPP
PAPPETEPEPGISIVILLVYRTLGGLLPAQFQAERRGARLPQNPVMNSPVVSVAVFHGRN
FLRGVLESPISLEFRLLQTANRSKAICVQWDPPGPADQHGMWTARDCELVHRNGSHARCR
CSRTGTFGVLMDASPRERLEGDLELLAVFTHVVVAVSVAALLLTAAILLSLRSLKSNMRG
IHANVAAALGVAELLFLLGIHRTQNQLVCTAVAILLHYFFLSTFAWLLVQGLHLYRMQVE
PRNVDRGAMRFYHALGWGVPAVLLGLTVGLDPEGYGNPDFCWISVYEPLIWSFAGPVVLV
VVMNGTMFLLAARTSCSTGQREAKKSSVLTLRSSFLLLLLISASWLFGLLAINHSILAFY
YLHAGLCGLQGLVVLLLFCVLNADARAAWTPACLGRKAVPEEARPAPGTGPGAYNNTALF
EESGLIRITLGASTVSSVSSARSGRTQDQDSQPGRSYLRDNVLVRHGSAADHTDHSLQAH
AGPTDLDVAMFHRDAGADSDSDSDLSLEEERSLSIPSSESEDNGRTRGRFQRPLRRAAQS
ERLLTHPKDVDGNDLLSYWPALGECEAAPCALQTWGSERRLGLDTSKDAANNNQPDLALT
SGDETSLGRAQHQRKGMIPDLGLHLRDPGDCPFQSPNPKLLHRHPESSSCSNLGSFCSST
LLHDPWRLPLLYIPRLREFLPPPYTPCPLFVCLTGILKNRLQYPLMPQTRGPPELSWCRA
ATLGHCAVPAASYDHGNALDFRGTCEWLSTLPLPHSAQDLDHSPTSAPVSQRQLSRDPLL
PSRPLDSLSRRSNSGERLDHGPSRHPSREGLGPPQLLRVREDPASGPSHGPSTEQLDILS
SILASFNSSALSSSVQSSSTPSGPHTTATPSATASALGPSTPRSATSHSISELSPDSEVP
RSEGRS
Download sequence
Identical sequences F6X224
9796.ENSECAP00000014231 ENSECAP00000014231 ENSECAP00000014231

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]