SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSLACP00000020143 from Latimeria chalumnae 76_1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSLACP00000020143
Domain Number 1 Region: 1174-1400
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 5.98e-41
Family Laminin G-like module 0.0011
Further Details:      
 
Domain Number 2 Region: 221-333
Classification Level Classification E-value
Superfamily Cadherin-like 1.71e-29
Family Cadherin 0.00063
Further Details:      
 
Domain Number 3 Region: 326-430
Classification Level Classification E-value
Superfamily Cadherin-like 5.23e-29
Family Cadherin 0.00082
Further Details:      
 
Domain Number 4 Region: 637-749
Classification Level Classification E-value
Superfamily Cadherin-like 1.71e-28
Family Cadherin 0.00063
Further Details:      
 
Domain Number 5 Region: 9-112
Classification Level Classification E-value
Superfamily Cadherin-like 1.07e-26
Family Cadherin 0.0017
Further Details:      
 
Domain Number 6 Region: 114-227
Classification Level Classification E-value
Superfamily Cadherin-like 2.22e-26
Family Cadherin 0.0011
Further Details:      
 
Domain Number 7 Region: 534-636
Classification Level Classification E-value
Superfamily Cadherin-like 3.01e-26
Family Cadherin 0.0023
Further Details:      
 
Domain Number 8 Region: 737-854
Classification Level Classification E-value
Superfamily Cadherin-like 5.85e-25
Family Cadherin 0.0017
Further Details:      
 
Domain Number 9 Region: 1422-1618
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 2.37e-24
Family Laminin G-like module 0.011
Further Details:      
 
Domain Number 10 Region: 431-532
Classification Level Classification E-value
Superfamily Cadherin-like 7.28e-22
Family Cadherin 0.0013
Further Details:      
 
Domain Number 11 Region: 1113-1152
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000000000712
Family EGF-type module 0.0072
Further Details:      
 
Domain Number 12 Region: 845-947
Classification Level Classification E-value
Superfamily Cadherin-like 0.000000000514
Family Cadherin 0.011
Further Details:      
 
Domain Number 13 Region: 1751-1790
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000000368
Family Laminin-type module 0.0094
Further Details:      
 
Domain Number 14 Region: 1622-1655
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000028
Family EGF-type module 0.0088
Further Details:      
 
Domain Number 15 Region: 1801-1853
Classification Level Classification E-value
Superfamily Hormone receptor domain 0.0000314
Family Hormone receptor domain 0.0078
Further Details:      
 
Domain Number 16 Region: 1662-1698
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000469
Family EGF-type module 0.032
Further Details:      
 
Weak hits

Sequence:  ENSLACP00000020143
Domain Number - Region: 1093-1117
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00352
Family EGF-type module 0.048
Further Details:      
 
Domain Number - Region: 1157-1185
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0402
Family EGF-type module 0.022
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSLACP00000020143   Gene: ENSLACG00000017702   Transcript: ENSLACT00000020281
Sequence length 2475
Comment pep:known_by_projection scaffold:LatCha1:JH126690.1:2307560:2405686:1 gene:ENSLACG00000017702 transcript:ENSLACT00000020281 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
RTRRAANRHPQFSQYNYQVQVAENNPPGTPVITMSAQDPDPGEAGRLSYSIAALMNSRSM
DLFTINPQNGLITTNKVLDRESMDLHYFRVTAADHGSPRLSATTMLAITVSDRNDHGPVF
EQAEYRETIRENVEEGYPILQLRATDSDSPSNANIRYRFIEQTAHSVFEIDPRSGLITTS
GQVDREKMEKYSLTVEANDQGKDPGPKSATVKVHISVLDENDNVPQFSEKRYIIQVREDI
RPHSEILRVTATDSDKDNNALIHYNIISGNSRGQFSIDSVTGEIQVVAPLDFEVEREYAL
RVRAQDAGRPPLSNNTGMVSIQVVDVNDNAPIFVSTPFQVTVLENAPLGHSVIHIQAVDA
DSGENSRMEYKLTDTSPDTPFVINSASGWITVSAPLDRESVEHYFFGVEARDHGSLSLSA
SASVTITVLDVNDNRPEFTQKEYFIRLNEDAVVGTSVLSVTAVDRDVNSAVSYQITAGNT
RNRFAISTQSGVGLITLSLPLDYKQERRYALTVTASDRILHDTCHVYINITDANTHRPVF
QSAHYSVSVNEDRPIGSTVVLISASDDDVGENARITYYLEDNLPQFRIDPDSGAITLQAE
LDYEDQMTYTLAITAKDNGIPQKSDTTYVEINVNDVNDNAPQFINAHYQGTVYEDASPFT
SVLQISATDRDSHLNGRVQYTFQNGEDGDGDFTIEPTSGIVRTVRKLDRESVPVYELTAY
AVDRGVPPQRTPVRIHVSVQDVNDNAPIFPADDFEVLVKENSIVGSVVAQVTASDPDEGT
NAQIMYQIVEGNIPEIFQMDIFSGELTALIDLDYETKAEYVIVVQATSAPLVSRATVHIK
LIDQNDNSPVLKNFQIILNNYISTKSNTFPSGVIGKIPAYDPDVSDRLYYTFERGNELNL
MILNQTSGELRLSRKLDNNRPLVASMLVTVTDGIHSVTAQCVLRVIVITEDMLTNSITVR
LENVSQERFLSPLLGRFLDGVATVLSTPKEDIFIFNIQNDTDVSGNILNVSFSALLPGGG
KGQFFTSEDLQEQVYLNRVLLASVAMLEVLPFDDNVCLREPCENYMKCISVLKFDSSAPF
IASESILFRPIHPITGLRCRCPQGFTGDYCETEINLCYSNPCQNGGVCSRREGGYTCVCR
EAFTGDHCEVDRRSGHCIPGVCRNGGTCTNLAEGGFRCDCPLGGFERPYCEVTSRSFPPR
SFVMFRGLRQRFHMTISLSFTTMERNGLLFYNGRFNEKHDFIAVEILDGQVQLKYSTGES
TTQVSPLLRGGVSDGQWHTVRVHYYNKPKIGSTGVAQGPSREKVAILTVDDCDTSIALRF
GNEIGNHTCAAEGVQSSSKKSLDLTGPLLLGGVPNLPENFPVINREFIGCMKDLHVDDKR
IDMAAYIANNGTSAGCSAKRTFCDISPCKNGGSCLVTWETFRCECPLGFGGKDCSHVMHH
PHRFLGNSLMSWDFRNEAKISIPWYLGFMFRTRHKQGVLVQAHAGQYTTIICQLDSGCLS
FTVSRGASHTVKLLLDQVLVNDGKWHDLQLELRDVRSGRETRYIVAISLDFGHYQDTVIV
GNELHGLKVKHLHVGGLMGAGEVQNGLEGCIQGVRIGDTPFGSPLPKPSRTVNVEPGCSI
SNPCDSNPCPPASLCIDEWQSYSCPCKPGYYGENCVDACQLNPCENESVCRRKASSSHGY
TCDCREQHFGEYCEHRIDQQCPRGWWGNRTCGPCSCDVNKGFDPDCNKTNGLCHCKEFHY
HPKGSDTCLSCDCYPIGSFSRSCDHETGQCHCRPGVIGRQCNSCDNPHAEVTHLGFHVIY
DGCPKALDAGVWWPRTKFGLPAAVPCPKGSLGVAIRHCDEERGWMEPDQFNCTSPPFTEL
ATLLEVLERNETELNTADVKKLARRLRSVTDQMDRYFGNDIQVTYHLLSRLLDFESKQRG
FGLTATQDVHFNENLLRAGSAALALENKEFWETLQQTERGSATVMEQLVQYSRTLAQNMK
LTYLNPIGVVTPNVMLSIDQVENHTHIRRRFPRYHSGLFRGQNLWDPHTHVVLPPSALLP
PKIQVRPTEPPILTSAEENDTTVDVGPPKRTLPEPEPAVTIIVLIIYRTLGSLLPARYHT
DRRSLSFQNISILHSTTVLLILFSTQYYSISNSSLAHSTMILLVLFSTQYYSITLCVQWN
HSSLLEPAGGWVARDCDVVFRNVSHVRCQCSKLGTFGVLMDSSQREQLEGDLETLAIVTY
SALSVSLVALLLTFSVLVCLKGLKSNTRSIHFNMVSAIFFSELVFLLGVNQTENQFVCTV
IAILLHYFFMATFAWLFVEGLHIYRMQTEARNINYGAMRFYYAIGWGVPAIITGLAVGLD
PEGYGNPDFCWISVQDKLVWSFAGPVAVVLVLNGVLLLMVVRLVCTPGQKETKKKSVLVT
VRSASILLLMVSATWLFALMAVNNSVLAFHYLYTILCCLQEQGLAVLLLFCILNEEVREA
WKMACLGKKTPAEDA
Download sequence
Identical sequences H3BE22
ENSLACP00000020143 ENSLACP00000020143

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]