SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSNLEP00000013430 from Nomascus leucogenys 69_1.0

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSNLEP00000013430
Domain Number 1 Region: 1125-1252
Classification Level Classification E-value
Superfamily Cadherin-like 5.71e-30
Family Cadherin 0.001
Further Details:      
 
Domain Number 2 Region: 2973-3084
Classification Level Classification E-value
Superfamily Cadherin-like 3.57e-29
Family Cadherin 0.00063
Further Details:      
 
Domain Number 3 Region: 2448-2556
Classification Level Classification E-value
Superfamily Cadherin-like 3.14e-28
Family Cadherin 0.001
Further Details:      
 
Domain Number 4 Region: 2663-2771
Classification Level Classification E-value
Superfamily Cadherin-like 4.45e-27
Family Cadherin 0.00096
Further Details:      
 
Domain Number 5 Region: 1543-1667
Classification Level Classification E-value
Superfamily Cadherin-like 5e-27
Family Cadherin 0.00088
Further Details:      
 
Domain Number 6 Region: 3420-3602
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 8.64e-27
Family Laminin G-like module 0.0048
Further Details:      
 
Domain Number 7 Region: 3072-3190
Classification Level Classification E-value
Superfamily Cadherin-like 1.57e-25
Family Cadherin 0.0017
Further Details:      
 
Domain Number 8 Region: 1018-1136
Classification Level Classification E-value
Superfamily Cadherin-like 4.84e-25
Family Cadherin 0.00069
Further Details:      
 
Domain Number 9 Region: 814-917
Classification Level Classification E-value
Superfamily Cadherin-like 6.14e-23
Family Cadherin 0.00068
Further Details:      
 
Domain Number 10 Region: 2760-2871
Classification Level Classification E-value
Superfamily Cadherin-like 6.28e-23
Family Cadherin 0.0025
Further Details:      
 
Domain Number 11 Region: 719-812
Classification Level Classification E-value
Superfamily Cadherin-like 1.08e-22
Family Cadherin 0.0022
Further Details:      
 
Domain Number 12 Region: 446-545
Classification Level Classification E-value
Superfamily Cadherin-like 3.57e-22
Family Cadherin 0.001
Further Details:      
 
Domain Number 13 Region: 2558-2669
Classification Level Classification E-value
Superfamily Cadherin-like 2.28e-21
Family Cadherin 0.0044
Further Details:      
 
Domain Number 14 Region: 1752-1872
Classification Level Classification E-value
Superfamily Cadherin-like 3.85e-21
Family Cadherin 0.0013
Further Details:      
 
Domain Number 15 Region: 918-1023
Classification Level Classification E-value
Superfamily Cadherin-like 6.71e-21
Family Cadherin 0.0028
Further Details:      
 
Domain Number 16 Region: 2133-2243
Classification Level Classification E-value
Superfamily Cadherin-like 7.99e-21
Family Cadherin 0.0016
Further Details:      
 
Domain Number 17 Region: 1438-1548
Classification Level Classification E-value
Superfamily Cadherin-like 4.32e-20
Family Cadherin 0.0026
Further Details:      
 
Domain Number 18 Region: 136-226
Classification Level Classification E-value
Superfamily Cadherin-like 7.57e-20
Family Cadherin 0.003
Further Details:      
 
Domain Number 19 Region: 1230-1338
Classification Level Classification E-value
Superfamily Cadherin-like 9.14e-20
Family Cadherin 0.0062
Further Details:      
 
Domain Number 20 Region: 2030-2128
Classification Level Classification E-value
Superfamily Cadherin-like 1.16e-18
Family Cadherin 0.0014
Further Details:      
 
Domain Number 21 Region: 1648-1768
Classification Level Classification E-value
Superfamily Cadherin-like 1.43e-18
Family Cadherin 0.0023
Further Details:      
 
Domain Number 22 Region: 2231-2339
Classification Level Classification E-value
Superfamily Cadherin-like 4.85e-18
Family Cadherin 0.003
Further Details:      
 
Domain Number 23 Region: 2871-2972
Classification Level Classification E-value
Superfamily Cadherin-like 7e-17
Family Cadherin 0.002
Further Details:      
 
Domain Number 24 Region: 2346-2455
Classification Level Classification E-value
Superfamily Cadherin-like 1.34e-16
Family Cadherin 0.0019
Further Details:      
 
Domain Number 25 Region: 557-658
Classification Level Classification E-value
Superfamily Cadherin-like 0.00000000000000124
Family Cadherin 0.0019
Further Details:      
 
Domain Number 26 Region: 1860-1959
Classification Level Classification E-value
Superfamily Cadherin-like 0.000000000002
Family Cadherin 0.0032
Further Details:      
 
Domain Number 27 Region: 1975-2037
Classification Level Classification E-value
Superfamily Cadherin-like 0.00000000000236
Family Cadherin 0.0094
Further Details:      
 
Domain Number 28 Region: 1349-1434
Classification Level Classification E-value
Superfamily Cadherin-like 0.000000000144
Family Cadherin 0.0085
Further Details:      
 
Domain Number 29 Region: 3186-3277
Classification Level Classification E-value
Superfamily Cadherin-like 0.00000000171
Family Cadherin 0.013
Further Details:      
 
Domain Number 30 Region: 362-458
Classification Level Classification E-value
Superfamily Cadherin-like 0.00000000571
Family Cadherin 0.006
Further Details:      
 
Domain Number 31 Region: 32-148
Classification Level Classification E-value
Superfamily Cadherin-like 0.0000000754
Family Cadherin 0.0046
Further Details:      
 
Domain Number 32 Region: 3605-3647
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000017
Family EGF-type module 0.0088
Further Details:      
 
Domain Number 33 Region: 3645-3690
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000126
Family EGF-type module 0.034
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSNLEP00000013430   Gene: ENSNLEG00000011039   Transcript: ENSNLET00000014095
Sequence length 4006
Comment pep:novel supercontig:Nleu1.0:GL397305.1:19182916:19246283:-1 gene:ENSNLEG00000011039 transcript:ENSNLET00000014095 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MTIALLGFAIFLLHCATCEKPLEGILSSSAWHFTHSHYNATIYENSSPKTYVESFEKMGI
YLMEPQWAVRYRIISGDVANVFKTEEYVVGNFCFLRIRTKSSNTALLNREVRDSYTLIIQ
ATEKTLELEALTRVVVYILDQNDLKPLFSPPSYRVTISEDMPLKSPICKVTATDADLGQN
AEFYYAFNTRSEMFAIHPTSGVVTVAGKLNVTWRGKHELQVLAVDRMRKISEGNGFGSLA
ALVVHVEPALRKPPAIASVVVTPPDGNDGTTYATVLVDANSSGAEVESVEVVGGDPGKHF
KAIKSYARSNEFSLVSVKDINWMEYLHGFNLSLQARSGSGPYFYSQIRGFHLPPSKLSSL
KFEKAVYRVHLSEFSPPGSRVVMVRVTPAFPNLQYVLKPSSENVGFKLNARTGLITTTKL
MDFHDRAHYQLHIRTSPGQASTTVVIDIVDCNNHAPLFNRSSYDGTLDENIPPGTSVLAV
TATDRDHGENGYVTYSIAGPKALPFSIDPYLGIISTSKPMDYELMKRIYTFRVRASDWGS
PFRREKEVSIFLQLRNLNDNQPMFEEVNCTGSIHQDWPVGKSIMTMSAIDVDELQNLKYE
IVSGNELGYFDLNHFSGVISLKRPFINLTAGQPTSYSLKITASDGKNYASPTTLNITVVK
YPHFEVPVTCDKTGVLTQFTKTILHSIGLQNQESSDEEFTSLSTYQINHYTPQFEDHFPQ
SIDVLESVPINTPLARLAATDPDAGFNGKLVYVIADGNEEGCFDIELETGLLTVAAPLDY
EATNFYILNVTVYDLGTPQKSSWKLLTVNVKDWNDNAPRFPPGGYQLTISEDTEVGTTVA
ELTTKDADSEDNGRVRYTLLSPTEKFSLHPLTGELVVTGHLDRESEPRYILKVEARDQPS
KGHQLFSVTDLIITLEDVNDNSPQCITERNRLKVPEDLPPGTVLTFLDASDPDLGPAGEV
RFVLTDGAHGTFRVDLMTGALILERELDFERRAGYNLSLWASDSGRPLARRTLCHVEVIV
LDVNENLHPPHFASFVHQGQVQENSPSGTQVMVVAARDDDSGLDGELQYFLRAGTGLAAF
SINQDTGMIQTLAPLDREFASYYWLTVLAVDRGSVPLSSVTEVYIEVTDANDNPPQMSQA
VFYPSIQEDAPVGTSVLQLDAWDPDSSSKGKLTFNITSGNHMGFFMIHPVTGLLSTAQQL
DRENKDEHILEVTVLDNGEPSLKSTSRVVVGILDVNDNPPVFSHKLFNVRLPERLSPVSP
GPVYRLVASDLDEGLNGRVTYSIEDSDEEAFSIDPVTGVVSSSSTFTAGEYNILTIKATD
SGQPPLSASVRLHIEWIPRPRPSSIPLAFDETYYSFTVMETDPVNHMVGVISVEGRPGLF
WFNISGGDKDMDFDIEKTTGSIVIARPLDTRRRLSYNLTVEVTDGSHTIATQVHIFMIAN
INHHRPQFLETHYEVRVPQDTVPGVELLRVQAIDQDKGKSLIYTIHGSQDPGSASLFQLD
PSSGVLVTVGKLDLGSGPSQHTLTVMVRDQEIPIKRNFVWVTIRVEDGNLHPPRFTQLHY
EASVPDTIAPGTELLQVRAMDADRGVNAEVHYSLLKGNSEGFFNINALLGIITLAQKFDR
ANHAPHTLTVKAEDQGSPQWHDLATVIIHVYPSDRSAPLFSKSEYFVEIPESIPVGSPIL
LVSAMSSSEVTYELREGNKDGVFSMNSYSGLISTQKNLDHEKISSYQLKIRGSNMAGAFT
DVMVVVDIIDENDNAPMFLKSTFVGQISEAAPLYSMIMDKNNNPFVIHASDSDKEANSLL
VYKILEPEALKFFKIDPSMGTLTIVSEMDYESMPSFQFCVYVHDQGSPILFSPRPAQVII
HVRDVNDSPPRFSEQIYEVAIVGPIHPGMELLMVRASDEDSEVNYSIKTGNADEAVTIHP
VTGSISVLNPAFLGLSRKLTIRASDGLYQDTALVKISLTQVLDKACSLIRMSIGQLSNVV
EIDGSTGEMSTVQELDYEAQQHFHVKVRAMDKGDPPLTGETLVVVNVSDINDNPPEFRQP
QYEANVSELATCGHLVLKVQAIDPDSRDTSRLEYLILSGNQDRHFSINSSSGIISMFNLC
KKHLDSSYSLRVGASDGVFRATVPVYINTTNANKYSPEFQQHLYEAELAENAMVGTKVID
LLAIDKDSGPYGTVDYTIINKLASEKFSINPNGQIATLQKLDRENSTERVIAIKVMARDG
GGRVAFCTVKIILTDENDNPPQFKASEYTVSIQSNVSKDSPVIQVLAYDADEGQNADVTY
SVNPEDLVKDVIEINPVTGVVKVKDSLVALENQTLDFFIKAQDGGPPHWNSLVPVRLQVV
PKKVSLPKFSEPLYTFSVPEDLPEGSDIGIVKAVAAQDPVIYSLVRGTTPESNKDGVFSL
DPDTGVIKVRKPMDHESTKLYQIDVMAHCLQNTDVVSLVSVNIQVGDVNDNRPVFEADPY
KAVLTENMPVGTSVIQVTAIDKDTGRDGQVSYRLSADPGSNVHELFAIDSESGWITTLQE
LDCETCQTYHFHVVAYDHGQTIQLSSQALVQVSITDENDNAPRFASEEYRGSVVENSEPG
ELVATLKTLDADISEQNRQVTCYITEGDPLGQFGISQVGDEWRISSRKTLDREHTAKYLL
RVTASDGKFQASVTVEIFVLDVNDNSPQCSQLLYTGKVHEDVFPGHFILKVSATDLDTDT
NAQITYSLHGPGAHEFKLDPHTGELTTLTALDRERKDVFNLVAKATDGGGRSCQADVTLY
VEDVNDNAPRFFPSHCAVAVFDNTTVKTPVAVVFARDPDQGANAQVVYSLPDSAEGHFSI
DATTGVIRLEKPLQVRPQAPLELTVRASDLGTPIPLSTLGTVTVSVVGLEDYLPVFLNTE
HSVQVPEDAPPGTEVLQLATLTRPGAEKTGYRVVSGNEQGRFRLDARTGILYVNASLDFE
TSPKYFLSIECSRKSSSSLSDVTTVMVNITDVNEHRPQFPQDPYSTRVLENALVGDVILT
VSATDEDGPLNSDITYSLVGGNQLGHFTIHPKKGELQVAKALDREQASSYSLKLRATDSG
QPPLHEDTDIAIQVADVNDNPPRFFQLNYSTTVQENSPIGSKVLQLILSDPDSPENGPPY
SFRITKGNNGSAFRVTPDGWLVTAEGLNRRAQEWYQLQIQASDSGIPPLSSSTSVRVHVT
EQSHYAPSALPLEIFITVGEEEFQGGMVGKIHATDRDPQDTLTYSLAEETLGRHFSVGAP
DGKIIAAQGLPRGHYSFNVTVSDGTFTTTAGVHVYVWHVGQEALQQAMWMGFYQLTPEEL
VSDHWRNLQRFLSHKLDIKRANIHLASLQPAEAVAGVDVLLVFEGHSGTFYEFQELASII
THSAKEMEHSVGVQMRSAMPMVPCQGPTCQGQICRNTVHLDPKVGPTYSTARLSILTPRH
HLQRSCSCNGTATRFSGQSYVRYRAPAARNWHIHFYLKTLQPQAILLFTNETASISLKLA
SGVPQLEYHCLGGFYGNLSSQRHVNDHEWHSILVEEMDASIRLMVDSMGNTSLVVPENCR
GLRPERHLLLGGLVLLHSSSNVSQGFEGCLDAVVVNEEALDLLAPGKTVAGLLETQALTQ
CCLHSDYCSQNPCLNGGKCSWTHGAGYVCKCPPQFSGKHCEQGRENCTFASCLEGGTCIL
SPKGASCNCPHPYTGDRCEMEARGCSEGHCLVTPEIKRGDWGQQELLIITVAVAFIIIST
VGLLFYCRRCKSHKPVAMEDPDLLARSVGVDTQAMPAIELNPLSASSCNNLNQLEPSKAS
VPNELVTFGPNSKQRPVVCSVPPRLPPAAVPSHSDDEPVIKRTWSGEEMVYPGGAMVWPP
TYSRNERWEYPHSEMTQGPLPPSAHRHSTPVVMPEPSGLYGGFPFPLEMENKRAPLPPRY
SNQNLEDLMPSRPPSPRERLVAPCLNEYTAISYYHSQFRQGGGGPCLAEGGYKGVSMRLS
RAGPSYAICEAEGAPLAGQGQPRAPPNYEGSDMVESDYGSCEEVMF
Download sequence
Identical sequences ENSNLEP00000013430 ENSNLEP00000013430

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]