SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSPTRP00000040827 from Pan troglodytes 76_2.1.4

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSPTRP00000040827
Domain Number 1 Region: 3822-4002
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 3.17e-35
Family Laminin G-like module 0.0028
Further Details:      
 
Domain Number 2 Region: 3125-3236
Classification Level Classification E-value
Superfamily Cadherin-like 7.07e-30
Family Cadherin 0.002
Further Details:      
 
Domain Number 3 Region: 1554-1690
Classification Level Classification E-value
Superfamily Cadherin-like 8.14e-30
Family Cadherin 0.00099
Further Details:      
 
Domain Number 4 Region: 3229-3334
Classification Level Classification E-value
Superfamily Cadherin-like 3.71e-29
Family Cadherin 0.00085
Further Details:      
 
Domain Number 5 Region: 3335-3446
Classification Level Classification E-value
Superfamily Cadherin-like 1.21e-28
Family Cadherin 0.00055
Further Details:      
 
Domain Number 6 Region: 1141-1246
Classification Level Classification E-value
Superfamily Cadherin-like 2.22e-28
Family Cadherin 0.0016
Further Details:      
 
Domain Number 7 Region: 2808-2916
Classification Level Classification E-value
Superfamily Cadherin-like 5.57e-28
Family Cadherin 0.0012
Further Details:      
 
Domain Number 8 Region: 460-579
Classification Level Classification E-value
Superfamily Cadherin-like 7.2e-28
Family Cadherin 0.0012
Further Details:      
 
Domain Number 9 Region: 1029-1147
Classification Level Classification E-value
Superfamily Cadherin-like 1.44e-27
Family Cadherin 0.0014
Further Details:      
 
Domain Number 10 Region: 930-1033
Classification Level Classification E-value
Superfamily Cadherin-like 2.43e-26
Family Cadherin 0.0018
Further Details:      
 
Domain Number 11 Region: 146-268
Classification Level Classification E-value
Superfamily Cadherin-like 4e-25
Family Cadherin 0.001
Further Details:      
 
Domain Number 12 Region: 2280-2386
Classification Level Classification E-value
Superfamily Cadherin-like 5.42e-25
Family Cadherin 0.00063
Further Details:      
 
Domain Number 13 Region: 1763-1883
Classification Level Classification E-value
Superfamily Cadherin-like 6.71e-25
Family Cadherin 0.0015
Further Details:      
 
Domain Number 14 Region: 3023-3131
Classification Level Classification E-value
Superfamily Cadherin-like 1.43e-24
Family Cadherin 0.0013
Further Details:      
 
Domain Number 15 Region: 2588-2725
Classification Level Classification E-value
Superfamily Cadherin-like 6.02e-24
Family Cadherin 0.0035
Further Details:      
 
Domain Number 16 Region: 825-928
Classification Level Classification E-value
Superfamily Cadherin-like 6.85e-24
Family Cadherin 0.00067
Further Details:      
 
Domain Number 17 Region: 1454-1558
Classification Level Classification E-value
Superfamily Cadherin-like 5.57e-23
Family Cadherin 0.00081
Further Details:      
 
Domain Number 18 Region: 2072-2180
Classification Level Classification E-value
Superfamily Cadherin-like 2.43e-22
Family Cadherin 0.0031
Further Details:      
 
Domain Number 19 Region: 2917-3021
Classification Level Classification E-value
Superfamily Cadherin-like 9.14e-22
Family Cadherin 0.001
Further Details:      
 
Domain Number 20 Region: 719-831
Classification Level Classification E-value
Superfamily Cadherin-like 1.27e-21
Family Cadherin 0.0021
Further Details:      
 
Domain Number 21 Region: 1666-1769
Classification Level Classification E-value
Superfamily Cadherin-like 1.86e-21
Family Cadherin 0.0018
Further Details:      
 
Domain Number 22 Region: 2490-2600
Classification Level Classification E-value
Superfamily Cadherin-like 3.01e-21
Family Cadherin 0.0035
Further Details:      
 
Domain Number 23 Region: 3434-3553
Classification Level Classification E-value
Superfamily Cadherin-like 1.01e-20
Family Cadherin 0.0023
Further Details:      
 
Domain Number 24 Region: 2387-2487
Classification Level Classification E-value
Superfamily Cadherin-like 2.28e-20
Family Cadherin 0.002
Further Details:      
 
Domain Number 25 Region: 2180-2287
Classification Level Classification E-value
Superfamily Cadherin-like 4.71e-20
Family Cadherin 0.002
Further Details:      
 
Domain Number 26 Region: 1242-1356
Classification Level Classification E-value
Superfamily Cadherin-like 4.85e-20
Family Cadherin 0.0031
Further Details:      
 
Domain Number 27 Region: 1871-1994
Classification Level Classification E-value
Superfamily Cadherin-like 8.51e-20
Family Cadherin 0.0017
Further Details:      
 
Domain Number 28 Region: 1980-2084
Classification Level Classification E-value
Superfamily Cadherin-like 0.00000000000000143
Family Cadherin 0.0026
Further Details:      
 
Domain Number 29 Region: 1361-1460
Classification Level Classification E-value
Superfamily Cadherin-like 0.0000000000000157
Family Cadherin 0.0072
Further Details:      
 
Domain Number 30 Region: 571-672
Classification Level Classification E-value
Superfamily Cadherin-like 0.0000000000000214
Family Cadherin 0.0021
Further Details:      
 
Domain Number 31 Region: 2705-2814
Classification Level Classification E-value
Superfamily Cadherin-like 0.0000000000214
Family Cadherin 0.0044
Further Details:      
 
Domain Number 32 Region: 375-472
Classification Level Classification E-value
Superfamily Cadherin-like 0.00000000101
Family Cadherin 0.0061
Further Details:      
 
Domain Number 33 Region: 3547-3637
Classification Level Classification E-value
Superfamily Cadherin-like 0.0000000314
Family Cadherin 0.0046
Further Details:      
 
Domain Number 34 Region: 4098-4135
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000000968
Family EGF-type module 0.0064
Further Details:      
 
Domain Number 35 Region: 4023-4061
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000071
Family EGF-type module 0.0062
Further Details:      
 
Domain Number 36 Region: 4060-4102
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000117
Family EGF-type module 0.018
Further Details:      
 
Domain Number 37 Region: 42-158
Classification Level Classification E-value
Superfamily Cadherin-like 0.00000414
Family Cadherin 0.0059
Further Details:      
 
Domain Number 38 Region: 259-357
Classification Level Classification E-value
Superfamily Cadherin-like 0.0000571
Family Cadherin 0.0057
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSPTRP00000040827   Gene: ENSPTRG00000004168   Transcript: ENSPTRT00000042356
Sequence length 4589
Comment pep:known_by_projection chromosome:CHIMP2.1.4:11:90143545:90696365:1 gene:ENSPTRG00000004168 transcript:ENSPTRT00000042356 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MDIIMGHCVGTRPPPCCLILLLFKLLATVSQGLPGTGPLGFHFTHSIYNATVYENSAART
YVNSQSRMGITLIDLSWDIKYRIVSGDEEGFFKAEEVIIADFCFLRIRTKGGNSAILNRE
IQDNYLLIVKGSVRGEDLEAWTKVNIQVLDMNDLRPLFSPTTYSVTIAESTPLRTSVAQV
TATDADIGSNGEFYYYFKNKVDLFSVHPTSGVISLSGRLNYDEKNRYDLEILAVDRGMKL
YGNNGVSSTAKLYVHIERINEHAPTIHVVTHVPFSLEKEPTYAVVTVDDLDDGANGEIES
VSIVAGDPLDQFFLAKEGKWLNEYKIKERKQIDWESFPYGYNLTLQAKDKGSPQKFSALK
AVYIGNPTRDTVPIRFEKEVYDVSISEFSPPGVVVAIVKLSPEPIDVEYKLSPGEDAVYF
KINPRSGLIVTARPLNTVKKEVYKLEVTNKEGDLKAQVTISIEDANDHTPEFQQPLYDAY
VNESVPVGTSVLTVSASDKDKGENGYITYSIASLNLLPFVINQFTGVISTTEELDFESSP
EIYRFIVRASDWGSPYRHESEVNVTIRIGNVNDNSPLFEKVACQGVISYDFPVGGHITAV
SAIDIDELELVKYKIISGNELGFFYLNPDSGVLQLKKSLTNSGIKNGNFALRITATDGEN
LADPMSINISVLHGKVSSKSFSCRETRVAQKLAEKLLIKAKANGKLNLEDGFLDFYSINR
QGPYFDKSFPSDVAVKEDLPVGANILKIKAYDADSGFNGKVLFTISDGNTDSCFNIDMET
GQLKVLMPMDREHTDLYLLNITIYDLGNPQKSSWRLLTINVEDANDNSPVFIQDSYSVNI
LESSGIGTEIIQVEARDKDLGSNGEVTYSVLTDTQQFAINSSTGIVYVADQLDRESTANY
SLKIEARDKAESGQQLFSVVTLKVFLDDVNDCSPAFIPSSYSVKVLEDLPVGTVIAWLET
HDPDLGLGGQVRYSLVNDYNGRFEIDKASGAIRLSKELDYEKQQFYNLTVRAKDKGRPVS
LSSVSFVEVEVVDVNENLHTPYFPDFAVVGSVKENSRIGTSVLQVTARDEDSGRDGEIQY
SIRDGSGLGRFSIDDESGVITAADILDRETMGSYWLTVYATDRGVVPLYSTIEVYIEVED
VNDNAPLTSEPIYYPVVMENSPKDVSVIQIQAEDPDSSSNEKLTYRITSGNPQNFFAINI
KTGLITTTSRKLDREQQAEHFLEVTVTDGGPSPKQSTIWVVVQVLDENDNKPQFPEKVYQ
IKLPERDRKKRGEPIYRAFAFDRDEGPNAEISYSIVDGNDDGKFFIDPKTGMVSSRKQFT
AGSYDILTIKAVDNGRPQKSSTARLHIEWIKKPPPSPIPLTFDEPFYNFTVMESDRVTEI
VGVVSVQPANTPLWFDIVGGNFDSAFDAEKGVGTIVIAKPLDAEQRSIYNMSVEVTDGTN
VAVTQVFIKVLDNNDNGPEFSQPNYDVTISEDVLPDTEILQIEATDRDEKHKLSYTVHSS
IDSISMRKFRIDPSTGVLYTAERLDHEAQDKHILNIMVRDQEFPYRRNLARVIVNVEDAN
DHSPYFTNPLYEASVFESAALGSAVLQVTALDKDKGENAELIYTIEAGNTGNMFKIEPVL
GIITICKEPDMTMMGQFVLSIKVTDQGSPPMSATAIVRISVTMSDNSHPKFIHKDYQAEV
NENVDIGTSVILISAISQSTLIYEVKDGDINGIFTINPYSGVITTRKALDYEHTSSYQLI
IQATNMAGMASNATVNIQIVDENDNAPVFLFSQYSGSLSEAAPINSIVRSLDNSPLVIRA
TDADSNRNALLVYQIVESTAKKFFTVDSSTGAIRTIANLDHETIAHFHFHVHVRDSGSPQ
LTAESPVEVNIEVTDVNDNPPVFTQAVFETVLLLPTYVGVEVLKVSATDPDSEVPPELTY
SLMEGSLDHFLIDSNSGVLTIKNNNLSKDHYMLIVKVSDGKFYSTSMVTIMVKEAMDSGL
HFTQSFYSTSISENNTNITKVAIVNAVGNRLNEPLKYSILNPGNKFKIKSTSGVIQTTGV
PFDREEQELYELVVEASRELDHLRVARVVVRVNIEDINDNSPVFVGLPYYAAVQVDAEPG
TLIYQVTAIDKDKGPNGEVTYVLQDDYGHFEINPNSGNVILKEAFNSDLSNIEYGVTILA
KDGGKPSLSTSVELPITIVNKAMPVFDKPFYTASVNEDIRMNTPILSINATSPEGQGIIY
IIIDGDPFKQFNIDFDTGVLKVVSPLDYEVTSAYKLTIRASDALTGARAEVTVDLLVNDV
NDNPPIFDQPTYNTTLSEASLIGTPVLQVVSIDADSENNKMVHYQIVQDTYNSTDYFHID
SSSGLILTARMLDHELVQHCTLKVRSIDSGFPSLSSEVLVHIYISDVNDNPPVFNQLIYE
SYVSELAPRGHFVTCVQASDADSSDFDRLEYSILSGNDRTSFLMDSKSGVITLSNHRKQR
MEPLYSLNVSVSDGLFTSTAQVHIRVLGANLYSPAFSQSTYVAEVRENVAAGTKVIHVRA
TDGDPGTYGQISYAIINDFAKDRFLIDSNGQVITTERLDRENPLEGDVSIFVRALDGGGR
TTFCTVRVIVVDENDNAPQFMTVEYRASVRADVGRGHFVTQVQAIDPDDGANSRITYSLY
SEASVSVADLLEIDPDNGWMVTKGNFNQLKNTVLSFFVKAVDGGIPVKHSLIPVYIHVLP
PETFLPSFTQSQYSFTIAEDTAIGSTVDTLRILPSQNVRFSTVNGERPENNKGGVFVIEQ
ETGTIKLDKRLDHETSPAFHFKVAATIPLDKVDIVFTVDVDIKVLDLNDNKPVFETSSYD
TVIMEGMPVGTKLTQVRAIDMDWGANGQVTYSLHSDSQPEKVMEAFNIDSNTGWISTLKD
LDHETDPTFTFSVVASDLGEAFSLSSTALVSVRVTDINDNAPVFAQEVYRGNVKESDPPG
EVVAVLSTWDRDTSDVNRQVSYHITGGNPRGRFALGLVQSEWKVYVKRPLDREEQDIYFL
NITATDGLFVTQAMVEVSVSDVNDNSPVCDQVAYTALLPEDIPSNKIILKVSAKDADIGS
NGDIRYSLYGSGNSEFYLDPESGELKTLALLDRERIPVYSLMAKATDGGGRFCQSNIHLI
LEDVNDNPPVFSSDHYNTCVYENTATKALLTRVQAVDPDIGINRKVVYSLADSAGGVFSI
DSSSGIIILEQPLDREQQSSYNISVRATDQSPGQSLSSLTTVTITVLDINDNPPVFERRD
YLVTVPEDTSPGTQVLAVFATSKDIGTNAEITYLIRSGNEQGKFKINPKTGGISVSEVLD
YELCKRFYLVVEAKDGGTPALSAVATVSINLTDVNDNPPKFSQDVYSAVISEDALVGDSV
ILLIAEDVDSQPNGQIHFSIVNGDRDNEFTVDPVLGLVKVKKKLDRERVSGYSLLVQAVD
SGIPAMSSTATVNIDISDVNDNSPVFTPANYTAVIQENKPVGTSILQLVVTDRDSFHNGP
PFSFSILSGNEEEEFVLDPHGILRSAVVFQHTESLEYVLCVQAKDSGKPQQVSHTYIRVR
VIEESTHKPTAIPLEIFIVTMEDDFPGGVIGKIHATDQDMYDVLTFALKSEQKSLFKVNS
HDGKIIALGGLDSGKYVLNVSVSDGRFQVPIDVVVHVEQLVHEMLQNTVTIRFENVSPED
FVGLHMHGFRRTLRNAVLTQKQDSLRIISIQPVTGTNQLDMLFAVEMHSSEFYKPAYLIQ
KLSNARRHLENIMRISAILEKNCSGLDCQEQHCEQGLSLDSHALMTYSTARISFVCPRFY
RNVRCTCNGGLCPGSNDPCMEKPCPGDMQCVGYEASRRPFLCQCPPGKLGECSGHTSLSF
AGNSYIKYRLSENSKEEDFKLALRLRTLQSNGIIMYTRANPCIILKIVDGKLWFQLDCGS
GPGILGISGRAVNDGSWHSVFLELNRNFTSLSLDDSYVERRRAPLYFQTLSTESSIYFGA
LVQADNIRSLTDTRVTQVLSGFQGCLDSVILNNNELPLQNKRSSFAEVVGLTELKLGCVL
YPDACERSPCQHGGSCTGLPSGGYQCTCLSQFTGRNCESEITACFPNPCRNGGSCDPIGN
TFICNCKAGLTGVTCEEDINECEREECENGGSCVNVFGSFLCNCTPGYVGQYCGLRPVVV
PNIQAGHSYVGKEELIGIAVVLFVIFILVVLFIVFRKKVFRKNYSRNNITLVQDPATAAL
LNKSNGIPFRNLRGSGDGRNVYQEVGPPQVPVRPMAYTPCFQSDSRSNLDKIVDGLGGEH
QEMTTFHPESPRILTARRGVVVCSVAPNLPAVSPCRSDCDSIRKNGWDAGTENKGVDDPG
EVTCFAGSNKGSNSEVQSLSSFQSDSGDDNASIVTVIQLVNNVVDTIENEVSVMDQGQNY
NRAYHWDTSDWMPGARLSDIEEVPNYENQDGGSAHQGSTRELESDYYLGGYDIDSEYPPP
HEEEFLSQDQLPPPLPEDFPDQYEALPPSQPVSLASTLSPDCRRRPQFHPSQYLPPHPFP
NETDLVGPPASCEFSTFAVSMNQGTEPTGPADSVSLSLHNSRGTSSSDVSANCGFDDSEV
AMSDYESVGELSLASLHIPFVETQHQTQV
Download sequence
Identical sequences ENSPTRP00000040827 ENSPTRP00000040827

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]