SUPERFAMILY 1.75 HMM library and genome assignments server

Superfamily is undergoing a server migration - you are now browsing on the new server. Please contact us if you experience any problems.

Domain assignment for ENSCPOP00000016768 from Cavia porcellus 69_3

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSCPOP00000016768
Domain Number 1 Region: 2208-2301
Classification Level Classification E-value
Superfamily C-type lectin-like 9.42e-32
Family Link domain 0.00013
Further Details:      
 
Domain Number 2 Region: 508-646
Classification Level Classification E-value
Superfamily FAS1 domain 1.22e-28
Family FAS1 domain 0.0017
Further Details:      
 
Domain Number 3 Region: 1471-1519,1597-1711
Classification Level Classification E-value
Superfamily FAS1 domain 1.83e-27
Family FAS1 domain 0.0027
Further Details:      
 
Domain Number 4 Region: 1712-1868
Classification Level Classification E-value
Superfamily FAS1 domain 4.05e-25
Family FAS1 domain 0.0044
Further Details:      
 
Domain Number 5 Region: 964-1120
Classification Level Classification E-value
Superfamily FAS1 domain 1.57e-24
Family FAS1 domain 0.0032
Further Details:      
 
Domain Number 6 Region: 379-500
Classification Level Classification E-value
Superfamily FAS1 domain 1.7e-17
Family FAS1 domain 0.0091
Further Details:      
 
Domain Number 7 Region: 2299-2462
Classification Level Classification E-value
Superfamily FAS1 domain 9.29e-17
Family FAS1 domain 0.012
Further Details:      
 
Domain Number 8 Region: 1129-1233
Classification Level Classification E-value
Superfamily FAS1 domain 0.000000000824
Family FAS1 domain 0.0068
Further Details:      
 
Domain Number 9 Region: 1543-1580
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000488
Family Merozoite surface protein 1 (MSP-1) 0.049
Further Details:      
 
Weak hits

Sequence:  ENSCPOP00000016768
Domain Number - Region: 201-235
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00127
Family Merozoite surface protein 1 (MSP-1) 0.067
Further Details:      
 
Domain Number - Region: 2140-2175
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00132
Family EGF-type module 0.071
Further Details:      
 
Domain Number - Region: 2009-2137
Classification Level Classification E-value
Superfamily Growth factor receptor domain 0.00135
Family Growth factor receptor domain 0.016
Further Details:      
 
Domain Number - Region: 1369-1496
Classification Level Classification E-value
Superfamily Growth factor receptor domain 0.00146
Family Growth factor receptor domain 0.018
Further Details:      
 
Domain Number - Region: 1976-2009
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0062
Family EGF-type module 0.023
Further Details:      
 
Domain Number - Region: 233-286
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0077
Family EGF-type module 0.057
Further Details:      
 
Domain Number - Region: 906-960
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0117
Family EGF-type module 0.062
Further Details:      
 
Domain Number - Region: 862-904
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0265
Family EGF-type module 0.045
Further Details:      
 
Domain Number - Region: 323-360
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0516
Family EGF-like domain of nidogen-1 0.089
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSCPOP00000016768   Gene: ENSCPOG00000021618   Transcript: ENSCPOT00000023404
Sequence length 2570
Comment pep:known scaffold:cavPor3:scaffold_0:56013451:56039252:-1 gene:ENSCPOG00000021618 transcript:ENSCPOT00000023404 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MAEPWGRLLLCFLVFWLSGSSFTRGQKVRSKHCDVKTKFITHTPCTSCAVKKWMCPSGWL
LQRFPDRASQNCRYEVQLGDSLMSMNGCSMECWKDVVQKACCPGYWGSQCYATECPGGAE
TPCNGHGTCLDGMAGNGTCVCQENFRGSACQECQDPRRFGTDCQSECSCVHGVCSSGPQG
NGSCLCFAGYTGPRCNQELPACQNLNCPPNSQCSPEAPACRCLPGYTLKDSRCVAPNPCH
PSPCSSIAHCSVSPEGHAQCRCPENYHGDGKVCLPYDPCTANFGGCPRNSTVCVYQKPGK
ATCSCQPGLVSANHNASAGCFAYCFPHSCDKSATCQVTPEGRTRCQCKAGEVGDGRACYG
HLLHEVQKANEMGTLFQRLKVTVTMLEQGCQEIVTTSGPFTVLVPSTVRSCGTQVTLAQQ
LCRQHIIAGQHILEDGGLPSTQRWWTLAGQEITVTFSRTRYTYQYKDQPQQTFIILKANN
IAANGVFHVVTALRWQPPPALPGDLKHTIAQILSSNEAFSRFETILENCGLPSILDGPGP
FTVFVPSNEAVDSLRDGRLIYLFTEGLSKLQELVRYHIFSRGQMTVDKLIAKRRILTMAN
QVLAVNISNEGRILLGPEGIPLRRVDMLAANGVIHMLEGVLLPPTILPILPKHCDEEQHE
IVMGSCVDCQALNTSVCPPNSVKLDIFPKECVYTHSALGFSVLKKGCAHNCNQTITKRGC
CKGFFGPDCTQCPGGFSSPCYGKGNCSDGIRGSGACLCFPDYKGIACHVCSNPNKHGDHC
QEDCGCVHGLCDNRPGSGGVCQHGTCAPGFSGHFCNETVRDCGPPGLAQRCHPHARCISQ
DGVSRCVCLDDFEGDGYSCTRRHPCSQPDRGGCSENAECVPGDLGNHNCTCHRGWSGDGR
VCVPIDECGLDTRGGCHADALCSYVGPGQSRCTCKLGFAGDGYQCSPIDPCRAGNGGCHD
LATCKAVGGGQRVCTCPPHFGGDGFSCYGDIIRELEANAHFSTFYQWFKSAGITLSADSR
VTALVPSDFAIRRLSPENKSFWLQPKMLPYLVRAHFLQGTLSEEDLARLGGQDVATLNPT
MRWEIRNISGRVWVQNASVDVADLLATNGVLHIINQVLLPLRGDVQVGQGVLERLGQVPT
FRLFRELLQHHSLVPQLEAATAYTIFVPTNQSLEAQGNSSSLDADTVLHHVILGEALSME
ALQKGGHRNSMLGPTHWLVFYNHSGQPEVNHMPLEGPFLKGPGYSLFGLSGVLTVGRSRC
MLSHTEAQEKCVSCSRKFRCTQGYQLQDTPRKSCVYRAGLAISRGCSYTCVKKIQVPDCC
PGYFGTLCEPCPRGLGGVCSGHGQCQDRFLGNGECRCHEGFHGTACEMCELGRYGPNCTG
VCNCAHGLCQEGLRGNGSCVCNVGWQGLRCDQKITGPQCEEKCDPNANCMQSSAGAPACV
CAAGYSGNGSYCSEVDPCAHGHGGCSPHASCTKVAPGQRTCTCQDGYTGDGELCQEVNSC
LNHNGGCHIHAECIPTGPQQVSCSCHEGYSGDGIRTCELLDPCSKNNGGCSPYAVCKSTG
DGQRTCTCDAARTVGDGFTCRTRVGLELLRDKNASFFSLHLLEYRELKGDGPFTIFVPQA
DLMTNMSQEELARIRAHRQMVFRYHVVGCRRLWSPDLLEQEYATALSGHSLRFSEREGSI
YINDFARVVSSDHEAVNGVLHFIDQVLLPPDVVYWEPGAVTVPRKNLTITTAADSFGYKT
FSHLVKMAGLLPMLQDASHRPLTMLWPTDAALQALPPDRQAWLYHEDHRDRLAAILRGHV
IRNMEALASDLPNLGLLRTMHGTPISFSCSRGRPGELMVGEDDARIVQRYLAFEGGLAYG
IDQLLEPPGLGARCDRFETRPLRLKICSVCGQEPPCPEGSREEGSTETCWRYPKLWTNAL
LHPFMLGGVWVRPSYWGQPQSLDRGCQRSCVTTIWKPTCCPGHYGSECQACPGGPSSPCN
DHGVCLDGMNGSGQCKCHLGFAGMACELCAPGAFGPQCQACRCMPHGRCDEGLGGSGSCF
CDEGWTGPHCEVQLELQPVCTPPCASQAMCRAGNHCECGLGYEGDGRVCTVADLCQDGHG
GCSEHANCSQVGTVVTCACLANYEGDGWSCRPRNPCADSHRGGCSEHADCLYTGPNTRRC
KCHAGYVGDGLQCLEEAEPPVDRCLGRPSPCHSDAVCTDLHFQEKWAGVFHLQATSGQYG
LNFSEAEAACGAQGAVLASFSQLSAAQQLGLHLCLVGWLANGSAAHPAVFPAADCGDGQV
GVVSLGTRKNLSERWDAYCYRVQDVACRCRDGFVGDGISVCNGKLLDVLAATANFSTFYG
MLLDYANATKRGLDFLGFLDDELTYKTLFVPVNEAFVNNMTLSGPDLELHASNTTFLHTN
ATQGTLLPAHSGLSLVISDVGPDNSSWAPAAPGTVVVRHIIVWDIIAFNGIIHALASPLM
SPPQAGAVIVPESLPVAASVGAVVVIGALLCLVAAALYLRVRGKPAGFGFSTFQAEDDAD
DDFAPWQEGHPTLVSVPNPVFGSHDAVCEPFDDSLLEDDFPDTQRILAVK
Download sequence
Identical sequences ENSCPOP00000016768 10141.ENSCPOP00000016768 ENSCPOP00000016768

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]