SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSGACP00000020731 from Gasterosteus aculeatus 69_1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGACP00000020731
Domain Number 1 Region: 250-471
Classification Level Classification E-value
Superfamily Metalloproteases ("zincins"), catalytic domain 4.4e-56
Family Reprolysin-like 0.00045
Further Details:      
 
Domain Number 2 Region: 561-616
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.000000000000471
Family TSP-1 type 1 repeat 0.00034
Further Details:      
 
Domain Number 3 Region: 907-963
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.00000000000929
Family TSP-1 type 1 repeat 0.0037
Further Details:      
 
Domain Number 4 Region: 846-903
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.00000000106
Family TSP-1 type 1 repeat 0.004
Further Details:      
 
Domain Number 5 Region: 1017-1072
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.0000000235
Family TSP-1 type 1 repeat 0.0053
Further Details:      
 
Domain Number 6 Region: 967-1020
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.00000392
Family TSP-1 type 1 repeat 0.0052
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGACP00000020731   Gene: ENSGACG00000015707   Transcript: ENSGACT00000020770
Sequence length 1120
Comment pep:novel group:BROADS1:groupXIV:662173:697993:1 gene:ENSGACG00000015707 transcript:ENSGACT00000020770 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MEILWKTLTWTLSLVVMAASEFQRLSHNSQASLEEFLSYLQHYQVTVPVRVDESGEFLSY
TVKHHRPGRRRRGVPDPLDPDPPHHNPRLFYRLSAYGMHFHLNLTLNPHLVSKHFAVEFW
GRDGLEWRHDAVDNCHYVGSLQNQRGATRVALSNCKGLHGVITTEEEQYLIEPLRNISSS
EWNLEEAQQHVIYKMSAIPSPQERSQELSCGISDLIKSSSPCQFSSPPPPAPQSDGERAN
GTHRPRRSVSSERFVETLVVADKMMVGYHGRKDIEGYILSVMNIVAKLYRDASLGNVVNI
IVTRLIVLTEDQPNLEINHHADKSLDSFCKWQKSIRSHHGDGNSIPENGMAHHDNAVLIT
RYDICTYKNKPCGTLVGLASVAGMCEPERSCSINEDIGLGSAFTIAHEIGHNFGMNHDGI
GNSCGTKGHETAKLMAAHITANTNPFSWSACSKDYITSFLDSGRGTCLDNEPMKRDFLYP
TVAPGQVYDADEQCRFQYGTSSRQCKYGEVCRELWCLSKSNRCVTNSIPAAEGTLCQTGS
IEKGWCYQGECVAFGTWPQSEDGGWGPWSTWGECSRTCGGGVSSSMRHCDSPAPSGGGKY
CLGERKRYRSCNTDACAAGSRDFREKQCADFDLMPFRGKYYNWKPYTGGGVKPCALNCLA
EATTFYTEGSPAVVEGTGCQVDSLDICINGECKHVGCDNVLSSDAKEDRCRVCGGDGSTC
EATEGLFNDSLPRGGYMEVVQIPKGSVHIEIKEVAMSKNYIGRGGGGGDYYINGAWTIDW
PRKFDIAGTAFHYKRPSDEPESLEALGATTEVLVVMVLLQEQNLGIRYKFNVPIQRTGSG
DNEVGFSWHYLPWSECSATCAGGSQKQEVVCKRLDDNSVVQNNYCDPDSKPPENQRDCNT
EPCPPEWFIGDWSECGKTCDGGMRTRTVLCIRKMGPAEEETLEDTHCLTHRPMEREACNN
QSCPPKWVTLDWSECTPKCGPGFKHRIALCKSSDLATFPPAHCPSHNKPPVRIRCSLGRC
PPPRWIPGEWGQCSAQCGLGQQMRTVTCLSYTGQPSGDCAESLRPATMQQCESKCDATPV
SNGDECKDVNKVAYCPLVLKFKFCSRAYFRQMCCRTCQGH
Download sequence
Identical sequences G3PSZ5
ENSGACP00000020731 69293.ENSGACP00000020731 ENSGACP00000020731

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]