SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSGACP00000000918 from Gasterosteus aculeatus 76_1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGACP00000000918
Domain Number 1 Region: 1428-1488
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.00000000000752
Family BSTI 0.067
Further Details:      
 
Domain Number 2 Region: 1824-1884
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.0000000000621
Family ATI-like 0.017
Further Details:      
 
Domain Number 3 Region: 653-713
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.000000000067
Family BSTI 0.089
Further Details:      
 
Domain Number 4 Region: 1039-1101
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.000000000142
Family ATI-like 0.05
Further Details:      
 
Domain Number 5 Region: 273-328
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.00000000965
Family BSTI 0.041
Further Details:      
 
Domain Number 6 Region: 1939-1978
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000000177
Family EGF-type module 0.0053
Further Details:      
 
Domain Number 7 Region: 1-77
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 0.000000364
Family MAM domain 0.029
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGACP00000000918   Gene: ENSGACG00000000704   Transcript: ENSGACT00000000918
Sequence length 2047
Comment pep:known_by_projection scaffold:BROADS1:scaffold_47:32105:76031:-1 gene:ENSGACG00000000704 transcript:ENSGACT00000000918 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MYGSDNQNVLRVLAKRPGSDDEVWKKIGIQSPSWLKGSITVSKQSAQDIAIVFEAQRGFS
SSCDTALDNIVISEGACPVPLEKLLKLILLLMPSTPGPTTPKPPNSEIPLGTVSLNCAKP
STPGPTTPKPSTPGPTTPKPTTPRPTTPGPTTPKPPIFFPPKPSTPGPTTPKPSTPGPTT
PKPTTPRPTTPGPTTPKPPTKPSTPGPTTPKPSTPGPTTPKPTTPRPTTPGPTTPKPPSP
TTPKPSTPGPTTPKPTTPRPTTPGPTTPKPPTPKCPPNSHYEACADPCQETCSGKPPSCG
GPCSESCVCDPGYVLSAGKCVKKTSCGCTHVNGQYYEPGEIIFGDGCSKLCRCAGNYTLE
CVDNSCDPTEECRDVNGVAGCYPKGSQDCVVSGDPHYNTFDKKFFTFMGTCTYTLARTCK
NTTGPWFSVEGKNEERGVPGMSYLRKLYVTVDGITVTLMKAKRTLVNGRRVAFPNSPSPL
ISLSLAGQYVTLQTSFGLRVRWDGSHYAQISVPSSYYDQMCGLCGDYDGNPGNDFTKPDG
TLVGNVNDFGNSWQTKEDEDDSCSPGTKPDPDCDPKLEAEVVKPDKCGKITDPAGPFREC
ISVVDPTPFFQSCVYDMCQFGGQQRVLCDQLQAYTDACQSAGAKVHQWRTPGFCSLECPP
NSSYTLCASSCPETCLGVVASPSCQDVCVEGCECNPGFILSNDKCVSLKDCGCVDTSGTY
HPVGDNWYLEGCEHKCDCHSGGLIQCHNSSCKPTTESCQLKDGEYECLPLGNGICSVSGD
PHYTTFDKHTHHYMGACSYTLSKPCNVSSGSPYFSVDTQNEHRGSSKKVSYVRAVVVNVN
GVTVVLGKGRTVQVNGTVVVPPVTSIGGVKIYLSGKFVVLETSFGLRVRFDGNHHADVTL
PISYNGLLCGMCGNFNGNSKDDNLKPDNTPAANTNELGDSWQVPDPRPDCTNGGGQEDCD
EVVEEEAKKPTSCGMITDPNGSLFKPCHVVVPPEPYFGNCVYDMCATGGQTVALCQAIES
YADLCAAAGVPIPWRNYTFCPLKCPAGSHYDSCSPACAQPSCQQPATPGGSCDLPCVEGC
VCSPGLILSGDKCVPLSECGCTDTDGLYRPVGDSWFTETDCSERCKCNGNNNITCEPWQC
SPAQECKVVEGVLDCHSTGKGVCHVAGDPHYYTFDGVMHTFMGTCTYTLVEVCNTTKVTP
FKIVAKNEERGQPEASYVRSVKVYLPHDTVVELQKSRRVSLNGRRVKTPLSIDLAGAKVI
TSGSYSLLDTNFGLQVKFDGVHHLEITVPGEYFNKLCGMCGNYNHNSSDEYLMPNKKPAK
DVIELGNSWKSDGDSDPGCQPDTRPNIHPNCTAEEEKLYEAQCAGVILSDRFKPCHSLVP
PEAFLGNCIYDLCEYDGMQATLCDNVDAYAQACQSSGVTISWRNSTFCPVPCPANSHYSD
CTPPCPPTCSDLFPIFCHLPSTTCVEGCQCDGGYVLSDSKCVPLDKCGCLGSDGEYHETR
SLIIETIITRTCSCNLGGQITCKDHTCSSNSVCSLDKYGDVYCKPTNFDKCSISGDPHHR
TFDGFTHHFQGPYTYVLTQSHNLQNSLTPFVVRGKNMRRGGNRRVSFLDQMYIDVYGVNV
RFLQKKSVLLNGERVAPPLRPVDGLTITMNSKQVQLTTDFGLTVRFDGNIRGEIILPSTY
KNAVRGLCGNYDGITRNEYMKPDGTVVRDLNNFGESWRVTFWHSVSRREVEADPDSGFET
SDCSQSELNGYNSVAQCGALSDSKGPFAACHATLPPKTYQDDCVFDLCAESGNAALRCAS
YEAYAAACQEAGVKLEPWRQQLDCVLSCDANSTYSPCMSPCPPSCADLAAPSECEATSCM
EGCQCAAGFVMSEGVCVPYTQCGCSVLNRYYTLKEEFVTEDCSQACECTSTGAVCRPKTC
QSGFVCTVYDFKRECYRASPCLSYPCLNGGQCADASNNTYTCQCAEGFEGDNCEVEITPS
GGLETKWIILIAVMVAVVVIIIVTAVACACRQKAKNRKGNMERLSLKSTTVLYTDMDKER
KDKMTTM
Download sequence
Identical sequences G3N6J3
ENSGACP00000000918

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]