SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSTRUP00000040444 from Takifugu rubripes 76_4

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSTRUP00000040444
Domain Number 1 Region: 340-396
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.00000000000000471
Family TSP-1 type 1 repeat 0.00035
Further Details:      
 
Domain Number 2 Region: 396-451
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.000000000000497
Family TSP-1 type 1 repeat 0.00046
Further Details:      
 
Domain Number 3 Region: 294-336
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.00000000000942
Family TSP-1 type 1 repeat 0.00093
Further Details:      
 
Domain Number 4 Region: 451-506
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.000000000129
Family TSP-1 type 1 repeat 0.00063
Further Details:      
 
Weak hits

Sequence:  ENSTRUP00000040444
Domain Number - Region: 869-1045
Classification Level Classification E-value
Superfamily Family A G protein-coupled receptor-like 0.0156
Family Rhodopsin-like 0.038
Further Details:      
 
Domain Number - Region: 38-92
Classification Level Classification E-value
Superfamily Spermadhesin, CUB domain 0.0792
Family Spermadhesin, CUB domain 0.005
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSTRUP00000040444   Gene: ENSTRUG00000015830   Transcript: ENSTRUT00000040586
Sequence length 1535
Comment pep:known_by_projection scaffold:FUGU4:scaffold_24:1454161:1527648:1 gene:ENSTRUG00000015830 transcript:ENSTRUT00000040586 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MKAVRNLLIYIFSTYLLVMFGLTGAQDYWCSTLVKGVIYGSYSVTEMFPKNYTNCTWTLE
NPDPTKYSIYLKLYKQDLSCSEYSLLAYQFDHYSHEKISELLKVNESIVYLCDTKNIYVF
LLYDKNFVQLRRVFPYDYNGLTPQKLDEEEKSIVEFLVLNKATPSQFGCQVLCTWLENCL
KLEKGTVETCGIVYTKCTCPQHLGDGESESMLMLNNVVLPLNPQTEGCLSPQLQGGQICN
LSAEVKRPPKEEYGMIGQHTVKSQRPRSVHDTKALQEQAESAKFMAQTGESGAEEWSQWS
SCSVTCGQGSQVRTRTCVSPYGTHCSGPLRESRVCNNTAPCPVHGVWEEWSPWSLCSFTC
GRGHRTRTRMCAPPQHGGRACDGPETQTKLCNIALCPVDGQWQEWSSWSDCSVTCANGTQ
QRTRQCSAAAHGGSECRGHWAESRECHNPDCTANGQWNPWGPWSGCSKSCDGGWQRRARV
CQGAAVTGQQCDGTGEEVRKCSDQRCPAPYEICPEDYAVSMVWRRTPSGELAFNRCPPNA
TGTTSRRCSLDHRGMAFWEQPSYARCITNEFRYLQQSVQGHLAKGQRMLAGDGMSQVTKN
LLDLTQRRNFYAGDLLSSVEILRNVTETFKRASYEPSSDDVQNFFQIISNLLEEENKEKW
EDAQKIYPGAVELMQVIEDFIHIVGLGMKDFHNAYLMTGNLVASIQRLPAVSVMTDINFP
MKGRKGMVDWARNSEDKVVIPKGLFVSQSSLDMEGSPVFILGTVLYKTLGLMLPSPKNHT
VVNSKVIAVTVRPEPKATESQLEIELAHNTNGTMGPYCALWDSTIMNDSWGAWSTKGCKT
VLTDASHTKCLCDRVSTFAILAQQPREITMEYSGVPSVTLIVGCGLSCLALITLAVIYAV
LWRYIRSERSIILLNFCLSIVCSNILILVGQTQTHNAGVCVMTTAFLHFFFLASFCWVLT
EAWQSYMAVTGKVRTRLIRKRFLCLGWGLPALVVAVSMGFTKTKGYGTPLYCWLSLEGGL
LYAFVGPAAAVVLVNMVIGILVFKLVSRDGILDKKLKHRAGQMSEPHTGLTLKCAKCGVV
STTALSATTASNAMASLWSSCGDEFSRHVNPFGAVFPPLNERDSIGGGVSVALALFDLAR
RHVCCAAHSFPLSGVLMSFRLKLFPQLLLHTHTHTHTHTHTHTQAFTDFEKDVDIACRSA
LHKDMGSCRAATITGTLSRISLNDEEEEKAPEGLNYSTLPGNIMSKVIIQQPSALHLPMG
VGDLKEQCMADGNADMRRTVYLCTDDTMRQSEHDMMGRDMEGHPVAAQMMETDYIVMPRA
SAAAVSGSCNTSTLLKDDSKMNITMDTLQHERLMHCKMSPDFSLGPSGMDHMNVNLEQHY
PSAPEQMQNLPFEPRTAVKNFLAEMEESAGLSRSETGSTISMSSLERRKSRYSDLDFEKV
MHTRKRHMELFQELNQKFQTLDRFRDIPNMGTMDKAMPNKNPWESYNPACEYQNYATMNV
LQSDAKDSLEMTPAEWEKCVNLPLDVQEGDFQTEV
Download sequence
Identical sequences H2UU20
ENSTRUP00000040444

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]