SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSGACP00000019545 from Gasterosteus aculeatus 69_1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGACP00000019545
Domain Number 1 Region: 11-462
Classification Level Classification E-value
Superfamily Sema domain 9.81e-140
Family Sema domain 0.000000149
Further Details:      
 
Domain Number 2 Region: 568-618
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.0000000000000641
Family TSP-1 type 1 repeat 0.00032
Further Details:      
 
Domain Number 3 Region: 458-508
Classification Level Classification E-value
Superfamily Plexin repeat 0.00000000000767
Family Plexin repeat 0.0038
Further Details:      
 
Domain Number 4 Region: 756-812
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.00000000000942
Family TSP-1 type 1 repeat 0.0013
Further Details:      
 
Domain Number 5 Region: 622-674
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.000000000458
Family TSP-1 type 1 repeat 0.00078
Further Details:      
 
Domain Number 6 Region: 811-861
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.00000000445
Family TSP-1 type 1 repeat 0.00063
Further Details:      
 
Domain Number 7 Region: 861-905
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.0000366
Family TSP-1 type 1 repeat 0.0032
Further Details:      
 
Weak hits

Sequence:  ENSGACP00000019545
Domain Number - Region: 512-565
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.000706
Family TSP-1 type 1 repeat 0.0021
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGACP00000019545   Gene: ENSGACG00000014799   Transcript: ENSGACT00000019583
Sequence length 1047
Comment pep:novel group:BROADS1:groupI:22834627:22844500:-1 gene:ENSGACG00000014799 transcript:ENSGACT00000019583 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
CSRKEHPVVSYQALRPWMSEFSHPGVKDFSRLAPDLSRNQLIVGARNFLFRLSLSNVSLI
QATEWAPDEDTRRSCQSKGKTEEECQNYVRVLLISGTTLFTCGTNAFTPVCATRQISNFS
RVLDTVNGVARCPYDPRHNSTAMVTERGELYAATVIDFSGRDPVIYRSLGNMPPLRTAQY
NSKWLNEPHFVSVYEIGRFAYFFLRETAVENDCGKVVFSRVARVCKNDMGGRFLLEDTWT
TFTKARLNCSRSGEIPFYYNELQSTFHLPEQDLIYGIFTTNVNSISASAVCAFNLSSITR
AFNGPFRYQENPRTAWLSTPNPIPNFQCGTLEEGGPGGNLTERSLQDAQRLFLMNDVVQP
LTVDPLLTQDNLRFSKLVVDIVQGRDTLYHVMYIGTEYGTILKALSTTNKSLHGCYLEEL
RPLPQGQMGSIKSLQILHNDRSLIVGLDDRLVKIPLERCSSYPTESQCMEARDPYCGWDH
KQKRCTTIEDSSNMNQWTQNITECPVRNMTRDGGFGLWAPWQPCNHGDGDGSVSSCACRS
RSCDGPLARCGGIECEGPLIQVANCSRNGGWTPWSSWGQCSSSCGIGFEVRQRSCNNPSP
RHGGRICVGQGREERLCNEKKPCPTPVSWMAWGPWAHCSAECGGGVHSRTRTCDSGNSCP
GCSMEYKACNLEACPEVRRNTPWTPWMPVNVSQDGSRQEQRFRYTCRALLPDPQQLQLGK
KKTETRFCPNDGSGACQTDTLVDDLVKISGRTLPQPQGVRWGSWETWSSCSQSCSRGFRT
RKRSCSTSEGRTNPGACVGSPVDYQDCNAQPCPVSGSWSCWSSWSQCSSSCGGGYYQRTR
TCGDICIGLHTEEALCSTHACEDWGEWTGWGDCDKGLQHRTRPCGEDQGAEAGLCQGNVT
QSRPCQPREVPVILPGQEDQSCGTFTLFQLVAVGAASFFAAALLSALAYTYCHQLGRPPA
ETGVIHPSTPNHLTCNKRGNATPKNEKYIPMEFKTLNKNNLHVNDETCNHFPSPLPSGNM
FTTTYYPPSLGKYDFHPDSPCRTYMHS
Download sequence
Identical sequences G3PPL1
ENSGACP00000019545 69293.ENSGACP00000019545 ENSGACP00000019545

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]