SUPERFAMILY 1.75 HMM library and genome assignments server

Superfamily is undergoing a server migration - you are now browsing on the new server. Please contact us if you experience any problems.

Domain assignment for ENSGGOP00000018317 from Gorilla gorilla 76_3.1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGGOP00000018317
Domain Number 1 Region: 648-709
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.00000000000126
Family ATI-like 0.097
Further Details:      
 
Domain Number 2 Region: 294-348
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.000000000343
Family BSTI 0.041
Further Details:      
 
Domain Number 3 Region: 772-829
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.00000000948
Family ATI-like 0.019
Further Details:      
 
Domain Number 4 Region: 1928-1984
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.000000409
Family ATI-like 0.07
Further Details:      
 
Domain Number 5 Region: 2300-2380
Classification Level Classification E-value
Superfamily FnI-like domain 0.0000356
Family VWC domain 0.062
Further Details:      
 
Weak hits

Sequence:  ENSGGOP00000018317
Domain Number - Region: 827-894
Classification Level Classification E-value
Superfamily FnI-like domain 0.000115
Family VWC domain 0.083
Further Details:      
 
Domain Number - Region: 2153-2229
Classification Level Classification E-value
Superfamily FnI-like domain 0.000565
Family VWC domain 0.068
Further Details:      
 
Domain Number - Region: 699-740
Classification Level Classification E-value
Superfamily FnI-like domain 0.00534
Family VWC domain 0.034
Further Details:      
 
Domain Number - Region: 440-512
Classification Level Classification E-value
Superfamily Methyl-coenzyme M reductase subunits 0.0549
Family Methyl-coenzyme M reductase alpha and beta chain N-terminal domain 0.0071
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGGOP00000018317   Gene: ENSGGOG00000016760   Transcript: ENSGGOT00000023447
Sequence length 2543
Comment pep:novel chromosome:gorGor3.1:12:6127281:6310096:-1 gene:ENSGGOG00000016760 transcript:ENSGGOT00000023447 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MIPARVAGVLLALALILPGTLCAEGTRGRSSMARCSLFGNDFVNTFDGSMYSFAGYCSYL
LAGGCQKRSFSIIGDFQNGKRVSLSVYLGEFFDIHLFVNGTVTQGDQRVSMPYASKGLYL
ETEAGYYKLSGEAYGFVARIDGSGNFQVLLSDRYFNKTCGLCGNFNIFAEDDFMTQEGTL
TSDPYDFANSWALSSGEQWCERVSPPSSSCNISSGEMQKGLWEQCQLLKSTSVFARCHPL
VDPEPFVALCEKTLCECAGGLECACPAFLEYARTCAQEGMVLYGWTDHSACSPVCPAGME
YKQCVSPCARTCQSLHVNEMCQERCVDGCSCPEGQLLDEGLCVESTECPCVHSGKRYPPG
ASLSRDCNTCICRNSQWICSNEECPGECLVTGQSHFKSFDNRYFTFSGICQYLLARDCQD
HSFSIVIETVQCADDRDAVCTRSVTLRLPGLHNSLVKLKHGGGVAMDGQDVQLPLLKGDL
RIQHTVTASVRLSYGEDLQMDWDGRGRLLVKLSPVYAGKTCGLCGNYNGNQGDDFLTPSG
LAEPRVEDFGNAWKLHGDCQDLQKQHSDPCALNPRMTRFSEEACAVLTSPTFEACHRAVS
PLPYLRNCRYDVCSCSDGRECLCGALASYAAACAGRGVRVAWREPGRCELNCPKGQVYLQ
CGTPCNLTCRSLSYPDEECNEACLEGCFCPPGLYMDERGDCVPKAQCPCYYDGEIFQPED
IFSDHHTMCYCEDGFMHCTMSGVPGSLLPDAVLSSPLSHRSKRSLSCRPPMVKLVCPADN
LRAEGLECAKTCQNYDLECMSMGCVSGCLCPPGMVRHENRCVALERCPCFHQGKEYAPGE
TVKIGCNTCVCRDRKWNCTDHVCDATCSTIGMAHYLTFDGLKYMFPGECQYVLVQDYCGS
NPGTFRILVGNEGCSHPSVKCKKRVTILVEGGEIELFDGEANVKRPMKDETHFEVVESGR
YIILLLGKALSVVWDRHLSISVVLKQTYQEKVCGLCGNFDGIQNNDLTSSNLQVEEDPVD
FGNSWRVSSQCADTRKVPLDSSPATCHNNIMKQTMVDSSCRILTSDVFQDCNKLVRTLRV
VAWPGGMDTECDLLTWTLPSCLDMGWTRLLPPHIFLHSVFAFNRFLSPNRFLLFSLKVMT
SLSTGVHILEVGTQERKVRWNPGPSGNMNVVGAIVKVAGCGCHNGEGRCSLLSPWLLTGL
ASGKLAFGEVRRSPQVLLISREHLLKSRSIKHHSLTLLKLENVNSLGRGYGHSCQSQILP
PPSVQTRMNTAWGEFSSKHRKLEIPSKTLPAHTLYAETGTVGRLGVLCQKGMRASSPLPT
SFAGTRAGAAATVHAEAASSVWTEVPQQAGGITWYQATGGHFLGSGRCRLVSFTGSAEGQ
VPPRLRVQRGDGPGPYWSPPHGHLISPVGWQAVPLSGTLAKQSLEIWHGSVGICVCGGAT
ALLQTCGVCLGLGTEPGQYLAHERCSCFFCGGGLGLYIFPLPDVSQNLHKATVLRGNKSM
QDSLSRSLSGFQDLASSPHLPHPWRDLHVNQDNDSVSCSYCHIIAVVGIIIINNRRACLK
NHRMTTSASCPYVSTVNLVQILFRVELTSKTRPGCLSRHLSVAGFVRICMDEDGNEKRPG
DVWTLPDQCHTVTCQPDGQTLLKSHRVNCDRGPRPSCPNSQSPVKVEETCGCRWTCPCVC
TGSSTRHIVTFDGQNFKLTGSCSYVLFQNKEQDLEVILHNGPCSPGARQGCMKSIEVKHS
ALSVELHSDMEVTVNGRLVSVPYVGGNMEVNVYGAIMHEVRFNHLGHIFTFTPQNNEFQL
QLSPKTFASKMYGLCGICDENGANDFMLRDGTVTTDWKTLVQEWTVQRPGQTCQPILEEQ
CLVPDSSHCQVLLLPLFAECHKVLAPATFYAICQQDSCHQERVCEVIASYAHLCRTNGVC
IDWRTPDFCAMSCPPSLVYNHCEHGCPRHCDGNVSSCGDHPSEGCFCPPNKVMLEGSCVP
EEACTQCIGEDGVQHQFLEAWVPDHQPCQICTCLSGRKVNCTTQPCPTAKAPTCGLCEVA
HLRQNADQCCPEYECVCDPVSCDLPPVPHCERGLQPTLTNPGECRPNFTCACRKEECKRV
SPPSCPPHRLPTLRKTQCCDEYECACNCVNSTVSCPLGYLASTATNDCGCTTTTCLPDKV
CVHRSTIYPVGQFWEEGCDVCTCTDMEDVVMGLRVAQCSQKPCEDSCRSGFTYVLHEGEC
CGRCLPSACEVVTGSPRGDSQSSWKSVGSQWASPENPCLISECVRVKEEVFIQQRNVSCP
QLEVPVCPSGFQLSCKTSACCPSCRCERMEACMLNGTVIGPGKTVMIDVCTTCRCMVQVG
VISGFKLECRKTTCNPCPLGYKEENNTGECCGRCLPTACTIQLRGGQIMTLKRDETLQDG
CDTHFCKVNERGEYFWEKRVTGCPPFDEHKCLAEGGKIMKIPGTCCDTCEEPECNDITAR
LQYVKVGSCKSEVEVDIHYCQGKCASKAMYSIDINDVQDQCSCCSPTRTEPMQVALHCTN
GSVVYHEVLNAMECKCSPRKCSK
Download sequence
Identical sequences ENSGGOP00000018317 ENSGGOP00000016375

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]