SUPERFAMILY 1.75 HMM library and genome assignments server

Superfamily is undergoing a server migration - you are now browsing on the new server. Please contact us if you experience any problems.

Domain assignment for ENSGGOP00000000905 from Gorilla gorilla 76_3.1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGGOP00000000905
Domain Number 1 Region: 3927-3987
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.0000000000278
Family BSTI 0.05
Further Details:      
 
Domain Number 2 Region: 2152-2210
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.0000000000474
Family BSTI 0.05
Further Details:      
 
Domain Number 3 Region: 741-801
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.000000000311
Family ATI-like 0.063
Further Details:      
 
Domain Number 4 Region: 2546-2606
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.000000000621
Family ATI-like 0.031
Further Details:      
 
Domain Number 5 Region: 1527-1587
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.00000000132
Family ATI-like 0.029
Further Details:      
 
Domain Number 6 Region: 4309-4369
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.00000000164
Family ATI-like 0.023
Further Details:      
 
Domain Number 7 Region: 1131-1191
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.00000000294
Family BSTI 0.047
Further Details:      
 
Domain Number 8 Region: 1946-2004
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.00000000572
Family ATI-like 0.075
Further Details:      
 
Domain Number 9 Region: 2805-2863
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.00000000572
Family ATI-like 0.075
Further Details:      
 
Domain Number 10 Region: 3541-3599
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.00000000605
Family ATI-like 0.075
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGGOP00000000905   Gene: ENSGGOG00000000914   Transcript: ENSGGOT00000000924
Sequence length 4598
Comment pep:known_by_projection chromosome:gorGor3.1:19:37255072:37338919:-1 gene:ENSGGOG00000000914 transcript:ENSGGOT00000000924 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MGALWSWWILWAGATLLWGLTQEASVDLKNTGREEFLTAFLQNYQLAYSKAYPCLLISSL
SESPASVSILSQADNTSKKVTVRPGESVMVNISAKAEMIGSKIFQHAVVIHSDYAISVQA
LNAKPDTAELTLLRPIQALGTEYFVLTPPGTSARNVKEFAVVAGAAGASVSVTLKGSLTF
NGKFYPAGDVLRVTLQPYNVAQLQSSMDLSGSKVTASSPVAVLSGHSCAQKHTTCDHVVE
QLLPTSAWGTHYVVPTLASQSRYDLAFVVASQATKLTYNHGGITGSRGLQAGDVVEFEVR
PSWPLYLSANVGIQVLLFGTGAIRNKVTYDPYLVLIPDVAAYCPAYVVKSVPGCEGMALV
VAQTKAISGLTIDGHAVGAKLTWEAVPGSEFSYAEVELGTADVIHTAEATTNFGLLTFGL
AKAIGYATAADCGRTVLSPAEPSCEGVQCAAGQRCQVVGGKAGCVAESTAVCRAQGDPHY
TTFDGRRYDMMGTCSYTMAELCSEDDTLPAFSVEAKNEHRGSRRVSYVGLVTVRAYSHSV
SLTRGEVGFVLVDNQRSRLPVSLSEGRLRVYQSGPRAVVELVFGLVVTYDWDCQLALSLP
ARFQDQVCGLCGNYNGDPADDFLTPDGALAPDAVEFASSWKLDDGDYLCEDGCQNNCPAC
TPGQAQHYEGDRLCGMLTKLDGPFAVCHDTLDPRPFLEQCVYDLCVVGGERLSLCRGLSA
YAQACLELGISVGDWRSPANCPLSCPTNSRYELCGPACPTSCNGAAAPSNCSGRPCVEGC
VCLPGFVASGGACVPASSCGCTFQGLQLAPGQEVWADELCQRRCTCNGATHQVTCRDTQS
CPAGERCSVQNGLLGCYPDRFGTCQGSGDPHYVSFDGRRFDFMGTCTYLLVGSCGQNAAL
PAFRVLVENEHRGSQTVSYTRAVRVEARGVKVAVRREYPGQVLVDDVLQYLPFQAADGQV
QVFRQGGDAVVRTDFGLTVTYDWNARVTAKVPSSYAEALCGLCGNFNGDPADDLALRDGG
QAANALAFGNSWQEETRPGCGATEPGDCPKLDSLVAQQLQSKNECGILADPKGPFRECHS
KLDPQGAVRDCVYDRCLLPGQSGPLCDALATYAAACQAAGATVHPWRSEELCPLSCPPHS
HYEACSYGCPLSCGDLPVLGGCGSECHEGCVCDEGFVLSGESCLPLASCGCVHQGTYHPP
GQTFYPGPGCDSLCHCQEGGLVSCESSSCGPHEACQPSGGSLGCVAVGSSTCQASGDPHY
TTFDGHRFDFMGTCVYVLAQTCGTRPGLHRFAVLQENVAWGNGRVSVTRVITVQVANFTL
RLEQRQWKVTVNGVDMKLPVVLANGQIRASQHGSDVVIETDFGLRVAYDLVYYVRVTVPG
NYYQLMCGLCGNYNGDPKDDFQKPNGSQAGNANEFGNSWEEVVPDSPCLPPPTCPPGTDG
CNGGGECPPELEKKYQKEEFCGLLSSPTGPLASCHKLVDPQGPLEDCIFDLCLGGGNLSI
LCSNIHAYVSACQAAGGHVEPWRNETFCPMECPQNSHYELCADTCSLGCSALSAPPQCQD
GCAEGCQCDSGFLYNGQACVPIQQCGCYHNGVYYEPEQTVLIDNCQQQCTCHAGKGVVCQ
EHSCKPGQVCQPSRGILSCVTKDPCHGVTCRPQETCKEQGGQGVCLPNYEATCWLWGDPH
YHSFDGRKFDFQGTCNYVLATTGCPGVSTQGLTPFTVTTKNENRGNPAVSYVRVVTVTAL
GTNISIHKDEIGKVRVNGVLTALPVSVADGRISVTQGASKALLVADFGLQVSYDWNWRVD
VTLPSSYHGAVCGLCGNMDRNPNNDQVFPNGTLAPSIPIWGGSWRAPGWDPLCWDECRGS
CPTCPEDRLEQYEGPGFCGPLAPGTGGPFTTCHAHVPPESFFKGCVLDVCMGGGDHDILC
KALASYVAACQAAGVVIEDWRAQVGCEITCPENSHYEVCGPPCPASCPSPAPLTTPAVCE
GPCVEGCQCDAGFVLSADRCVSLNNGCGCWANGTYHEAGSEFWADGTCSQRCRCGPGGGS
LVCTPASCGLGEVCGLLPSGQHGCQPISTAECQAWGDPHYVTLDGHRFDFQGTCEYLLSA
PCHGPPLGAENFTVTVANEHRGSQAVSYTRSVTLQIYNHSLTLSARWPRKLQLQCPAHSH
YELCGDSCPGSCPSLSAPEGCESACREGCVCDAGFVLSGDTCVPVGQCGCLHDDRYYPLG
QTFYPGPGCDSLCHCREGGEVSCEPSSCGPHETCRPSGGSLGCVAVGSSTCQASGDPHYT
TFDGHRFDFMGTCVYVLAQTCGTRPGLHRFAVLQENVAWGNGRVSVTRVITVQVANFTLR
LEQRQWKVTVNGVDMKLPVVLANGQIRASQHGSDVVIETDFGLRVAYDLVYYVRVTVPGN
YYQLMCGLCGNYNGDPKDDFQKPNGSQAGNANEFGNSWEEVVPDSPCLPPPTCPPGTDGC
NGGGECPPELEKKYQKEEFCGLLSSPTGPLASCHKLVDPQGPLEDCIFDLCLGGGNLSIL
CSNIHAYVSACQAAGGHVEPWRNETFCPMECPQNSHYELCADTCSLGCSALSAPLQCPDG
CAEGCQCDSGFLYNGQACVPIQQCGCYHNGAYYEVNGVLTALPVSVADGRISVTQGASKA
LLVADFGLQVSYDWNWRVDVTLPSSYHGAVCGLCGNMDRNPNNDQVFPNGTLAPSIPIWG
GSWRAPGWDPLCWDECRGSCPTCPEDRLEQYEGPGFCGPLAPGTGGPFTTCHAHVPPESF
FKGCVLDVCMGGGDHDILCKALASYVAACQAAGVVIEDWRAQVGCEITCPENSHYEVCGP
PCPASCPSPAPLTTPAVCEGPCVEGCQCDAGFVLSADRCVSLNNGCGCWANGTYHEAGSE
FWADGTCSQRCRCGPGGGSLVCTPASCGLGEVCGLLPSGQHGCQPISTAECQAWGDPHYV
TLDGHRFDFQGTCEYLLSAPCHGPPLGAENFTVTVANEHRGSQAVSYTRSVTLQIYNHSL
TLSARWPRKLQVDGVFVALPFQLDSLLHAHLSGADVVVTTTSGLSLAFDGDSFVRLRVPA
AYAGSLCGLCGNYNQDPADDLKAVGGKPAGWQVGGAQGCGECVSKPCPSPCTPEQQESFG
GQDACGVISATDGPLAPCHGLVPPAQYFQGCLLDACQVQGHPGGLCPAVATYVAACQAAG
AQLGEWRRPDFCPEQTVLIDNCRQQCTCHAGKVVVCQEHSCKPGQVCQPSGGILSCVNKD
PCHGVTCRPQETCKEQGGQGVCLPNYEATCWLWGDPHYHSFDGRKFDFQGTCNYVLATTG
CPGVSTQGLTPFTVTTKNNRGNPAVSYVRVVTVAALGTNISIHKDEIGKVRVNGVLTALP
VSVADGRISVTQGASKALLVADFGLQVSYDWNWRVDVTLPSSYHGAVCGLCGNMDRNPNN
DQVFPNGTLAPSIPIWGGSWRAPGWDPLCWDECRGSCPTCPEDRLEQYEGPFCGPLAPGT
GGPFTTCHAHVPPESFFKGCVLDVCMGGGDHDILCKALASYVAACQAAGVVIEDWRAQVG
CEITCPENSHYEVCGPPCPASCPSPAPLTTPAVCEGPCVEGCQCDAGFVLSADRCVSLNN
GCGCWANGTYHEAGSEFWADGTCSQRCRCGPGGGSLVCTSASCGLGEVCGLLPSGQHGCQ
PISTAECQAWGDPHYVTLDGHRFDFQGTCEYLLSAPCHGPPLGAENFTVTVANEHRGSQA
VSYTRSVTLQIYNHSLTLSARWPRKLQVDGVFVALPFQLDSLLHAHLSGADVVVTTTSGL
SLAFDGDSFVRLRVPAAYAGSLCGLCGNYNQDPADDLKAVGGKPAGWQVGGAQGCGECVS
KPCPSPCTPEQQESFGGPDACGLISATDGPLAPCHGLVPPAQYFQGCLLDACQVQGHPGG
LCPAVAAYVAACQAAGAQLGEWRRPDFCPLQCPAHSHYELCGDSCPVSCPSLSAPEGCES
ACREGCVCDAGFVLSGDTCVPVGQCGCLHDGRYYPLGEVFYPGPECERRCECGPGGHVTC
QEGAACGPHEECRLEDGVQACHAAGCGRCLANGGIHYITLDGRVYDLHGSCSYVLAQVCH
PKPGDEDFSIVLEKNAAGDLQRLLVTVAGQVVSLAQGQQVTVDGEAVALPVAVGHVRVTA
EGRNMVLQTTKGLRLLFDGDAHLLMSIPSPFRGRLCGLCGNFNGNWSDDFVLPNGSAASS
VETFGAAWRAPGSSKGCGEGCGPQGCPVCLAEETAPYESNEACGQLRNPQGPFATCQAVL
SPSEYFRQCIYDLCAQKGDKAFLCRSLAAYTAACQAAGVAVKPWRTDSFCPLQCPAHSHY
SICTRTCQGSCAALSGLTGCTTRCFEGCECDDRFLLSQGVCIPVQDCGCTHDGRYLPVNS
SLLTSDCSERCSCSSSSGLTCQAAGCPPGRVCEVKAEARNCWATRGLCVLSVGANLTTFD
GARGATTSPGVYELSSRCPGLQNTIPWYRVVAEVQICHGKTEAVGQVHIFFQDGMVTLTP
NKGVWVNGLQVDLPAEKLASVSVSRTPDGSLLVHQKAGVQVWLGANGKVAVIVSDDHAGK
LCGACGNFDGDQTNDWHDSQEKPAMEKWRAQDFSPCYG
Download sequence
Identical sequences ENSGGOP00000000905 ENSGGOP00000000905

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]