SUPERFAMILY 1.75 HMM library and genome assignments server

Superfamily is undergoing a server migration - you are now browsing on the new server. Please contact us if you experience any problems.

Domain assignment for ENSGGOP00000028733 from Gorilla gorilla 76_3.1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGGOP00000028733
Domain Number 1 Region: 2064-2225
Classification Level Classification E-value
Superfamily Galactose-binding domain-like 5.1e-24
Family Discoidin domain (FA58C, coagulation factor 5/8 C-terminal domain) 0.0049
Further Details:      
 
Domain Number 2 Region: 2554-2611
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.00000000000000654
Family TSP-1 type 1 repeat 0.0006
Further Details:      
 
Domain Number 3 Region: 2827-2883
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.00000000000196
Family TSP-1 type 1 repeat 0.00049
Further Details:      
 
Domain Number 4 Region: 3255-3303
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.0000000000051
Family TSP-1 type 1 repeat 0.00053
Further Details:      
 
Domain Number 5 Region: 2717-2767
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.0000000000196
Family TSP-1 type 1 repeat 0.001
Further Details:      
 
Domain Number 6 Region: 1270-1333
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.000000000932
Family ATI-like 0.029
Further Details:      
 
Domain Number 7 Region: 2985-3035
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.0000000017
Family TSP-1 type 1 repeat 0.001
Further Details:      
 
Domain Number 8 Region: 3089-3146
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.00000000491
Family ATI-like 0.044
Further Details:      
 
Domain Number 9 Region: 1374-1411
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.0000000072
Family LDL receptor-like module 0.0013
Further Details:      
 
Domain Number 10 Region: 1416-1453
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.00000000825
Family LDL receptor-like module 0.0015
Further Details:      
 
Domain Number 11 Region: 2464-2500
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.00000000956
Family LDL receptor-like module 0.0014
Further Details:      
 
Domain Number 12 Region: 1491-1529
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.0000000103
Family LDL receptor-like module 0.0019
Further Details:      
 
Domain Number 13 Region: 1451-1486
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.0000000131
Family LDL receptor-like module 0.0013
Further Details:      
 
Domain Number 14 Region: 2389-2428
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.000000017
Family LDL receptor-like module 0.0018
Further Details:      
 
Domain Number 15 Region: 1749-1802
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.0000000366
Family TSP-1 type 1 repeat 0.00077
Further Details:      
 
Domain Number 16 Region: 464-526
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.0000000425
Family BSTI 0.041
Further Details:      
 
Domain Number 17 Region: 1694-1743
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.0000000536
Family TSP-1 type 1 repeat 0.0011
Further Details:      
 
Domain Number 18 Region: 822-882
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.0000000752
Family ATI-like 0.046
Further Details:      
 
Domain Number 19 Region: 1810-1869
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.000000121
Family BSTI 0.043
Further Details:      
 
Domain Number 20 Region: 2233-2271
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.000000275
Family LDL receptor-like module 0.0025
Further Details:      
 
Domain Number 21 Region: 2501-2554
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.000000327
Family TSP-1 type 1 repeat 0.0043
Further Details:      
 
Domain Number 22 Region: 1562-1599
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.000000654
Family LDL receptor-like module 0.0012
Further Details:      
 
Domain Number 23 Region: 2946-3002
Classification Level Classification E-value
Superfamily FnI-like domain 0.000000743
Family VWC domain 0.058
Further Details:      
 
Domain Number 24 Region: 1602-1641
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.000000798
Family LDL receptor-like module 0.0019
Further Details:      
 
Domain Number 25 Region: 2883-2948
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.00000262
Family ATI-like 0.084
Further Details:      
 
Domain Number 26 Region: 3318-3367
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.00000311
Family BSTI 0.03
Further Details:      
 
Domain Number 27 Region: 3144-3203
Classification Level Classification E-value
Superfamily FnI-like domain 0.00000534
Family VWC domain 0.03
Further Details:      
 
Domain Number 28 Region: 923-977
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.00000589
Family ATI-like 0.057
Further Details:      
 
Domain Number 29 Region: 1906-1964
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.0000105
Family TSP-1 type 1 repeat 0.0012
Further Details:      
 
Domain Number 30 Region: 3043-3084
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.0000116
Family TSP-1 type 1 repeat 0.003
Further Details:      
 
Domain Number 31 Region: 2634-2677
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.0000213
Family ATI-like 0.027
Further Details:      
 
Domain Number 32 Region: 975-1016
Classification Level Classification E-value
Superfamily FnI-like domain 0.0000324
Family Fibronectin type I module 0.075
Further Details:      
 
Domain Number 33 Region: 1655-1681
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.000034
Family LDL receptor-like module 0.0037
Further Details:      
 
Domain Number 34 Region: 1961-2024
Classification Level Classification E-value
Superfamily FnI-like domain 0.0000439
Family VWC domain 0.074
Further Details:      
 
Domain Number 35 Region: 890-919
Classification Level Classification E-value
Superfamily PMP inhibitors 0.0000889
Family PMP inhibitors 0.0028
Further Details:      
 
Weak hits

Sequence:  ENSGGOP00000028733
Domain Number - Region: 2776-2823
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.000107
Family TSP-1 type 1 repeat 0.0016
Further Details:      
 
Domain Number - Region: 533-561
Classification Level Classification E-value
Superfamily PMP inhibitors 0.00262
Family PMP inhibitors 0.003
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGGOP00000028733   Gene: ENSGGOG00000034887   Transcript: ENSGGOT00000041025
Sequence length 3392
Comment pep:novel chromosome:gorGor3.1:7:148223868:148261867:1 gene:ENSGGOG00000034887 transcript:ENSGGOT00000041025 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MLLPALLFGMARALADGRWCEWTETICVEEEVAPRQEDLVPCARLDHYSRQGWRLDLPWS
GRAGLTRSPEPGLCPIYKPPETQPAKWNRTVRACCPGWGGAHCTEALAKASPEGHCFAMW
QCQLQAGSANASAGSLEECCARPWGRSWWDGSSQACLSCSSRHLPGSASSPALLQPLAGA
VGQLWSQHQRPSATCASWSGFHYRTFDGRHYHFLGRCTYLLAGAADSTWAVHLTPGDRCP
QPGHCQRVTMGPEEVLIQAGNVSVKGQLVPEGQSLLHGLSLQWLGDWLVLSGGLVVVVRL
DRTGSISISVDHELWGQTQGLCGLYNGWPEDDFMEPGGGLAMLAATFGNSWRLPGSEPGC
LDAVEVAQGCDGPLGLTDADVEPGHLRAEAQDVCHQLLEGPFGQCHAQVSPAEYHETCLF
AYCTGAMAGSGQEGRQQAVCATFASYVQACARRHIHIRWRKPGFCERLCPGGQLYSDCVS
LCPPSCEAVGQGEEESCREECVSGCECLRGLFWNGTLCVPAAHCPCYYRRQRYAPGDTVR
QLCNPCVCRDGRWHCAQALCPAECAVGGDGHYLTFDGRSYSFRGGQGCRYSLVQDYVKGQ
LLILLEHGACDAGSCLHAISVSLEDTHIQLRDSGAVLVNGQDVGLPWIGAEGLSVRRASS
AFLLLRWPRAQVLWGLSDPAAYITLDPRHAHQVQGLCGTFTQKQQDDFLTPAGDVETSIA
AFASKFQVAGKGRCPSGDSALLSPCTTHSQRHAFAEAACAILHSSVFQECHRLVDKEPFY
LRCLAAVCGCDPGRDCLCPVLSAYARRCAQEGASPPWRNQTLCPVMCPGGQEYRECAPAC
GQHCREPEDCGELGSCVAGCNCPLGLLWDPEGQCVPPSLCPCQLGARRYAPGSATMKECN
RCICQERGLWNCMARHCPSQRAFCPRELVYAPGACLLTCDSPSASHSCPAGSTDGCVCPP
GTVLLDERCVPPDLCPCHHSGQWYLPNATIQEDCNVCVCRGRQWHCTGQRCSGRCQASGA
PHYVTFDGLAFTYPGACELLVREASGLFTVSAQNLPCGASGLTCTKALAVRLEGTVVHML
RGRAVTVNGVSVTPPKVYTGPGLSLHRAGLFLLLSTRLGLTLLWDGGTRVLVQLSPQFHS
RVAGLCGDFDGDASNDLRSRQGVLEPTAELAAHSWRLSSLCPEPGDLPHPCTMNTHRAVW
ARARCGALLQPLFASCHAEVPPQQHYEWCLYDACGCDSGGNCECLCSAIATYADECARHG
HHVRWRSQELCPLQCEGGQVYEACGPTCPPTCHEQHPESRWHCQVVACVEGCFCPEGTLL
HRGACLEPASCPCEWGHNSFPPGSVLQKDCGNCTCQEGQWHCGGDGGHCEELVPACAEGE
ALCQENGHCVPHGWLCDNRDDCGDGSDEEGCAAPGCGEGQMTCSSGHCLPLALLCDSQDD
CGDGTDEQSCPCPHGLLACANGRCLPPALLCDGHPDCPDTADEESCLGQVTCVPGEVSCV
DGTCLGAIQLCDGVWDCPDGADEGPEHCPLPSLPTPPASTLPGPSPGSLDTASSPLASAS
PAPPCGPFEFRRGSGECTPRGWRCDQEEDCADGRDERGCGGPCAPHHAPCARGPHCVSPE
QLCDGVRQCPDGSDEGPDACARLPALGGPNRTGLPCPEYTCPNGTCIGFQLVCDGQPDCG
RTGRVGPSPEEQGCGAWGPWSPWGPCSQTCGPGGQGQSRRCSPLGLLVLQNCPGPEHQSQ
ACFTAACPVDGEWSAWSPWSVCSEPCRGTMTRQRQCHPPQNGGRTCAALPGGPHSTRQTK
PCPQDSCPNATCSGELMFQPCAPCPLTCDDISGQVTCPPDRPCGSPGCWCPEGQVLGSEG
WCVWPWQCPCLVDGARYWPGQRIKADCRLCICQDGRPRRCRLNPDCTVDCSWSSWSPWAE
CLGPCGSQSIQWSFRSPNNPRPSGRGRQCRGIHRKARRCQTETCEGCEHQGQVHRVGERW
RGGPCRVCQCLHNLTACCSPYCPLGSCPQGWVLVEGTGESCCHCALPGENQTVQPMATPA
AAPAPSPQIRFPSATYILPPPGDPCYSPLGLAGLAEGSLHASSQQLEHPTQAALLGAPTQ
GPSPQGWHAGGDAYAKWHTRPHYLQLDLLQPWNLTGIRVPETGSSIAYASSFSLQFSSNG
LHCHDYRDLLPGILPLPKLFPRNWDDLDPAVWTFGPMVQARFVRVWPHDVHHSDVPLQVE
LLGCDCEPGSPPAPLCPGVGLRCASGECVLRGEPRDGVLDCEDGSDEEGCVLLPEGTGRF
HSTAKTLALSSAQLGQLLQWPREGLAETEHWPPGQESPTSPTETRPVSPGPASGVPHHGE
SVQMVTTTPIPQMEARTLPPGMAAVTVLPPHPVTPATPAGQSVAPGPFPPVQCGPGQMPC
EVLGCVEQAQVCDGREDCLDGSDERHCARNLLMWLPSLPALWAASTVPFMVPTTALPGLP
ASRALCSPSQLSCGSGECLSAERRCDLRPDCQDGSDEDGCVDCVLAPWSVWSSCSRSCGL
GLTFQRQELLRLPLPGGSCPRDRFRSQSCFVQACPVAGAWAMWEAWGPCSVSCGGGHQSR
QRSCVDPPPKNDDAPCPGASQERAPCGLQPCSGGTDCELGHVYVSADLCQKGLVPPCPPS
CLDPKANRSCSGHCVEGCCCPPGLLLHDTRCLPLSECPCLVGEELKWPGVSFLLGNCSQC
VCEKGELLCQPGGCPLPCGWSAWSSWAPCDRSCGSGVRARFRSPSNPPAAWGGAQCKGDR
QELQGCHTVCGTEVLGWTPWTSWSSCSQNCLAPGGGPGWRSRSLCPSPGDSSCPGDATQE
EPCSPPVCPVPSVWGLWAPWSTCSAPCDGGIQTRGRSCSSLAPGATTCPGPHSQTRDCNT
QPCTAQCPENMVFRSAEQCHQEGGPCPRLCLTQGPGIECTGFCAPGCTCPPGLFLHNASC
LPRSQCPCQLYGQLYASGAMARLDSCNNCSCVSGEMACTSERCPVACGWSPWTPWSLCSR
SCNVGIRCCFRAGTAPPAAFGGAECQGPTMEAEFCSLRPCPGPGGEWGPWSPCSVPCGGG
YRNHTRGSGLRSLMEFSTCGLQPCAGPVPGMCPRDKQWLDCAQGPASCAELSAPGGTNQT
CHPGCHCPSGMLLLNNVCVPTQDCPCAHEGHLYPPGSTVVRRCENCSCVSGLIANCSSWP
CAESEPTWSPWTPWSQCSASCGPARRHWRRFCARSPSAVPSTMAPLPLPATPTPLCSGPE
AEEEPCLLQGCDRAGGWGPWGPWSHCSRSCGGGLRSRTRACDQSPPQGLGDYCEGPRAQG
EVCQALPCPVTNCTAIEGAEYSPCGPPCPRSCDDLVHCVWRCQPGCYCPPGQVLSSNGAI
CVQPGHCSCLDMLTGQRHHPGARLARPDGCNH
Download sequence
Identical sequences ENSGGOP00000028733 ENSGGOP00000028733

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]