SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for H3DHK1 from Uniprot 2018_03 genome

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  H3DHK1
Domain Number 1 Region: 2295-2507
Classification Level Classification E-value
Superfamily vWA-like 2.23e-62
Family Integrin A (or I) domain 0.00012
Further Details:      
 
Domain Number 2 Region: 124-312
Classification Level Classification E-value
Superfamily vWA-like 2.36e-60
Family Integrin A (or I) domain 0.00024
Further Details:      
 
Domain Number 3 Region: 1177-1375
Classification Level Classification E-value
Superfamily vWA-like 8.48e-58
Family Integrin A (or I) domain 0.0000726
Further Details:      
 
Domain Number 4 Region: 419-611
Classification Level Classification E-value
Superfamily vWA-like 2.64e-56
Family Integrin A (or I) domain 0.00014
Further Details:      
 
Domain Number 5 Region: 2524-2729
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 2.3e-33
Family Laminin G-like module 0.073
Further Details:      
 
Domain Number 6 Region: 1937-2106
Classification Level Classification E-value
Superfamily Fibronectin type III 8.78e-26
Family Fibronectin type III 0.00086
Further Details:      
 
Domain Number 7 Region: 812-982
Classification Level Classification E-value
Superfamily Fibronectin type III 2.13e-24
Family Fibronectin type III 0.0012
Further Details:      
 
Domain Number 8 Region: 628-799
Classification Level Classification E-value
Superfamily Fibronectin type III 3.83e-24
Family Fibronectin type III 0.00094
Further Details:      
 
Domain Number 9 Region: 1845-1933
Classification Level Classification E-value
Superfamily Fibronectin type III 5.08e-20
Family Fibronectin type III 0.00069
Further Details:      
 
Domain Number 10 Region: 1382-1469
Classification Level Classification E-value
Superfamily Fibronectin type III 2.22e-19
Family Fibronectin type III 0.00055
Further Details:      
 
Domain Number 11 Region: 1754-1843
Classification Level Classification E-value
Superfamily Fibronectin type III 1.52e-17
Family Fibronectin type III 0.00097
Further Details:      
 
Domain Number 12 Region: 1562-1726
Classification Level Classification E-value
Superfamily Fibronectin type III 2.69e-17
Family Fibronectin type III 0.0023
Further Details:      
 
Domain Number 13 Region: 1082-1171
Classification Level Classification E-value
Superfamily Fibronectin type III 3.49e-17
Family Fibronectin type III 0.00059
Further Details:      
 
Domain Number 14 Region: 2120-2205
Classification Level Classification E-value
Superfamily Fibronectin type III 8.48e-17
Family Fibronectin type III 0.0018
Further Details:      
 
Domain Number 15 Region: 328-416
Classification Level Classification E-value
Superfamily Fibronectin type III 2.36e-16
Family Fibronectin type III 0.0015
Further Details:      
 
Domain Number 16 Region: 997-1079
Classification Level Classification E-value
Superfamily Fibronectin type III 0.0000000000000112
Family Fibronectin type III 0.0019
Further Details:      
 
Domain Number 17 Region: 2208-2295
Classification Level Classification E-value
Superfamily Fibronectin type III 0.000000000000222
Family Fibronectin type III 0.0028
Further Details:      
 
Domain Number 18 Region: 23-105
Classification Level Classification E-value
Superfamily Fibronectin type III 0.0000000000262
Family Fibronectin type III 0.00072
Further Details:      
 
Domain Number 19 Region: 1472-1561
Classification Level Classification E-value
Superfamily Fibronectin type III 0.0000000000476
Family Fibronectin type III 0.0018
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) H3DHK1
Sequence length 3092
Comment (tr|H3DHK1|H3DHK1_TETNG) Collagen, type XII, alpha 1a {ECO:0000313|Ensembl:ENSTNIP00000019995} KW=Complete proteome; Reference proteome OX=99883 OS=nigroviridis). GN= OC=Tetraodon.
Sequence
MKFGLCLAAVLAALLSSVDAQVEPPSDLKFKILNENSVEMSWRRPSSRIDGFRIQVVSDS
DEAVRDFTLDAYTSVTSITDLTPDLDYSVSINSFDGSEESIPIFGQLTIQSGNSSDRVRR
PTDTIKCSVSAITDLVFLVDGSWSVGRENFKHIRSFIASLAGAFDIGEDKTRVAVVQYST
DTRTEFPLTRYTRRGDLLQAINSLPYKGGNTMTGDAIDYLLQNIFTEAGGSRKSFPKVAM
IITDGKSQDPVEEHARRLRNIGVEIFVLGIKGADEDELREIASTPHSKHMYNVPNFDKIQ
EVQKKIIREVCSGVDEQLSSLVSGEELVEPASNLQVTEIASKSMRVTWDPSLGDVTGYKL
TLNPMLPGMKRQELYTGPTHTSINVRDLSPETVYEIALYALKGLTPSEPIMATERTQPVK
VTTECSLGVDVQADVVLLVDGSYSIGLQNFAKVRAFLEVLVNSFDIGPSKVQISLVQYSR
DPHTEFALNTHHDINAVVRAVRTFPYRGGSTNTGKAMKYVKDKIFVASRGARQNVPRVMV
LITDGKSSDSFKDAATNLRNIDVEIFAVGVKDAVRSELEAIANPPADNHVFEVEDFDAFQ
RISKELTQSICLRIEQELLNIKKRSLLPPTDLQFSEITSRSFRTSWTPPAARVMSYLVRY
RKAEDITGDYISIALPSDATSVVLQHLSPLTAYEVNVFAQYDKGDSFPLSGEETTLEEQG
PVRNLRVSEETTNSFRVSWQAAPGPVIRYRLSYVPLSGTGEILEAQTIEDETSIVLQELF
PITTYRVSVFAEYSTGMGSEMQIDGTTKEVLGAPRDLRVFDETISTMKLAWQPARGNVLQ
YRIVYKPVEGGDRKEISVKGDTTQAVLKNLQPATEYDLFVSAHYTSGVGDPLIGTGTTLE
ELGSPRDLTTRDVTDTSFLVSWVPAPGNVRQYRIKWKSLYAEEAGESTIPGDNTAMVLDG
LTPETRYQVSVSAVYGHGEGQPLNGEETTDISAAAKAIVVSEETERTMKVTWQPAPGNVL
NYRVTYKPKLGGRQLAAKVPGGNTSTVLRRLTALTTYDITVLPVYRSGEGKAREGEGTTL
TPYKGPRNLHTSDSSRTSFRVTWDHAPGDVKGYKVQFHPVGEDIDLGELLVGPYDNTVVL
EELRAGTKYTVSVFGMFDGGESLPLAGEERTTLSDGPDPTPYSPSDVTCKTKAQADIVLL
LDGSWSIGRLNFKTIRTFISRMVEVFDIGPDKVQVGLAQYSGDVKTEWHLNAHPTRESLL
EAVANLPYKGGNTRTGLALNYVLQNNFKENVGMRRNSRKIGVLVTDGKSQDDVHEKAQNL
RNENIELYAVGVKNAEENELRSIASDPDDIHMYNVADFSFLLDIVDNLTNNLCNSVKGPG
GSPEAPTNLVTSDVTHHSFRATWTPPEDPPERYRVEYTSPSGQSRQVYVDGRENTVVLQG
LDPLTEYGVKVFSVVDDESSEPLQGMETTLPLPAVRSMDVYDEQTTTMRVRWEEVKDATG
YILRYDAINATQPTVEQEVRVGRDKTDTQLVRLLPNTAYGISVLALHGESASKPLSDQGV
TLPLPPAGQLRVRDVTHSTMNLDWDAAPGPVEKYLITYKPENGEARELEVGGDVTNKDLD
NLISQTEYSLAVTPIYDDAGPGQPMMGDAITDVVPAPKNLQFSDVTQTSFRATWEHGAPD
VALYRIGWKKVGEDDFQYDILNSDETSYVITDLEIDTRYDVSVTAIYPDEAESEDLLGTE
KTSSLVSKDESTSLPHAPATNLVVYNETITSLNARWNPAPGPVQDYSITYVPTSGGRPQT
TKVSGRKTSVLLPKLTPDTEYSIGVTAVYPKGVSKELTGLGKTKPLGGVRNLQVTDPTTS
TLNVKWDPAEGNVRQYKVNYVPTAGGPEDMVQVPGKTHNTVLKNLQPDTDYTVTVVPVYS
TGEGKPVSENGKTLERSPVRNIEVFNPTTNTLNVRWEAAKGPVQGYRVNYAPMNGARPVQ
SIVVPDTTAFLQQLLPNTDYKVEVVALYSDGEGPAISDTEKTPSVPRSAPRNLQVYNPTT
SSLTVSWEPAEGPVTQYRIAYAPTTGDPIEEYATVPANRNNVVLPNLDPDTPYNIKVTAI
YNDGPGGELEGNGRTLDMLGPRNLRVSDEWYTRFKVSWDPAPSKVNGYKLLYKPKGSTED
YTEVFVGDVVTHQLHDLKPGTTYDLEVLAQYDQGLSQPLDGEGTTLYLNVTGLTTYNVDH
DSFCIRWTPHRAATSYRLKVNPVDPSKNGAQEITVRGSESNNCFTGLSPDTLYNATVYTQ
TPNLEGPGVSVEEKTLVKPTEVPTQPPTPPAPATVPPALDVCKGAKADLVFLIDGSWSIG
DESFNKVIQFVTSMIGAFEVISPNGMQVSLVQYSDDAKTEFKLNTYYNKGIVISALKSVR
YRGGNTKTGIALKHVYEKVFTSDSGMRRNVPKVLVVLTDGRSQDDVKKSAEKLQHSGYSV
FVVGVADVDMTELRIIGSKPSERHVFVVDDYDAFAKIQDNLITFICETATSTCPLIYING
YTTPGFRMLEAFNITDKTFAGMNGVSMEPGSFNSYIAYRVHKDSFINQPTKEIHPEGLPP
SYTIVLLFRLLPDSPSEPFDIWQISDKNNNPEVGVSLNPSSKTITFYNKDTRGEIQKATF
NQEQVKRIFHGSFHKLHITVSPDKVKINVDCQEVAERPIKAANNITLDGYEVLGKMVRSG
GGRRQSATFQLQMFDIICSLSWISRDKCCDLPATRDEAKCPALPHSCTCTQDSIGPPGPS
GPPGGPGSKGPRGDRGEKGNTGSMGPRGEPGLPGAMGPPGPQGPNGLSLPGAPGRQGPKG
DRGDPGQPGVQGSVGQRGPLGPVGPVGARGPPGKEGSSGPRGPPGPMGNPGTPGVPGITG
KPGKPGEPGNPGPVGLKGEKGERGDAASQHMMRSVARQVCEQLINSQMTRIEMMLNQVPS
GYRSNSPGPPGPPGPPGDQGTRGEPGQPGRTGFPGNPGLPGNQGERGLPGEKGERGQAGT
GIRGQRGPSGPPGPPGESRTGPPGSSGSAGPRGPPGRSGTPGVRGPSGSPGYCDSSQCVG
IPYNGQGYTVPNMFLAGGYPPEPEVVPIPVEP
Download sequence
Identical sequences H3DHK1
ENSTNIP00000019995 99883.ENSTNIP00000019995 ENSTNIP00000019995

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]