SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for H2P914 from Uniprot 2018_03 genome

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  H2P914
Domain Number 1 Region: 8-218
Classification Level Classification E-value
Superfamily vWA-like 1.67e-53
Family Integrin A (or I) domain 0.00042
Further Details:      
 
Domain Number 2 Region: 628-817
Classification Level Classification E-value
Superfamily vWA-like 9.46e-51
Family Integrin A (or I) domain 0.00088
Further Details:      
 
Domain Number 3 Region: 1408-1612
Classification Level Classification E-value
Superfamily vWA-like 1.53e-46
Family Integrin A (or I) domain 0.00064
Further Details:      
 
Domain Number 4 Region: 1232-1406
Classification Level Classification E-value
Superfamily vWA-like 2.09e-44
Family Integrin A (or I) domain 0.00085
Further Details:      
 
Domain Number 5 Region: 1026-1206
Classification Level Classification E-value
Superfamily vWA-like 2.82e-44
Family Integrin A (or I) domain 0.00097
Further Details:      
 
Domain Number 6 Region: 444-608
Classification Level Classification E-value
Superfamily vWA-like 1.91e-43
Family Integrin A (or I) domain 0.00049
Further Details:      
 
Domain Number 7 Region: 234-420
Classification Level Classification E-value
Superfamily vWA-like 3.12e-42
Family Integrin A (or I) domain 0.00068
Further Details:      
 
Domain Number 8 Region: 1625-1818
Classification Level Classification E-value
Superfamily vWA-like 2.82e-40
Family Integrin A (or I) domain 0.00079
Further Details:      
 
Domain Number 9 Region: 836-1012
Classification Level Classification E-value
Superfamily vWA-like 4.03e-39
Family Integrin A (or I) domain 0.0011
Further Details:      
 
Domain Number 10 Region: 2615-2805
Classification Level Classification E-value
Superfamily vWA-like 8.73e-28
Family Integrin A (or I) domain 0.0019
Further Details:      
 
Domain Number 11 Region: 2396-2587
Classification Level Classification E-value
Superfamily vWA-like 5.42e-27
Family Integrin A (or I) domain 0.017
Further Details:      
 
Domain Number 12 Region: 3111-3170
Classification Level Classification E-value
Superfamily BPTI-like 3.26e-21
Family Small Kunitz-type inhibitors & BPTI-like toxins 0.0000818
Further Details:      
 
Domain Number 13 Region: 1835-2030
Classification Level Classification E-value
Superfamily vWA-like 5.19e-18
Family Integrin A (or I) domain 0.018
Further Details:      
 
Domain Number 14 Region: 2999-3090
Classification Level Classification E-value
Superfamily Fibronectin type III 0.0000000574
Family Fibronectin type III 0.0049
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) H2P914
Sequence length 3182
Comment (tr|H2P914|H2P914_PONAB) Collagen type VI alpha 3 chain {ECO:0000313|Ensembl:ENSPPYP00000014882} KW=Complete proteome; Reference proteome OX=9601 OS=Pongo abelii (Sumatran orangutan) (Pongo pygmaeus abelii). GN=COL6A3 OC=Catarrhini; Hominidae; Pongo.
Sequence
MRKHRHLPLVAVFCLFLSGFPTTHAQQQQADVKNGAAADIIFLVDSSWTIGEEHFQLVRE
FLYDVVKSLAVGENDFHFALVQFNGNPHTEFLLNTYRTKQEVLSHISNMSYIGGTNQTGK
GLEYIMQSHLTKAAGSRAGDGVPQVIVVLTDGHSKDGLALPSAELKSADVNVFAIGVEDA
DEGALKEIASEPLNMHMFNLENFTSLHDIVGNLVSCVHSSVSPERAGDTETLKDITAQDS
ADIIFLIDGSNNTGSVNFAVILDFLVNLLEKLPIGTQQIRVGVVQFSDEPRTMFSLDTYS
TKAQVLGAVKALGFAGGELANIGLALDFVVENHFTRAGGSRVEEGVPQVLVLISAGPSSD
EIRYGVVALKQASVFSFGLGAQAASRAELQHIATDDNLVFTVPEFRSFGDLQEKLLPYIV
GVAQRHIVLKPPTIVTQVIEVNKRDIVFLVDGSSALGLANFNAIRDFIAKVIQRLEIGQD
LIQVAVAQYADTVRPEFYFNTHPTKREVITAVRKMKPLDGSALYTGSALDFVRNNLFTSS
AGYRAAEGIPKLLVLITGGKSLDEISQSAQELKRSSIMAFAIGNKGADQAELKEIAFDSS
LVFIPAEFRAAPLQGMLPGFLAPLRTLSGTPEVHANKRDIIFLLDGSANVGKTNFPYVRD
FVMNLVNSLDVGNDNIRVGLVQFSDTPVTEFSLNTYQTKSDILGHLRQLQLQGGSGLNTG
SALSYVHANHFTEAGGSRIHEHVPQLLLLLTAGQSEDSYLQAANALTRAGILTFCVGASQ
ANKAELEQIAFNPSLVYLMDDFSSLPALPQQLIQPLTTYVSGGVEEVPLAQPESKRDILF
LFDGSANLVGQFPVVRDFLYKIIDELDVKPDGTRIAVAQYSDDVKVESRFDEHQSKPEIL
NLVKRMKIKTGKALNLGYALDYAQRYIFVKSAGSRIEDGVLQFLVLLVAGRSSDRVDGPA
SNLKQSGVVPFIFQAKNADPAELEQIVLSPAFILAAESLPKIGDLQPQIVNLLKSVHNGA
PAPVSGEKDVVFLLDGSEGVRSGFPLLKEFVQRVVESLDVGQDRVRVAVVQYSDRTRPEF
YLNSYMNQQDVVNAVRQLTLLGGPTPNTGAALEFVLRNILVSSAGSRITEGVPQLLIVLT
ADRSGDDVRNPSVVVKRGGAVPIGIGIGNADITEMQTISFIPDFAVAIPTFRQLGTVQQV
ISERVTQLTREELSRLQPVLQPLPSPGVGGKRDVVFLIDGSQSAGPEFQYIRTLIERLVD
YLDVGFDTTRVAVIQFSDDPKVEFLLNAHSSKDEVQNAVQRLRPKGGRQINVGSALEYVS
RNIFKRPLGSRIEEGVPQFLVLISSGKSDDEVDDPAVELKQFGVAPFTIARNADQEELVK
ISLSPEYVFSVSTFRELPSLEQKLLTPITTLTSEQIQKLLASTRYPPPAVESDAADIVFL
IDSSEGVRPDGFAHIRDFVSRIVRRLNIGPSKVRVGVVQFSNDVFPEFYLKTYRSQAPVL
DAIRRLRLRGGSPLNTGKALEFVARNLFVKSAGSRIEDGVPQHLVLVLGGKSQDDVSRFA
QVIRSSGIVSLGVGDRNIDRTELQTITNDPRLVFTVREFRELPNIEERIMNSFGPSAATP
APPGVDTPPPSRPEKKKADIVFLLDGSINFRRDSFQEVLRFVSEIVDTVYEDGDSIQVGL
VQYNSDPTDEFFLKDFSTKRQIIDAINKVVYKGGRHANTRVGLEHLRVNHFVPEAGSRLD
QRVPQIAFVITGGKSVEDAQDVSLALTQRGVKVFAVGVRNIDSEEVGKIASNSATAFRVG
NVQELSELSEQVLETLHDAMHETLCPGVTDAAKACNLDVILGFDGSRDQNVFVAQKGFES
KVDAILNRISQMHRVSCSGGRSPTVRVSVVANTPSGPVEAFDFDEYQPEMLEKFRNMRSQ
HPYVLTEDTLKVYQNKFRQSSPDSVKVVIHFTDGADGDLADLHRASENLRQEGVHALILV
GLERVANLERLMHLEFGRGFMYDRPLRLNLLDLDYELAEQLDNIAEKACCGVPCKCSGQR
GDRGPIGSIGPKGIPGEDGYRGYPGDEGGPGERGPPGVNGTQGFQGCPGQRGVKGSRGFP
GEKGEVGEIGLDGLDGEDGDKGLPGSSGEKGNPGRRGDKGPRGEKGERGDVGIRGDPGNP
GQDSQERGPKGETGDLGPMGVPGRDGVPGGPGETGKNGGFGRRGPPGAKGNKGGSGQPGF
EGEQGTRGAQGPAGPAGPPGLIGEQGISGPRGSGGAAGAPGERGRTGPLGRKGEPGEPGP
KGGIGNRGPRGETGDDGRDGVGSEGRRGKKGERGFPGYPGPKGNPGEPGLNGTTGPKGIR
GRRGNSGPPGIVGQKGDPGYPGPAGPKGNRGDSIDQCALIQSIKDKCPCCYGPLECPVFP
TELAFALDTSEGVNQDTFGRMRDVVLSIVNDLTIAESNCPRGARVAVVTYNNEVTTEIRF
ADSKRKSVLLDKIKNLQVALTSKQQSLETAMSFVARNTFKRVRNGFLMRKVAVFFSNTPT
RASPQLREAVLKLSDAGITPLFLTSQEDRQLINALQINNTAVGHALVLPAGRDLTDFLEN
VLTCHVCLDICNIDPSCGFGSWRPSFRDRRAAGSDVDIDMAFVLDSAETTTLFQFNEMKK
YIAYLVRQLDMSPDPKASQHFARVAVVQHAPSESMGNASMPPVKVEFSLTDYGSKEKLVD
FLSRGMTQLQGTRALGSAIEYTIENVFESAPNPRDLKIVVLMLTGEVQEQQLEEAQRVIL
QAKCKGYFFVVLGIGRKVNIKEVYTFASEPNDVFFKLVDKSTELNEEPLMRFGRLLPSFV
SSENAFYLSPDIRKQCDWFQGDQPTKNLVKFGHKQVNVPNNVTSSPTSNPAMTTKPVTTM
KPVTTTTKPVTTTTKPVAIVNQPSAKPAAAKPAPVKPAPAKPMAAKPVATKTATVRPPVV
VKPATAAKPVAAKPAAVRPPAAAAAKPVVTKPEAPRPQAAKPAATKPATTKPVVRVSREV
QVFEITENSAKLHWERPEPPSPYFYDLTVTSAHDQSLVLKQNLTVTDRIIGGLLAGQTYH
VAVVCYLRSQVRATYHGSFSTKKSQPPPPQPARSASSSTINLMVSTEPLALTETDICKLP
KDEGTCRDFILKWYYDPNTKSCARFWYGGCGGNENKFGSQKECEKVCAPVLAKPGVISVM
GT
Download sequence
Identical sequences H2P914
XP_009236555.1.23681 ENSPPYP00000014882 ENSPPYP00000014882 9600.ENSPPYP00000014882

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]