SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for XP_005041121.1.66865 from NCBI 2017_08 genome

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  XP_005041121.1.66865
Domain Number 1 Region: 755-984
Classification Level Classification E-value
Superfamily vWA-like 3.76e-49
Family Integrin A (or I) domain 0.00033
Further Details:      
 
Domain Number 2 Region: 44-231
Classification Level Classification E-value
Superfamily vWA-like 1.07e-31
Family Integrin A (or I) domain 0.003
Further Details:      
 
Domain Number 3 Region: 1101-1158
Classification Level Classification E-value
Superfamily BPTI-like 1.42e-17
Family Small Kunitz-type inhibitors & BPTI-like toxins 0.0022
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) XP_005041121.1.66865
Sequence length 1159
Comment PREDICTED: collagen alpha-1(XXVIII) chain [Ficedula albicollis]; AA=GCF_000247815.1; RF=representative genome; TAX=59894; STAX=59894; NAME=Ficedula albicollis; AL=Chromosome; RT=Major
Sequence
MRKKCFFFCFMLLTMTTNQIVHGQNRKRGKNIYSPMPKDEGDMCLIDIVFILDSSESAKN
QLFDLQKNFVQNITDSIFQMKPVKSHMYSVKLASMQFSSTVSIDHPFTAWKNVQNFKEKI
NSLGFIGHGTYSYYAVSNATQLFKTESQKRSVKVAFLMTDGVDHPNSPNVQGIATTARNL
GIHFITIGLSKKTVQEEKLRMISGDSSSKHVLCLDDQNLVVGVASELEALLQKQCVRKVC
ECEKGVKGDKGDSGRDGNRGEKGDPGPKGSKGDAQKGDHGEKGEEGAPGYKGDKGERGEC
GPPGTKGEIGLQGSVGPTGPRGPQGIRGDPGQQGPKGIQGNKGEQGPPGPDGPPGPSGIG
EPGPKGAEGLEGKIGLRGPPGIGEPGLPGPPGPAGSPGERGPAGEGIQGQKGEKGSEGVA
GPPGPKGEQVKGDKGEQGQAGPQGPPGPPGIGTQGSRGIQGPQGNPGAKGSQGYGRPGPK
GEPGEPGQKGEAGPPGISSSGLKGDTGLPGSPGEKGVKGEKGLSGKKGEKGDQGPRGPEG
PPGKGVVGQKGDPGEKGSKGVIGLTGLKGPVGPKGDPGERGPPGVPGASVWGPPGPKGDP
GDKGPPGDDGLPGNSIMGPTGAQGPPGPPGPHGAKGDKGGKGEAGPPGPAGPKGPAGPKG
VGDAGPKGDRGIRGPPGLPGPTGWGSMGPKGIMGRNGLPGPPGPPGISIQGDKGEKGNKG
FLGPKGPRGTGLPGQKGDYGDKGDPGSKGAKGEIGDPGPPGPKGVGGRKGEPGLSREDVI
RLINEICGCGIKCRVTPLELVFVIDSSESVGPDNFNIIKTFMKTFIDKVSADHATTRIGV
INFSHRVDLVSSLKQYTGKEYLKSAVDRMSYLGEGTYTASAIQEAIHLFQAARPAVRKVA
VVITDGQTDSRDKERLDAVVRKAHAADIEMFVIGIVQRTDPHYDDFLKEMQLIAADPDEE
HVYQIDDFITLSALENKLITKICENESAVYTREYNILSPSPSQEPEITRKDANSQLPKMT
TSEIDIRPTEGISYTDSPLPSRYTEPLPSAQDHTVTASAHKESILPPVYNVSEIRPLSPA
DNHFLGGTAAENHQPAVWQTSQKNPRCLEPMKPGGCWDYVVKWYYDKNGNSCGQFWYGGC
NGSNNRFETEKECQETCVD
Download sequence
Identical sequences U3K6I6
ENSFALP00000010640 XP_005041121.1.66865

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]