SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSNLEP00000004210 from Nomascus leucogenys 76_1.0

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSNLEP00000004210
Domain Number 1 Region: 1463-1657
Classification Level Classification E-value
Superfamily vWA-like 3.18e-46
Family Integrin A (or I) domain 0.001
Further Details:      
 
Domain Number 2 Region: 1269-1460
Classification Level Classification E-value
Superfamily vWA-like 3.81e-43
Family Integrin A (or I) domain 0.0000000114
Further Details:      
 
Domain Number 3 Region: 1649-1871
Classification Level Classification E-value
Superfamily vWA-like 1.4e-39
Family Integrin A (or I) domain 0.0000000409
Further Details:      
 
Domain Number 4 Region: 646-707
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.00000000000141
Family ATI-like 0.097
Further Details:      
 
Domain Number 5 Region: 292-346
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.000000000409
Family BSTI 0.047
Further Details:      
 
Domain Number 6 Region: 771-827
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.000000213
Family ATI-like 0.014
Further Details:      
 
Domain Number 7 Region: 2196-2252
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.000000458
Family ATI-like 0.07
Further Details:      
 
Domain Number 8 Region: 1141-1194
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.0000196
Family ATI-like 0.03
Further Details:      
 
Domain Number 9 Region: 2568-2648
Classification Level Classification E-value
Superfamily FnI-like domain 0.0000764
Family VWC domain 0.062
Further Details:      
 
Weak hits

Sequence:  ENSNLEP00000004210
Domain Number - Region: 825-892
Classification Level Classification E-value
Superfamily FnI-like domain 0.000126
Family VWC domain 0.083
Further Details:      
 
Domain Number - Region: 2421-2497
Classification Level Classification E-value
Superfamily FnI-like domain 0.000586
Family VWC domain 0.068
Further Details:      
 
Domain Number - Region: 697-738
Classification Level Classification E-value
Superfamily FnI-like domain 0.00513
Family VWC domain 0.034
Further Details:      
 
Domain Number - Region: 438-510
Classification Level Classification E-value
Superfamily Methyl-coenzyme M reductase subunits 0.0263
Family Methyl-coenzyme M reductase alpha and beta chain N-terminal domain 0.0065
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSNLEP00000004210   Gene: ENSNLEG00000003335   Transcript: ENSNLET00000004430
Sequence length 2811
Comment pep:known_by_projection supercontig:Nleu1.0:GL397358.1:5916649:6098714:-1 gene:ENSNLEG00000003335 transcript:ENSNLET00000004430 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MIPARFAGVLLALALILPGTLCAEGTRGRSSMARCSLFGSDFINTFDGSMYSFAGYCSYL
LAGDCQKRSFSIIGDFQNGKRVSLSVYLGEFFDIHLFVNGTVTQGDQRVSMPYASKGLYL
ETEAGYSKLSGETYGFVARIDGSGNFQVLLSDRYFNKTCGLCGNFNIFAEDDFTTQEGTL
TSDPYDFANSWALSSGEQWCERASPPSSSCNISSGEMQKDLWEQCQLLSTSVFARCHPLV
PEPFVALCEKTLCECAGGLECTCPAFLEYARTCAQEGMVLYGWTDHSACSPVCPAGMEYK
QCVSPCARTCQSLHINEVCQERCVDGCSCPEGQLLDEGLCVESTECPCMHSGKRYPPGAS
LSRDCNTCICRNSQWICSNEECPGECLVTGQSHFKSFDNRYFTFSGICQYLLARDCQDHS
FSIVIETVQCADDPDAVCTRSVTIRLPGLHNSLVKLKHGGGVAMDGQDVQLPLLKGDLRI
QHTVTASVHLSHGEDLQLDWDGRGRLLVKLSPVYAGKTCGLCGNYNGNQGDDFITPSGLA
EPRVEDFGNAWKLHGNCQDLQKQHSDPCALNPRMTRFSEEACAVLTSPTFEACHRAVSPL
PYLRNCRYDVCSCSDGRECLCGALASYAAACAGRGVRVAWRQPDRCELNCPKGQVYLQCG
TPCNLTCRSLSYPDEECNEACLEGCFCPPGLYMDERGDCVPKAQCPCYYDGEIFQPEDIF
SDHHTMCYCEDGFMHCTMSGVPGSLLPDAVLSSPLSHRSKRSLSCRPPMVKLVCPADNPR
AEGLECAKTCQNYDLECMSMGCVSGCLCPPGMVRHENRCVALERCPCFHQGKEYAPGETV
KIGCNTCVCRDRKWNCTDHVCDATCSTIGMAHYLTFDGLKYMFPGECQYVLVQDYCGSNP
GTFRVLVGNEGCSHPSVKCKKRVTILVEGGEIELFDGEANVKRPMKDETHFEVVESGRYI
ILLLGKALSVVWDRHLSISVVLKQTYQEKVCGLCGNFDGIQNNDLTSSNLQVEEDPVDFG
NSWKVSSQCADTRKVPLDSSPATCHNNIMKQTMVDSSCRILTSDVFQDCNKLVDPEPYLD
VCIYDTCSCESIGDCACFCDTIAAYAHVCAQHGKVVTWRTATLCPQSCEERNLRENGYEC
EWRYNSCAPACRVTCQHPEPLACPVQCVEGCHAHCPPGKILDELLQTCVSPEDCPVCEVA
GRRFAPGKKVTLNPSDPEHCQICHCDGVNLTCEACQEPGGLVVPPTDAPVSPTTPYVEDT
WEPPLHDFYCSRLLDLVFLLDGSSRLSEAEFEVLKAFVVDMMERLRISQKWVRVAVVEYH
DGSHAYIGLKDRKRPSELRRIASQVKYAGSQVASTSEVLKYTLFQIFGKIDRPEASRIAL
LLMASQEPQRMSRNFVRYVQGLKKKKVIVIPVGIGPHANLKQIRLVEKQAPENKAFVLSG
VDELEQQRDEIVSYLCDLAPEAPPPTLPPNMAQVTVGPGLLGVSTLGPKRNSMVLDVAFV
LEGSDKIGEADFNRSKEFMEEVIQRMDVGQDSIHVTVLQYSYMVTVEYPFSEAQSKGDIL
QRVREIRYQGGNRTNTGLALQYLSDHSFLVSQGDREQAPNLVYMVTGNPASDEIKRLPGD
IQVVPIGVGPNANVQELERIGWPNAPILIQDFETLPREAPDLVLQRCCSGEGLQIPTLSP
ASDCSQPLDVIFLLDGSSSFPASYFDEMKSFAKAFISKANIGPHLTQVSVLQYGSITTID
VPWNVAPEKAHLLSLVDVMQREGGPSQIGDALGFAVRYLTSEMHGARPGASKAVVILVTD
VSVDSVDAAADAARSNRVTVFPIGIGDRYDAAQLRILAGPAGDSNMVKLQRIEDLPTMVT
LGNSFLHKLCSGFVRICMDEDGNEKRPGDVWTLPDQCHTVTCQPDGQTLLKSHRVNCDRG
PRPSCPNSQSPVKVEETCGCRWTCPCVCTGSSTRHIVTFDGQNFKLTGSCSYVLFQNKEQ
DLEVILHNGACSPGARQGCMKSIEVKHSALSVELHSDMEVTVNGRLVSVPYVGGNMEVNV
YGTIMHEVRFNHLGHIFTFTPQNNEFQLQLSPKTFASKTYGLCGICDENGANDFMLRDGT
VTTDWKTLVQEWTVQQPGHTCHPIPEEQCLVPDSSHCQILLLPLFAECHKVLAPATFYAI
CQQDSCHQEQVCGVIASYAHLCRTNGVCVDWRTPDFCAMSCPPSLVYNHCEHGCPRHCDG
NVSSCGDHPSEGCFCPPNKVMLEGSCVPEEACTQCIGEDGVQHQFLEAWVPDHQPCQICT
CLSGRKVNCTTQPCPTAKAPTCGLCEVARLRQNADQCCPEYECVCDPVSCDLPPVPHCEG
GLQPTLTNPGECRPNFTCACRKEECERVSPPSCPPHRLPTLRKTQCCDEYECACNCVNST
VSCPLGYLASTATNDCGCTTTTCLPDKVCVHRSTIYPVGQFWEEGCDVCTCTDMEDAVMG
LRVAQCSQKPCEDSCRSGFTYVLHEGECCGRCLPSACEVVTGSPRGDSQSFWKSVGSQWA
SPENPCLINECVRVKEEVFVQQRNVSCPQLEVPACPSGFQLSCKTSVCCPSCRCERVEAC
MLNGTIIGPGKSVMIDVCTTCRCMVQVGVISGFKLECRKTTCNPCPLGYKEENNTGECCG
RCLPTACTIQLRGGQIMTLKRDETLQDGCDTHFCKVNERGEYFWEKRVTGCPPFDEHKCL
AEGGKIMKIPGTCCDTCEEPECNDITARLQYVKVGSCKSEVEVDIHYCQGKCASKAMYSI
DINDVQDQCSCCSPTRTEPMQVPLHCTNGSVVYHEVLNAMECKCSPRKCSK
Download sequence
Identical sequences ENSNLEP00000004210 ENSNLEP00000004210

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]