SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for jgi|Monbr1|11672|fgenesh1_pg.scaffold_31000029 from Monosiga brevicollis

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  jgi|Monbr1|11672|fgenesh1_pg.scaffold_31000029
Domain Number 1 Region: 1877-1968
Classification Level Classification E-value
Superfamily Cadherin-like 0.00000000000614
Family Cadherin 0.0069
Further Details:      
 
Domain Number 2 Region: 553-608,637-715
Classification Level Classification E-value
Superfamily Growth factor receptor domain 0.000000000113
Family Growth factor receptor domain 0.013
Further Details:      
 
Domain Number 3 Region: 2288-2371
Classification Level Classification E-value
Superfamily Cadherin-like 0.000000000742
Family Cadherin 0.002
Further Details:      
 
Domain Number 4 Region: 2374-2447,2497-2549
Classification Level Classification E-value
Superfamily Growth factor receptor domain 0.00000000471
Family Growth factor receptor domain 0.01
Further Details:      
 
Domain Number 5 Region: 2172-2279
Classification Level Classification E-value
Superfamily Cadherin-like 0.00000000742
Family Cadherin 0.002
Further Details:      
 
Domain Number 6 Region: 1132-1223
Classification Level Classification E-value
Superfamily Cadherin-like 0.00000002
Family Cadherin 0.01
Further Details:      
 
Domain Number 7 Region: 828-911
Classification Level Classification E-value
Superfamily Cadherin-like 0.0000000286
Family Cadherin 0.0046
Further Details:      
 
Domain Number 8 Region: 778-815
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000000864
Family EGF-type module 0.0095
Further Details:      
 
Domain Number 9 Region: 1648-1742
Classification Level Classification E-value
Superfamily Cadherin-like 0.000000143
Family Cadherin 0.0042
Further Details:      
 
Domain Number 10 Region: 2689-2799
Classification Level Classification E-value
Superfamily L domain-like 0.000000327
Family L domain 0.0094
Further Details:      
 
Domain Number 11 Region: 1238-1329
Classification Level Classification E-value
Superfamily Cadherin-like 0.00000286
Family Cadherin 0.0067
Further Details:      
 
Domain Number 12 Region: 2856-2896
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000335
Family EGF-type module 0.016
Further Details:      
 
Domain Number 13 Region: 2063-2176
Classification Level Classification E-value
Superfamily Cadherin-like 0.00000371
Family Cadherin 0.0059
Further Details:      
 
Domain Number 14 Region: 1456-1551
Classification Level Classification E-value
Superfamily Cadherin-like 0.00000514
Family Cadherin 0.0084
Further Details:      
 
Domain Number 15 Region: 4082-4149
Classification Level Classification E-value
Superfamily Type I dockerin domain 0.00000549
Family Type I dockerin domain 0.0064
Further Details:      
 
Domain Number 16 Region: 1331-1429
Classification Level Classification E-value
Superfamily Cadherin-like 0.0000271
Family Cadherin 0.0063
Further Details:      
 
Domain Number 17 Region: 2817-2858
Classification Level Classification E-value
Superfamily TSP type-3 repeat 0.0000275
Family TSP type-3 repeat 0.0019
Further Details:      
 
Domain Number 18 Region: 1755-1865
Classification Level Classification E-value
Superfamily Cadherin-like 0.0000971
Family Cadherin 0.0092
Further Details:      
 
Weak hits

Sequence:  jgi|Monbr1|11672|fgenesh1_pg.scaffold_31000029
Domain Number - Region: 2451-2510
Classification Level Classification E-value
Superfamily Complement control module/SCR domain 0.000236
Family Complement control module/SCR domain 0.0032
Further Details:      
 
Domain Number - Region: 126-159,222-282
Classification Level Classification E-value
Superfamily Growth factor receptor domain 0.000251
Family Growth factor receptor domain 0.012
Further Details:      
 
Domain Number - Region: 927-1010
Classification Level Classification E-value
Superfamily Cadherin-like 0.000628
Family Cadherin 0.0084
Further Details:      
 
Domain Number - Region: 611-647
Classification Level Classification E-value
Superfamily Complement control module/SCR domain 0.00206
Family Complement control module/SCR domain 0.007
Further Details:      
 
Domain Number - Region: 1557-1650
Classification Level Classification E-value
Superfamily Cadherin-like 0.00243
Family Cadherin 0.0075
Further Details:      
 
Domain Number - Region: 1978-2064
Classification Level Classification E-value
Superfamily Cadherin-like 0.00557
Family Cadherin 0.01
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) jgi|Monbr1|11672|fgenesh1_pg.scaffold_31000029
Sequence length 5741
Sequence
MGLIGAPRVLSTWLVALLLALASAQQLTVSPTATEYYNREAHVNLDLKPLAGFANLSAIT
AIRYGVSDVALADVEVAPNGTSARVRLLIGEIGVTGDDGALFTLTLPNGTTLTSDRFYLY
GDIDGCSDEPCAHGSCLDTPQGFECNCSASGYKGELCNEPVSCGMLPTQTHASYSPAVAE
LSFGQATVVTCAEGYGVFNTTAGAFANVGGSEDLIQRGFRVDCTASARYSNEELACRQID
ECASPLLNDCAELCIDTNGSYTCSCADLQRFFLANDNTSCSDRLAPLPTTLDVRAINDSL
VTQLVVVQDISFAIRPHRLDIRLRLAADSNDTALLRGPLAYEWVDTTLIAVDVSLDLLPL
RSVLFEYHITNLDASPDGASAGGVAVGGPVLLLRHNEDSDDDDIDSPPGTNPEHNVTTCG
DLPCQNSGMCEPMVRYEFDTIPTIANLTSALNLNASAIARSFADLITAEFSPAFTLQAAS
AGPLYGIYAGRQQDLDFQLFLVWLESLDVVLQSTTSSENRSITVDMSTVTLNVRRNGTTG
AVVDDATNAVEPRQFSCRCATGFSGDLCEINVDDCFASACGPGVCLDGVASYACNCSGTG
FEGSRCQTQQTCPTLPMLPHATPNVTEAHVFDVIAYQCDPGYSLDGTGVGDNQILARCTL
AQQVEPEAASCQNVDECAEGACAQLCTDTNGSFLCACTPGYELATNGEDCELVSCGSAPA
LNFSSSPAQGQELFFHDVAVYSCVAGAALDGLVNGSTALNVTCQADGSLAHSGQCLPWDG
CASQPCANEGVCTSTGANFTCACVDGYVGPTCTQDGNLAPTDILLIPGAIPENVVDVVVD
LVAIDPNESDNHSFVLESDSSGVLRVVGSTLRLIRALDHEQADYHTLRLRATDDGPRPKS
TTATVVLRVEDENDPPTALALSTNEVYEAQTDSRYAVASITVSDQDVNDRHNISVDIDWA
SVEAGILYFHQPLDFETASSLVLNFTATDLDGATYSESFIFFVLDVNEPPSTLTLNGTAL
LALSLVVNEEPDPTADVTSLGELGAVDPDAVDQPTAARPVPFEFSLSPASNQACLLDGRT
LQLNASLVDAEAQSSISCFLTATDSANNRLTQRLDIDVRSVNEAPLAMRLVPATALESHP
VGSVLATILVDDVDVGDEARVMLVPINGSQADNAFVEVVPGTRQLRLVAPLDFEHQPQLV
FHVRAVDGGNASLDAVLILSVADVNEAPYNLTLTSDGLPENAVNGTMVGFFTALDDDVGQ
DLTFALTAAQPPAAFRLEGNVLVYSGQGQTPDFEGGANYSVTITVHDLGGLAATMPLSTS
QTFAIQLLDVNEPVTGVRVAPATVPEDALVGAAFAQLVLTDPDFDEQLTILALTSDGPIA
LADPSGALACSRMVAGTRCTATLVITDALDYETQPMLTVVAVVRDAAGVEQELQTDIAVV
DRNDAPIIWGLNQETEALTIPENSFVNTVVATFALIDADAEGNFTCAVVGALNDSSTVPL
RVQGPVVLNATLRLVSTASLDFEAQAAYNFSLACSDGQATTAITLTLQVTNVPEAPFNVR
AAVAAVPESLPVGAIAACLLADDPEGDVLQFEAFSPLFEVQDDCLLLRQPLDHEANATVR
AEVVAIDSTGLVGRGVVVLDVLDTNDDPAIVASSLQLSVAEDAAANTVVGTLVANDPDGD
ALTWSVVNVSAPSLVFGFQGRALVFLGGTLDFETAPEIAVGVQVADGNGGLATAIATIAI
LDAPDAPVIIGLMPSRISEAAVVGTTVTWLSARDEDAGDDIAFSLGGVDAVSFALNASSR
RCDSVDGVAEDDDEATTGSIGGVVCSIGLVLVQPLDFETQTFYTISVRVSDARGASVTEL
LLLRINDANDAPVSISLSSTTVPENSPPGTFIGLVNVQDPDQGQTHTCQVTSSLPAGLIM
QANSVLAVATTVPSFELGTVYSVEITCTDSGLPPQSLTETLTVTVTDVAEPPTGITISHN
SVLENATQGFLIGQLDIIDEDMVDFPIFQLSDGADARFAVRGRSLVRGTTALDYEASTSH
TVELRVVENSLTASFFNLSINVVNVNEPPSSIQLSEAALREDSQAGTVVGRLTATDPDAN
QTHVFSLDSTAGTDPEVTSMFVILGAELIYIGNDTALINFEERTSLTIAVRALDDAAYSV
GAPLALTRLFEISITDVNEPFSNPRFSNTLVPEDIRPGQLIANLLVVDPDQGQSVTCTLI
DSAGGRLVLEGDALLAGPTSFDHETSSTVSISLTCRDDGQPPFTSTSNMTFTVLDVPEAP
RRVVLLGGSVSEAAAPGTVVGVLVAQDDEGNAVSFALGASNEAGPFVVQDVSLVLNGTLD
YEARQHYNLSIVATDSTGLSTEASVWVSVTNEPDCEAGSCLNGATCLDVGANFTCICAPG
FRGRRCELNIDDCSGGPCANGGLCIDGVDNFTCDCTNTGFEGAFCVEPQACAPPPAKPHA
SLLTGDTPRFGDDIAYSCTPGFSADGTPSGANIVHERCTAQGTVSNTSTCLQIDYCVGEP
CANNASCTSEDLRYLCACPSGFTGTNCELPLRCARESPVIITASDQLAALSACLTFDNEL
IIANMTTNFDPAFLTPVRAFRGGLQLVNLRLASSASAAASGEGPQQVEWSASEMKGLQLS
GISAAHNVELVLPNITVLDGSLTLLQVARNVQLAAPQLNRVEGDVEVRATNWTQLTGLDA
LVEVYGSVVLAENLVLRGVGTLDALLVVRGSLEVHNNAALATLMQPANLVSVGEDVLVTG
NNVLRDVNALAALQFYGRFLIIRNNPLLCFIGEHPLSSIRALESGTDAPSLLNNGQDCAS
NRTDADNDGVFDDVDNCVTIANPSQADANANGLGDACDCFPLDPCLNGATCSVDSTQFAC
ACAPTFTGRTCAISTLVPTPRFGASASALLSAAVTRQADDTFAVAAPALVGADLGGQVLA
SAYLAGRAGHSTVSVNSLEATILHLILFDADDASAGNAPYLSFNQRTLRALVTVQDGLGN
ANVSSATIVVQMDNGVTTHRDTCTTSQPSGHCLVTTSVPEAFFAINAPTSVQITAYLLDQ
PQVATASQMFTVEPYPVVGSTDAVALYVPVTSCAPNQTFTAAVLVRTSQSLGAFGLVLTL
DSALSVERADFDVANWQVTTRAVTASGRYVAVGRRMPRSGAAVSAAGETVRLLLLTMRVS
SEAATPTTPTMSLNVTQLLDTSGRSLLSASDLPRRADHFDFEGSNHVGARVLVEEPVVLR
LQARLPRAQLINTARLTGITRRVPLVVLALRSDGLVVELQSGLACATARMAVARTNAGCT
ALEVVGSETAGEAELAVSVALGSLRTTVMARVWFPVSTQLVTSTRDLHPIRNLLDLNCNQ
VYQRARVVVESTFQAGTSSFSVDVTSLAAARLRSSSPAVVAIEANDSPLVAPRVDAVGQS
PGFALVSWNNAAGIVLASEFFFVSESLATVQSIFPAVVTAVEVEASPAEAGPDQSVLIRA
VIRDELLSLGESANIAVSGVFADGARDLSFWSNISLSSLSPSVAWIDGSNQINPLASGVG
DLRAELRSACPANPQIVATGTGPIRVMLPTVVGLQASLAEPIIYTTSTVSTDVVPTTTQL
LISLTLSNGQMRDASRNPAVNARIVDVTPLTSENEPFDDAALRAGVQLARASPVEPFFLS
TTSQAVVPTRVVLEVGLASSPELVQNVSVIVDVLSGLLLTVRPLVSVRADEPTEGFHRMP
LTGHFSALQVHTFLEATAGTRVNITSAEGLSLTSTDNAVTLDGNRVELDSAFSGQSVGLT
ANFSGFAETLELSVEQDPVSVRVVYLSMDSDALVDSAPRDTATTLSVDVELADGALQNNV
VSQGEVLIPDYVTFTSSNPQVLRVNATTGTLALLGTAAFPVTITAAVGQATAELELNANL
SPLVGDIDLGNSATPAISILGPGEVAWISVFANVSGVMMRGFDLTVTSASSSTVSLEGAV
LGRDVTGQFMTAQQTASRVRFGAVVAAVTGGRLEIARVQLRAGSSASAAATTISVTVNDL
FDQDLAIIGAPTPRLAIAGSNVPVVVRSAGSAAAGLVAFLPSTIEEAKQRARRAHTCDES
PPGDVNGDCVFTLRDVAFVNDYAADVLAGIPPPNVTAEVYAEMDVDRDNTITVGDSALLA
DILYGVVDFVRVSTATPNETLQTCATAITAQVLTYDNMPSERPVRLFAVMRGTADLSLEW
GRRDGGCGTNCVVYEFANSSMPGEYVLEFDGSTTTDVSFYYLQIHLNDTNTSYAEQFRIY
APAVASGGSTAAYQLPVVDSEGVPLQFPAGSYGSVERLGSITEPFHLFFYQLQHIYFHFH
LYFHFHLYFHFHLYLHYHFYFHFHFHLHYHFYLHFHFHFHIFFNKLKLLYLYLFLHLYLN
FYFYFYVFIHHNIIHLIILNKLKLFNLHLHFHFYFYLDKLKYQHVNNFICRLSRYCRLWC
PIYRSCARRTHASSLLNGKPSTSTSTSTSTSSSTETVPPFVVEEMVGNFQLRFVGKLNNN
RRPLMFGTSFNNSVRVGRFLGRDKDFCAGVCEGLTACVAFVMYLENDRQACNLLADVGDG
PITTTEPSLSYSWVSTSTSTSTSTSTSTSTSTSTSTSTSTSTSTSTSTSTSTSTSTSTST
SSSTTTTPFVAETVVSGFDIVFKGRLEENPNPRVFSTFMDDDSLIGELNGSPAFCASLCT
RDIGCAGFVLYNQFEQTMCRLLSTIGNESGAMTTMQSYSYARLLQPGTTSSTSTSTSSST
STSTSTSTSTNTITVPFVEPTPPERFELVLQGVVNGNPVPEVFNTTGALLGFLTAASLKQ
CASICLTLNECYGFVVRPTLTSNIQCLLVTRAAVLSAVPDEAPSYSFGRILLPGTTSSTS
TSSSTSSSSSSFTSTSTSTSTSTSTSTSTEQVVVREEVDGYEIVFVGRLSGNPMPERPVQ
VMSVSTWLANYPAPSASGCAEVCDLLADCAGFYFAYEFSTDQSRCRLLSVLDPNQSEYEP
LQSFTYQKHLHHYHYLINHVHHHHLINYLHHHYLINYLHHHYLINYLHHLINYLHHHHLI
NYLHHHYLINYLHHHYLINYLHHHYLVNYLYLYLVNYEDLCAVLCNTIDCGAFTFSPVEN
RCLFVSSDADLLTLVSTDEAIVTYQRAFLPRTTSSTSSSTSTSASSSSTSTSTSTSTSTS
TSTSTSTSTSTPAFLDLVIPPELTERFTIQFEGRVNGEPNPRLFTTVFDEASLVGLVRRS
SVGTCFRVCDDVYACDGFAILESNGGFECRLLSDLGAPEGVPTTIPSYSIAKVRLPATTS
STSSSTSSSTSTSTSTSTSTSTSTSTSSMTSSSSSTSTSTSTSTTTFLPTMDLARFELVA
RGQVEGETMAQRAIVTEENQLGAFMGVSVAQCRSFCLSILECSILQLEAIGDGMFRCVFL
SGEADLEPTMEQSYTYARIFLPGTTTSTSTSTSSSTSSSSSTSTSTSSSTSTSTSTSTMP
FAPRPQIRGYGLRFVGRILNEMSPRRFSTSNDAAAVLADFEATNNEVSECTDFAKAAIPC
VDQLNLELFIIFNIFFFLYELKLKFFDLFLLLDLHFHFLLDQLKHELIHFYFHVWFYLYL
DLDLDLDLDLHFLLCLNLFLYLYLNFYFYVFIHHNVIHLIIFNKLKLFNLHLHFHFVFFL
FLFFHQLQLQPKFIHFHFHFHFHFHFHFHFHFHFHFHLLQLQLQLINFHFHFHLVQLQLF
HFYLHFNLYLYLYLYLYLYLYLYLYLLLFLFFHQLQLQLQL
Download sequence
Identical sequences A9V9Y4
81824.JGI11672 jgi|Monbr1|11672|fgenesh1_pg.scaffold_31000029 XP_001749516.1.20067

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]